Knowledge (XXG)

Information retrieval

Source đź“ť

606: 1001:, and eventually became associate director of the Center for Documentation and Communications Research. That same year, Kent and colleagues published a paper in American Documentation describing the precision and recall measures as well as detailing a proposed "framework" for evaluating an IR system which included statistical sampling methods for determining the number of relevant documents not retrieved. 621:
In order to effectively retrieve relevant documents by IR strategies, the documents are typically transformed into a suitable representation. Each retrieval strategy incorporates a specific model for its document representation purposes. The picture on the right illustrates the relationship of some
340:
there is ... a machine called the Univac ... whereby letters and figures are coded as a pattern of magnetic spots on a long steel tape. By this means the text of a document, preceded by its subject code symbol, can be recorded ... the machine ... automatically selects and types out those references
296:
An information retrieval process begins when a user enters a query into the system. Queries are formal statements of information needs, for example search strings in web search engines. In information retrieval, a query does not uniquely identify a single object in the collection. Instead, several
1061:
published early findings of the Cranfield studies, developing a model for IR system evaluation. See: Cyril W. Cleverdon, "Report on the Testing and Analysis of an Investigation into the Comparative Efficiency of Indexing Systems". Cranfield Collection of Aeronautics, Cranfield, England,
392:(TREC) as part of the TIPSTER text program. The aim of this was to look into the information retrieval community by supplying the infrastructure that was needed for evaluation of text retrieval methodologies on a very large text collection. This catalyzed research on methods that 815:
The evaluation of an information retrieval system' is the process of assessing how well a system meets the information needs of its users. In general, measurement considers a collection of documents to be searched and a search query. Traditional evaluation metrics, designed for
330:
Most IR systems compute a numeric score on how well each object in the database matches the query, and rank the objects according to this value. The top ranking objects are then shown to the user. The process may then be iterated if the user wishes to refine the query.
799:
allow a representation of interdependencies between terms, but they do not allege how the interdependency between two terms is defined. They rely on an external source for the degree of interdependency between two terms. (For example, a human or sophisticated
1615: â€“ ESSIR promotes research, innovation, and development of information access systems by educating junior and senior researchers, students, professionals, and developers on the latest developments in the field, both methodological and technological. 308:. User queries are matched against the database information. However, as opposed to classical SQL queries of a database, in information retrieval the results returned may or may not match the query, so results are typically ranked. This 1377:, Robert N. Oddy, and Helen M. Brooks proposed the ASK (Anomalous State of Knowledge) viewpoint for information retrieval. This was an important concept, though their automated analysis tool proved ultimately disappointing. 2205:
Doszkocs, T.E. & Rapp, B.A. (1979). "Searching MEDLINE in English: a Prototype User Interface with Natural Language Query, Ranked Output, and relevance feedback," In: Proceedings of the ASIS Annual Meeting, 16:
784:
allow a representation of interdependencies between terms. However the degree of the interdependency between two terms is defined by the model itself. It is usually directly or indirectly derived (e.g. by
365:
in the 1920s and 1930s – that searched for documents stored on film. The first description of a computer searching for information was described by Holmstrom in 1948, detailing an early mention of the
1124:
sponsored a symposium titled "Statistical Association Methods for Mechanized Documentation". Several highly significant papers, including G. Salton's first published reference (we believe) to the
610: 698:
treat the process of document retrieval as a probabilistic inference. Similarities are computed as probabilities that a document is relevant for a given query. Probabilistic theorems like
903:
submits patents for his "Statistical Machine", a document search engine that used photoelectric cells and pattern recognition to search the metadata on rolls of microfilmed documents.
1942: 1919:. Proceedings of the 5th International Conference on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom'09). Washington, DC: IEEE. Archived from 1520: 757:
methods. Feature functions are arbitrary functions of document and query, and as such can easily incorporate almost any other retrieval model as just another feature.
1656: â€“ field of research that involves studying situations, motivations, and methods for people seeking and sharing information in participatory online social sites 1729: 1612: 1331: 1080:
Weinberg report "Science, Government and Information" gave a full articulation of the idea of a "crisis of scientific information". The report was named after Dr.
381:
such as the Cranfield collection (several thousand documents). Large-scale retrieval systems, such as the Lockheed Dialog system, came into use early in the 1970s.
661:
represent documents and queries usually as vectors, matrices, or tuples. The similarity of the query vector and document vector is represented as a scalar value.
385: 956:: Growing concern in the US for a "science gap" with the USSR motivated, encouraged funding and provided a backdrop for mechanized literature searching systems ( 1495:
implementation of many features formerly found only in experimental IR systems. Search engines become the most common and maybe best instantiation of IR models.
323:
or videos. Often the documents themselves are not kept or stored directly in the IR system, but are instead represented in the system by document surrogates or
225: 1514: 2118:
Perry, James W.; Kent, Allen; Berry, Madeline M. (1955). "Machine literature searching X. Machine language; factors underlying its design and development".
1832: 810: 188: 1216: 1621: 1007:: International Conference on Scientific Information Washington DC included consideration of IR systems as a solution to problems identified. See: 1526: 1088: 1049:
and John Lary Kuhns published "On relevance, probabilistic indexing, and information retrieval" in the Journal of the ACM 7(3):216–244, July 1960.
622:
common models. In the picture, the models are categorized according to two dimensions: the mathematical basis and the properties of the model.
2335: 2233: 2102: 1952: 284:. An IR system is a software system that provides access to books, journals and other documents; it also stores and manages those documents. 2146: 2413: 1591: 714: 579: 1958: 1328: 1354:
for MEDLINE at the National Library of Medicine. The CITE system supported free form query input, ranked output and relevance feedback.
1125: 828:
notion of relevance: every document is known to be either relevant or non-relevant to a particular query. In practice, queries may be
316: 1702: 1648: 1573: 775: 557: 298: 218: 261:. The information need can be specified in the form of a search query. In the case of document retrieval, queries can be based on 2281: 1679: 730: 523: 916:: The US military confronted problems of indexing and retrieval of wartime scientific research documents captured from Germans. 408:
Areas where information retrieval techniques are employed include (the entries are in alphabetical order within each category):
1684: 1351: 998: 369:
computer. Automated information retrieval systems were introduced in the 1950s: one even featured in the 1957 romantic comedy,
309: 178: 2408: 1741: 1664: 674: 669: 100: 90: 40: 1204:
John W. Sammon, Jr.'s RADC Tech report "Some Mathematics of Information Storage and Retrieval..." outlined the vector model.
948:(research engineer at IBM since 1941) began work on a mechanized punch card-based system for searching chemical compounds. 710: 1914: 1186:
completed evaluation studies of the MEDLARS system and published the first edition of his text on information retrieval.
1121: 567: 211: 1145:
Medical Literature Analysis and Retrieval System, the first major machine-readable database and batch-retrieval system.
1717: 735: 638:
of words or phrases. Similarities are usually derived from set-theoretic operations on those sets. Common models are:
537: 450: 198: 1560: 1653: 1272: 1829: 1454: 1314: 1114: 705: 389: 95: 688: 684: 574: 55: 2388: 2270: 2179:
N. Jardine, C.J. van Rijsbergen (December 1971). "The use of hierarchic clustering in information retrieval".
614: 1670: 584: 562: 2054: 1276: 1213: 987:: Philip Bagley conducted the earliest experiment in computerized document retrieval in a master thesis at 770:
treat different terms/words as independent. This fact is usually represented in vector space models by the
269:
of searching for information in a document, searching for documents themselves, and also searching for the
2362: 1877: 1723: 1631: 817: 679: 646: 641: 163: 138: 85: 65: 2374: 1640: â€“ Process or activity of attempting to obtain information in both human and technological contexts 17: 1585: 1550: 1285:: Three highly influential publications by Salton fully articulated his vector processing framework and 851: 422: 377:
at Cornell. By the 1970s several different retrieval techniques had been shown to perform well on small
351:
The idea of using computers to search for relevant pieces of information was popularized in the article
173: 143: 1106: 1413: 1367:: First international ACM SIGIR conference, joint with British Computer Society IR group in Cambridge. 1223:" (IEEE Transactions on Computers) was the first proposal for visualization interface to an IR system. 2354: 2055:
The Theory of Digital Handling of Non-numerical Information and its Implications to Machine Economics
2040:
The Royal Society Scientific Information Conference, 21 June-2 July 1948: Report and Papers Submitted
1183: 821: 532: 281: 168: 60: 1882: 1696: 1643: 1637: 1555: 1478: 1286: 786: 720: 493: 427: 250: 80: 75: 33: 361:
in 1945. It would appear that Bush was inspired by patents for a 'statistical machine' – filed by
312:
of results is a key difference of information retrieval searching compared to database searching.
2383: 1895: 1811: 1708: 1464: 1417: 1374: 1157: 1058: 883: 664: 594: 473: 254: 50: 2256: 1673: â€“ Set of techniques for creating images, diagrams, or animations to communicate a message 605: 2229: 2225: 2218: 2098: 2035: 1948: 1492: 1046: 753:) and seek the best way to combine these features into a single relevance score, typically by 699: 635: 503: 483: 417: 397: 285: 258: 2188: 2161: 2127: 2073: 2014: 1887: 1839:. Journal of the American Society for Information Sciences and Technology. 61(8), 1517-1534. 1801: 1793: 1735: 1268: 934: 900: 868:
invents an electro-mechanical data tabulator using punch cards as a machine readable medium.
865: 754: 488: 362: 262: 148: 1397:
publish: An Evaluation of Retrieval Effectiveness for a Full-Text Document-Retrieval System
2378: 2366: 2274: 2260: 1836: 1579: 1433: 1220: 1018: 968: 945: 928: 651: 542: 461: 440: 353: 1849:
Goodrum, Abby A. (2000). "Image Information Retrieval: An Overview of Current Research".
2254:
Modern Information Retrieval: The Concepts and Technology behind Search (second edition)
2289: 1830:
The Seventeen Theoretical Constructs of Information Searching and Information Retrieval
1606: 1429: 1246: 1081: 964: 887: 725: 589: 478: 468: 128: 2402: 2192: 1815: 1753: 1690: 1174:
was involved in studies at University of Chicago on Requirements for Future Catalogs.
1036: 978: 923: 875: 855: 790: 771: 498: 374: 358: 304:
An object is an entity that is represented by information in a content collection or
153: 123: 1899: 825: 553:
Methods/Techniques in which information retrieval techniques are employed include:
445: 378: 373:. In the 1960s, the first large information retrieval research group was formed by 2318: 1744: â€“ Area of research related to information retrieval centered on timeliness 1705: â€“ Measure of a document's applicability to a given subject or search query 1600: 1171: 393: 280:
Automated information retrieval systems are used to reduce what has been called
158: 2350: 2019: 2002: 1797: 1784:
Luk, R. W. P. (2022). "Why is information retrieval a scientific discipline?".
1747: 2165: 1920: 1759: 1394: 957: 133: 1984:
Bulletin of the IEEE Computer Society Technical Committee on Data Engineering
1091:
published text on information retrieval. Becker, Joseph; Hayes, Robert Mayo.
858:, the first machine to use punched cards to control a sequence of operations. 2078: 2061: 1250: 1243:
First online systems—NLM's AIM-TWX, MEDLINE; Lockheed's Dialog; SDC's ORBIT.
829: 320: 246: 2336:
BCS IRSG: British Computer Society – Information Retrieval Specialist Group
2131: 1009:
Proceedings of the International Conference on Scientific Information, 1958
2394:
Information retrieval performance evaluation tool @ Athena Research Centre
1891: 400:
has boosted the need for very large scale retrieval systems even further.
1762: â€“ Process of extracting and discovering patterns in large data sets 1603: â€“ Process of extracting and discovering patterns in large data sets 879: 370: 324: 305: 274: 270: 105: 341:
which have been coded in any desired way at a rate of 120 words a minute
1806: 1142: 266: 193: 1868:
Foote, Jonathan (1999). "An overview of audio information retrieval".
1409:: Key papers on and experimental systems for visualization interfaces. 1279:
in information retrieval", which articulated the "cluster hypothesis".
1720: â€“ A classification model in machine learning based on centroids 366: 319:
the data objects may be, for example, text documents, images, audio,
183: 2305: 1582: â€“ Computer component that stores information for immediate use 2267: 1976: 1532: 1304: 2304:
Christopher D. Manning, Prabhakar Raghavan, and Hinrich SchĂĽtze.
2268:
Information Retrieval: Implementing and Evaluating Search Engines
1471:
with emphasis on visualization and multi-reference point systems.
1403:: Efforts to develop end-user versions of commercial IR systems. 1021:
published "Auto-encoding of documents for information retrieval".
2345: 1485:
by Addison Wesley, the first book that attempts to cover all IR.
1437: 1916:
Information Retrieval On Mind Maps - What Could It Be Good For?
1509:
Conference on Research and Development in Information Retrieval
297:
objects may match the query, perhaps with different degrees of
27:
Obtaining information resources relevant to an information need
2315:
Behind the Search Box: Google and the Global Internet Industry
2266:
Stefan BĂĽttcher, Charles L. A. Clarke, and Gordon V. Cormack.
1538: 988: 265:
or other content-based indexing. Information retrieval is the
2147:"An Historical Note on the Origins of Probabilistic Indexing" 1644:
Information seeking § Compared to information retrieval
1093:
Information storage and retrieval: tools, elements, theories
774:
assumption of term vectors or in probabilistic models by an
2393: 2359: 1732: â€“ Subgroup of the Association for Computing Machinery 1539:
International Conference on Theory of Information Retrieval
2384:
TREC report on information retrieval evaluation techniques
2371: 1750: â€“ Estimate of the importance of a word in a document 2340: 2253: 2331:
ACM SIGIR: Information Retrieval Special Interest Group
1764:
Pages displaying short descriptions of redirect targets
1675:
Pages displaying short descriptions of redirect targets
1594: â€“ retrieval of Information in different languages 1344:(Butterworths). Heavy emphasis on probabilistic models. 1944:
Information Retrieval Data Structures & Algorithms
1301:
A Theory of Term Importance in Automatic Text Analysis
2330: 1687: â€“ Tools and systems for managing one's own data 384:
In 1992, the US Department of Defense along with the
1713:
Pages displaying wikidata descriptions as a fallback
1658:
Pages displaying wikidata descriptions as a fallback
1617:
Pages displaying wikidata descriptions as a fallback
1596:
Pages displaying wikidata descriptions as a fallback
1576: â€“ Information retrieval strategies in datasets 1508: 1387:(McGraw-Hill), with heavy emphasis on vector models. 1039:
began work on IR at Harvard, later moved to Cornell.
1011:(National Academy of Sciences, Washington, DC, 1959) 1913:Beel, Jöran; Gipp, Bela; Stiller, Jan-Olaf (2009). 2217: 1521:Conference on Information and Knowledge Management 2346:Forum for Information Retrieval Evaluation (FIRE) 1941:Frakes, William B.; Baeza-Yates, Ricardo (1992). 1667: â€“ Organization in Vienna, Austria 2006–2012 1634: â€“ Machine reading of unstructured documents 977:: The term "information retrieval" was coined by 1977:"Modern Information Retrieval: A Brief Overview" 1756: â€“ Content-based retrieval of XML documents 1297:(Society for Industrial and Applied Mathematics) 1199:Automatic Information Organization and Retrieval 832:and there may be different shades of relevance. 2317:(U of Illinois Press, 2023) ISBN 10:0252087127 2003:"The History of Information Retrieval Research" 1730:Special Interest Group on Information Retrieval 1613:European Summer School in Information Retrieval 1214:A nonlinear mapping for data structure analysis 797:Models with transcendent term interdependencies 338: 1420:, Matthew Chalmers, Anselm Spoerri and others. 386:National Institute of Standards and Technology 2062:"Automatic Retrieval of Recorded Information" 1970: 1968: 1738: â€“ Classifying a document by index terms 793:of those terms in the whole set of documents. 609:Categorization of IR-models (translated from 528:Information retrieval for chemical structures 219: 8: 2277:. MIT Press, Cambridge, Massachusetts, 2010. 2252:Ricardo Baeza-Yates, Berthier Ribeiro-Neto. 2058:(Zator Technical Bulletin No. 48), cited in 2001:Mark Sanderson & W. Bruce Croft (2012). 1515:European Conference on Information Retrieval 1385:Introduction to Modern Information Retrieval 1383:: Salton (and Michael J. McGill) published 1311:A Vector Space Model for Automatic Indexing 811:Evaluation measures (information retrieval) 782:Models with immanent term interdependencies 1609: â€“ Way to obtain data from a database 253:is the task of identifying and retrieving 226: 212: 29: 2286:Library & Information Science Network 2077: 2018: 1881: 1805: 762:Second dimension: properties of the model 675:(Enhanced) Topic-based Vector Space Model 18:Information storage and retrieval systems 1533:Conference on Web Search and Data Mining 604: 2036:"'Section III. Opening Plenary Session" 1776: 1527:International World Wide Web Conference 1141:National Library of Medicine developed 745:view documents as vectors of values of 114: 39: 32: 2093:Doyle, Lauren; Becker, Joseph (1975). 1588: â€“ Method of organizing knowledge 1350:: Tamas Doszkocs implemented the CITE 288:are the most visible IR applications. 2306:Introduction to Information Retrieval 2154:Information Processing and Management 1699: â€“ Search engine processing step 768:Models without term-interdependencies 396:to huge corpora. The introduction of 7: 2095:Information Retrieval and Processing 1622:Human–computer information retrieval 1592:Cross-language information retrieval 1111:Synonymy and Semantic Classification 805:Performance and correctness measures 2308:. Cambridge University Press, 2008. 626:First dimension: mathematical basis 2389:How eBay measures search relevance 1828:Jansen, B. J. and Rieh, S. (2010) 1726: â€“ Method for data management 1109:finished her thesis at Cambridge, 1067:Information Analysis and Retrieval 257:resources that are relevant to an 25: 2220:Information Storage and Retrieval 2181:Information Storage and Retrieval 1703:Relevance (information retrieval) 1649:Collaborative information seeking 1574:Adversarial information retrieval 1469:Information Storage and Retrieval 1340:: C. J. van Rijsbergen published 558:Adversarial information retrieval 2341:Text Retrieval Conference (TREC) 1680:Multimedia information retrieval 731:Divergence-from-randomness model 702:are often used in these models. 524:Geographic information retrieval 2288:. 24 April 2015. Archived from 1693: â€“ Type of search strategy 1685:Personal information management 1352:natural language user interface 999:Case Western Reserve University 179:Library and information science 2372:Information Retrieval Facility 2282:"Information Retrieval System" 1742:Temporal information retrieval 1665:Information Retrieval Facility 778:assumption for term variables. 743:Feature-based retrieval models 670:Generalized vector space model 634:models represent documents as 101:Science and technology studies 1: 2097:. Melville. pp. 410 pp. 711:Probabilistic relevance model 520:Genomic information retrieval 273:that describes data, and for 2216:Korfhage, Robert R. (1997). 2193:10.1016/0020-0271(71)90051-9 1483:Modern Information Retrieval 1481:and Berthier Ribeiro-Neto's 1122:National Bureau of Standards 820:or top-k retrieval, include 568:Multi-document summarization 512:Domain-specific applications 277:of texts, images or sounds. 115:Related fields and subfields 2414:Natural language processing 2263:. Addison-Wesley, UK, 2011. 1255:Computer Lib/Dream Machines 963:) and the invention of the 736:Latent Dirichlet allocation 538:Legal information retrieval 199:Quantum information science 2430: 2360:Information Retrieval Wiki 2060:Fairthorne, R. A. (1958). 2020:10.1109/jproc.2012.2189916 1798:10.1007/s10699-020-09685-x 1654:Social information seeking 1273:Cornelis J. van Rijsbergen 808: 2166:10.1016/j.ipm.2007.02.012 2145:Maron, Melvin E. (2008). 1115:computational linguistics 1095:. New York, Wiley (1963). 706:Binary Independence Model 531:Information retrieval in 390:Text Retrieval Conference 1711: â€“ type of feedback 1561:Karen Spärck Jones Award 1197:Gerard Salton published 1113:, and continued work on 824:. All measures assume a 689:latent semantic analysis 685:Latent semantic indexing 575:Compound term processing 388:(NIST), cosponsored the 2313:Yeo, ShinJoung. (2023) 2007:Proceedings of the IEEE 1671:Knowledge visualization 1162:Libraries of the Future 585:Document classification 580:Cross-lingual retrieval 563:Automatic summarization 549:Other retrieval methods 2132:10.1002/asi.5090060411 2120:American Documentation 1975:Singhal, Amit (2001). 1947:. Prentice-Hall, Inc. 1786:Foundations of Science 1724:Search engine indexing 1718:Rocchio classification 1632:Information extraction 1275:published "The use of 1148:Project Intrex at MIT. 713:on which is based the 680:Extended Boolean model 647:Extended Boolean model 642:Standard Boolean model 618: 349: 164:Information technology 86:Knowledge organization 2409:Information retrieval 2351:Information Retrieval 2079:10.1093/comjnl/1.1.36 2034:JE Holmstrom (1948). 1892:10.1007/s005300050106 1586:Controlled vocabulary 1551:Tony Kent Strix award 1342:Information Retrieval 1277:hierarchic clustering 1249:promoting concept of 852:Joseph Marie Jacquard 787:dimensional reduction 608: 517:Expert search finding 423:Information filtering 346:J. E. Holmstrom, 1948 239:Information retrieval 174:Intellectual property 144:Computer data storage 2355:C. J. van Rijsbergen 2066:The Computer Journal 1295:A Theory of Indexing 1184:F. Wilfrid Lancaster 1117:as it applies to IR. 997:: Allen Kent joined 886:used to process the 822:precision and recall 696:Probabilistic models 533:software engineering 412:General applications 282:information overload 169:Intellectual freedom 2052:Mooers, Calvin N.; 1697:Query understanding 1638:Information seeking 1556:Gerard Salton Award 1545:Awards in the field 1479:Ricardo Baeza-Yates 1287:term discrimination 721:Uncertain inference 428:Recommender systems 251:information science 34:Information science 2377:2008-05-22 at the 2365:2015-11-24 at the 2273:2020-10-05 at the 2259:2017-09-18 at the 2224:. Wiley. pp.  1870:Multimedia Systems 1835:2016-03-04 at the 1709:Relevance feedback 1493:Web search engines 1418:Robert R. Korfhage 1393:: David Blair and 1375:Nicholas J. Belkin 1219:2017-08-08 at the 1158:J. C. R. Licklider 1107:Karen Spärck Jones 1087:Joseph Becker and 1059:Cyril W. Cleverdon 717:relevance function 665:Vector space model 619: 613:, original source 595:Question answering 398:web search engines 286:Web search engines 255:information system 2353:(online book) by 2235:978-0-471-14338-3 2104:978-0-471-22151-7 1954:978-0-13-463837-9 1851:Informing Science 1502:Major conferences 1477:: Publication of 1463:: Publication of 1047:Melvin Earl Maron 818:Boolean retrieval 747:feature functions 484:Enterprise search 418:Digital libraries 315:Depending on the 236: 235: 16:(Redirected from 2421: 2301: 2299: 2297: 2240: 2239: 2223: 2213: 2207: 2203: 2197: 2196: 2176: 2170: 2169: 2151: 2142: 2136: 2135: 2115: 2109: 2108: 2090: 2084: 2083: 2081: 2050: 2044: 2043: 2031: 2025: 2024: 2022: 1998: 1992: 1991: 1981: 1972: 1963: 1962: 1957:. Archived from 1938: 1932: 1931: 1929: 1928: 1910: 1904: 1903: 1885: 1865: 1859: 1858: 1846: 1840: 1826: 1820: 1819: 1809: 1781: 1765: 1736:Subject indexing 1714: 1676: 1659: 1627: 1618: 1597: 1414:Donald B. Crouch 1269:Nicholas Jardine 935:Atlantic Monthly 901:Emanuel Goldberg 866:Herman Hollerith 755:learning to rank 659:Algebraic models 489:Federated search 458:Speech retrieval 363:Emanuel Goldberg 347: 259:information need 228: 221: 214: 149:Cultural studies 30: 21: 2429: 2428: 2424: 2423: 2422: 2420: 2419: 2418: 2399: 2398: 2379:Wayback Machine 2367:Wayback Machine 2327: 2295: 2293: 2280: 2275:Wayback Machine 2261:Wayback Machine 2249: 2247:Further reading 2244: 2243: 2236: 2215: 2214: 2210: 2204: 2200: 2178: 2177: 2173: 2149: 2144: 2143: 2139: 2117: 2116: 2112: 2105: 2092: 2091: 2087: 2059: 2051: 2047: 2033: 2032: 2028: 2000: 1999: 1995: 1979: 1974: 1973: 1966: 1955: 1940: 1939: 1935: 1926: 1924: 1912: 1911: 1907: 1867: 1866: 1862: 1848: 1847: 1843: 1837:Wayback Machine 1827: 1823: 1783: 1782: 1778: 1773: 1768: 1763: 1712: 1674: 1657: 1625: 1616: 1595: 1580:Computer memory 1569: 1547: 1504: 1434:Tim Berners-Lee 1221:Wayback Machine 1089:Robert M. Hayes 1065:Kent published 1019:Hans Peter Luhn 969:Eugene Garfield 946:Hans Peter Luhn 929:As We May Think 838: 813: 807: 764: 726:Language models 652:Fuzzy retrieval 628: 615:Dominik Kuropka 603: 551: 543:Vertical search 514: 462:Video retrieval 451:Music retrieval 441:Image retrieval 414: 406: 354:As We May Think 348: 345: 337: 294: 232: 203: 110: 41:General aspects 28: 23: 22: 15: 12: 11: 5: 2427: 2425: 2417: 2416: 2411: 2401: 2400: 2397: 2396: 2391: 2386: 2381: 2369: 2357: 2348: 2343: 2338: 2333: 2326: 2325:External links 2323: 2322: 2321: 2310: 2309: 2302: 2292:on 11 May 2020 2278: 2264: 2248: 2245: 2242: 2241: 2234: 2208: 2198: 2187:(5): 217–240. 2171: 2160:(2): 971–972. 2137: 2126:(4): 242–254. 2110: 2103: 2085: 2045: 2026: 1993: 1964: 1961:on 2013-09-28. 1953: 1933: 1905: 1883:10.1.1.39.6339 1860: 1841: 1821: 1792:(2): 427–453. 1775: 1774: 1772: 1769: 1767: 1766: 1757: 1751: 1745: 1739: 1733: 1727: 1721: 1715: 1706: 1700: 1694: 1688: 1682: 1677: 1668: 1662: 1661: 1660: 1651: 1646: 1635: 1629: 1619: 1610: 1607:Data retrieval 1604: 1598: 1589: 1583: 1577: 1570: 1568: 1565: 1564: 1563: 1558: 1553: 1546: 1543: 1542: 1541: 1535: 1529: 1523: 1517: 1511: 1503: 1500: 1499: 1498: 1497: 1496: 1486: 1472: 1458: 1443: 1442: 1441: 1430:World Wide Web 1423: 1422: 1421: 1410: 1398: 1388: 1378: 1368: 1357: 1356: 1355: 1345: 1335: 1322: 1321: 1320: 1319: 1318: 1308: 1298: 1280: 1262: 1261: 1260: 1259: 1258: 1247:Theodor Nelson 1244: 1228: 1227: 1226: 1225: 1224: 1206: 1205: 1202: 1194: 1193: 1177: 1176: 1175: 1165: 1151: 1150: 1149: 1146: 1131: 1130: 1129: 1118: 1098: 1097: 1096: 1085: 1082:Alvin Weinberg 1072: 1071: 1070: 1063: 1050: 1040: 1024: 1023: 1022: 1012: 1002: 992: 982: 972: 965:citation index 951: 950: 949: 939: 906: 905: 904: 893: 892: 891: 888:1890 US Census 869: 859: 837: 834: 809:Main article: 806: 803: 802: 801: 794: 779: 763: 760: 759: 758: 740: 739: 738: 733: 728: 723: 718: 708: 700:Bayes' theorem 693: 692: 691: 682: 677: 672: 667: 656: 655: 654: 649: 644: 627: 624: 602: 599: 598: 597: 592: 590:Spam filtering 587: 582: 577: 572: 571: 570: 560: 550: 547: 546: 545: 540: 535: 529: 526: 521: 518: 513: 510: 509: 508: 507: 506: 501: 496: 491: 486: 481: 479:Desktop search 476: 469:Search engines 466: 465: 464: 459: 456: 453: 448: 443: 438: 432: 431: 430: 420: 413: 410: 405: 402: 343: 336: 333: 293: 290: 234: 233: 231: 230: 223: 216: 208: 205: 204: 202: 201: 196: 191: 186: 181: 176: 171: 166: 161: 156: 151: 146: 141: 139:Classification 136: 131: 129:Categorization 126: 120: 117: 116: 112: 111: 109: 108: 103: 98: 93: 88: 83: 78: 73: 68: 63: 58: 53: 47: 44: 43: 37: 36: 26: 24: 14: 13: 10: 9: 6: 4: 3: 2: 2426: 2415: 2412: 2410: 2407: 2406: 2404: 2395: 2392: 2390: 2387: 2385: 2382: 2380: 2376: 2373: 2370: 2368: 2364: 2361: 2358: 2356: 2352: 2349: 2347: 2344: 2342: 2339: 2337: 2334: 2332: 2329: 2328: 2324: 2320: 2316: 2312: 2311: 2307: 2303: 2291: 2287: 2283: 2279: 2276: 2272: 2269: 2265: 2262: 2258: 2255: 2251: 2250: 2246: 2237: 2231: 2227: 2222: 2221: 2212: 2209: 2202: 2199: 2194: 2190: 2186: 2182: 2175: 2172: 2167: 2163: 2159: 2155: 2148: 2141: 2138: 2133: 2129: 2125: 2121: 2114: 2111: 2106: 2100: 2096: 2089: 2086: 2080: 2075: 2071: 2067: 2063: 2057: 2056: 2049: 2046: 2041: 2037: 2030: 2027: 2021: 2016: 2013:: 1444–1451. 2012: 2008: 2004: 1997: 1994: 1989: 1985: 1978: 1971: 1969: 1965: 1960: 1956: 1950: 1946: 1945: 1937: 1934: 1923:on 2011-05-13 1922: 1918: 1917: 1909: 1906: 1901: 1897: 1893: 1889: 1884: 1879: 1875: 1871: 1864: 1861: 1856: 1852: 1845: 1842: 1838: 1834: 1831: 1825: 1822: 1817: 1813: 1808: 1803: 1799: 1795: 1791: 1787: 1780: 1777: 1770: 1761: 1758: 1755: 1754:XML retrieval 1752: 1749: 1746: 1743: 1740: 1737: 1734: 1731: 1728: 1725: 1722: 1719: 1716: 1710: 1707: 1704: 1701: 1698: 1695: 1692: 1691:Pearl growing 1689: 1686: 1683: 1681: 1678: 1672: 1669: 1666: 1663: 1655: 1652: 1650: 1647: 1645: 1642: 1641: 1639: 1636: 1633: 1630: 1623: 1620: 1614: 1611: 1608: 1605: 1602: 1599: 1593: 1590: 1587: 1584: 1581: 1578: 1575: 1572: 1571: 1566: 1562: 1559: 1557: 1554: 1552: 1549: 1548: 1544: 1540: 1536: 1534: 1530: 1528: 1524: 1522: 1518: 1516: 1512: 1510: 1506: 1505: 1501: 1494: 1490: 1487: 1484: 1480: 1476: 1473: 1470: 1466: 1462: 1459: 1456: 1452: 1449: 1448: 1447: 1444: 1439: 1435: 1432:proposals by 1431: 1427: 1424: 1419: 1415: 1411: 1408: 1405: 1404: 1402: 1399: 1396: 1392: 1389: 1386: 1382: 1379: 1376: 1372: 1369: 1366: 1363: 1362: 1361: 1358: 1353: 1349: 1346: 1343: 1339: 1336: 1333: 1330: 1326: 1323: 1316: 1312: 1309: 1306: 1302: 1299: 1296: 1293: 1292: 1291: 1290: 1288: 1284: 1281: 1278: 1274: 1270: 1266: 1263: 1256: 1252: 1248: 1245: 1242: 1241: 1240: 1239: 1237: 1234: 1233: 1232: 1229: 1222: 1218: 1215: 1211: 1208: 1207: 1203: 1200: 1196: 1195: 1191: 1188: 1187: 1185: 1181: 1178: 1173: 1169: 1166: 1163: 1159: 1155: 1152: 1147: 1144: 1140: 1139: 1138: 1137: 1135: 1132: 1127: 1123: 1119: 1116: 1112: 1108: 1105: 1104: 1102: 1099: 1094: 1090: 1086: 1083: 1079: 1078: 1076: 1073: 1068: 1064: 1060: 1057: 1056: 1054: 1051: 1048: 1044: 1041: 1038: 1037:Gerard Salton 1034: 1031: 1030: 1028: 1025: 1020: 1016: 1013: 1010: 1006: 1003: 1000: 996: 993: 990: 986: 983: 980: 979:Calvin Mooers 976: 973: 970: 966: 962: 959: 955: 952: 947: 943: 940: 937: 936: 931: 930: 925: 924:Vannevar Bush 921: 918: 917: 915: 912: 911: 910: 907: 902: 899: 898: 897: 894: 889: 885: 881: 877: 873: 870: 867: 863: 860: 857: 856:Jacquard loom 853: 849: 846: 845: 844: 840: 839: 835: 833: 831: 827: 823: 819: 812: 804: 798: 795: 792: 791:co-occurrence 788: 783: 780: 777: 773: 772:orthogonality 769: 766: 765: 761: 756: 752: 748: 744: 741: 737: 734: 732: 729: 727: 724: 722: 719: 716: 712: 709: 707: 704: 703: 701: 697: 694: 690: 686: 683: 681: 678: 676: 673: 671: 668: 666: 663: 662: 660: 657: 653: 650: 648: 645: 643: 640: 639: 637: 633: 632:Set-theoretic 630: 629: 625: 623: 616: 612: 607: 600: 596: 593: 591: 588: 586: 583: 581: 578: 576: 573: 569: 566: 565: 564: 561: 559: 556: 555: 554: 548: 544: 541: 539: 536: 534: 530: 527: 525: 522: 519: 516: 515: 511: 505: 502: 500: 499:Social search 497: 495: 494:Mobile search 492: 490: 487: 485: 482: 480: 477: 475: 472: 471: 470: 467: 463: 460: 457: 454: 452: 449: 447: 444: 442: 439: 436: 435: 434:Media search 433: 429: 426: 425: 424: 421: 419: 416: 415: 411: 409: 403: 401: 399: 395: 391: 387: 382: 380: 376: 375:Gerard Salton 372: 368: 364: 360: 359:Vannevar Bush 356: 355: 342: 334: 332: 328: 326: 322: 318: 313: 311: 307: 302: 300: 291: 289: 287: 283: 278: 276: 272: 268: 264: 260: 256: 252: 248: 244: 240: 229: 224: 222: 217: 215: 210: 209: 207: 206: 200: 197: 195: 192: 190: 187: 185: 182: 180: 177: 175: 172: 170: 167: 165: 162: 160: 157: 155: 154:Data modeling 152: 150: 147: 145: 142: 140: 137: 135: 132: 130: 127: 125: 124:Bibliometrics 122: 121: 119: 118: 113: 107: 104: 102: 99: 97: 94: 92: 89: 87: 84: 82: 79: 77: 74: 72: 69: 67: 64: 62: 59: 57: 54: 52: 49: 48: 46: 45: 42: 38: 35: 31: 19: 2314: 2294:. Retrieved 2290:the original 2285: 2219: 2211: 2201: 2184: 2180: 2174: 2157: 2153: 2140: 2123: 2119: 2113: 2094: 2088: 2069: 2065: 2053: 2048: 2039: 2029: 2010: 2006: 1996: 1987: 1983: 1959:the original 1943: 1936: 1925:. Retrieved 1921:the original 1915: 1908: 1873: 1869: 1863: 1854: 1850: 1844: 1824: 1789: 1785: 1779: 1488: 1482: 1474: 1468: 1460: 1450: 1445: 1425: 1406: 1400: 1390: 1384: 1380: 1370: 1364: 1359: 1347: 1341: 1337: 1327:: The First 1324: 1310: 1300: 1294: 1282: 1264: 1254: 1253:, published 1235: 1230: 1212:: Sammon's " 1209: 1198: 1189: 1179: 1167: 1161: 1153: 1133: 1110: 1100: 1092: 1074: 1066: 1052: 1042: 1032: 1026: 1014: 1008: 1004: 994: 984: 974: 960: 953: 941: 933: 932:appeared in 927: 919: 913: 908: 895: 871: 861: 854:invents the 847: 842: 826:ground truth 814: 800:algorithms.) 796: 781: 776:independency 767: 750: 746: 742: 715:okapi (BM25) 695: 658: 631: 620: 611:German entry 552: 446:3D retrieval 407: 404:Applications 383: 379:text corpora 352: 350: 339: 329: 314: 303: 295: 279: 242: 238: 237: 189:Preservation 70: 56:Architecture 1990:(4): 35–43. 1807:10397/94873 1601:Data mining 1457:conference. 1334:conference. 1236:early 1970s 1172:Don Swanson 1033:early 1960s 909:1940s–1950s 896:1920s-1930s 841:Before the 789:) from the 601:Model types 474:Site search 455:News search 437:Blog search 317:application 159:Informatics 2403:Categories 1927:2012-03-13 1771:References 1760:Web mining 1489:late 1990s 1395:Bill Maron 1180:late 1960s 1160:published 958:Allen Kent 914:late 1940s 884:tabulators 880:keypunches 874:Hollerith 504:Web search 134:Censorship 96:Philosophy 66:Management 2072:(1): 37. 1878:CiteSeerX 1816:220506422 1407:1985–1993 1401:mid-1980s 1251:hypertext 1134:mid-1960s 830:ill-posed 749:(or just 321:mind maps 299:relevance 275:databases 263:full-text 247:computing 71:Retrieval 2375:Archived 2363:Archived 2271:Archived 2257:Archived 2206:131-139. 1876:: 2–10. 1833:Archived 1567:See also 1465:Korfhage 1453:: First 1428:: First 1412:Work by 1217:Archived 836:Timeline 751:features 371:Desk Set 344:—  325:metadata 306:database 292:Overview 271:metadata 106:Taxonomy 91:Ontology 61:Behavior 1900:2000641 1537:ICTIR: 1507:SIGIR: 1289:model: 1143:MEDLARS 1128:system. 687:a.k.a. 335:History 310:ranking 267:science 194:Privacy 81:Society 76:Seeking 2319:online 2232:  2226:368 pp 2101:  1951:  1898:  1880:  1814:  1748:tf–idf 1531:WSDM: 1519:CIKM: 1513:ECIR: 1317:18:11) 1307:v. 26) 961:et al. 367:Univac 184:Memory 51:Access 2296:3 May 2150:(PDF) 2042:: 85. 1980:(PDF) 1896:S2CID 1812:S2CID 1525:WWW: 1446:1990s 1360:1980s 1332:SIGIR 1305:JASIS 1231:1970s 1126:SMART 1062:1962. 1027:1960s 954:1950s 890:data. 876:cards 862:1880s 843:1900s 394:scale 245:) in 2298:2020 2230:ISBN 2099:ISBN 1949:ISBN 1857:(2). 1626:HCIR 1475:1999 1461:1997 1455:TREC 1451:1992 1438:CERN 1426:1989 1391:1985 1381:1983 1371:1982 1365:1980 1348:1979 1338:1979 1325:1978 1315:CACM 1283:1975 1271:and 1265:1971 1210:1969 1190:1968 1168:1966 1154:1965 1120:The 1101:1964 1075:1963 1053:1962 1043:1960 1015:1959 1005:1958 995:1955 985:1951 975:1950 942:1947 920:1945 882:and 872:1890 848:1801 636:sets 249:and 2189:doi 2162:doi 2128:doi 2074:doi 2015:doi 2011:100 1888:doi 1802:hdl 1794:doi 1467:'s 1436:at 1329:ACM 989:MIT 967:by 926:'s 357:by 2405:: 2284:. 2228:. 2183:. 2158:44 2156:. 2152:. 2122:. 2068:. 2064:. 2038:. 2009:. 2005:. 1988:24 1986:. 1982:. 1967:^ 1894:. 1886:. 1872:. 1853:. 1810:. 1800:. 1790:27 1788:. 1491:: 1416:, 1373:: 1267:: 1238:: 1182:: 1170:: 1156:: 1136:: 1103:: 1077:: 1055:: 1045:: 1035:: 1029:: 1017:: 944:: 922:: 878:, 864:: 850:: 327:. 301:. 243:IR 2300:. 2238:. 2195:. 2191:: 2185:7 2168:. 2164:: 2134:. 2130:: 2124:6 2107:. 2082:. 2076:: 2070:1 2023:. 2017:: 1930:. 1902:. 1890:: 1874:7 1855:3 1818:. 1804:: 1796:: 1628:) 1624:( 1440:. 1313:( 1303:( 1257:. 1201:. 1192:: 1164:. 1084:. 1069:. 991:. 981:. 971:. 938:. 617:) 241:( 227:e 220:t 213:v 20:)

Index

Information storage and retrieval systems
Information science
General aspects
Access
Architecture
Behavior
Management
Retrieval
Seeking
Society
Knowledge organization
Ontology
Philosophy
Science and technology studies
Taxonomy
Bibliometrics
Categorization
Censorship
Classification
Computer data storage
Cultural studies
Data modeling
Informatics
Information technology
Intellectual freedom
Intellectual property
Library and information science
Memory
Preservation
Privacy

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.

↑