606:
1001:, and eventually became associate director of the Center for Documentation and Communications Research. That same year, Kent and colleagues published a paper in American Documentation describing the precision and recall measures as well as detailing a proposed "framework" for evaluating an IR system which included statistical sampling methods for determining the number of relevant documents not retrieved.
621:
In order to effectively retrieve relevant documents by IR strategies, the documents are typically transformed into a suitable representation. Each retrieval strategy incorporates a specific model for its document representation purposes. The picture on the right illustrates the relationship of some
340:
there is ... a machine called the Univac ... whereby letters and figures are coded as a pattern of magnetic spots on a long steel tape. By this means the text of a document, preceded by its subject code symbol, can be recorded ... the machine ... automatically selects and types out those references
296:
An information retrieval process begins when a user enters a query into the system. Queries are formal statements of information needs, for example search strings in web search engines. In information retrieval, a query does not uniquely identify a single object in the collection. Instead, several
1061:
published early findings of the
Cranfield studies, developing a model for IR system evaluation. See: Cyril W. Cleverdon, "Report on the Testing and Analysis of an Investigation into the Comparative Efficiency of Indexing Systems". Cranfield Collection of Aeronautics, Cranfield, England,
392:(TREC) as part of the TIPSTER text program. The aim of this was to look into the information retrieval community by supplying the infrastructure that was needed for evaluation of text retrieval methodologies on a very large text collection. This catalyzed research on methods that
815:
The evaluation of an information retrieval system' is the process of assessing how well a system meets the information needs of its users. In general, measurement considers a collection of documents to be searched and a search query. Traditional evaluation metrics, designed for
330:
Most IR systems compute a numeric score on how well each object in the database matches the query, and rank the objects according to this value. The top ranking objects are then shown to the user. The process may then be iterated if the user wishes to refine the query.
799:
allow a representation of interdependencies between terms, but they do not allege how the interdependency between two terms is defined. They rely on an external source for the degree of interdependency between two terms. (For example, a human or sophisticated
1615: – ESSIR promotes research, innovation, and development of information access systems by educating junior and senior researchers, students, professionals, and developers on the latest developments in the field, both methodological and technological.
308:. User queries are matched against the database information. However, as opposed to classical SQL queries of a database, in information retrieval the results returned may or may not match the query, so results are typically ranked. This
1377:, Robert N. Oddy, and Helen M. Brooks proposed the ASK (Anomalous State of Knowledge) viewpoint for information retrieval. This was an important concept, though their automated analysis tool proved ultimately disappointing.
2205:
Doszkocs, T.E. & Rapp, B.A. (1979). "Searching MEDLINE in
English: a Prototype User Interface with Natural Language Query, Ranked Output, and relevance feedback," In: Proceedings of the ASIS Annual Meeting, 16:
784:
allow a representation of interdependencies between terms. However the degree of the interdependency between two terms is defined by the model itself. It is usually directly or indirectly derived (e.g. by
365:
in the 1920s and 1930s – that searched for documents stored on film. The first description of a computer searching for information was described by
Holmstrom in 1948, detailing an early mention of the
1124:
sponsored a symposium titled "Statistical
Association Methods for Mechanized Documentation". Several highly significant papers, including G. Salton's first published reference (we believe) to the
610:
698:
treat the process of document retrieval as a probabilistic inference. Similarities are computed as probabilities that a document is relevant for a given query. Probabilistic theorems like
903:
submits patents for his "Statistical
Machine", a document search engine that used photoelectric cells and pattern recognition to search the metadata on rolls of microfilmed documents.
1942:
1919:. Proceedings of the 5th International Conference on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom'09). Washington, DC: IEEE. Archived from
1520:
757:
methods. Feature functions are arbitrary functions of document and query, and as such can easily incorporate almost any other retrieval model as just another feature.
1656: – field of research that involves studying situations, motivations, and methods for people seeking and sharing information in participatory online social sites
1729:
1612:
1331:
1080:
Weinberg report "Science, Government and
Information" gave a full articulation of the idea of a "crisis of scientific information". The report was named after Dr.
381:
such as the
Cranfield collection (several thousand documents). Large-scale retrieval systems, such as the Lockheed Dialog system, came into use early in the 1970s.
661:
represent documents and queries usually as vectors, matrices, or tuples. The similarity of the query vector and document vector is represented as a scalar value.
385:
956:: Growing concern in the US for a "science gap" with the USSR motivated, encouraged funding and provided a backdrop for mechanized literature searching systems (
1495:
implementation of many features formerly found only in experimental IR systems. Search engines become the most common and maybe best instantiation of IR models.
323:
or videos. Often the documents themselves are not kept or stored directly in the IR system, but are instead represented in the system by document surrogates or
225:
1514:
2118:
Perry, James W.; Kent, Allen; Berry, Madeline M. (1955). "Machine literature searching X. Machine language; factors underlying its design and development".
1832:
810:
188:
1216:
1621:
1007:: International Conference on Scientific Information Washington DC included consideration of IR systems as a solution to problems identified. See:
1526:
1088:
1049:
and John Lary Kuhns published "On relevance, probabilistic indexing, and information retrieval" in the
Journal of the ACM 7(3):216–244, July 1960.
622:
common models. In the picture, the models are categorized according to two dimensions: the mathematical basis and the properties of the model.
2335:
2233:
2102:
1952:
284:. An IR system is a software system that provides access to books, journals and other documents; it also stores and manages those documents.
2146:
2413:
1591:
714:
579:
1958:
1328:
1354:
for MEDLINE at the
National Library of Medicine. The CITE system supported free form query input, ranked output and relevance feedback.
1125:
828:
notion of relevance: every document is known to be either relevant or non-relevant to a particular query. In practice, queries may be
316:
1702:
1648:
1573:
775:
557:
298:
218:
261:. The information need can be specified in the form of a search query. In the case of document retrieval, queries can be based on
2281:
1679:
730:
523:
916:: The US military confronted problems of indexing and retrieval of wartime scientific research documents captured from Germans.
408:
Areas where information retrieval techniques are employed include (the entries are in alphabetical order within each category):
1684:
1351:
998:
369:
computer. Automated information retrieval systems were introduced in the 1950s: one even featured in the 1957 romantic comedy,
309:
178:
2408:
1741:
1664:
674:
669:
100:
90:
40:
1204:
John W. Sammon, Jr.'s RADC Tech report "Some
Mathematics of Information Storage and Retrieval..." outlined the vector model.
948:(research engineer at IBM since 1941) began work on a mechanized punch card-based system for searching chemical compounds.
710:
1914:
1186:
completed evaluation studies of the MEDLARS system and published the first edition of his text on information retrieval.
1121:
567:
211:
1145:
Medical Literature Analysis and Retrieval System, the first major machine-readable database and batch-retrieval system.
1717:
735:
638:
of words or phrases. Similarities are usually derived from set-theoretic operations on those sets. Common models are:
537:
450:
198:
1560:
1653:
1272:
1829:
1454:
1314:
1114:
705:
389:
95:
688:
684:
574:
55:
2388:
2270:
2179:
N. Jardine, C.J. van Rijsbergen (December 1971). "The use of hierarchic clustering in information retrieval".
614:
1670:
584:
562:
2054:
1276:
1213:
987:: Philip Bagley conducted the earliest experiment in computerized document retrieval in a master thesis at
770:
treat different terms/words as independent. This fact is usually represented in vector space models by the
269:
of searching for information in a document, searching for documents themselves, and also searching for the
2362:
1877:
1723:
1631:
817:
679:
646:
641:
163:
138:
85:
65:
2374:
1640: – Process or activity of attempting to obtain information in both human and technological contexts
17:
1585:
1550:
1285:: Three highly influential publications by Salton fully articulated his vector processing framework and
851:
422:
377:
at Cornell. By the 1970s several different retrieval techniques had been shown to perform well on small
351:
The idea of using computers to search for relevant pieces of information was popularized in the article
173:
143:
1106:
1413:
1367:: First international ACM SIGIR conference, joint with British Computer Society IR group in Cambridge.
1223:" (IEEE Transactions on Computers) was the first proposal for visualization interface to an IR system.
2354:
2055:
The Theory of Digital Handling of Non-numerical Information and its Implications to Machine Economics
2040:
The Royal Society Scientific Information Conference, 21 June-2 July 1948: Report and Papers Submitted
1183:
821:
532:
281:
168:
60:
1882:
1696:
1643:
1637:
1555:
1478:
1286:
786:
720:
493:
427:
250:
80:
75:
33:
361:
in 1945. It would appear that Bush was inspired by patents for a 'statistical machine' – filed by
312:
of results is a key difference of information retrieval searching compared to database searching.
2383:
1895:
1811:
1708:
1464:
1417:
1374:
1157:
1058:
883:
664:
594:
473:
254:
50:
2256:
1673: – Set of techniques for creating images, diagrams, or animations to communicate a message
605:
2229:
2225:
2218:
2098:
2035:
1948:
1492:
1046:
753:) and seek the best way to combine these features into a single relevance score, typically by
699:
635:
503:
483:
417:
397:
285:
258:
2188:
2161:
2127:
2073:
2014:
1887:
1839:. Journal of the American Society for Information Sciences and Technology. 61(8), 1517-1534.
1801:
1793:
1735:
1268:
934:
900:
868:
invents an electro-mechanical data tabulator using punch cards as a machine readable medium.
865:
754:
488:
362:
262:
148:
1397:
publish: An Evaluation of Retrieval Effectiveness for a Full-Text Document-Retrieval System
2378:
2366:
2274:
2260:
1836:
1579:
1433:
1220:
1018:
968:
945:
928:
651:
542:
461:
440:
353:
1849:
Goodrum, Abby A. (2000). "Image Information Retrieval: An Overview of Current Research".
2254:
Modern Information Retrieval: The Concepts and Technology behind Search (second edition)
2289:
1830:
The Seventeen Theoretical Constructs of Information Searching and Information Retrieval
1606:
1429:
1246:
1081:
964:
887:
725:
589:
478:
468:
128:
2402:
2192:
1815:
1753:
1690:
1174:
was involved in studies at University of Chicago on Requirements for Future Catalogs.
1036:
978:
923:
875:
855:
790:
771:
498:
374:
358:
304:
An object is an entity that is represented by information in a content collection or
153:
123:
1899:
825:
553:
Methods/Techniques in which information retrieval techniques are employed include:
445:
378:
373:. In the 1960s, the first large information retrieval research group was formed by
2318:
1744: – Area of research related to information retrieval centered on timeliness
1705: – Measure of a document's applicability to a given subject or search query
1600:
1171:
393:
280:
Automated information retrieval systems are used to reduce what has been called
158:
2350:
2019:
2002:
1797:
1784:
Luk, R. W. P. (2022). "Why is information retrieval a scientific discipline?".
1747:
2165:
1920:
1759:
1394:
957:
133:
1984:
Bulletin of the IEEE Computer Society Technical Committee on Data Engineering
1091:
published text on information retrieval. Becker, Joseph; Hayes, Robert Mayo.
858:, the first machine to use punched cards to control a sequence of operations.
2078:
2061:
1250:
1243:
First online systems—NLM's AIM-TWX, MEDLINE; Lockheed's Dialog; SDC's ORBIT.
829:
320:
246:
2336:
BCS IRSG: British Computer Society – Information Retrieval Specialist Group
2131:
1009:
Proceedings of the International Conference on Scientific Information, 1958
2394:
Information retrieval performance evaluation tool @ Athena Research Centre
1891:
400:
has boosted the need for very large scale retrieval systems even further.
1762: – Process of extracting and discovering patterns in large data sets
1603: – Process of extracting and discovering patterns in large data sets
879:
370:
324:
305:
274:
270:
105:
341:
which have been coded in any desired way at a rate of 120 words a minute
1806:
1142:
266:
193:
1868:
Foote, Jonathan (1999). "An overview of audio information retrieval".
1409:: Key papers on and experimental systems for visualization interfaces.
1279:
in information retrieval", which articulated the "cluster hypothesis".
1720: – A classification model in machine learning based on centroids
366:
319:
the data objects may be, for example, text documents, images, audio,
183:
2305:
1582: – Computer component that stores information for immediate use
2267:
1976:
1532:
1304:
2304:
Christopher D. Manning, Prabhakar Raghavan, and Hinrich SchĂĽtze.
2268:
Information Retrieval: Implementing and Evaluating Search Engines
1471:
with emphasis on visualization and multi-reference point systems.
1403:: Efforts to develop end-user versions of commercial IR systems.
1021:
published "Auto-encoding of documents for information retrieval".
2345:
1485:
by Addison Wesley, the first book that attempts to cover all IR.
1437:
1916:
Information Retrieval On Mind Maps - What Could It Be Good For?
1509:
Conference on Research and Development in Information Retrieval
297:
objects may match the query, perhaps with different degrees of
27:
Obtaining information resources relevant to an information need
2315:
Behind the Search Box: Google and the Global Internet Industry
2266:
Stefan BĂĽttcher, Charles L. A. Clarke, and Gordon V. Cormack.
1538:
988:
265:
or other content-based indexing. Information retrieval is the
2147:"An Historical Note on the Origins of Probabilistic Indexing"
1644:
Information seeking § Compared to information retrieval
1093:
Information storage and retrieval: tools, elements, theories
774:
assumption of term vectors or in probabilistic models by an
2393:
2359:
1732: – Subgroup of the Association for Computing Machinery
1539:
International Conference on Theory of Information Retrieval
2384:
TREC report on information retrieval evaluation techniques
2371:
1750: – Estimate of the importance of a word in a document
2340:
2253:
2331:
ACM SIGIR: Information Retrieval Special Interest Group
1764:
Pages displaying short descriptions of redirect targets
1675:
Pages displaying short descriptions of redirect targets
1594: – retrieval of Information in different languages
1344:(Butterworths). Heavy emphasis on probabilistic models.
1944:
Information Retrieval Data Structures & Algorithms
1301:
A Theory of Term Importance in Automatic Text Analysis
2330:
1687: – Tools and systems for managing one's own data
384:
In 1992, the US Department of Defense along with the
1713:
Pages displaying wikidata descriptions as a fallback
1658:
Pages displaying wikidata descriptions as a fallback
1617:
Pages displaying wikidata descriptions as a fallback
1596:
Pages displaying wikidata descriptions as a fallback
1576: – Information retrieval strategies in datasets
1508:
1387:(McGraw-Hill), with heavy emphasis on vector models.
1039:
began work on IR at Harvard, later moved to Cornell.
1011:(National Academy of Sciences, Washington, DC, 1959)
1913:Beel, Jöran; Gipp, Bela; Stiller, Jan-Olaf (2009).
2217:
1521:Conference on Information and Knowledge Management
2346:Forum for Information Retrieval Evaluation (FIRE)
1941:Frakes, William B.; Baeza-Yates, Ricardo (1992).
1667: – Organization in Vienna, Austria 2006–2012
1634: – Machine reading of unstructured documents
977:: The term "information retrieval" was coined by
1977:"Modern Information Retrieval: A Brief Overview"
1756: – Content-based retrieval of XML documents
1297:(Society for Industrial and Applied Mathematics)
1199:Automatic Information Organization and Retrieval
832:and there may be different shades of relevance.
2317:(U of Illinois Press, 2023) ISBN 10:0252087127
2003:"The History of Information Retrieval Research"
1730:Special Interest Group on Information Retrieval
1613:European Summer School in Information Retrieval
1214:A nonlinear mapping for data structure analysis
797:Models with transcendent term interdependencies
338:
1420:, Matthew Chalmers, Anselm Spoerri and others.
386:National Institute of Standards and Technology
2062:"Automatic Retrieval of Recorded Information"
1970:
1968:
1738: – Classifying a document by index terms
793:of those terms in the whole set of documents.
609:Categorization of IR-models (translated from
528:Information retrieval for chemical structures
219:
8:
2277:. MIT Press, Cambridge, Massachusetts, 2010.
2252:Ricardo Baeza-Yates, Berthier Ribeiro-Neto.
2058:(Zator Technical Bulletin No. 48), cited in
2001:Mark Sanderson & W. Bruce Croft (2012).
1515:European Conference on Information Retrieval
1385:Introduction to Modern Information Retrieval
1383:: Salton (and Michael J. McGill) published
1311:A Vector Space Model for Automatic Indexing
811:Evaluation measures (information retrieval)
782:Models with immanent term interdependencies
1609: – Way to obtain data from a database
253:is the task of identifying and retrieving
226:
212:
29:
2286:Library & Information Science Network
2077:
2018:
1881:
1805:
762:Second dimension: properties of the model
675:(Enhanced) Topic-based Vector Space Model
18:Information storage and retrieval systems
1533:Conference on Web Search and Data Mining
604:
2036:"'Section III. Opening Plenary Session"
1776:
1527:International World Wide Web Conference
1141:National Library of Medicine developed
745:view documents as vectors of values of
114:
39:
32:
2093:Doyle, Lauren; Becker, Joseph (1975).
1588: – Method of organizing knowledge
1350:: Tamas Doszkocs implemented the CITE
288:are the most visible IR applications.
2306:Introduction to Information Retrieval
2154:Information Processing and Management
1699: – Search engine processing step
768:Models without term-interdependencies
396:to huge corpora. The introduction of
7:
2095:Information Retrieval and Processing
1622:Human–computer information retrieval
1592:Cross-language information retrieval
1111:Synonymy and Semantic Classification
805:Performance and correctness measures
2308:. Cambridge University Press, 2008.
626:First dimension: mathematical basis
2389:How eBay measures search relevance
1828:Jansen, B. J. and Rieh, S. (2010)
1726: – Method for data management
1109:finished her thesis at Cambridge,
1067:Information Analysis and Retrieval
257:resources that are relevant to an
25:
2220:Information Storage and Retrieval
2181:Information Storage and Retrieval
1703:Relevance (information retrieval)
1649:Collaborative information seeking
1574:Adversarial information retrieval
1469:Information Storage and Retrieval
1340:: C. J. van Rijsbergen published
558:Adversarial information retrieval
2341:Text Retrieval Conference (TREC)
1680:Multimedia information retrieval
731:Divergence-from-randomness model
702:are often used in these models.
524:Geographic information retrieval
2288:. 24 April 2015. Archived from
1693: – Type of search strategy
1685:Personal information management
1352:natural language user interface
999:Case Western Reserve University
179:Library and information science
2372:Information Retrieval Facility
2282:"Information Retrieval System"
1742:Temporal information retrieval
1665:Information Retrieval Facility
778:assumption for term variables.
743:Feature-based retrieval models
670:Generalized vector space model
634:models represent documents as
101:Science and technology studies
1:
2097:. Melville. pp. 410 pp.
711:Probabilistic relevance model
520:Genomic information retrieval
273:that describes data, and for
2216:Korfhage, Robert R. (1997).
2193:10.1016/0020-0271(71)90051-9
1483:Modern Information Retrieval
1481:and Berthier Ribeiro-Neto's
1122:National Bureau of Standards
820:or top-k retrieval, include
568:Multi-document summarization
512:Domain-specific applications
277:of texts, images or sounds.
115:Related fields and subfields
2414:Natural language processing
2263:. Addison-Wesley, UK, 2011.
1255:Computer Lib/Dream Machines
963:) and the invention of the
736:Latent Dirichlet allocation
538:Legal information retrieval
199:Quantum information science
2430:
2360:Information Retrieval Wiki
2060:Fairthorne, R. A. (1958).
2020:10.1109/jproc.2012.2189916
1798:10.1007/s10699-020-09685-x
1654:Social information seeking
1273:Cornelis J. van Rijsbergen
808:
2166:10.1016/j.ipm.2007.02.012
2145:Maron, Melvin E. (2008).
1115:computational linguistics
1095:. New York, Wiley (1963).
706:Binary Independence Model
531:Information retrieval in
390:Text Retrieval Conference
1711: – type of feedback
1561:Karen Spärck Jones Award
1197:Gerard Salton published
1113:, and continued work on
824:. All measures assume a
689:latent semantic analysis
685:Latent semantic indexing
575:Compound term processing
388:(NIST), cosponsored the
2313:Yeo, ShinJoung. (2023)
2007:Proceedings of the IEEE
1671:Knowledge visualization
1162:Libraries of the Future
585:Document classification
580:Cross-lingual retrieval
563:Automatic summarization
549:Other retrieval methods
2132:10.1002/asi.5090060411
2120:American Documentation
1975:Singhal, Amit (2001).
1947:. Prentice-Hall, Inc.
1786:Foundations of Science
1724:Search engine indexing
1718:Rocchio classification
1632:Information extraction
1275:published "The use of
1148:Project Intrex at MIT.
713:on which is based the
680:Extended Boolean model
647:Extended Boolean model
642:Standard Boolean model
618:
349:
164:Information technology
86:Knowledge organization
2409:Information retrieval
2351:Information Retrieval
2079:10.1093/comjnl/1.1.36
2034:JE Holmstrom (1948).
1892:10.1007/s005300050106
1586:Controlled vocabulary
1551:Tony Kent Strix award
1342:Information Retrieval
1277:hierarchic clustering
1249:promoting concept of
852:Joseph Marie Jacquard
787:dimensional reduction
608:
517:Expert search finding
423:Information filtering
346:J. E. Holmstrom, 1948
239:Information retrieval
174:Intellectual property
144:Computer data storage
2355:C. J. van Rijsbergen
2066:The Computer Journal
1295:A Theory of Indexing
1184:F. Wilfrid Lancaster
1117:as it applies to IR.
997:: Allen Kent joined
886:used to process the
822:precision and recall
696:Probabilistic models
533:software engineering
412:General applications
282:information overload
169:Intellectual freedom
2052:Mooers, Calvin N.;
1697:Query understanding
1638:Information seeking
1556:Gerard Salton Award
1545:Awards in the field
1479:Ricardo Baeza-Yates
1287:term discrimination
721:Uncertain inference
428:Recommender systems
251:information science
34:Information science
2377:2008-05-22 at the
2365:2015-11-24 at the
2273:2020-10-05 at the
2259:2017-09-18 at the
2224:. Wiley. pp.
1870:Multimedia Systems
1835:2016-03-04 at the
1709:Relevance feedback
1493:Web search engines
1418:Robert R. Korfhage
1393:: David Blair and
1375:Nicholas J. Belkin
1219:2017-08-08 at the
1158:J. C. R. Licklider
1107:Karen Spärck Jones
1087:Joseph Becker and
1059:Cyril W. Cleverdon
717:relevance function
665:Vector space model
619:
613:, original source
595:Question answering
398:web search engines
286:Web search engines
255:information system
2353:(online book) by
2235:978-0-471-14338-3
2104:978-0-471-22151-7
1954:978-0-13-463837-9
1851:Informing Science
1502:Major conferences
1477:: Publication of
1463:: Publication of
1047:Melvin Earl Maron
818:Boolean retrieval
747:feature functions
484:Enterprise search
418:Digital libraries
315:Depending on the
236:
235:
16:(Redirected from
2421:
2301:
2299:
2297:
2240:
2239:
2223:
2213:
2207:
2203:
2197:
2196:
2176:
2170:
2169:
2151:
2142:
2136:
2135:
2115:
2109:
2108:
2090:
2084:
2083:
2081:
2050:
2044:
2043:
2031:
2025:
2024:
2022:
1998:
1992:
1991:
1981:
1972:
1963:
1962:
1957:. Archived from
1938:
1932:
1931:
1929:
1928:
1910:
1904:
1903:
1885:
1865:
1859:
1858:
1846:
1840:
1826:
1820:
1819:
1809:
1781:
1765:
1736:Subject indexing
1714:
1676:
1659:
1627:
1618:
1597:
1414:Donald B. Crouch
1269:Nicholas Jardine
935:Atlantic Monthly
901:Emanuel Goldberg
866:Herman Hollerith
755:learning to rank
659:Algebraic models
489:Federated search
458:Speech retrieval
363:Emanuel Goldberg
347:
259:information need
228:
221:
214:
149:Cultural studies
30:
21:
2429:
2428:
2424:
2423:
2422:
2420:
2419:
2418:
2399:
2398:
2379:Wayback Machine
2367:Wayback Machine
2327:
2295:
2293:
2280:
2275:Wayback Machine
2261:Wayback Machine
2249:
2247:Further reading
2244:
2243:
2236:
2215:
2214:
2210:
2204:
2200:
2178:
2177:
2173:
2149:
2144:
2143:
2139:
2117:
2116:
2112:
2105:
2092:
2091:
2087:
2059:
2051:
2047:
2033:
2032:
2028:
2000:
1999:
1995:
1979:
1974:
1973:
1966:
1955:
1940:
1939:
1935:
1926:
1924:
1912:
1911:
1907:
1867:
1866:
1862:
1848:
1847:
1843:
1837:Wayback Machine
1827:
1823:
1783:
1782:
1778:
1773:
1768:
1763:
1712:
1674:
1657:
1625:
1616:
1595:
1580:Computer memory
1569:
1547:
1504:
1434:Tim Berners-Lee
1221:Wayback Machine
1089:Robert M. Hayes
1065:Kent published
1019:Hans Peter Luhn
969:Eugene Garfield
946:Hans Peter Luhn
929:As We May Think
838:
813:
807:
764:
726:Language models
652:Fuzzy retrieval
628:
615:Dominik Kuropka
603:
551:
543:Vertical search
514:
462:Video retrieval
451:Music retrieval
441:Image retrieval
414:
406:
354:As We May Think
348:
345:
337:
294:
232:
203:
110:
41:General aspects
28:
23:
22:
15:
12:
11:
5:
2427:
2425:
2417:
2416:
2411:
2401:
2400:
2397:
2396:
2391:
2386:
2381:
2369:
2357:
2348:
2343:
2338:
2333:
2326:
2325:External links
2323:
2322:
2321:
2310:
2309:
2302:
2292:on 11 May 2020
2278:
2264:
2248:
2245:
2242:
2241:
2234:
2208:
2198:
2187:(5): 217–240.
2171:
2160:(2): 971–972.
2137:
2126:(4): 242–254.
2110:
2103:
2085:
2045:
2026:
1993:
1964:
1961:on 2013-09-28.
1953:
1933:
1905:
1883:10.1.1.39.6339
1860:
1841:
1821:
1792:(2): 427–453.
1775:
1774:
1772:
1769:
1767:
1766:
1757:
1751:
1745:
1739:
1733:
1727:
1721:
1715:
1706:
1700:
1694:
1688:
1682:
1677:
1668:
1662:
1661:
1660:
1651:
1646:
1635:
1629:
1619:
1610:
1607:Data retrieval
1604:
1598:
1589:
1583:
1577:
1570:
1568:
1565:
1564:
1563:
1558:
1553:
1546:
1543:
1542:
1541:
1535:
1529:
1523:
1517:
1511:
1503:
1500:
1499:
1498:
1497:
1496:
1486:
1472:
1458:
1443:
1442:
1441:
1430:World Wide Web
1423:
1422:
1421:
1410:
1398:
1388:
1378:
1368:
1357:
1356:
1355:
1345:
1335:
1322:
1321:
1320:
1319:
1318:
1308:
1298:
1280:
1262:
1261:
1260:
1259:
1258:
1247:Theodor Nelson
1244:
1228:
1227:
1226:
1225:
1224:
1206:
1205:
1202:
1194:
1193:
1177:
1176:
1175:
1165:
1151:
1150:
1149:
1146:
1131:
1130:
1129:
1118:
1098:
1097:
1096:
1085:
1082:Alvin Weinberg
1072:
1071:
1070:
1063:
1050:
1040:
1024:
1023:
1022:
1012:
1002:
992:
982:
972:
965:citation index
951:
950:
949:
939:
906:
905:
904:
893:
892:
891:
888:1890 US Census
869:
859:
837:
834:
809:Main article:
806:
803:
802:
801:
794:
779:
763:
760:
759:
758:
740:
739:
738:
733:
728:
723:
718:
708:
700:Bayes' theorem
693:
692:
691:
682:
677:
672:
667:
656:
655:
654:
649:
644:
627:
624:
602:
599:
598:
597:
592:
590:Spam filtering
587:
582:
577:
572:
571:
570:
560:
550:
547:
546:
545:
540:
535:
529:
526:
521:
518:
513:
510:
509:
508:
507:
506:
501:
496:
491:
486:
481:
479:Desktop search
476:
469:Search engines
466:
465:
464:
459:
456:
453:
448:
443:
438:
432:
431:
430:
420:
413:
410:
405:
402:
343:
336:
333:
293:
290:
234:
233:
231:
230:
223:
216:
208:
205:
204:
202:
201:
196:
191:
186:
181:
176:
171:
166:
161:
156:
151:
146:
141:
139:Classification
136:
131:
129:Categorization
126:
120:
117:
116:
112:
111:
109:
108:
103:
98:
93:
88:
83:
78:
73:
68:
63:
58:
53:
47:
44:
43:
37:
36:
26:
24:
14:
13:
10:
9:
6:
4:
3:
2:
2426:
2415:
2412:
2410:
2407:
2406:
2404:
2395:
2392:
2390:
2387:
2385:
2382:
2380:
2376:
2373:
2370:
2368:
2364:
2361:
2358:
2356:
2352:
2349:
2347:
2344:
2342:
2339:
2337:
2334:
2332:
2329:
2328:
2324:
2320:
2316:
2312:
2311:
2307:
2303:
2291:
2287:
2283:
2279:
2276:
2272:
2269:
2265:
2262:
2258:
2255:
2251:
2250:
2246:
2237:
2231:
2227:
2222:
2221:
2212:
2209:
2202:
2199:
2194:
2190:
2186:
2182:
2175:
2172:
2167:
2163:
2159:
2155:
2148:
2141:
2138:
2133:
2129:
2125:
2121:
2114:
2111:
2106:
2100:
2096:
2089:
2086:
2080:
2075:
2071:
2067:
2063:
2057:
2056:
2049:
2046:
2041:
2037:
2030:
2027:
2021:
2016:
2013:: 1444–1451.
2012:
2008:
2004:
1997:
1994:
1989:
1985:
1978:
1971:
1969:
1965:
1960:
1956:
1950:
1946:
1945:
1937:
1934:
1923:on 2011-05-13
1922:
1918:
1917:
1909:
1906:
1901:
1897:
1893:
1889:
1884:
1879:
1875:
1871:
1864:
1861:
1856:
1852:
1845:
1842:
1838:
1834:
1831:
1825:
1822:
1817:
1813:
1808:
1803:
1799:
1795:
1791:
1787:
1780:
1777:
1770:
1761:
1758:
1755:
1754:XML retrieval
1752:
1749:
1746:
1743:
1740:
1737:
1734:
1731:
1728:
1725:
1722:
1719:
1716:
1710:
1707:
1704:
1701:
1698:
1695:
1692:
1691:Pearl growing
1689:
1686:
1683:
1681:
1678:
1672:
1669:
1666:
1663:
1655:
1652:
1650:
1647:
1645:
1642:
1641:
1639:
1636:
1633:
1630:
1623:
1620:
1614:
1611:
1608:
1605:
1602:
1599:
1593:
1590:
1587:
1584:
1581:
1578:
1575:
1572:
1571:
1566:
1562:
1559:
1557:
1554:
1552:
1549:
1548:
1544:
1540:
1536:
1534:
1530:
1528:
1524:
1522:
1518:
1516:
1512:
1510:
1506:
1505:
1501:
1494:
1490:
1487:
1484:
1480:
1476:
1473:
1470:
1466:
1462:
1459:
1456:
1452:
1449:
1448:
1447:
1444:
1439:
1435:
1432:proposals by
1431:
1427:
1424:
1419:
1415:
1411:
1408:
1405:
1404:
1402:
1399:
1396:
1392:
1389:
1386:
1382:
1379:
1376:
1372:
1369:
1366:
1363:
1362:
1361:
1358:
1353:
1349:
1346:
1343:
1339:
1336:
1333:
1330:
1326:
1323:
1316:
1312:
1309:
1306:
1302:
1299:
1296:
1293:
1292:
1291:
1290:
1288:
1284:
1281:
1278:
1274:
1270:
1266:
1263:
1256:
1252:
1248:
1245:
1242:
1241:
1240:
1239:
1237:
1234:
1233:
1232:
1229:
1222:
1218:
1215:
1211:
1208:
1207:
1203:
1200:
1196:
1195:
1191:
1188:
1187:
1185:
1181:
1178:
1173:
1169:
1166:
1163:
1159:
1155:
1152:
1147:
1144:
1140:
1139:
1138:
1137:
1135:
1132:
1127:
1123:
1119:
1116:
1112:
1108:
1105:
1104:
1102:
1099:
1094:
1090:
1086:
1083:
1079:
1078:
1076:
1073:
1068:
1064:
1060:
1057:
1056:
1054:
1051:
1048:
1044:
1041:
1038:
1037:Gerard Salton
1034:
1031:
1030:
1028:
1025:
1020:
1016:
1013:
1010:
1006:
1003:
1000:
996:
993:
990:
986:
983:
980:
979:Calvin Mooers
976:
973:
970:
966:
962:
959:
955:
952:
947:
943:
940:
937:
936:
931:
930:
925:
924:Vannevar Bush
921:
918:
917:
915:
912:
911:
910:
907:
902:
899:
898:
897:
894:
889:
885:
881:
877:
873:
870:
867:
863:
860:
857:
856:Jacquard loom
853:
849:
846:
845:
844:
840:
839:
835:
833:
831:
827:
823:
819:
812:
804:
798:
795:
792:
791:co-occurrence
788:
783:
780:
777:
773:
772:orthogonality
769:
766:
765:
761:
756:
752:
748:
744:
741:
737:
734:
732:
729:
727:
724:
722:
719:
716:
712:
709:
707:
704:
703:
701:
697:
694:
690:
686:
683:
681:
678:
676:
673:
671:
668:
666:
663:
662:
660:
657:
653:
650:
648:
645:
643:
640:
639:
637:
633:
632:Set-theoretic
630:
629:
625:
623:
616:
612:
607:
600:
596:
593:
591:
588:
586:
583:
581:
578:
576:
573:
569:
566:
565:
564:
561:
559:
556:
555:
554:
548:
544:
541:
539:
536:
534:
530:
527:
525:
522:
519:
516:
515:
511:
505:
502:
500:
499:Social search
497:
495:
494:Mobile search
492:
490:
487:
485:
482:
480:
477:
475:
472:
471:
470:
467:
463:
460:
457:
454:
452:
449:
447:
444:
442:
439:
436:
435:
434:Media search
433:
429:
426:
425:
424:
421:
419:
416:
415:
411:
409:
403:
401:
399:
395:
391:
387:
382:
380:
376:
375:Gerard Salton
372:
368:
364:
360:
359:Vannevar Bush
356:
355:
342:
334:
332:
328:
326:
322:
318:
313:
311:
307:
302:
300:
291:
289:
287:
283:
278:
276:
272:
268:
264:
260:
256:
252:
248:
244:
240:
229:
224:
222:
217:
215:
210:
209:
207:
206:
200:
197:
195:
192:
190:
187:
185:
182:
180:
177:
175:
172:
170:
167:
165:
162:
160:
157:
155:
154:Data modeling
152:
150:
147:
145:
142:
140:
137:
135:
132:
130:
127:
125:
124:Bibliometrics
122:
121:
119:
118:
113:
107:
104:
102:
99:
97:
94:
92:
89:
87:
84:
82:
79:
77:
74:
72:
69:
67:
64:
62:
59:
57:
54:
52:
49:
48:
46:
45:
42:
38:
35:
31:
19:
2314:
2294:. Retrieved
2290:the original
2285:
2219:
2211:
2201:
2184:
2180:
2174:
2157:
2153:
2140:
2123:
2119:
2113:
2094:
2088:
2069:
2065:
2053:
2048:
2039:
2029:
2010:
2006:
1996:
1987:
1983:
1959:the original
1943:
1936:
1925:. Retrieved
1921:the original
1915:
1908:
1873:
1869:
1863:
1854:
1850:
1844:
1824:
1789:
1785:
1779:
1488:
1482:
1474:
1468:
1460:
1450:
1445:
1425:
1406:
1400:
1390:
1384:
1380:
1370:
1364:
1359:
1347:
1341:
1337:
1327:: The First
1324:
1310:
1300:
1294:
1282:
1264:
1254:
1253:, published
1235:
1230:
1212:: Sammon's "
1209:
1198:
1189:
1179:
1167:
1161:
1153:
1133:
1110:
1100:
1092:
1074:
1066:
1052:
1042:
1032:
1026:
1014:
1008:
1004:
994:
984:
974:
960:
953:
941:
933:
932:appeared in
927:
919:
913:
908:
895:
871:
861:
854:invents the
847:
842:
826:ground truth
814:
800:algorithms.)
796:
781:
776:independency
767:
750:
746:
742:
715:okapi (BM25)
695:
658:
631:
620:
611:German entry
552:
446:3D retrieval
407:
404:Applications
383:
379:text corpora
352:
350:
339:
329:
314:
303:
295:
279:
242:
238:
237:
189:Preservation
70:
56:Architecture
1990:(4): 35–43.
1807:10397/94873
1601:Data mining
1457:conference.
1334:conference.
1236:early 1970s
1172:Don Swanson
1033:early 1960s
909:1940s–1950s
896:1920s-1930s
841:Before the
789:) from the
601:Model types
474:Site search
455:News search
437:Blog search
317:application
159:Informatics
2403:Categories
1927:2012-03-13
1771:References
1760:Web mining
1489:late 1990s
1395:Bill Maron
1180:late 1960s
1160:published
958:Allen Kent
914:late 1940s
884:tabulators
880:keypunches
874:Hollerith
504:Web search
134:Censorship
96:Philosophy
66:Management
2072:(1): 37.
1878:CiteSeerX
1816:220506422
1407:1985–1993
1401:mid-1980s
1251:hypertext
1134:mid-1960s
830:ill-posed
749:(or just
321:mind maps
299:relevance
275:databases
263:full-text
247:computing
71:Retrieval
2375:Archived
2363:Archived
2271:Archived
2257:Archived
2206:131-139.
1876:: 2–10.
1833:Archived
1567:See also
1465:Korfhage
1453:: First
1428:: First
1412:Work by
1217:Archived
836:Timeline
751:features
371:Desk Set
344:—
325:metadata
306:database
292:Overview
271:metadata
106:Taxonomy
91:Ontology
61:Behavior
1900:2000641
1537:ICTIR:
1507:SIGIR:
1289:model:
1143:MEDLARS
1128:system.
687:a.k.a.
335:History
310:ranking
267:science
194:Privacy
81:Society
76:Seeking
2319:online
2232:
2226:368 pp
2101:
1951:
1898:
1880:
1814:
1748:tf–idf
1531:WSDM:
1519:CIKM:
1513:ECIR:
1317:18:11)
1307:v. 26)
961:et al.
367:Univac
184:Memory
51:Access
2296:3 May
2150:(PDF)
2042:: 85.
1980:(PDF)
1896:S2CID
1812:S2CID
1525:WWW:
1446:1990s
1360:1980s
1332:SIGIR
1305:JASIS
1231:1970s
1126:SMART
1062:1962.
1027:1960s
954:1950s
890:data.
876:cards
862:1880s
843:1900s
394:scale
245:) in
2298:2020
2230:ISBN
2099:ISBN
1949:ISBN
1857:(2).
1626:HCIR
1475:1999
1461:1997
1455:TREC
1451:1992
1438:CERN
1426:1989
1391:1985
1381:1983
1371:1982
1365:1980
1348:1979
1338:1979
1325:1978
1315:CACM
1283:1975
1271:and
1265:1971
1210:1969
1190:1968
1168:1966
1154:1965
1120:The
1101:1964
1075:1963
1053:1962
1043:1960
1015:1959
1005:1958
995:1955
985:1951
975:1950
942:1947
920:1945
882:and
872:1890
848:1801
636:sets
249:and
2189:doi
2162:doi
2128:doi
2074:doi
2015:doi
2011:100
1888:doi
1802:hdl
1794:doi
1467:'s
1436:at
1329:ACM
989:MIT
967:by
926:'s
357:by
2405::
2284:.
2228:.
2183:.
2158:44
2156:.
2152:.
2122:.
2068:.
2064:.
2038:.
2009:.
2005:.
1988:24
1986:.
1982:.
1967:^
1894:.
1886:.
1872:.
1853:.
1810:.
1800:.
1790:27
1788:.
1491::
1416:,
1373::
1267::
1238::
1182::
1170::
1156::
1136::
1103::
1077::
1055::
1045::
1035::
1029::
1017::
944::
922::
878:,
864::
850::
327:.
301:.
243:IR
2300:.
2238:.
2195:.
2191::
2185:7
2168:.
2164::
2134:.
2130::
2124:6
2107:.
2082:.
2076::
2070:1
2023:.
2017::
1930:.
1902:.
1890::
1874:7
1855:3
1818:.
1804::
1796::
1628:)
1624:(
1440:.
1313:(
1303:(
1257:.
1201:.
1192::
1164:.
1084:.
1069:.
991:.
981:.
971:.
938:.
617:)
241:(
227:e
220:t
213:v
20:)
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.