135:, 1990) "scientific", computational, neo-Darwinist project of literary evolution, and the role of reading is downplayed by both Martindale and Moretti. According to Martindale, the principles of the evolution of art are based on statistic regularities rather than meaning, data or observation. "So far as the engines of history are concerned, meaning does not matter. In principle, one could study the history of a literary tradition without reading any of literature. ... the main virtue of the computerized content analysis methods I use is that they save one from actually having to read the literature" (p. 14).
183:. Moretti has described the concept of 'operationalizing' as "absolutely central to the new field of computational criticism" that includes distant reading. This principle, for Moretti, consists of "building a bridge from concepts to measurement, and then to the world" (104), underscoring the combined interests of empirical and quantitative study at its heart. In practice, distant reading has been undertaken with the aid of
22:
203:, but the specific example he isolates for critique is informed by his impression of distant reading methodology: "first you run the numbers, and then you see if they prompt an interpretive hypothesis. The method, if it can be called that, is dictated by the capability of the tool". In a similar vein,
274:
to argue for a "systemic concretization of language and fundamental change in the social spaces of the novel". Their analysis demonstrates a change in the way in which concrete detail is presented across the span of the nineteenth century, with an observable shift in the novel's narrative style "from
207:
focuses on the prospects for interpretation within the framework of computational literary analysis in an article which begins with the provocation, "ig data is coming for your books". Though he initially described distant reading as the "most promising path, at least on the surface" of a range of
246:
to address questions about the shortening of eighteenth-century novel titles, about the nature of very short novel titles, and about the relationship of novel titles to genres. For examples, in
Section I, he provides evidence of the decreasing length of titles across the time span, and links the
85:
While the term is collective, and is used to refer to a range of different computational methods of analysing literary data, similar approaches also include macroanalysis, cultural analytics, computational formalism, computational literary studies, quantitative literary studies, and algorithmic
114:
However, Moretti initially conceived distant reading for analysis of secondary literature as a roundabout way of getting to know more about primary literature: " will become 'second-hand': a patchwork of other people's research, without a single direct textual reading". Only later did the term
250:
In 'Why
Literary Time is Measured in Minutes" Ted Underwood asks " Why are short spans of time so central to our discipline? ... Why is experience measured in seconds or minutes more appropriately literary than experience measured in weeks or months?". Methodologically, Underwood supplements
118:
Despite the consensus about the origins of distant reading at the turn of the twenty-first century, Ted
Underwood has traced a longer genealogy of the method, arguing for its elision in current discourse about distant reading. He writes that "distant reading has a largely distinct genealogy
138:
This variety in the stated definitions and aims of distant reading is characteristic of its development since the turn of the twenty-first century, where it has come to encompass a variety of different methods and approaches, rather than representing a single or unified method of
119:
stretching back many decades before the advent of the internet—a genealogy that is not for the most part centrally concerned with computers". Underwood emphasises a social-scientific dimension in this prehistory of distant reading, referring to particular examples in the work of
298:, in her article, "The Image of Absence: Archival Silence, Data Visualization, and James Hemings", in order to present examples of how distant reading can uncover and illuminate "the silences endemic to the archive of American slavery". Searching for archival traces of
216:
is unconvinced about distant reading's claims to represent the perspectives of the "great unread", asking "hould our only ambition be to create authoritative totalizing patterns depending on untested statements by small groups of people treated as native informants?".
166:
Commonly, distant reading is performed at scale, using a large collection of texts. However, some scholars have adopted the principles of distant reading in the analysis of a small number of texts or an individual text. Distant reading often shares with the
102:. In the article, Moretti proposed a mode of reading which included works outside of established literary canons, which he variously termed "the great unread" and, elsewhere, "the Slaughterhouse of Literature". The innovation it proposed, as far as
237:
of these novels' texts, Moretti argues that "titles are still the best way to go beyond the 1 percent of novels that make up the canon, and catch a glimpse of the literary field as a whole". In the article, Moretti combines the results of
106:
was concerned, was that the method employed samples, statistics, paratexts, and other features not often considered within the ambit of literary analysis. Moretti also established a direct opposition to the theory and methods of
208:
Digital
Humanities methods he surveys, he concludes that the generalisations he perceives in the method are ineffective when "applied to literary questions proper". Additional critiques of distant reading have come from
111:: "One thing for sure: it cannot mean the very close reading of very few texts—secularized theology, really ('canon'!)—that has radiated from the cheerful town of New Haven over the whole field of literary studies".
187:
in the twenty-first century (though
Underwood has argued for prominent non-computational precursors); however, some works combining scale and literary study have been described as "distant-reading-by-hand".
267:, with the author suggesting that "I see close readings and statistical models not as competing epistemologies but as interlocking modes of interpretation that excel at different scales of analysis".
270:
In their
Literary Lab pamphlet, "A Quantitative Literary History of 2,958 Nineteenth-Century British Novels: The Semantic Cohort Method", Ryan Heuser and Long Le-Khac analyse word usage within their
1080:
1240:
40:
251:
theoretical ideas about the compression of fictional time with approaches from distant reading which model the average lengths of time described in 250-word portions of
275:
telling to showing" as the century develops. The findings tally with many literary-critical writings about the change in nineteenth-century narrative style from
1218:
233:
uses an early distant reading methodology to analyse certain changes in the titles of novels in the given period and country. In the absence of dedicated
1629:
1073:
1798:
629:
325:. It aims to create a network of researchers jointly developing the distant reading resources and methods necessary to change the way European
159:
can be written without necessarily resorting to the kind of careful, sustained reading encounter with individual texts that is fundamental to
115:
distant reading (via
Moretti and other scholars) come to become primarily identified with computational analysis of primary literary sources.
1539:
1230:
1066:
306:'s enslaved chef, Klein juxtaposes visualisations of his presence with Jefferson's own charts and tables as the basis for a discussion of
1793:
1400:
1554:
1385:
682:
612:
58:
1325:
1742:
1395:
393:
367:
1390:
1135:
1659:
1380:
1352:
653:
247:
phenomenon to the growth of the market for novels and the establishment of periodicals which regularly reviewed novels.
78:
that applies computational methods to literary data, usually derived from large digital libraries, for the purposes of
1829:
1697:
1682:
1654:
1519:
1514:
1089:
1834:
1434:
1405:
1183:
1277:
1130:
556:
287:
1803:
1727:
1459:
1415:
1300:
1198:
699:
1707:
1677:
1344:
1564:
1257:
1235:
1225:
1193:
1168:
999:
Klein, Lauren F. (2013). "The Image of
Absence: Archival Silence, Data Visualization, and James Hemings".
972:
Klein, Lauren F. (2013). "The Image of
Absence: Archival Silence, Data Visualization, and James Hemings".
1424:
948:"A Quantitative Literary History of 2,958 Nineteenth-Century British Novels: The Semantic Cohort Method"
922:"A Quantitative Literary History of 2,958 Nineteenth-Century British Novels: The Semantic Cohort Method"
896:"A Quantitative Literary History of 2,958 Nineteenth-Century British Novels: The Semantic Cohort Method"
724:
Moretti, Franco (2009). "Style, Inc. Reflections on Seven
Thousand Titles (British Novels, 1740–1850)".
256:
1777:
1453:
1429:
1282:
260:
259:
findings with close reading, Underwood concludes his article with a discussion of the integration of
180:
672:
1757:
1687:
1644:
1600:
1372:
1362:
1357:
1245:
1767:
1639:
1504:
1267:
1250:
1108:
876:
831:
786:
741:
537:
475:
436:
334:
307:
291:
200:
172:
156:
518:"Close Reading with Computers: Genre Signals, Parts of Speech, and David Mitchell's Cloud Atlas"
579:
494:
333:
European Literary Text Collection (ELTeC) containing digital full-texts of novels in different
1772:
1484:
1292:
1203:
678:
608:
1649:
1534:
1509:
1310:
1213:
1008:
981:
866:
858:
821:
813:
776:
768:
733:
529:
467:
428:
326:
322:
303:
276:
243:
179:
are a regular characteristic of distant reading, and are often accompanied by a reliance on
152:
120:
103:
79:
75:
1761:
1722:
1717:
1585:
1315:
1163:
1145:
1469:
1449:
1173:
947:
921:
895:
602:
264:
230:
213:
204:
176:
168:
140:
95:
1823:
1732:
1544:
1524:
1305:
880:
835:
790:
479:
440:
346:
299:
218:
160:
124:
108:
541:
1712:
330:
209:
196:
229:
In "Style, Inc. Reflections on Seven Thousand Titles (British Novels, 1740–1850)"
329:
is written. The objectives of the project include coordinating the creation of a
1669:
1549:
1262:
1178:
1155:
1103:
318:
271:
234:
1272:
1058:
745:
471:
432:
239:
1012:
985:
1140:
280:
184:
1044:
862:
817:
772:
199:
takes a broad view of what he frames as problems of interpretation in the
1615:
1595:
1580:
1559:
1529:
1474:
1439:
1320:
557:"'Operationalizing': Or, the Function of Measurement in Literary Theory"
1752:
1610:
1590:
1464:
1208:
1123:
871:
826:
781:
533:
295:
252:
517:
1118:
1113:
455:
416:
314:
313:
The COST Action 'Distant Reading for European Literary History' is a
1026:
849:
Underwood, Ted (2018). "Why Literary Time is Measured in Minutes".
804:
Underwood, Ted (2018). "Why Literary Time is Measured in Minutes".
759:
Underwood, Ted (2018). "Why Literary Time is Measured in Minutes".
737:
221:
questions the "unavowed imperialism of English" in Moretti's work.
1808:
1444:
630:"Mind Your P's and B's: The Digital Humanities and Interpretation"
1330:
127:(from the 1980s). Moretti’s conception of literary evolution in
1062:
1605:
15:
317:
networking project bringing together scholars interested in
171:
a focus on the analysis of long-term histories and trends.
151:
One of the central principles of distant reading is that
131:
is quite similar to the psychologist Colin Martindale’s (
36:
94:
The term "distant reading" is generally attributed to
654:"Literature Is not Data: Against Digital Humanities"
1786:
1741:
1696:
1668:
1628:
1573:
1495:
1483:
1414:
1371:
1343:
1291:
1154:
1096:
604:
Metaphors of Mind: An Eighteenth-Century Dictionary
321:building, quantitative text analysis, and European
31:
may be too technical for most readers to understand
1027:"Distant Reading for European Literary History"
1074:
677:. Columbia University Press. pp. 107–8.
607:. Baltimore: Johns Hopkins University Press.
255:across three centuries. Having also combined
242:of these titles with contextual knowledge of
8:
310:as it relates to the construction of race.
1492:
1288:
1081:
1067:
1059:
1045:"ELTeC: European Literary Text Collection"
870:
825:
780:
59:Learn how and when to remove this message
43:, without removing the technical details.
358:
955:Pamphlets of the Stanford Literary Lab
929:Pamphlets of the Stanford Literary Lab
903:Pamphlets of the Stanford Literary Lab
719:
717:
647:
645:
643:
41:make it understandable to non-experts
7:
1540:Simple Knowledge Organization System
946:Heuser, Ryan; Le-Khac, Long (2012).
920:Heuser, Ryan; Le-Khac, Long (2012).
894:Heuser, Ryan; Le-Khac, Long (2012).
671:Spivak, Gayatri Chakravorty (2005).
387:
385:
286:Lauren F. Klein trains methods from
456:"The Slaughterhouse of Literature"
417:"The Slaughterhouse of Literature"
14:
1555:Thesaurus (information retrieval)
394:"Conjectures on World Literature"
368:"Conjectures on World Literature"
580:"A Genealogy of Distant Reading"
495:"A Genealogy of Distant Reading"
20:
652:Marche, Stephen (28 Oct 2012).
100:Conjectures on World Literature
1136:Natural language understanding
1:
1660:Optical character recognition
628:Fish, Stanley (23 Jan 2012).
192:Criticisms of distant reading
1353:Multi-document summarization
584:Digital Humanities Quarterly
499:Digital Humanities Quarterly
1683:Latent Dirichlet allocation
1655:Natural language generation
1520:Machine-readable dictionary
1515:Linguistic Linked Open Data
1090:Natural language processing
658:Los Angeles Review of Books
1851:
1435:Explicit semantic analysis
1184:Deep linguistic processing
1278:Word-sense disambiguation
1131:Computational linguistics
516:Eve, Martin Paul (2017).
472:10.1215/00267929-61-1-207
460:Modern Language Quarterly
433:10.1215/00267929-61-1-207
421:Modern Language Quarterly
288:computational linguistics
1804:Natural Language Toolkit
1728:Pronunciation assessment
1630:Automatic identification
1460:Latent semantic analysis
1416:Distributional semantics
1301:Compound-term processing
1199:Named-entity recognition
1013:10.1215/00029831-2367310
986:10.1215/00029831-2367310
555:Moretti, Franco (2013).
454:Moretti, Franco (2000).
415:Moretti, Franco (2000).
392:Moretti, Franco (2000).
366:Moretti, Franco (2000).
1708:Automated essay scoring
1678:Document classification
1345:Automatic summarization
698:Arac, Jonathan (2002).
578:Underwood, Ted (2017).
493:Underwood, Ted (2017).
147:Principles and practice
1565:Universal Dependencies
1258:Terminology extraction
1241:Semantic decomposition
1236:Semantic role labeling
1226:Part-of-speech tagging
1194:Information extraction
1179:Coreference resolution
1169:Collocation extraction
601:Pasanek, Brad (2015).
98:and his 2000 article,
1326:Sentence segmentation
863:10.1353/elh.2018.0013
818:10.1353/elh.2018.0013
773:10.1353/elh.2018.0013
674:Death of a Discipline
240:quantitative analysis
123:(from the 1960s) and
1778:Voice user interface
1489:datasets and corpora
1430:Document-term matrix
1283:Word-sense induction
261:quantitative methods
181:quantitative methods
173:Empirical approaches
86:literary criticism.
1758:Interactive fiction
1688:Pachinko allocation
1645:Speech segmentation
1601:Google Ngram Viewer
1373:Machine translation
1363:Text simplification
1358:Sentence extraction
1246:Semantic similarity
1001:American Literature
974:American Literature
1830:Literary criticism
1768:Question answering
1640:Speech recognition
1505:Corpus linguistics
1485:Language resources
1268:Textual entailment
1251:Sentiment analysis
700:"Anglo-Globalism?"
534:10.3368/ss.46.3.76
335:European languages
308:data visualisation
292:data visualisation
201:digital humanities
157:literary criticism
74:is an approach in
1835:Reading (process)
1817:
1816:
1773:Virtual assistant
1698:Computer-assisted
1624:
1623:
1381:Computer-assisted
1339:
1338:
1331:Word segmentation
1293:Text segmentation
1231:Semantic analysis
1219:Syntactic parsing
1204:Ontology learning
294:on an archive of
69:
68:
61:
1842:
1794:Formal semantics
1743:Natural language
1650:Speech synthesis
1632:and data capture
1535:Semantic network
1510:Lexical resource
1493:
1311:Lexical analysis
1289:
1214:Semantic parsing
1083:
1076:
1069:
1060:
1053:
1052:
1041:
1035:
1034:
1023:
1017:
1016:
996:
990:
989:
969:
963:
962:
952:
943:
937:
936:
926:
917:
911:
910:
900:
891:
885:
884:
874:
846:
840:
839:
829:
801:
795:
794:
784:
756:
750:
749:
726:Critical Inquiry
721:
712:
711:
695:
689:
688:
668:
662:
661:
649:
638:
637:
625:
619:
618:
598:
592:
591:
575:
569:
568:
552:
546:
545:
513:
507:
506:
490:
484:
483:
451:
445:
444:
412:
406:
405:
389:
380:
379:
363:
327:literary history
323:literary history
304:Thomas Jefferson
244:literary history
153:literary history
121:Raymond Williams
104:literary studies
80:literary history
76:literary studies
64:
57:
53:
50:
44:
24:
23:
16:
1850:
1849:
1845:
1844:
1843:
1841:
1840:
1839:
1820:
1819:
1818:
1813:
1782:
1762:Syntax guessing
1744:
1737:
1723:Predictive text
1718:Grammar checker
1699:
1692:
1664:
1631:
1620:
1586:Bank of English
1569:
1497:
1488:
1479:
1410:
1367:
1335:
1287:
1189:Distant reading
1164:Argument mining
1150:
1146:Text processing
1092:
1087:
1057:
1056:
1049:Distant Reading
1043:
1042:
1038:
1031:Distant Reading
1025:
1024:
1020:
998:
997:
993:
971:
970:
966:
950:
945:
944:
940:
924:
919:
918:
914:
898:
893:
892:
888:
848:
847:
843:
803:
802:
798:
758:
757:
753:
723:
722:
715:
704:New Left Review
697:
696:
692:
685:
670:
669:
665:
651:
650:
641:
627:
626:
622:
615:
600:
599:
595:
577:
576:
572:
561:New Left Review
554:
553:
549:
515:
514:
510:
492:
491:
487:
453:
452:
448:
414:
413:
409:
398:New Left Review
391:
390:
383:
372:New Left Review
365:
364:
360:
355:
343:
227:
194:
149:
129:Distant Reading
92:
72:Distant reading
65:
54:
48:
45:
37:help improve it
34:
25:
21:
12:
11:
5:
1848:
1846:
1838:
1837:
1832:
1822:
1821:
1815:
1814:
1812:
1811:
1806:
1801:
1796:
1790:
1788:
1784:
1783:
1781:
1780:
1775:
1770:
1765:
1755:
1749:
1747:
1745:user interface
1739:
1738:
1736:
1735:
1730:
1725:
1720:
1715:
1710:
1704:
1702:
1694:
1693:
1691:
1690:
1685:
1680:
1674:
1672:
1666:
1665:
1663:
1662:
1657:
1652:
1647:
1642:
1636:
1634:
1626:
1625:
1622:
1621:
1619:
1618:
1613:
1608:
1603:
1598:
1593:
1588:
1583:
1577:
1575:
1571:
1570:
1568:
1567:
1562:
1557:
1552:
1547:
1542:
1537:
1532:
1527:
1522:
1517:
1512:
1507:
1501:
1499:
1490:
1481:
1480:
1478:
1477:
1472:
1470:Word embedding
1467:
1462:
1457:
1450:Language model
1447:
1442:
1437:
1432:
1427:
1421:
1419:
1412:
1411:
1409:
1408:
1403:
1401:Transfer-based
1398:
1393:
1388:
1383:
1377:
1375:
1369:
1368:
1366:
1365:
1360:
1355:
1349:
1347:
1341:
1340:
1337:
1336:
1334:
1333:
1328:
1323:
1318:
1313:
1308:
1303:
1297:
1295:
1286:
1285:
1280:
1275:
1270:
1265:
1260:
1254:
1253:
1248:
1243:
1238:
1233:
1228:
1223:
1222:
1221:
1216:
1206:
1201:
1196:
1191:
1186:
1181:
1176:
1174:Concept mining
1171:
1166:
1160:
1158:
1152:
1151:
1149:
1148:
1143:
1138:
1133:
1128:
1127:
1126:
1121:
1111:
1106:
1100:
1098:
1094:
1093:
1088:
1086:
1085:
1078:
1071:
1063:
1055:
1054:
1036:
1018:
991:
980:(4): 661–688.
964:
938:
912:
886:
841:
796:
767:(2): 341–365.
751:
746:10.1086/606125
738:10.1086/605619
732:(1): 134–158.
713:
690:
683:
663:
639:
634:New York Times
620:
613:
593:
570:
547:
508:
485:
446:
407:
381:
357:
356:
354:
351:
350:
349:
342:
339:
265:literary study
231:Franco Moretti
226:
223:
214:Gayatri Spivak
205:Stephen Marche
193:
190:
177:literary study
169:Annales school
148:
145:
141:literary study
133:Clockwork Muse
96:Franco Moretti
91:
88:
67:
66:
28:
26:
19:
13:
10:
9:
6:
4:
3:
2:
1847:
1836:
1833:
1831:
1828:
1827:
1825:
1810:
1807:
1805:
1802:
1800:
1799:Hallucination
1797:
1795:
1792:
1791:
1789:
1785:
1779:
1776:
1774:
1771:
1769:
1766:
1763:
1759:
1756:
1754:
1751:
1750:
1748:
1746:
1740:
1734:
1733:Spell checker
1731:
1729:
1726:
1724:
1721:
1719:
1716:
1714:
1711:
1709:
1706:
1705:
1703:
1701:
1695:
1689:
1686:
1684:
1681:
1679:
1676:
1675:
1673:
1671:
1667:
1661:
1658:
1656:
1653:
1651:
1648:
1646:
1643:
1641:
1638:
1637:
1635:
1633:
1627:
1617:
1614:
1612:
1609:
1607:
1604:
1602:
1599:
1597:
1594:
1592:
1589:
1587:
1584:
1582:
1579:
1578:
1576:
1572:
1566:
1563:
1561:
1558:
1556:
1553:
1551:
1548:
1546:
1545:Speech corpus
1543:
1541:
1538:
1536:
1533:
1531:
1528:
1526:
1525:Parallel text
1523:
1521:
1518:
1516:
1513:
1511:
1508:
1506:
1503:
1502:
1500:
1494:
1491:
1486:
1482:
1476:
1473:
1471:
1468:
1466:
1463:
1461:
1458:
1455:
1451:
1448:
1446:
1443:
1441:
1438:
1436:
1433:
1431:
1428:
1426:
1423:
1422:
1420:
1417:
1413:
1407:
1404:
1402:
1399:
1397:
1394:
1392:
1389:
1387:
1386:Example-based
1384:
1382:
1379:
1378:
1376:
1374:
1370:
1364:
1361:
1359:
1356:
1354:
1351:
1350:
1348:
1346:
1342:
1332:
1329:
1327:
1324:
1322:
1319:
1317:
1316:Text chunking
1314:
1312:
1309:
1307:
1306:Lemmatisation
1304:
1302:
1299:
1298:
1296:
1294:
1290:
1284:
1281:
1279:
1276:
1274:
1271:
1269:
1266:
1264:
1261:
1259:
1256:
1255:
1252:
1249:
1247:
1244:
1242:
1239:
1237:
1234:
1232:
1229:
1227:
1224:
1220:
1217:
1215:
1212:
1211:
1210:
1207:
1205:
1202:
1200:
1197:
1195:
1192:
1190:
1187:
1185:
1182:
1180:
1177:
1175:
1172:
1170:
1167:
1165:
1162:
1161:
1159:
1157:
1156:Text analysis
1153:
1147:
1144:
1142:
1139:
1137:
1134:
1132:
1129:
1125:
1122:
1120:
1117:
1116:
1115:
1112:
1110:
1107:
1105:
1102:
1101:
1099:
1097:General terms
1095:
1091:
1084:
1079:
1077:
1072:
1070:
1065:
1064:
1061:
1050:
1046:
1040:
1037:
1032:
1028:
1022:
1019:
1014:
1010:
1006:
1002:
995:
992:
987:
983:
979:
975:
968:
965:
960:
956:
949:
942:
939:
934:
930:
923:
916:
913:
908:
904:
897:
890:
887:
882:
878:
873:
868:
864:
860:
856:
852:
845:
842:
837:
833:
828:
823:
819:
815:
811:
807:
800:
797:
792:
788:
783:
778:
774:
770:
766:
762:
755:
752:
747:
743:
739:
735:
731:
727:
720:
718:
714:
709:
705:
701:
694:
691:
686:
684:9780231129459
680:
676:
675:
667:
664:
659:
655:
648:
646:
644:
640:
635:
631:
624:
621:
616:
614:9781421416885
610:
606:
605:
597:
594:
589:
585:
581:
574:
571:
566:
562:
558:
551:
548:
543:
539:
535:
531:
527:
523:
519:
512:
509:
504:
500:
496:
489:
486:
481:
477:
473:
469:
465:
461:
457:
450:
447:
442:
438:
434:
430:
426:
422:
418:
411:
408:
403:
399:
395:
388:
386:
382:
377:
373:
369:
362:
359:
352:
348:
347:Close reading
345:
344:
340:
338:
336:
332:
328:
324:
320:
316:
311:
309:
305:
301:
300:James Hemings
297:
293:
289:
284:
282:
278:
273:
268:
266:
262:
258:
254:
248:
245:
241:
236:
232:
224:
222:
220:
219:Jonathan Arac
215:
211:
206:
202:
198:
191:
189:
186:
182:
178:
174:
170:
164:
162:
161:close reading
158:
154:
146:
144:
142:
136:
134:
130:
126:
125:Janice Radway
122:
116:
112:
110:
109:close reading
105:
101:
97:
89:
87:
83:
82:and theory.
81:
77:
73:
63:
60:
52:
42:
38:
32:
29:This article
27:
18:
17:
1713:Concordancer
1188:
1109:Bag-of-words
1048:
1039:
1030:
1021:
1004:
1000:
994:
977:
973:
967:
958:
954:
941:
932:
928:
915:
906:
902:
889:
854:
850:
844:
809:
805:
799:
764:
760:
754:
729:
725:
707:
703:
693:
673:
666:
657:
633:
623:
603:
596:
587:
583:
573:
564:
560:
550:
525:
521:
511:
502:
498:
488:
463:
459:
449:
424:
420:
410:
401:
397:
375:
371:
361:
331:multilingual
312:
285:
269:
257:quantitative
249:
228:
210:postcolonial
197:Stanley Fish
195:
165:
150:
137:
132:
128:
117:
113:
99:
93:
84:
71:
70:
55:
46:
30:
1670:Topic model
1550:Text corpus
1396:Statistical
1263:Text mining
1104:AI-complete
872:2142/100076
827:2142/100076
782:2142/100076
212:theorists.
1824:Categories
1391:Rule-based
1273:Truecasing
1141:Stop words
1007:(4): 661.
857:(2): 363.
812:(2): 342.
466:(1): 208.
427:(1): 207.
353:References
1700:reviewing
1498:standards
1496:Types and
881:192215143
836:192215143
791:192215143
522:SubStance
480:161329715
441:161329715
281:modernism
185:computers
1616:Wikidata
1596:FrameNet
1581:BabelNet
1560:Treebank
1530:PropBank
1475:Word2vec
1440:fastText
1321:Stemming
542:54614638
341:See also
315:European
225:Examples
49:May 2020
1787:Related
1753:Chatbot
1611:WordNet
1591:DBpedia
1465:Seq2seq
1209:Parsing
1124:Trigram
296:slavery
277:realism
253:fiction
235:corpora
90:History
35:Please
1760:(c.f.
1418:models
1406:Neural
1119:Bigram
1114:n-gram
879:
834:
789:
744:
681:
611:
567:: 103.
540:
478:
439:
319:corpus
272:corpus
1809:spaCy
1454:large
1445:GloVe
961:: 45.
951:(PDF)
925:(PDF)
899:(PDF)
877:S2CID
832:S2CID
787:S2CID
742:JSTOR
710:: 44.
538:S2CID
528:(3).
476:S2CID
437:S2CID
404:: 55.
263:into
1574:Data
1425:BERT
935:: 2.
679:ISBN
609:ISBN
590:(2).
505:(2).
290:and
155:and
1606:UBY
1009:doi
982:doi
867:hdl
859:doi
851:ELH
822:hdl
814:doi
806:ELH
777:hdl
769:doi
761:ELH
734:doi
530:doi
468:doi
429:doi
279:to
175:to
39:to
1826::
1047:.
1029:.
1005:85
1003:.
978:85
976:.
957:.
953:.
931:.
927:.
905:.
901:.
875:.
865:.
855:85
853:.
830:.
820:.
810:85
808:.
785:.
775:.
765:85
763:.
740:.
730:36
728:.
716:^
708:16
706:.
702:.
656:.
642:^
632:.
588:11
586:.
582:.
565:84
563:.
559:.
536:.
526:46
524:.
520:.
503:11
501:.
497:.
474:.
464:61
462:.
458:.
435:.
425:61
423:.
419:.
400:.
396:.
384:^
374:.
370:.
337:.
302:,
283:.
163:.
143:.
1764:)
1487:,
1456:)
1452:(
1082:e
1075:t
1068:v
1051:.
1033:.
1015:.
1011::
988:.
984::
959:4
933:4
909:.
907:4
883:.
869::
861::
838:.
824::
816::
793:.
779::
771::
748:.
736::
687:.
660:.
636:.
617:.
544:.
532::
482:.
470::
443:.
431::
402:1
378:.
376:1
62:)
56:(
51:)
47:(
33:.
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.