82:
921:
In interlinear morphological glosses, various forms of punctuation separate the glosses. Typically, the words are aligned with their glosses; within words, a hyphen is used when a boundary is marked in both the text and its gloss, a period when a boundary appears in only one. That is, there should be
1281:
In the manual correction phase, the database creators manually corrected the boundaries of the interlinear gloss instances discovered by the sequence-labelling method in Step 2 of the automatic construction phase. The creators then verified the language names and language codes in a second and third
1530:
Researchers have used interlinear glosses is to obtain the morphological paradigms of the object language (i.e., the language being glossed). To automatically create morphological paradigms from interlinear glosses, researchers have created tables for every stem in the gloss and a (possibly empty)
1276:
Third, each interlinear gloss instance was assigned a language name (e.g., Tagalog) and an ISO 693-3 language ID. Language names and IDs were automatically assigned to interlinear glosses using
Coreference Resolution models from Natural Language Processing, where the interlinear gloss instance was
1264:
The Online
Database of Interlinear Text (ODIN) is a database of over 200,000 instances of interlinear glosses for more than 1,500 languages extracted from scholarly linguistic research. The database was constructed in two phases: automatic construction followed by manual correction. The automatic
275:
More modern 19th- and 20th-century approaches took to glossing vertically, aligning the same sort of word-by-word content in such a way that the metalanguage terms were placed vertically below the source language terms. In this style, the given example might be rendered thus (here
English gloss):
1269:
First, search engines (e.g., Google, Bing) were queried to retrieve scholarly documents that were likely to contain interlinear glosses. The queries comprised terms relevant to linguistic research such as grammatical morphemes (e.g., "NOM", short for nominative; "3SG", short for 3rd person
859:. if the morpheme-by-morpheme gloss (middle line) contains an element that does not correspond to an overt element in the example, a standard strategy is to include an overt "ø" in the object-language text, which is separated by a hyphen like an overt element would be:
103:
Interlinear glosses have been used for a variety of purposes over a long period of time. One common usage has been to annotate bilingual textbooks for language education. This sort of interlinearization serves to help make the meaning of a
922:
the same number of words separated with spaces in the text and its gloss, as well as the same number of hyphenated morphemes within a word and its gloss. This is the basic system, and can be applied universally. For example:
700:. At the sub-word level, segmentable morphemes are separated by hyphens, both in the example and in the gloss. There should be the same number of hyphens in the example and in the gloss, as shown in the following example:
694:. According to the Leipzig Glossing Rules, it is standard to left-align the words in the object language with the corresponding words in the metalanguage; this alignment can be seen between lines (1-3) and line (4).
264:
This "inline" style allows examples to be included within the flow of text, and for the word order of the target language to be written in an order which approximates the target language syntax. (In the gloss here,
271:
is reordered from the corresponding source order to approximate German syntax more naturally.) Even so, this approach requires the readers to "re-align" the correspondences between source and target forms.
1486:
Given the morpheme segmented line (first line above) and the free translation line (third line above), the task is to produce the middle glossed line comprising stem translations (e.g.,
445:
This approach is denser and also requires effort to read, but it is less reliant on the grammatical structure of the metalanguage for expressing the semantics of the target forms.
513:
a free translation, which may be placed in a separate paragraph or on the facing page if the structures of the languages are too different for it to follow the text line by line.
1012:, when a single word in the source language happens to correspond to a phrase in the glossing language, though a period would still be used for other situations, such as Greek
1273:
Second, each line in an extracted document was tagged for whether it was a line belonging to an interlinear gloss or not using sequence-labeling methods from
Machine Learning.
111:
Such annotations have occasionally been expressed not through interlinear layout, but rather through enumeration of words in the object and meta language. One such example is
365:
Finally, modern linguists have adopted the practice of using abbreviated grammatical category labels. A 2008 publication which repeats this example labels it as follows:
460:
Though there is no formal specification for the IGT format, the
Leipzig Glossing Rules are a set of guidelines that aim to standardize the format as much as possible.
1753:
Lehmann, Christian (2004-01-23). "Directions for interlinear morphemic translations". In Geert Booij; Christian
Lehmann; Joachim Mugdan; Stavros Skopeteas (eds.).
1502:). Sequence prediction models from Natural Language Processing have been used to perform this task. Two factors contribute to the difficulty of this task:
69:
and its translation, and the structure of the original language. In its simplest form, an interlinear gloss is a literal, word-for-word translation of the
2173:
1407:
Natural
Language Processing models leveraging interlinear gloss resources, such as the Online Database of Interlinear Text, have been developed.
2134:
Proceedings of the 2012 Conference of the North
American Chapter of the Association for Computational Linguistics: Human Language Technologies
2097:
Proceedings of the 2015 Conference of the North
American Chapter of the Association for Computational Linguistics: Human Language Technologies
823:; a list of standard abbreviations for grammatical categories that are widely used in linguistics can be found in the Leipzig Glossing Rules.
1785:
41:(series of brief explanations, such as definitions or pronunciations) placed between lines, such as between a line of original text and its
463:
An interlinear text for linguistics will commonly consist of some or all of the following, usually in this order, from top to bottom:
1718:, often displayed as interlinear glosses under the tagged words, sometimes at the same time as an interlinear word-by-word translation
1694:
grammatical category). A statistical machine learning model for morphological inflection can be used to fill in the missing entries.
1727:
889:
is treated similarly to affixation but with a tilde (instead of the standard hyphen) that connects the copied element to the stem:
2167:
2155:
2161:
1277:
tagged with the language name (and ID) that appears in the scholarly document the interlinear gloss instance was extracted from.
20:
1286:
The language distribution of interlinear gloss instances in Online
Database of Interlinear Text after phase 1 and (phase 2)
2181:. A forum for recommendations on the Interlinar Morphemic Glossing of ancient languages as attested in ancient manuscripts.
829:. When a single object-language element corresponds to several metalanguage elements, they are separated by periods. E.g.,
2233:
2050:
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume
1027:
may be separated with a double hyphen (or, for ease of typing, an equal sign) rather than a hyphen. A French example:
1415:
Natural Language Processing systems, for example, have been developed to automatically produce interlinear glosses.:
49:. When glossed, each line of the original text acquires one or more corresponding lines of transcription known as an
1757:. Handbücher der Sprach- und Kommunikationswissenschaft. Vol. 2. Berlin: W. de Gruyter. pp. 1834–1857.
1866:
Xia, Fei; Lewis, William; Wayne, Michael; Slayden, Glenn; Georgi, Ryan; Crowgey, Joshua; Bender, Emily (2016).
449:
1531:
slot for every grammatical category (e.g., ERG) in the gloss. For instance, given the glossed sentence below:
815:) are inflectional affixes representing future tense and negation. These inflectional affixes are glossed as
1972:. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP): 5251–5262.
81:
2197:
Listing of older interlinear and construed texts, mostly from Latin or Ancient Greek and mostly to English
1715:
767:
2201:
1148:
1091:
2228:
2223:
2206:
112:
108:
explicit without attempting to formally model the structural characteristics of the source language.
898:
1850:
From Aari to Zulu: massively multilingual creation of language tools using interlinear glossed tex
528:
2184:
2110:
2073:
2053:
2026:
1983:
1937:
1887:
1551:
38:
1248:
A few other conventions which are sometimes seen are illustrated in the Leipzig Glossing Rules.
505:-by-morpheme gloss, where morphemes within a word are separated by hyphens or other punctuation,
1777:
1867:
1781:
1576:
967:
116:
1918:
Xingyuan, Zhao; Satoru, Ozaki; Anastasopoulos, Antonios; Neubig, Graham; Levin, Lori (2020).
2150:
2100:
2063:
2016:
1973:
1927:
1926:. Proceedings of the 28th International Conference on Computational Linguistics: 5397–5408.
1879:
1811:
1769:
1513:
Some words in the morpheme segmented line have multiple correspondences in the gloss (e.g.,
551:
518:
92:
950:
2194:
1615:
1594:
1588:
1563:
1557:
1181:
951:
869:
96:
1510:
is the last word in the translation but the second word in the morpheme segmented line).
1506:
The translation is not necessarily in alignment with the morpheme segmented line (e.g.,
1932:
1709:
1682:
would be filled (since it was observed in the interlinear gloss data) but the slot for
495:
2013:
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
1978:
1920:"Automatic Interlinear Glossing for Under-Resourced Languages Leveraging Translations"
1812:"The Leipzig Glossing Rules. Conventions for Interlinear Morpheme by Morpheme Glosses"
2217:
2077:
2068:
1987:
1941:
1770:
1730:, nineteenth-century composer and promoter of interlinear texts for language learning
1154:
1108:
1069:
935:
724:
2151:
The Leipzig Glossing Rules: Conventions for interlinear morpheme-by-morpheme glosses
2114:
2030:
2015:. Brussels, Belgium: Association for Computational Linguistics. pp. 2883–2889.
1256:
Efforts have been undertaken to digitize IGT for hundreds of the world's languages.
1891:
761:
718:
2178:
1964:
Moeller, Sarah; Liu, Ling; Yang, Changbing; Kann, Katharina; Hulden, Mans (2020).
2099:. Denver, Colorado: Association for Computational Linguistics. pp. 922–931.
1919:
1712:– a gloss sometimes used with Chinese or Japanese to show the pronunciation
544:
537:
468:
105:
70:
66:
42:
26:
1965:
16:
Explanatory matter inserted between a line of original text and its translation
2045:
1883:
1733:
1582:
839:
1065:
543:
3. a gloss showing the underlying tones in citation form (before undergoing
65:
for short. Such glosses help the reader follow the relationship between the
2021:
1686:
would be empty (assuming that no other interlinear gloss instance contains
2105:
2189:
2052:. Online: Association for Computational Linguistics. pp. 1901–1907.
1868:"Enriching a massively multilingual database of interlinear glossed text"
1721:
502:
488:
46:
30:
972:
409:
381:
373:
2136:. Montréal, Canada: Association for Computational Linguistics: 396–406.
2092:
2008:
362:
Here word ordering is determined by the syntax of the object language.
1810:
Bickel, Balthasar; Bernard Comrie; Martin Haspelmath (February 2008).
1494:) and the grammatical category labels corresponding to affixes (e.g.,
2130:"Leveraging supplemental representations for sequential transduction"
1755:
Morphologie. Ein internationales Handbuch zur Flexion und Wortbildung
1703:
1024:
897:
2185:
Online Interlinear of Biblical Greek Scriptures (New Testament) text
2129:
1772:
Language typology and language universals: an international handbook
2058:
1575:
1470:
966:
766:
413:
389:
2009:"An Encoder-Decoder Approach to the Paradigm Cell Filling Problem"
1174:
1061:
80:
1614:
1593:
1587:
1562:
1556:
1076:
sulat, susulat, sumulat, sumusulat (verbal declensions) (Tagalog)
868:
1724:, often displayed as a gloss or annotation to the original text.
1540:
1023:
However, sometimes finer distinctions may be made. For example,
1455:
1447:
1424:
1216:
1208:
1197:
934:
723:
1550:
1451:
1443:
760:
717:
2093:"Inflection Generation as Discriminative String Transduction"
1212:
1193:
385:
1581:
838:
399:
2091:
Nicolai, Garrett; Cherry, Colin; Kondrak, Grzegorz (2015).
1526:
Automatic discovery of morphological structure from glosses
1459:
2046:"Applying the Transformer to Character-level Transduction"
1068:, transfixes, etc.) may be set off by angle brackets, and
377:
1706:– Japanese tradition of glossing Classical Chinese texts
1265:
construction stage itself was completed in three steps:
1180:
Morphemes which cannot be easily separated out, such as
1184:, may be marked with a backslash rather than a period:
799:) is translated into the corresponding English lexeme (
484:
a conventional transliteration into the Latin alphabet,
448:
In computing, special text markers are provided in the
521:
clause has been transcribed with five lines of text:
452:
to indicate the start and end of interlinear glosses.
1005:
An underscore may be used instead of a period, as in
681:
goa2 iau2-boe7 koat4-teng7 tang1-si5 boeh4 tng2-khi3.
675:
goa1 iau1-boe3 koat2-teng3 tang7-si5 boeh2 tng1-khi3.
526:
439:
1SG.SUBJ-3SG.OBJ-mach-APPL DET 1SG.POSS-Sohn ein Haus
1625:
evening-INS 1.SG.NOM run-PFV.PST.SG.FEM in store.ACC
1966:"IG2P: From Interlinear Glossed Texts to Paradigms"
1913:
1911:
1909:
1907:
1403:
Automatic processing of interlinear gloss instances
2044:Wu, Shijie; Cotterell, Ryan; Hulden, Mans (2021).
687:(5.) "I have not yet decided when I shall return."
1959:
1957:
1816:Dept. of Linguistics – Resources – Glossing Rules
777:now they-OBL-GEN farm forever behind stay-FUT-NEG
1861:
1859:
774:Gila abur-u-n ferma hamišaluǧ güǧüna amuqʼ-da-č
2190:ODIN - The Online Database of INterlinear text
1836:A Basic Vocabulary for a Beginner in Taiwanese
1805:
1803:
1801:
1799:
1797:
1479:you-GEN camel we.OBL-ERG.1.PL-steal-PRT be.NEG
780:'Now their farm will not stay behind forever.'
810:
804:
794:
788:
8:
2128:Bhargava, Aditya; Kondrak, Grzegorz (2012).
266:
86:
2168:Towards a General Model of Interlinear Text
2007:Silfverberg, Miikka; Hulden, Mans (2018).
669:goá iáu-boē koat-tēng tang-sî boeh tńg-khì
352:ni- c- chihui -lia in no- piltzin ce calli
252:ni- c- chihui -lia in no- piltzin ce calli
2104:
2067:
2057:
2020:
1977:
1931:
1646:
1284:
998:room-from speed-with go_out-perfective-I
2202:"The New Old Way of Learning Languages"
1745:
1632:There would be a paradigm for the stem
1072:with tildes, rather than with hyphens:
684:(4.) I not-yet decide when want return.
258:ich mache es für der mein Sohn ein Haus
436:ni-c-chihui-lia in no-piltzin ce calli
7:
2179:Glossing Ancient Languages and Texts
1628:'In the evening I ran to the store.'
1476:mi-s ħumukuli elu-ab-ok'ek'-asi anu
1260:Online Database of Interlinear Text
1060:Affixes which cause discontinuity (
698:Morpheme-by-morpheme correspondence
550:4. a morpheme-by-morpheme gloss in
355:I it make for to-the my son a house
2156:Interlinear Glossed Text Standards
1282:pass over the data, respectively.
1236:'to our fathers' (the singular of
803:) while the inflectional affixes (
14:
1872:Language Resources and Evaluation
995:room-ABL speed-COM go.out-PFV-1sg
1933:10.18653/v1/2020.coling-main.471
1852:(PhD). University of Washington.
1838:by Ko Chek Hoan and Tan Pang Tin
1622:Vecher-om ya pobeja-la v magazin
88:Toussaint–Langenscheidt Spanisch
2195:Latinum Interlinear Method page
2162:Interlinear Glossed Text Levels
1979:10.18653/v1/2020.emnlp-main.424
2069:10.18653/v1/2021.eacl-main.163
1648:(Partial) paradigm for pobeja
21:List of glossing abbreviations
1:
2174:Interlinear Morphemic Glosses
1776:. Walter de Gruyter. p.
1482:'We didn't steal your camel.'
517:As an example, the following
1311:interlinear gloss instances
1305:interlinear gloss instances
811:
805:
795:
789:
755:
747:
739:
731:
712:
704:
658:
655:
652:
644:
641:
638:
630:
627:
624:
616:
613:
610:
602:
599:
596:
588:
585:
582:
572:
567:
562:
527:
1768:Haspelmath, Martin (2008).
1291:Range of interlinear gloss
1252:Interlinear gloss resources
1233:our-DAT.PL father\PL-DAT.PL
827:One-to-many correspondences
785:Grammatical category labels
2252:
1001:'I left the room quickly.'
557:5. an English translation:
18:
1884:10.1007/s10579-015-9325-4
358:"I made my son a house."
55:interlinear glossed text
1609:
1601:
1570:
1546:
1535:
992:oda-dan hız-lı çık-tı-m
893:
428:
420:
405:
395:
369:
344:
336:
328:
320:
312:
304:
296:
288:
280:
238:
224:
210:
196:
182:
168:
154:
140:
126:
61:) – an
1716:Part-of-speech tagging
1465:
1438:
1430:
1419:
1117:
1098:
1087:
1079:
1037:
1034:
964:
961:
948:
945:
932:
929:
863:
833:
692:Word-by-word alignment
540:for the surface tones,
450:Specials Unicode block
431:
423:
408:
398:
372:
267:
244:
230:
216:
202:
188:
174:
160:
146:
132:
100:
87:
1848:Georgi, Ryan (2016).
1177:for other examples.)
84:
2207:The American Scholar
2022:10.18653/v1/D18-1315
926:Odadan hızlı çıktım.
113:Wilhelm von Humboldt
85:Interlinear text in
2234:Reordered languages
2106:10.3115/v1/N15-1093
1649:
1287:
1167:contemplative~write
1132:contemplative~write
977:go_out-perfective-I
1690:inflected for the
1647:
1411:Automatic glossing
1393:189,244 (157,114)
1285:
1149:contemplative mood
1142:⟨um⟩
1138:⟨um⟩
1120:⟨um⟩
1101:⟨um⟩
1092:contemplative mood
857:Non-overt elements
501:a word-by-word or
101:
1787:978-3-11-011423-2
1676:
1675:
1439:elu-ab-ok'ek'-asi
1400:
1399:
1020:'to the houses'.
536:2. a gloss using
255:1 3 2 4 5 6 7 8 9
249:1 2 3 4 5 6 7 8 9
117:Classical Nahuatl
115:'s annotation of
35:interlinear gloss
2241:
2138:
2137:
2125:
2119:
2118:
2108:
2088:
2082:
2081:
2071:
2061:
2041:
2035:
2034:
2024:
2004:
1998:
1997:
1995:
1994:
1981:
1961:
1952:
1951:
1949:
1948:
1935:
1915:
1902:
1901:
1899:
1898:
1863:
1854:
1853:
1845:
1839:
1832:
1826:
1825:
1823:
1822:
1807:
1792:
1791:
1775:
1765:
1759:
1758:
1750:
1669:PFV.PST.SG.MASC
1650:
1618:
1597:
1591:
1585:
1579:
1566:
1560:
1554:
1542:
1472:
1461:
1457:
1453:
1449:
1445:
1426:
1365:12,822 (15,560)
1351:40,260 (46,420)
1337:97,158 (81,218)
1323:36,691 (10,814)
1288:
1218:
1214:
1210:
1199:
1195:
1166:
1164:
1158:
1143:
1139:
1136:sulat su~sulat s
1131:
1129:
1121:
1112:
1102:
1019:
1010:
974:
970:
954:
938:
901:
872:
842:
814:
808:
798:
792:
770:
764:
727:
721:
680:
674:
668:
575:
570:
565:
533:transliteration,
532:
525:1. the standard
519:Taiwanese Minnan
498:transliteration,
415:
411:
401:
391:
387:
383:
379:
375:
270:
93:Spanish-language
90:
51:interlinear text
2251:
2250:
2244:
2243:
2242:
2240:
2239:
2238:
2214:
2213:
2147:
2142:
2141:
2127:
2126:
2122:
2090:
2089:
2085:
2043:
2042:
2038:
2006:
2005:
2001:
1992:
1990:
1963:
1962:
1955:
1946:
1944:
1917:
1916:
1905:
1896:
1894:
1865:
1864:
1857:
1847:
1846:
1842:
1833:
1829:
1820:
1818:
1809:
1808:
1795:
1788:
1767:
1766:
1762:
1752:
1751:
1747:
1742:
1700:
1692:PFV.PST.SG.MASC
1684:PFV.PST.SG.MASC
1661:PFV.PST.SG.FEM
1642:PFV.PST.SG.MASC
1636:with slots for
1630:
1620:
1607:
1599:
1568:
1544:
1528:
1484:
1474:
1463:
1436:
1428:
1413:
1405:
1262:
1254:
1246:
1230:unser-n Väter-n
1228:
1220:
1201:
1171:
1165:trigger⟩
1162:
1160:
1152:
1141:
1137:
1134:
1127:
1125:
1119:
1115:
1106:
1100:
1096:
1085:
1077:
1058:
1048:
1040:
1032:
1017:
1008:
1003:
990:
979:
959:
943:
927:
919:
914:
904:
884:
874:
854:
844:
782:
772:
753:
745:
737:
729:
710:
689:
678:
672:
666:
664:
650:
636:
622:
608:
594:
580:
573:
568:
563:
458:
443:
434:
426:
418:
403:
393:
370:ni-c-chihui-lia
360:
350:
342:
334:
326:
318:
310:
302:
294:
286:
262:
247:
233:
219:
205:
191:
177:
163:
149:
135:
79:
23:
17:
12:
11:
5:
2249:
2248:
2245:
2237:
2236:
2231:
2226:
2216:
2215:
2212:
2211:
2210:, Autumn 2008.
2198:
2192:
2187:
2182:
2176:
2171:
2165:
2159:
2153:
2146:
2145:External links
2143:
2140:
2139:
2120:
2083:
2036:
1999:
1953:
1903:
1878:(2): 321–349.
1855:
1840:
1827:
1793:
1786:
1760:
1744:
1743:
1741:
1738:
1737:
1736:
1731:
1728:James Hamilton
1725:
1719:
1713:
1707:
1699:
1696:
1680:PFV.PST.SG.FEM
1674:
1673:
1670:
1666:
1665:
1662:
1658:
1657:
1654:
1638:PFV.PST.SG.FEM
1608:
1600:
1569:
1545:
1534:
1533:
1527:
1524:
1523:
1522:
1511:
1464:
1437:
1429:
1418:
1417:
1412:
1409:
1404:
1401:
1398:
1397:
1394:
1391:
1390:1,326 (1,493)
1388:
1384:
1383:
1380:
1379:2,313 (3,012)
1377:
1374:
1370:
1369:
1366:
1363:
1360:
1356:
1355:
1354:21.27 (29.55)
1352:
1349:
1346:
1342:
1341:
1340:51.34 (51.69)
1338:
1335:
1332:
1328:
1327:
1324:
1321:
1318:
1314:
1313:
1307:
1301:
1295:
1279:
1278:
1274:
1271:
1261:
1258:
1253:
1250:
1221:
1202:
1187:
1186:
1116:
1097:
1086:
1078:
1075:
1074:
1041:
1033:
1030:
1029:
980:
960:
944:
928:
925:
924:
918:
915:
892:
891:
862:
861:
832:
831:
754:
746:
738:
730:
711:
703:
702:
651:
637:
623:
609:
595:
581:
561:
560:
559:
558:
555:
548:
541:
534:
515:
514:
507:
506:
499:
496:morphophonemic
492:
491:transcription,
485:
482:
471:(typically in
457:
454:
427:
419:
404:
394:
368:
367:
343:
335:
327:
319:
311:
303:
295:
287:
279:
278:
234:
220:
206:
192:
178:
164:
150:
136:
122:
121:
99:speakers, 1910
78:
75:
15:
13:
10:
9:
6:
4:
3:
2:
2247:
2246:
2235:
2232:
2230:
2227:
2225:
2222:
2221:
2219:
2209:
2208:
2203:
2200:Ernest Blum,
2199:
2196:
2193:
2191:
2188:
2186:
2183:
2180:
2177:
2175:
2172:
2169:
2166:
2163:
2160:
2157:
2154:
2152:
2149:
2148:
2144:
2135:
2131:
2124:
2121:
2116:
2112:
2107:
2102:
2098:
2094:
2087:
2084:
2079:
2075:
2070:
2065:
2060:
2055:
2051:
2047:
2040:
2037:
2032:
2028:
2023:
2018:
2014:
2010:
2003:
2000:
1989:
1985:
1980:
1975:
1971:
1967:
1960:
1958:
1954:
1943:
1939:
1934:
1929:
1925:
1921:
1914:
1912:
1910:
1908:
1904:
1893:
1889:
1885:
1881:
1877:
1873:
1869:
1862:
1860:
1856:
1851:
1844:
1841:
1837:
1834:Example from
1831:
1828:
1817:
1813:
1806:
1804:
1802:
1800:
1798:
1794:
1789:
1783:
1779:
1774:
1773:
1764:
1761:
1756:
1749:
1746:
1739:
1735:
1732:
1729:
1726:
1723:
1720:
1717:
1714:
1711:
1708:
1705:
1702:
1701:
1697:
1695:
1693:
1689:
1685:
1681:
1678:The slot for
1671:
1668:
1667:
1663:
1660:
1659:
1655:
1652:
1651:
1645:
1643:
1639:
1635:
1629:
1626:
1623:
1619:
1617:
1611:
1606:
1603:
1598:
1596:
1590:
1584:
1578:
1572:
1567:
1565:
1559:
1553:
1548:
1543:
1537:
1532:
1525:
1520:
1516:
1512:
1509:
1505:
1504:
1503:
1501:
1497:
1493:
1489:
1483:
1480:
1477:
1473:
1467:
1462:
1440:
1435:
1432:
1427:
1421:
1416:
1410:
1408:
1402:
1395:
1392:
1389:
1386:
1385:
1381:
1378:
1375:
1372:
1371:
1367:
1364:
1361:
1358:
1357:
1353:
1350:
1347:
1344:
1343:
1339:
1336:
1333:
1330:
1329:
1326:19.39 (6.88)
1325:
1322:
1319:
1316:
1315:
1312:
1308:
1306:
1302:
1300:
1296:
1294:
1290:
1289:
1283:
1275:
1272:
1268:
1267:
1266:
1259:
1257:
1251:
1249:
1245:
1243:
1240:'fathers' is
1239:
1234:
1231:
1227:
1224:
1219:
1205:
1200:
1190:
1185:
1183:
1178:
1176:
1170:
1168:
1161:⟨agent
1157:.past⟩
1156:
1155:agent trigger
1150:
1145:
1133:
1123:
1114:
1110:
1109:agent trigger
1104:
1095:
1093:
1089:
1084:
1081:
1073:
1071:
1070:reduplication
1067:
1063:
1057:
1056:'I love you.'
1054:
1051:
1047:
1044:
1039:
1036:
1028:
1026:
1021:
1015:
1011:
1002:
999:
996:
993:
989:
986:
983:
978:
975:
969:
963:
958:
955:
953:
947:
942:
939:
937:
931:
923:
916:
913:
910:
907:
903:
900:
895:
890:
888:
887:Reduplication
883:
880:
877:
873:
871:
865:
860:
858:
853:
852:'to come out'
850:
847:
843:
841:
835:
830:
828:
824:
822:
818:
813:
807:
802:
797:
791:
786:
781:
778:
775:
771:
769:
763:
757:
752:
749:
744:
741:
736:
733:
728:
726:
720:
714:
709:
706:
701:
699:
695:
693:
688:
685:
682:
676:
670:
663:
660:
657:
654:
649:
646:
643:
640:
635:
632:
629:
626:
621:
618:
615:
612:
607:
604:
601:
598:
593:
590:
587:
584:
579:
576:
571:
566:
556:
553:
549:
546:
542:
539:
535:
531:
530:
524:
523:
522:
520:
512:
511:
510:
509:and finally
504:
500:
497:
493:
490:
486:
483:
480:
479:
474:
470:
467:The original
466:
465:
464:
461:
455:
453:
451:
446:
442:
440:
437:
433:
430:
425:
422:
417:
407:
402:
397:
392:
371:
366:
363:
359:
356:
353:
349:
346:
341:
338:
333:
330:
325:
322:
317:
314:
309:
306:
301:
298:
293:
290:
285:
282:
277:
273:
269:
261:
259:
256:
253:
250:
246:
243:
240:
237:
232:
229:
226:
223:
218:
215:
212:
209:
204:
201:
198:
195:
190:
187:
184:
181:
176:
173:
170:
167:
162:
159:
156:
153:
148:
145:
142:
139:
134:
131:
128:
125:
120:
118:
114:
109:
107:
98:
95:textbook for
94:
89:
83:
76:
74:
72:
68:
64:
60:
56:
52:
48:
45:into another
44:
40:
36:
32:
28:
22:
2205:
2133:
2123:
2096:
2086:
2049:
2039:
2012:
2002:
1991:. Retrieved
1969:
1945:. Retrieved
1923:
1895:. Retrieved
1875:
1871:
1849:
1843:
1835:
1830:
1819:. Retrieved
1815:
1771:
1763:
1754:
1748:
1691:
1687:
1683:
1679:
1677:
1641:
1637:
1633:
1631:
1627:
1624:
1621:
1612:
1604:
1573:
1549:
1538:
1529:
1518:
1514:
1507:
1499:
1495:
1491:
1487:
1485:
1481:
1478:
1475:
1468:
1441:
1433:
1422:
1414:
1406:
1382:1.22 (1.92)
1368:6.78 (9.96)
1310:
1304:
1298:
1292:
1280:
1263:
1255:
1247:
1241:
1237:
1235:
1232:
1229:
1225:
1222:
1206:
1203:
1191:
1188:
1179:
1172:
1169:
1146:
1135:
1124:
1105:
1090:
1082:
1059:
1055:
1052:
1049:
1045:
1042:
1022:
1013:
1006:
1004:
1000:
997:
994:
991:
987:
984:
981:
976:
956:
940:
920:
911:
908:
905:
896:
886:
885:
881:
878:
875:
866:
856:
855:
851:
849:come.out-INF
848:
845:
836:
826:
825:
820:
816:
800:
793:, the stem (
784:
783:
779:
776:
773:
758:
750:
742:
734:
715:
707:
697:
696:
691:
690:
686:
683:
677:
671:
665:
661:
647:
633:
619:
605:
591:
577:
538:tone numbers
516:
508:
477:
476:
472:
462:
459:
447:
444:
441:
438:
435:
364:
361:
357:
354:
351:
347:
339:
331:
323:
315:
307:
299:
291:
283:
274:
263:
260:
257:
254:
251:
248:
241:
235:
227:
221:
213:
207:
199:
193:
185:
179:
171:
165:
157:
151:
143:
137:
129:
123:
110:
102:
62:
58:
54:
50:
34:
24:
2229:Linguistics
2224:Translation
1656:inflection
1317:>10,000
1309:Percent of
1066:circumfixes
917:Punctuation
912:'is buying'
545:tone sandhi
478:bold italic
469:orthography
106:source text
71:source text
67:source text
63:interlinear
43:translation
27:linguistics
2218:Categories
2059:2005.10213
1993:2021-12-15
1947:2021-12-15
1897:2021-12-15
1821:2010-06-30
1740:References
1734:Metaphrase
1664:pobeja-la
1396:100 (100)
1376:838 (862)
1362:326 (460)
1348:122 (139)
1331:1000-9999
1303:Number of
1299:languages
1297:Number of
1293:instances
1270:singular).
1053:I⹀you⹀love
1050:je⹀te⹀aime
1038:I⹀you⹀love
1035:je⹀te⹀aime
1031:Je t'aime.
1018:FEM.PL.DAT
957:speed-with
790:amuqʼ-da-č
756:amuqʼ-da-č
529:pe̍h-ōe-jī
406:no-piltzin
19:See also:
2078:218718982
1988:226262296
1942:227231816
1722:Treebanks
1710:Ruby text
1571:pobeja-la
1536:Vecher-om
941:room-from
837:come.out-
740:hamišaluǧ
617:koat-teng
614:koat-teng
611:koat-tēng
456:Structure
2170:(E-MELD)
2164:(E-MELD)
2158:(E-MELD)
2115:14929030
2031:53082616
1698:See also
1539:evening-
1500:ERG.1.PL
1431:ħumukuli
1345:100-999
1334:37 (31)
1223:(German)
1153:⟨
1130:trigger⟩
1088:su~sulat
1043:(French)
962:çık-tı-m
909:IPFV~buy
713:abur-u-n
659:tng-khi.
656:tng-khi.
503:morpheme
489:phonetic
268:mache es
47:language
31:pedagogy
1892:2674996
1610:magazin
1458:-steal-
1207:father\
1204:Väter-n
1189:unser-n
1151:~write
1144:u~sulat
1122:u~sulat
1062:infixes
1025:clitics
1014:oikíais
1007:go_out-
982:Turkish
965:go.out-
930:oda-dan
906:bi~bili
894:bi~bili
879:boy-NOM
846:çık-mak
834:çık-mak
809:) and (
743:forever
662:return.
653:tńg-khì
631:tang-si
628:tang-si
625:tang-sî
606:not-yet
603:iau-boe
600:iau-boe
597:iáu-boē
552:English
329:piltzin
211:piltzin
77:History
2113:
2076:
2029:
1986:
1940:
1924:COLING
1890:
1784:
1704:Kanbun
1688:pobeja
1634:pobeja
1613:store.
1519:be.NEG
1387:Total
1359:10-99
1320:3 (1)
1226:
1182:umlaut
1163:
1159:write
1147:write
1140:ulat s
1128:
1126:⟨agent
1111:.past⟩
1094:~write
1046:
1016:house.
988:
985:
949:speed-
946:hız-lı
876:puer-ø
864:puer-ø
751:behind
748:güǧüna
620:decide
473:italic
388:-mach-
316:to-the
297:chihui
155:chihui
97:German
2111:S2CID
2074:S2CID
2054:arXiv
2027:S2CID
1984:S2CID
1970:EMNLP
1938:S2CID
1888:S2CID
1653:Slot
1508:camel
1434:camel
1242:Vater
1238:Väter
1175:affix
1173:(See
1113:write
1083:write
1080:sulat
933:room-
882:'boy'
787:. In
759:stay-
732:ferma
716:they-
554:, and
429:calli
416:-Sohn
348:house
345:calli
239:calli
147:mache
39:gloss
37:is a
33:, an
1782:ISBN
1640:and
1574:run-
1423:you-
1420:mi-s
1373:1-9
1192:our-
1103:ulat
902:~buy
899:IPFV
867:boy-
819:and
801:stay
796:amuq
735:farm
705:Gila
679:(3.)
673:(2.)
667:(1.)
648:want
645:boeh
642:boeh
639:boeh
634:when
578:(4.)
574:(3.)
569:(2.)
564:(1.)
432:Haus
414:POSS
390:APPL
378:SUBJ
305:-lia
300:make
245:Haus
217:Sohn
203:mein
169:-lia
91:, a
29:and
2101:doi
2064:doi
2017:doi
1974:doi
1928:doi
1880:doi
1778:715
1616:ACC
1595:FEM
1583:PST
1577:PFV
1564:NOM
1541:INS
1515:anu
1492:you
1471:NEG
1469:be.
1466:anu
1460:PRT
1448:ERG
1444:OBL
1442:we.
1425:GEN
1213:DAT
1194:DAT
1009:PFV
973:1sg
968:PFV
952:COM
936:ABL
870:NOM
840:INF
821:NEG
817:FUT
768:NEG
762:FUT
725:GEN
719:OBL
708:now
589:goa
586:goa
583:goá
475:or
424:ein
410:1SG
400:DET
386:OBJ
382:3SG
374:1SG
332:son
321:no-
308:for
281:ni-
231:ein
197:no-
189:der
175:für
133:ich
127:ni-
59:IGT
53:or
25:In
2220::
2204:,
2132:.
2109:.
2095:.
2072:.
2062:.
2048:.
2025:.
2011:.
1982:.
1968:.
1956:^
1936:.
1922:.
1906:^
1886:.
1876:50
1874:.
1870:.
1858:^
1814:.
1796:^
1780:.
1672:?
1644::
1605:in
1589:SG
1558:SG
1547:ya
1521:).
1488:mi
1456:PL
1217:PL
1209:PL
1198:PL
1064:,
806:da
547:),
494:a
487:a
481:),
421:ce
396:in
337:ce
324:my
313:in
292:it
289:c-
225:ce
183:in
161:es
141:c-
119::
73:.
2117:.
2103::
2080:.
2066::
2056::
2033:.
2019::
1996:.
1976::
1950:.
1930::
1900:.
1882::
1824:.
1790:.
1602:v
1592:.
1586:.
1580:.
1561:.
1555:.
1552:1
1517::
1498::
1496:a
1490::
1454:.
1452:1
1450:.
1446:-
1244:)
1215:.
1211:-
1196:.
1118:s
1107:⟨
1099:s
971:-
812:č
765:-
722:-
592:I
412:.
384:.
380:-
376:.
340:a
284:I
242:9
236:9
228:8
222:8
214:7
208:7
200:6
194:6
186:5
180:5
172:4
166:4
158:2
152:3
144:3
138:2
130:1
124:1
57:(
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.