Knowledge (XXG)

Interlinear gloss

Source 📝

93: 932:
In interlinear morphological glosses, various forms of punctuation separate the glosses. Typically, the words are aligned with their glosses; within words, a hyphen is used when a boundary is marked in both the text and its gloss, a period when a boundary appears in only one. That is, there should be
1292:
In the manual correction phase, the database creators manually corrected the boundaries of the interlinear gloss instances discovered by the sequence-labelling method in Step 2 of the automatic construction phase. The creators then verified the language names and language codes in a second and third
1541:
Researchers have used interlinear glosses is to obtain the morphological paradigms of the object language (i.e., the language being glossed). To automatically create morphological paradigms from interlinear glosses, researchers have created tables for every stem in the gloss and a (possibly empty)
1287:
Third, each interlinear gloss instance was assigned a language name (e.g., Tagalog) and an ISO 693-3 language ID. Language names and IDs were automatically assigned to interlinear glosses using Coreference Resolution models from Natural Language Processing, where the interlinear gloss instance was
1275:
The Online Database of Interlinear Text (ODIN) is a database of over 200,000 instances of interlinear glosses for more than 1,500 languages extracted from scholarly linguistic research. The database was constructed in two phases: automatic construction followed by manual correction. The automatic
286:
More modern 19th- and 20th-century approaches took to glossing vertically, aligning the same sort of word-by-word content in such a way that the metalanguage terms were placed vertically below the source language terms. In this style, the given example might be rendered thus (here English gloss):
1280:
First, search engines (e.g., Google, Bing) were queried to retrieve scholarly documents that were likely to contain interlinear glosses. The queries comprised terms relevant to linguistic research such as grammatical morphemes (e.g., "NOM", short for nominative; "3SG", short for 3rd person
870:. if the morpheme-by-morpheme gloss (middle line) contains an element that does not correspond to an overt element in the example, a standard strategy is to include an overt "ø" in the object-language text, which is separated by a hyphen like an overt element would be: 114:
Interlinear glosses have been used for a variety of purposes over a long period of time. One common usage has been to annotate bilingual textbooks for language education. This sort of interlinearization serves to help make the meaning of a
933:
the same number of words separated with spaces in the text and its gloss, as well as the same number of hyphenated morphemes within a word and its gloss. This is the basic system, and can be applied universally. For example:
711:. At the sub-word level, segmentable morphemes are separated by hyphens, both in the example and in the gloss. There should be the same number of hyphens in the example and in the gloss, as shown in the following example: 705:. According to the Leipzig Glossing Rules, it is standard to left-align the words in the object language with the corresponding words in the metalanguage; this alignment can be seen between lines (1-3) and line (4). 275:
This "inline" style allows examples to be included within the flow of text, and for the word order of the target language to be written in an order which approximates the target language syntax. (In the gloss here,
282:
is reordered from the corresponding source order to approximate German syntax more naturally.) Even so, this approach requires the readers to "re-align" the correspondences between source and target forms.
1497:
Given the morpheme segmented line (first line above) and the free translation line (third line above), the task is to produce the middle glossed line comprising stem translations (e.g.,
456:
This approach is denser and also requires effort to read, but it is less reliant on the grammatical structure of the metalanguage for expressing the semantics of the target forms.
524:
a free translation, which may be placed in a separate paragraph or on the facing page if the structures of the languages are too different for it to follow the text line by line.
1023:, when a single word in the source language happens to correspond to a phrase in the glossing language, though a period would still be used for other situations, such as Greek 1284:
Second, each line in an extracted document was tagged for whether it was a line belonging to an interlinear gloss or not using sequence-labeling methods from Machine Learning.
122:
Such annotations have occasionally been expressed not through interlinear layout, but rather through enumeration of words in the object and meta language. One such example is
376:
Finally, modern linguists have adopted the practice of using abbreviated grammatical category labels. A 2008 publication which repeats this example labels it as follows:
471:
Though there is no formal specification for the IGT format, the Leipzig Glossing Rules are a set of guidelines that aim to standardize the format as much as possible.
1764:
Lehmann, Christian (2004-01-23). "Directions for interlinear morphemic translations". In Geert Booij; Christian Lehmann; Joachim Mugdan; Stavros Skopeteas (eds.).
1513:). Sequence prediction models from Natural Language Processing have been used to perform this task. Two factors contribute to the difficulty of this task: 80:
and its translation, and the structure of the original language. In its simplest form, an interlinear gloss is a literal, word-for-word translation of the
2184: 1418:
Natural Language Processing models leveraging interlinear gloss resources, such as the Online Database of Interlinear Text, have been developed.
2145:
Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
2108:
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
834:; a list of standard abbreviations for grammatical categories that are widely used in linguistics can be found in the Leipzig Glossing Rules. 1796: 52:(series of brief explanations, such as definitions or pronunciations) placed between lines, such as between a line of original text and its 474:
An interlinear text for linguistics will commonly consist of some or all of the following, usually in this order, from top to bottom:
1729:, often displayed as interlinear glosses under the tagged words, sometimes at the same time as an interlinear word-by-word translation 1705:
grammatical category). A statistical machine learning model for morphological inflection can be used to fill in the missing entries.
1738: 900:
is treated similarly to affixation but with a tilde (instead of the standard hyphen) that connects the copied element to the stem:
2178: 2166: 2172: 1288:
tagged with the language name (and ID) that appears in the scholarly document the interlinear gloss instance was extracted from.
31: 1297:
The language distribution of interlinear gloss instances in Online Database of Interlinear Text after phase 1 and (phase 2)
2192:. A forum for recommendations on the Interlinar Morphemic Glossing of ancient languages as attested in ancient manuscripts. 840:. When a single object-language element corresponds to several metalanguage elements, they are separated by periods. E.g., 2244: 2061:
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume
1038:
may be separated with a double hyphen (or, for ease of typing, an equal sign) rather than a hyphen. A French example:
1426:
Natural Language Processing systems, for example, have been developed to automatically produce interlinear glosses.:
60:. When glossed, each line of the original text acquires one or more corresponding lines of transcription known as an 1768:. Handbücher der Sprach- und Kommunikationswissenschaft. Vol. 2. Berlin: W. de Gruyter. pp. 1834–1857. 1877:
Xia, Fei; Lewis, William; Wayne, Michael; Slayden, Glenn; Georgi, Ryan; Crowgey, Joshua; Bender, Emily (2016).
460: 1542:
slot for every grammatical category (e.g., ERG) in the gloss. For instance, given the glossed sentence below:
826:) are inflectional affixes representing future tense and negation. These inflectional affixes are glossed as 1983:. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP): 5251–5262. 92: 2208:
Listing of older interlinear and construed texts, mostly from Latin or Ancient Greek and mostly to English
1726: 778: 2212: 1159: 1102: 2239: 2234: 2217: 123: 119:
explicit without attempting to formally model the structural characteristics of the source language.
909: 1861:
From Aari to Zulu: massively multilingual creation of language tools using interlinear glossed tex
539: 2195: 2121: 2084: 2064: 2037: 1994: 1948: 1898: 1562: 49: 1259:
A few other conventions which are sometimes seen are illustrated in the Leipzig Glossing Rules.
516:-by-morpheme gloss, where morphemes within a word are separated by hyphens or other punctuation, 1788: 1878: 1792: 1587: 978: 127: 1929:
Xingyuan, Zhao; Satoru, Ozaki; Anastasopoulos, Antonios; Neubig, Graham; Levin, Lori (2020).
2161: 2111: 2074: 2027: 1984: 1938: 1937:. Proceedings of the 28th International Conference on Computational Linguistics: 5397–5408. 1890: 1822: 1780: 1524:
Some words in the morpheme segmented line have multiple correspondences in the gloss (e.g.,
562: 529: 103: 961: 2205: 1626: 1605: 1599: 1574: 1568: 1192: 962: 880: 107: 1521:
is the last word in the translation but the second word in the morpheme segmented line).
1517:
The translation is not necessarily in alignment with the morpheme segmented line (e.g.,
1943: 1720: 1693:
would be filled (since it was observed in the interlinear gloss data) but the slot for
506: 2024:
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
1989: 1931:"Automatic Interlinear Glossing for Under-Resourced Languages Leveraging Translations" 1823:"The Leipzig Glossing Rules. Conventions for Interlinear Morpheme by Morpheme Glosses" 2228: 2088: 2079: 1998: 1952: 1781: 1741:, nineteenth-century composer and promoter of interlinear texts for language learning 1165: 1119: 1080: 946: 735: 2162:
The Leipzig Glossing Rules: Conventions for interlinear morpheme-by-morpheme glosses
2125: 2041: 2026:. Brussels, Belgium: Association for Computational Linguistics. pp. 2883–2889. 1267:
Efforts have been undertaken to digitize IGT for hundreds of the world's languages.
17: 1902: 772: 729: 2189: 1975:
Moeller, Sarah; Liu, Ling; Yang, Changbing; Kann, Katharina; Hulden, Mans (2020).
2110:. Denver, Colorado: Association for Computational Linguistics. pp. 922–931. 1930: 1723:– a gloss sometimes used with Chinese or Japanese to show the pronunciation 555: 548: 479: 116: 81: 77: 53: 37: 1976: 27:
Explanatory matter inserted between a line of original text and its translation
2056: 1894: 1744: 1593: 850: 1076: 554:
3. a gloss showing the underlying tones in citation form (before undergoing
76:
for short. Such glosses help the reader follow the relationship between the
2032: 1697:
would be empty (assuming that no other interlinear gloss instance contains
2116: 2200: 2063:. Online: Association for Computational Linguistics. pp. 1901–1907. 1879:"Enriching a massively multilingual database of interlinear glossed text" 1732: 513: 499: 57: 41: 983: 420: 392: 384: 2147:. Montréal, Canada: Association for Computational Linguistics: 396–406. 2103: 2019: 373:
Here word ordering is determined by the syntax of the object language.
1821:
Bickel, Balthasar; Bernard Comrie; Martin Haspelmath (February 2008).
1505:) and the grammatical category labels corresponding to affixes (e.g., 2141:"Leveraging supplemental representations for sequential transduction" 1766:
Morphologie. Ein internationales Handbuch zur Flexion und Wortbildung
1714: 1035: 908: 2196:
Online Interlinear of Biblical Greek Scriptures (New Testament) text
2140: 1783:
Language typology and language universals: an international handbook
2069: 1586: 1481: 977: 777: 424: 400: 2020:"An Encoder-Decoder Approach to the Paradigm Cell Filling Problem" 1185: 1072: 91: 1625: 1604: 1598: 1573: 1567: 1087:
sulat, susulat, sumulat, sumusulat (verbal declensions) (Tagalog)
879: 1735:, often displayed as a gloss or annotation to the original text. 1551: 1034:
However, sometimes finer distinctions may be made. For example,
1466: 1458: 1435: 1227: 1219: 1208: 945: 734: 1561: 1462: 1454: 771: 728: 2104:"Inflection Generation as Discriminative String Transduction" 1223: 1204: 396: 1592: 849: 410: 2102:
Nicolai, Garrett; Cherry, Colin; Kondrak, Grzegorz (2015).
1537:
Automatic discovery of morphological structure from glosses
1470: 2057:"Applying the Transformer to Character-level Transduction" 1079:, transfixes, etc.) may be set off by angle brackets, and 388: 1717:– Japanese tradition of glossing Classical Chinese texts 1276:
construction stage itself was completed in three steps:
1191:
Morphemes which cannot be easily separated out, such as
1195:, may be marked with a backslash rather than a period: 810:) is translated into the corresponding English lexeme ( 495:
a conventional transliteration into the Latin alphabet,
459:
In computing, special text markers are provided in the
532:
clause has been transcribed with five lines of text:
463:
to indicate the start and end of interlinear glosses.
1016:
An underscore may be used instead of a period, as in
692:
goa2 iau2-boe7 koat4-teng7 tang1-si5 boeh4 tng2-khi3.
686:
goa1 iau1-boe3 koat2-teng3 tang7-si5 boeh2 tng1-khi3.
537: 450:
1SG.SUBJ-3SG.OBJ-mach-APPL DET 1SG.POSS-Sohn ein Haus
1636:
evening-INS 1.SG.NOM run-PFV.PST.SG.FEM in store.ACC
1977:"IG2P: From Interlinear Glossed Texts to Paradigms" 1924: 1922: 1920: 1918: 1414:
Automatic processing of interlinear gloss instances
2055:Wu, Shijie; Cotterell, Ryan; Hulden, Mans (2021). 698:(5.) "I have not yet decided when I shall return." 1970: 1968: 1827:Dept. of Linguistics – Resources – Glossing Rules 788:now they-OBL-GEN farm forever behind stay-FUT-NEG 1872: 1870: 785:Gila abur-u-n ferma hamišaluǧ güǧüna amuqʼ-da-č 2201:ODIN - The Online Database of INterlinear text 1847:A Basic Vocabulary for a Beginner in Taiwanese 1816: 1814: 1812: 1810: 1808: 1490:you-GEN camel we.OBL-ERG.1.PL-steal-PRT be.NEG 791:'Now their farm will not stay behind forever.' 821: 815: 805: 799: 8: 2139:Bhargava, Aditya; Kondrak, Grzegorz (2012). 277: 97: 2179:Towards a General Model of Interlinear Text 2018:Silfverberg, Miikka; Hulden, Mans (2018). 680:goá iáu-boē koat-tēng tang-sî boeh tńg-khì 363:ni- c- chihui -lia in no- piltzin ce calli 263:ni- c- chihui -lia in no- piltzin ce calli 2115: 2078: 2068: 2031: 1988: 1942: 1657: 1295: 1009:room-from speed-with go_out-perfective-I 2213:"The New Old Way of Learning Languages" 1756: 1643:There would be a paradigm for the stem 1083:with tildes, rather than with hyphens: 695:(4.) I not-yet decide when want return. 269:ich mache es für der mein Sohn ein Haus 447:ni-c-chihui-lia in no-piltzin ce calli 7: 2190:Glossing Ancient Languages and Texts 1639:'In the evening I ran to the store.' 1487:mi-s ħumukuli elu-ab-ok'ek'-asi anu 1271:Online Database of Interlinear Text 1071:Affixes which cause discontinuity ( 709:Morpheme-by-morpheme correspondence 561:4. a morpheme-by-morpheme gloss in 366:I it make for to-the my son a house 2167:Interlinear Glossed Text Standards 1293:pass over the data, respectively. 1247:'to our fathers' (the singular of 814:) while the inflectional affixes ( 25: 1883:Language Resources and Evaluation 1006:room-ABL speed-COM go.out-PFV-1sg 1944:10.18653/v1/2020.coling-main.471 1863:(PhD). University of Washington. 1849:by Ko Chek Hoan and Tan Pang Tin 1633:Vecher-om ya pobeja-la v magazin 99:Toussaint–Langenscheidt Spanisch 2206:Latinum Interlinear Method page 2173:Interlinear Glossed Text Levels 1990:10.18653/v1/2020.emnlp-main.424 2080:10.18653/v1/2021.eacl-main.163 1659:(Partial) paradigm for pobeja 32:List of glossing abbreviations 1: 2185:Interlinear Morphemic Glosses 1787:. Walter de Gruyter. p.  1493:'We didn't steal your camel.' 528:As an example, the following 1322:interlinear gloss instances 1316:interlinear gloss instances 822: 816: 806: 800: 766: 758: 750: 742: 723: 715: 669: 666: 663: 655: 652: 649: 641: 638: 635: 627: 624: 621: 613: 610: 607: 599: 596: 593: 583: 578: 573: 538: 1779:Haspelmath, Martin (2008). 1302:Range of interlinear gloss 1263:Interlinear gloss resources 1244:our-DAT.PL father\PL-DAT.PL 838:One-to-many correspondences 796:Grammatical category labels 2261: 1012:'I left the room quickly.' 568:5. an English translation: 29: 1895:10.1007/s10579-015-9325-4 369:"I made my son a house." 66:interlinear glossed text 1620: 1612: 1581: 1557: 1546: 1003:oda-dan hız-lı çık-tı-m 904: 439: 431: 416: 406: 380: 355: 347: 339: 331: 323: 315: 307: 299: 291: 249: 235: 221: 207: 193: 179: 165: 151: 137: 72:) – an 1727:Part-of-speech tagging 1476: 1449: 1441: 1430: 1128: 1109: 1098: 1090: 1048: 1045: 975: 972: 959: 956: 943: 940: 874: 844: 703:Word-by-word alignment 551:for the surface tones, 461:Specials Unicode block 442: 434: 419: 409: 383: 278: 255: 241: 227: 213: 199: 185: 171: 157: 143: 111: 98: 1859:Georgi, Ryan (2016). 1188:for other examples.) 95: 2218:The American Scholar 2033:10.18653/v1/D18-1315 937:Odadan hızlı çıktım. 124:Wilhelm von Humboldt 96:Interlinear text in 18:Interlinear glossing 2245:Reordered languages 2117:10.3115/v1/N15-1093 1660: 1298: 1178:contemplative~write 1143:contemplative~write 988:go_out-perfective-I 1701:inflected for the 1658: 1422:Automatic glossing 1404:189,244 (157,114) 1296: 1160:contemplative mood 1153:⟨um⟩ 1149:⟨um⟩ 1131:⟨um⟩ 1112:⟨um⟩ 1103:contemplative mood 868:Non-overt elements 512:a word-by-word or 112: 1798:978-3-11-011423-2 1687: 1686: 1450:elu-ab-ok'ek'-asi 1411: 1410: 1031:'to the houses'. 547:2. a gloss using 266:1 3 2 4 5 6 7 8 9 260:1 2 3 4 5 6 7 8 9 128:Classical Nahuatl 126:'s annotation of 46:interlinear gloss 16:(Redirected from 2252: 2149: 2148: 2136: 2130: 2129: 2119: 2099: 2093: 2092: 2082: 2072: 2052: 2046: 2045: 2035: 2015: 2009: 2008: 2006: 2005: 1992: 1972: 1963: 1962: 1960: 1959: 1946: 1926: 1913: 1912: 1910: 1909: 1874: 1865: 1864: 1856: 1850: 1843: 1837: 1836: 1834: 1833: 1818: 1803: 1802: 1786: 1776: 1770: 1769: 1761: 1680:PFV.PST.SG.MASC 1661: 1629: 1608: 1602: 1596: 1590: 1577: 1571: 1565: 1553: 1483: 1472: 1468: 1464: 1460: 1456: 1437: 1376:12,822 (15,560) 1362:40,260 (46,420) 1348:97,158 (81,218) 1334:36,691 (10,814) 1299: 1229: 1225: 1221: 1210: 1206: 1177: 1175: 1169: 1154: 1150: 1147:sulat su~sulat s 1142: 1140: 1132: 1123: 1113: 1030: 1021: 985: 981: 965: 949: 912: 883: 853: 825: 819: 809: 803: 781: 775: 738: 732: 691: 685: 679: 586: 581: 576: 544:transliteration, 543: 536:1. the standard 530:Taiwanese Minnan 509:transliteration, 426: 422: 412: 402: 398: 394: 390: 386: 281: 104:Spanish-language 101: 62:interlinear text 21: 2260: 2259: 2255: 2254: 2253: 2251: 2250: 2249: 2225: 2224: 2158: 2153: 2152: 2138: 2137: 2133: 2101: 2100: 2096: 2054: 2053: 2049: 2017: 2016: 2012: 2003: 2001: 1974: 1973: 1966: 1957: 1955: 1928: 1927: 1916: 1907: 1905: 1876: 1875: 1868: 1858: 1857: 1853: 1844: 1840: 1831: 1829: 1820: 1819: 1806: 1799: 1778: 1777: 1773: 1763: 1762: 1758: 1753: 1711: 1703:PFV.PST.SG.MASC 1695:PFV.PST.SG.MASC 1672:PFV.PST.SG.FEM 1653:PFV.PST.SG.MASC 1647:with slots for 1641: 1631: 1618: 1610: 1579: 1555: 1539: 1495: 1485: 1474: 1447: 1439: 1424: 1416: 1273: 1265: 1257: 1241:unser-n Väter-n 1239: 1231: 1212: 1182: 1176:trigger⟩ 1173: 1171: 1163: 1152: 1148: 1145: 1138: 1136: 1130: 1126: 1117: 1111: 1107: 1096: 1088: 1069: 1059: 1051: 1043: 1028: 1019: 1014: 1001: 990: 970: 954: 938: 930: 925: 915: 895: 885: 865: 855: 793: 783: 764: 756: 748: 740: 721: 700: 689: 683: 677: 675: 661: 647: 633: 619: 605: 591: 584: 579: 574: 469: 454: 445: 437: 429: 414: 404: 381:ni-c-chihui-lia 371: 361: 353: 345: 337: 329: 321: 313: 305: 297: 273: 258: 244: 230: 216: 202: 188: 174: 160: 146: 90: 34: 28: 23: 22: 15: 12: 11: 5: 2258: 2256: 2248: 2247: 2242: 2237: 2227: 2226: 2223: 2222: 2221:, Autumn 2008. 2209: 2203: 2198: 2193: 2187: 2182: 2176: 2170: 2164: 2157: 2156:External links 2154: 2151: 2150: 2131: 2094: 2047: 2010: 1964: 1914: 1889:(2): 321–349. 1866: 1851: 1838: 1804: 1797: 1771: 1755: 1754: 1752: 1749: 1748: 1747: 1742: 1739:James Hamilton 1736: 1730: 1724: 1718: 1710: 1707: 1691:PFV.PST.SG.FEM 1685: 1684: 1681: 1677: 1676: 1673: 1669: 1668: 1665: 1649:PFV.PST.SG.FEM 1619: 1611: 1580: 1556: 1545: 1544: 1538: 1535: 1534: 1533: 1522: 1475: 1448: 1440: 1429: 1428: 1423: 1420: 1415: 1412: 1409: 1408: 1405: 1402: 1401:1,326 (1,493) 1399: 1395: 1394: 1391: 1390:2,313 (3,012) 1388: 1385: 1381: 1380: 1377: 1374: 1371: 1367: 1366: 1365:21.27 (29.55) 1363: 1360: 1357: 1353: 1352: 1351:51.34 (51.69) 1349: 1346: 1343: 1339: 1338: 1335: 1332: 1329: 1325: 1324: 1318: 1312: 1306: 1290: 1289: 1285: 1282: 1272: 1269: 1264: 1261: 1232: 1213: 1198: 1197: 1127: 1108: 1097: 1089: 1086: 1085: 1052: 1044: 1041: 1040: 991: 971: 955: 939: 936: 935: 929: 926: 903: 902: 873: 872: 843: 842: 765: 757: 749: 741: 722: 714: 713: 662: 648: 634: 620: 606: 592: 572: 571: 570: 569: 566: 559: 552: 545: 526: 525: 518: 517: 510: 507:morphophonemic 503: 502:transcription, 496: 493: 482:(typically in 468: 465: 438: 430: 415: 405: 379: 378: 354: 346: 338: 330: 322: 314: 306: 298: 290: 289: 245: 231: 217: 203: 189: 175: 161: 147: 133: 132: 110:speakers, 1910 89: 86: 26: 24: 14: 13: 10: 9: 6: 4: 3: 2: 2257: 2246: 2243: 2241: 2238: 2236: 2233: 2232: 2230: 2220: 2219: 2214: 2211:Ernest Blum, 2210: 2207: 2204: 2202: 2199: 2197: 2194: 2191: 2188: 2186: 2183: 2180: 2177: 2174: 2171: 2168: 2165: 2163: 2160: 2159: 2155: 2146: 2142: 2135: 2132: 2127: 2123: 2118: 2113: 2109: 2105: 2098: 2095: 2090: 2086: 2081: 2076: 2071: 2066: 2062: 2058: 2051: 2048: 2043: 2039: 2034: 2029: 2025: 2021: 2014: 2011: 2000: 1996: 1991: 1986: 1982: 1978: 1971: 1969: 1965: 1954: 1950: 1945: 1940: 1936: 1932: 1925: 1923: 1921: 1919: 1915: 1904: 1900: 1896: 1892: 1888: 1884: 1880: 1873: 1871: 1867: 1862: 1855: 1852: 1848: 1845:Example from 1842: 1839: 1828: 1824: 1817: 1815: 1813: 1811: 1809: 1805: 1800: 1794: 1790: 1785: 1784: 1775: 1772: 1767: 1760: 1757: 1750: 1746: 1743: 1740: 1737: 1734: 1731: 1728: 1725: 1722: 1719: 1716: 1713: 1712: 1708: 1706: 1704: 1700: 1696: 1692: 1689:The slot for 1682: 1679: 1678: 1674: 1671: 1670: 1666: 1663: 1662: 1656: 1654: 1650: 1646: 1640: 1637: 1634: 1630: 1628: 1622: 1617: 1614: 1609: 1607: 1601: 1595: 1589: 1583: 1578: 1576: 1570: 1564: 1559: 1554: 1548: 1543: 1536: 1531: 1527: 1523: 1520: 1516: 1515: 1514: 1512: 1508: 1504: 1500: 1494: 1491: 1488: 1484: 1478: 1473: 1451: 1446: 1443: 1438: 1432: 1427: 1421: 1419: 1413: 1406: 1403: 1400: 1397: 1396: 1392: 1389: 1386: 1383: 1382: 1378: 1375: 1372: 1369: 1368: 1364: 1361: 1358: 1355: 1354: 1350: 1347: 1344: 1341: 1340: 1337:19.39 (6.88) 1336: 1333: 1330: 1327: 1326: 1323: 1319: 1317: 1313: 1311: 1307: 1305: 1301: 1300: 1294: 1286: 1283: 1279: 1278: 1277: 1270: 1268: 1262: 1260: 1256: 1254: 1251:'fathers' is 1250: 1245: 1242: 1238: 1235: 1230: 1216: 1211: 1201: 1196: 1194: 1189: 1187: 1181: 1179: 1172:⟨agent 1168:.past⟩ 1167: 1166:agent trigger 1161: 1156: 1144: 1134: 1125: 1121: 1120:agent trigger 1115: 1106: 1104: 1100: 1095: 1092: 1084: 1082: 1081:reduplication 1078: 1074: 1068: 1067:'I love you.' 1065: 1062: 1058: 1055: 1050: 1047: 1039: 1037: 1032: 1026: 1022: 1013: 1010: 1007: 1004: 1000: 997: 994: 989: 986: 980: 974: 969: 966: 964: 958: 953: 950: 948: 942: 934: 927: 924: 921: 918: 914: 911: 906: 901: 899: 898:Reduplication 894: 891: 888: 884: 882: 876: 871: 869: 864: 863:'to come out' 861: 858: 854: 852: 846: 841: 839: 835: 833: 829: 824: 818: 813: 808: 802: 797: 792: 789: 786: 782: 780: 774: 768: 763: 760: 755: 752: 747: 744: 739: 737: 731: 725: 720: 717: 712: 710: 706: 704: 699: 696: 693: 687: 681: 674: 671: 668: 665: 660: 657: 654: 651: 646: 643: 640: 637: 632: 629: 626: 623: 618: 615: 612: 609: 604: 601: 598: 595: 590: 587: 582: 577: 567: 564: 560: 557: 553: 550: 546: 542: 541: 535: 534: 533: 531: 523: 522: 521: 520:and finally 515: 511: 508: 504: 501: 497: 494: 491: 490: 485: 481: 478:The original 477: 476: 475: 472: 466: 464: 462: 457: 453: 451: 448: 444: 441: 436: 433: 428: 418: 413: 408: 403: 382: 377: 374: 370: 367: 364: 360: 357: 352: 349: 344: 341: 336: 333: 328: 325: 320: 317: 312: 309: 304: 301: 296: 293: 288: 284: 280: 272: 270: 267: 264: 261: 257: 254: 251: 248: 243: 240: 237: 234: 229: 226: 223: 220: 215: 212: 209: 206: 201: 198: 195: 192: 187: 184: 181: 178: 173: 170: 167: 164: 159: 156: 153: 150: 145: 142: 139: 136: 131: 129: 125: 120: 118: 109: 106:textbook for 105: 100: 94: 87: 85: 83: 79: 75: 71: 67: 63: 59: 56:into another 55: 51: 47: 43: 39: 33: 19: 2216: 2144: 2134: 2107: 2097: 2060: 2050: 2023: 2013: 2002:. Retrieved 1980: 1956:. Retrieved 1934: 1906:. Retrieved 1886: 1882: 1860: 1854: 1846: 1841: 1830:. Retrieved 1826: 1782: 1774: 1765: 1759: 1702: 1698: 1694: 1690: 1688: 1652: 1648: 1644: 1642: 1638: 1635: 1632: 1623: 1615: 1584: 1560: 1549: 1540: 1529: 1525: 1518: 1510: 1506: 1502: 1498: 1496: 1492: 1489: 1486: 1479: 1452: 1444: 1433: 1425: 1417: 1393:1.22 (1.92) 1379:6.78 (9.96) 1321: 1315: 1309: 1303: 1291: 1274: 1266: 1258: 1252: 1248: 1246: 1243: 1240: 1236: 1233: 1217: 1214: 1202: 1199: 1190: 1183: 1180: 1157: 1146: 1135: 1116: 1101: 1093: 1070: 1066: 1063: 1060: 1056: 1053: 1033: 1024: 1017: 1015: 1011: 1008: 1005: 1002: 998: 995: 992: 987: 967: 951: 931: 922: 919: 916: 907: 897: 896: 892: 889: 886: 877: 867: 866: 862: 860:come.out-INF 859: 856: 847: 837: 836: 831: 827: 811: 804:, the stem ( 795: 794: 790: 787: 784: 769: 761: 753: 745: 726: 718: 708: 707: 702: 701: 697: 694: 688: 682: 676: 672: 658: 644: 630: 616: 602: 588: 549:tone numbers 527: 519: 488: 487: 483: 473: 470: 458: 455: 452: 449: 446: 375: 372: 368: 365: 362: 358: 350: 342: 334: 326: 318: 310: 302: 294: 285: 274: 271: 268: 265: 262: 259: 252: 246: 238: 232: 224: 218: 210: 204: 196: 190: 182: 176: 168: 162: 154: 148: 140: 134: 121: 113: 73: 69: 65: 61: 45: 35: 2240:Linguistics 2235:Translation 1667:inflection 1328:>10,000 1320:Percent of 1077:circumfixes 928:Punctuation 923:'is buying' 556:tone sandhi 489:bold italic 480:orthography 117:source text 82:source text 78:source text 74:interlinear 54:translation 38:linguistics 2229:Categories 2070:2005.10213 2004:2021-12-15 1958:2021-12-15 1908:2021-12-15 1832:2010-06-30 1751:References 1745:Metaphrase 1675:pobeja-la 1407:100 (100) 1387:838 (862) 1373:326 (460) 1359:122 (139) 1342:1000-9999 1314:Number of 1310:languages 1308:Number of 1304:instances 1281:singular). 1064:I⹀you⹀love 1061:je⹀te⹀aime 1049:I⹀you⹀love 1046:je⹀te⹀aime 1042:Je t'aime. 1029:FEM.PL.DAT 968:speed-with 801:amuqʼ-da-č 767:amuqʼ-da-č 540:pe̍h-ōe-jī 417:no-piltzin 30:See also: 2089:218718982 1999:226262296 1953:227231816 1733:Treebanks 1721:Ruby text 1582:pobeja-la 1547:Vecher-om 952:room-from 848:come.out- 751:hamišaluǧ 628:koat-teng 625:koat-teng 622:koat-tēng 467:Structure 2181:(E-MELD) 2175:(E-MELD) 2169:(E-MELD) 2126:14929030 2042:53082616 1709:See also 1550:evening- 1511:ERG.1.PL 1442:ħumukuli 1356:100-999 1345:37 (31) 1234:(German) 1164:⟨ 1141:trigger⟩ 1099:su~sulat 1054:(French) 973:çık-tı-m 920:IPFV~buy 724:abur-u-n 670:tng-khi. 667:tng-khi. 514:morpheme 500:phonetic 279:mache es 58:language 42:pedagogy 1903:2674996 1621:magazin 1469:-steal- 1218:father\ 1215:Väter-n 1200:unser-n 1162:~write 1155:u~sulat 1133:u~sulat 1073:infixes 1036:clitics 1025:oikíais 1018:go_out- 993:Turkish 976:go.out- 941:oda-dan 917:bi~bili 905:bi~bili 890:boy-NOM 857:çık-mak 845:çık-mak 820:) and ( 754:forever 673:return. 664:tńg-khì 642:tang-si 639:tang-si 636:tang-sî 617:not-yet 614:iau-boe 611:iau-boe 608:iáu-boē 563:English 340:piltzin 222:piltzin 88:History 2124:  2087:  2040:  1997:  1951:  1935:COLING 1901:  1795:  1715:Kanbun 1699:pobeja 1645:pobeja 1624:store. 1530:be.NEG 1398:Total 1370:10-99 1331:3 (1) 1237:  1193:umlaut 1174:  1170:write 1158:write 1151:ulat s 1139:  1137:⟨agent 1122:.past⟩ 1105:~write 1057:  1027:house. 999:  996:  960:speed- 957:hız-lı 887:puer-ø 875:puer-ø 762:behind 759:güǧüna 631:decide 484:italic 399:-mach- 327:to-the 308:chihui 166:chihui 108:German 2122:S2CID 2085:S2CID 2065:arXiv 2038:S2CID 1995:S2CID 1981:EMNLP 1949:S2CID 1899:S2CID 1664:Slot 1519:camel 1445:camel 1253:Vater 1249:Väter 1186:affix 1184:(See 1124:write 1094:write 1091:sulat 944:room- 893:'boy' 798:. In 770:stay- 743:ferma 727:they- 565:, and 440:calli 427:-Sohn 359:house 356:calli 250:calli 158:mache 50:gloss 48:is a 44:, an 1793:ISBN 1651:and 1585:run- 1434:you- 1431:mi-s 1384:1-9 1203:our- 1114:ulat 913:~buy 910:IPFV 878:boy- 830:and 812:stay 807:amuq 746:farm 716:Gila 690:(3.) 684:(2.) 678:(1.) 659:want 656:boeh 653:boeh 650:boeh 645:when 589:(4.) 585:(3.) 580:(2.) 575:(1.) 443:Haus 425:POSS 401:APPL 389:SUBJ 316:-lia 311:make 256:Haus 228:Sohn 214:mein 180:-lia 102:, a 40:and 2112:doi 2075:doi 2028:doi 1985:doi 1939:doi 1891:doi 1789:715 1627:ACC 1606:FEM 1594:PST 1588:PFV 1575:NOM 1552:INS 1526:anu 1503:you 1482:NEG 1480:be. 1477:anu 1471:PRT 1459:ERG 1455:OBL 1453:we. 1436:GEN 1224:DAT 1205:DAT 1020:PFV 984:1sg 979:PFV 963:COM 947:ABL 881:NOM 851:INF 832:NEG 828:FUT 779:NEG 773:FUT 736:GEN 730:OBL 719:now 600:goa 597:goa 594:goá 486:or 435:ein 421:1SG 411:DET 397:OBJ 393:3SG 385:1SG 343:son 332:no- 319:for 292:ni- 242:ein 208:no- 200:der 186:für 144:ich 138:ni- 70:IGT 64:or 36:In 2231:: 2215:, 2143:. 2120:. 2106:. 2083:. 2073:. 2059:. 2036:. 2022:. 1993:. 1979:. 1967:^ 1947:. 1933:. 1917:^ 1897:. 1887:50 1885:. 1881:. 1869:^ 1825:. 1807:^ 1791:. 1683:? 1655:: 1616:in 1600:SG 1569:SG 1558:ya 1532:). 1499:mi 1467:PL 1228:PL 1220:PL 1209:PL 1075:, 817:da 558:), 505:a 498:a 492:), 432:ce 407:in 348:ce 335:my 324:in 303:it 300:c- 236:ce 194:in 172:es 152:c- 130:: 84:. 2128:. 2114:: 2091:. 2077:: 2067:: 2044:. 2030:: 2007:. 1987:: 1961:. 1941:: 1911:. 1893:: 1835:. 1801:. 1613:v 1603:. 1597:. 1591:. 1572:. 1566:. 1563:1 1528:: 1509:: 1507:a 1501:: 1465:. 1463:1 1461:. 1457:- 1255:) 1226:. 1222:- 1207:. 1129:s 1118:⟨ 1110:s 982:- 823:č 776:- 733:- 603:I 423:. 395:. 391:- 387:. 351:a 295:I 253:9 247:9 239:8 233:8 225:7 219:7 211:6 205:6 197:5 191:5 183:4 177:4 169:2 163:3 155:3 149:2 141:1 135:1 68:( 20:)

Index

Interlinear glossing
List of glossing abbreviations
linguistics
pedagogy
gloss
translation
language
source text
source text

Spanish-language
German
source text
Wilhelm von Humboldt
Classical Nahuatl
Specials Unicode block
orthography
phonetic
morphophonemic
morpheme
Taiwanese Minnan
pe̍h-ōe-jī
tone numbers
tone sandhi
English
OBL
GEN
FUT
NEG
INF

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.