Knowledge (XXG)

Interlinear gloss

Source 📝

82: 921:
In interlinear morphological glosses, various forms of punctuation separate the glosses. Typically, the words are aligned with their glosses; within words, a hyphen is used when a boundary is marked in both the text and its gloss, a period when a boundary appears in only one. That is, there should be
1281:
In the manual correction phase, the database creators manually corrected the boundaries of the interlinear gloss instances discovered by the sequence-labelling method in Step 2 of the automatic construction phase. The creators then verified the language names and language codes in a second and third
1530:
Researchers have used interlinear glosses is to obtain the morphological paradigms of the object language (i.e., the language being glossed). To automatically create morphological paradigms from interlinear glosses, researchers have created tables for every stem in the gloss and a (possibly empty)
1276:
Third, each interlinear gloss instance was assigned a language name (e.g., Tagalog) and an ISO 693-3 language ID. Language names and IDs were automatically assigned to interlinear glosses using Coreference Resolution models from Natural Language Processing, where the interlinear gloss instance was
1264:
The Online Database of Interlinear Text (ODIN) is a database of over 200,000 instances of interlinear glosses for more than 1,500 languages extracted from scholarly linguistic research. The database was constructed in two phases: automatic construction followed by manual correction. The automatic
275:
More modern 19th- and 20th-century approaches took to glossing vertically, aligning the same sort of word-by-word content in such a way that the metalanguage terms were placed vertically below the source language terms. In this style, the given example might be rendered thus (here English gloss):
1269:
First, search engines (e.g., Google, Bing) were queried to retrieve scholarly documents that were likely to contain interlinear glosses. The queries comprised terms relevant to linguistic research such as grammatical morphemes (e.g., "NOM", short for nominative; "3SG", short for 3rd person
859:. if the morpheme-by-morpheme gloss (middle line) contains an element that does not correspond to an overt element in the example, a standard strategy is to include an overt "ø" in the object-language text, which is separated by a hyphen like an overt element would be: 103:
Interlinear glosses have been used for a variety of purposes over a long period of time. One common usage has been to annotate bilingual textbooks for language education. This sort of interlinearization serves to help make the meaning of a
922:
the same number of words separated with spaces in the text and its gloss, as well as the same number of hyphenated morphemes within a word and its gloss. This is the basic system, and can be applied universally. For example:
700:. At the sub-word level, segmentable morphemes are separated by hyphens, both in the example and in the gloss. There should be the same number of hyphens in the example and in the gloss, as shown in the following example: 694:. According to the Leipzig Glossing Rules, it is standard to left-align the words in the object language with the corresponding words in the metalanguage; this alignment can be seen between lines (1-3) and line (4). 264:
This "inline" style allows examples to be included within the flow of text, and for the word order of the target language to be written in an order which approximates the target language syntax. (In the gloss here,
271:
is reordered from the corresponding source order to approximate German syntax more naturally.) Even so, this approach requires the readers to "re-align" the correspondences between source and target forms.
1486:
Given the morpheme segmented line (first line above) and the free translation line (third line above), the task is to produce the middle glossed line comprising stem translations (e.g.,
445:
This approach is denser and also requires effort to read, but it is less reliant on the grammatical structure of the metalanguage for expressing the semantics of the target forms.
513:
a free translation, which may be placed in a separate paragraph or on the facing page if the structures of the languages are too different for it to follow the text line by line.
1012:, when a single word in the source language happens to correspond to a phrase in the glossing language, though a period would still be used for other situations, such as Greek 1273:
Second, each line in an extracted document was tagged for whether it was a line belonging to an interlinear gloss or not using sequence-labeling methods from Machine Learning.
111:
Such annotations have occasionally been expressed not through interlinear layout, but rather through enumeration of words in the object and meta language. One such example is
365:
Finally, modern linguists have adopted the practice of using abbreviated grammatical category labels. A 2008 publication which repeats this example labels it as follows:
460:
Though there is no formal specification for the IGT format, the Leipzig Glossing Rules are a set of guidelines that aim to standardize the format as much as possible.
1753:
Lehmann, Christian (2004-01-23). "Directions for interlinear morphemic translations". In Geert Booij; Christian Lehmann; Joachim Mugdan; Stavros Skopeteas (eds.).
1502:). Sequence prediction models from Natural Language Processing have been used to perform this task. Two factors contribute to the difficulty of this task: 69:
and its translation, and the structure of the original language. In its simplest form, an interlinear gloss is a literal, word-for-word translation of the
2173: 1407:
Natural Language Processing models leveraging interlinear gloss resources, such as the Online Database of Interlinear Text, have been developed.
2134:
Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
2097:
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
823:; a list of standard abbreviations for grammatical categories that are widely used in linguistics can be found in the Leipzig Glossing Rules. 1785: 41:(series of brief explanations, such as definitions or pronunciations) placed between lines, such as between a line of original text and its 463:
An interlinear text for linguistics will commonly consist of some or all of the following, usually in this order, from top to bottom:
1718:, often displayed as interlinear glosses under the tagged words, sometimes at the same time as an interlinear word-by-word translation 1694:
grammatical category). A statistical machine learning model for morphological inflection can be used to fill in the missing entries.
1727: 889:
is treated similarly to affixation but with a tilde (instead of the standard hyphen) that connects the copied element to the stem:
2167: 2155: 2161: 1277:
tagged with the language name (and ID) that appears in the scholarly document the interlinear gloss instance was extracted from.
20: 1286:
The language distribution of interlinear gloss instances in Online Database of Interlinear Text after phase 1 and (phase 2)
2181:. A forum for recommendations on the Interlinar Morphemic Glossing of ancient languages as attested in ancient manuscripts. 829:. When a single object-language element corresponds to several metalanguage elements, they are separated by periods. E.g., 2233: 2050:
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume
1027:
may be separated with a double hyphen (or, for ease of typing, an equal sign) rather than a hyphen. A French example:
1415:
Natural Language Processing systems, for example, have been developed to automatically produce interlinear glosses.:
49:. When glossed, each line of the original text acquires one or more corresponding lines of transcription known as an 1757:. Handbücher der Sprach- und Kommunikationswissenschaft. Vol. 2. Berlin: W. de Gruyter. pp. 1834–1857. 1866:
Xia, Fei; Lewis, William; Wayne, Michael; Slayden, Glenn; Georgi, Ryan; Crowgey, Joshua; Bender, Emily (2016).
449: 1531:
slot for every grammatical category (e.g., ERG) in the gloss. For instance, given the glossed sentence below:
815:) are inflectional affixes representing future tense and negation. These inflectional affixes are glossed as 1972:. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP): 5251–5262. 81: 2197:
Listing of older interlinear and construed texts, mostly from Latin or Ancient Greek and mostly to English
1715: 767: 2201: 1148: 1091: 2228: 2223: 2206: 112: 108:
explicit without attempting to formally model the structural characteristics of the source language.
898: 1850:
From Aari to Zulu: massively multilingual creation of language tools using interlinear glossed tex
528: 2184: 2110: 2073: 2053: 2026: 1983: 1937: 1887: 1551: 38: 1248:
A few other conventions which are sometimes seen are illustrated in the Leipzig Glossing Rules.
505:-by-morpheme gloss, where morphemes within a word are separated by hyphens or other punctuation, 1777: 1867: 1781: 1576: 967: 116: 1918:
Xingyuan, Zhao; Satoru, Ozaki; Anastasopoulos, Antonios; Neubig, Graham; Levin, Lori (2020).
2150: 2100: 2063: 2016: 1973: 1927: 1926:. Proceedings of the 28th International Conference on Computational Linguistics: 5397–5408. 1879: 1811: 1769: 1513:
Some words in the morpheme segmented line have multiple correspondences in the gloss (e.g.,
551: 518: 92: 950: 2194: 1615: 1594: 1588: 1563: 1557: 1181: 951: 869: 96: 1510:
is the last word in the translation but the second word in the morpheme segmented line).
1506:
The translation is not necessarily in alignment with the morpheme segmented line (e.g.,
1932: 1709: 1682:
would be filled (since it was observed in the interlinear gloss data) but the slot for
495: 2013:
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
1978: 1920:"Automatic Interlinear Glossing for Under-Resourced Languages Leveraging Translations" 1812:"The Leipzig Glossing Rules. Conventions for Interlinear Morpheme by Morpheme Glosses" 2217: 2077: 2068: 1987: 1941: 1770: 1730:, nineteenth-century composer and promoter of interlinear texts for language learning 1154: 1108: 1069: 935: 724: 2151:
The Leipzig Glossing Rules: Conventions for interlinear morpheme-by-morpheme glosses
2114: 2030: 2015:. Brussels, Belgium: Association for Computational Linguistics. pp. 2883–2889. 1256:
Efforts have been undertaken to digitize IGT for hundreds of the world's languages.
1891: 761: 718: 2178: 1964:
Moeller, Sarah; Liu, Ling; Yang, Changbing; Kann, Katharina; Hulden, Mans (2020).
2099:. Denver, Colorado: Association for Computational Linguistics. pp. 922–931. 1919: 1712:– a gloss sometimes used with Chinese or Japanese to show the pronunciation 544: 537: 468: 105: 70: 66: 42: 26: 1965: 16:
Explanatory matter inserted between a line of original text and its translation
2045: 1883: 1733: 1582: 839: 1065: 543:
3. a gloss showing the underlying tones in citation form (before undergoing
65:
for short. Such glosses help the reader follow the relationship between the
2021: 1686:
would be empty (assuming that no other interlinear gloss instance contains
2105: 2189: 2052:. Online: Association for Computational Linguistics. pp. 1901–1907. 1868:"Enriching a massively multilingual database of interlinear glossed text" 1721: 502: 488: 46: 30: 972: 409: 381: 373: 2136:. Montréal, Canada: Association for Computational Linguistics: 396–406. 2092: 2008: 362:
Here word ordering is determined by the syntax of the object language.
1810:
Bickel, Balthasar; Bernard Comrie; Martin Haspelmath (February 2008).
1494:) and the grammatical category labels corresponding to affixes (e.g., 2130:"Leveraging supplemental representations for sequential transduction" 1755:
Morphologie. Ein internationales Handbuch zur Flexion und Wortbildung
1703: 1024: 897: 2185:
Online Interlinear of Biblical Greek Scriptures (New Testament) text
2129: 1772:
Language typology and language universals: an international handbook
2058: 1575: 1470: 966: 766: 413: 389: 2009:"An Encoder-Decoder Approach to the Paradigm Cell Filling Problem" 1174: 1061: 80: 1614: 1593: 1587: 1562: 1556: 1076:
sulat, susulat, sumulat, sumusulat (verbal declensions) (Tagalog)
868: 1724:, often displayed as a gloss or annotation to the original text. 1540: 1023:
However, sometimes finer distinctions may be made. For example,
1455: 1447: 1424: 1216: 1208: 1197: 934: 723: 1550: 1451: 1443: 760: 717: 2093:"Inflection Generation as Discriminative String Transduction" 1212: 1193: 385: 1581: 838: 399: 2091:
Nicolai, Garrett; Cherry, Colin; Kondrak, Grzegorz (2015).
1526:
Automatic discovery of morphological structure from glosses
1459: 2046:"Applying the Transformer to Character-level Transduction" 1068:, transfixes, etc.) may be set off by angle brackets, and 377: 1706:– Japanese tradition of glossing Classical Chinese texts 1265:
construction stage itself was completed in three steps:
1180:
Morphemes which cannot be easily separated out, such as
1184:, may be marked with a backslash rather than a period: 799:) is translated into the corresponding English lexeme ( 484:
a conventional transliteration into the Latin alphabet,
448:
In computing, special text markers are provided in the
521:
clause has been transcribed with five lines of text:
452:
to indicate the start and end of interlinear glosses.
1005:
An underscore may be used instead of a period, as in
681:
goa2 iau2-boe7 koat4-teng7 tang1-si5 boeh4 tng2-khi3.
675:
goa1 iau1-boe3 koat2-teng3 tang7-si5 boeh2 tng1-khi3.
526: 439:
1SG.SUBJ-3SG.OBJ-mach-APPL DET 1SG.POSS-Sohn ein Haus
1625:
evening-INS 1.SG.NOM run-PFV.PST.SG.FEM in store.ACC
1966:"IG2P: From Interlinear Glossed Texts to Paradigms" 1913: 1911: 1909: 1907: 1403:
Automatic processing of interlinear gloss instances
2044:Wu, Shijie; Cotterell, Ryan; Hulden, Mans (2021). 687:(5.) "I have not yet decided when I shall return." 1959: 1957: 1816:Dept. of Linguistics – Resources – Glossing Rules 777:now they-OBL-GEN farm forever behind stay-FUT-NEG 1861: 1859: 774:Gila abur-u-n ferma hamišaluǧ güǧüna amuqʼ-da-č 2190:ODIN - The Online Database of INterlinear text 1836:A Basic Vocabulary for a Beginner in Taiwanese 1805: 1803: 1801: 1799: 1797: 1479:you-GEN camel we.OBL-ERG.1.PL-steal-PRT be.NEG 780:'Now their farm will not stay behind forever.' 810: 804: 794: 788: 8: 2128:Bhargava, Aditya; Kondrak, Grzegorz (2012). 266: 86: 2168:Towards a General Model of Interlinear Text 2007:Silfverberg, Miikka; Hulden, Mans (2018). 669:goá iáu-boē koat-tēng tang-sî boeh tńg-khì 352:ni- c- chihui -lia in no- piltzin ce calli 252:ni- c- chihui -lia in no- piltzin ce calli 2104: 2067: 2057: 2020: 1977: 1931: 1646: 1284: 998:room-from speed-with go_out-perfective-I 2202:"The New Old Way of Learning Languages" 1745: 1632:There would be a paradigm for the stem 1072:with tildes, rather than with hyphens: 684:(4.) I not-yet decide when want return. 258:ich mache es für der mein Sohn ein Haus 436:ni-c-chihui-lia in no-piltzin ce calli 7: 2179:Glossing Ancient Languages and Texts 1628:'In the evening I ran to the store.' 1476:mi-s ħumukuli elu-ab-ok'ek'-asi anu 1260:Online Database of Interlinear Text 1060:Affixes which cause discontinuity ( 698:Morpheme-by-morpheme correspondence 550:4. a morpheme-by-morpheme gloss in 355:I it make for to-the my son a house 2156:Interlinear Glossed Text Standards 1282:pass over the data, respectively. 1236:'to our fathers' (the singular of 803:) while the inflectional affixes ( 14: 1872:Language Resources and Evaluation 995:room-ABL speed-COM go.out-PFV-1sg 1933:10.18653/v1/2020.coling-main.471 1852:(PhD). University of Washington. 1838:by Ko Chek Hoan and Tan Pang Tin 1622:Vecher-om ya pobeja-la v magazin 88:Toussaint–Langenscheidt Spanisch 2195:Latinum Interlinear Method page 2162:Interlinear Glossed Text Levels 1979:10.18653/v1/2020.emnlp-main.424 2069:10.18653/v1/2021.eacl-main.163 1648:(Partial) paradigm for pobeja 21:List of glossing abbreviations 1: 2174:Interlinear Morphemic Glosses 1776:. Walter de Gruyter. p.  1482:'We didn't steal your camel.' 517:As an example, the following 1311:interlinear gloss instances 1305:interlinear gloss instances 811: 805: 795: 789: 755: 747: 739: 731: 712: 704: 658: 655: 652: 644: 641: 638: 630: 627: 624: 616: 613: 610: 602: 599: 596: 588: 585: 582: 572: 567: 562: 527: 1768:Haspelmath, Martin (2008). 1291:Range of interlinear gloss 1252:Interlinear gloss resources 1233:our-DAT.PL father\PL-DAT.PL 827:One-to-many correspondences 785:Grammatical category labels 2252: 1001:'I left the room quickly.' 557:5. an English translation: 18: 1884:10.1007/s10579-015-9325-4 358:"I made my son a house." 55:interlinear glossed text 1609: 1601: 1570: 1546: 1535: 992:oda-dan hız-lı çık-tı-m 893: 428: 420: 405: 395: 369: 344: 336: 328: 320: 312: 304: 296: 288: 280: 238: 224: 210: 196: 182: 168: 154: 140: 126: 61:) – an 1716:Part-of-speech tagging 1465: 1438: 1430: 1419: 1117: 1098: 1087: 1079: 1037: 1034: 964: 961: 948: 945: 932: 929: 863: 833: 692:Word-by-word alignment 540:for the surface tones, 450:Specials Unicode block 431: 423: 408: 398: 372: 267: 244: 230: 216: 202: 188: 174: 160: 146: 132: 100: 87: 1848:Georgi, Ryan (2016). 1177:for other examples.) 84: 2207:The American Scholar 2022:10.18653/v1/D18-1315 926:Odadan hızlı çıktım. 113:Wilhelm von Humboldt 85:Interlinear text in 2234:Reordered languages 2106:10.3115/v1/N15-1093 1649: 1287: 1167:contemplative~write 1132:contemplative~write 977:go_out-perfective-I 1690:inflected for the 1647: 1411:Automatic glossing 1393:189,244 (157,114) 1285: 1149:contemplative mood 1142:⟨um⟩ 1138:⟨um⟩ 1120:⟨um⟩ 1101:⟨um⟩ 1092:contemplative mood 857:Non-overt elements 501:a word-by-word or 101: 1787:978-3-11-011423-2 1676: 1675: 1439:elu-ab-ok'ek'-asi 1400: 1399: 1020:'to the houses'. 536:2. a gloss using 255:1 3 2 4 5 6 7 8 9 249:1 2 3 4 5 6 7 8 9 117:Classical Nahuatl 115:'s annotation of 35:interlinear gloss 2241: 2138: 2137: 2125: 2119: 2118: 2108: 2088: 2082: 2081: 2071: 2061: 2041: 2035: 2034: 2024: 2004: 1998: 1997: 1995: 1994: 1981: 1961: 1952: 1951: 1949: 1948: 1935: 1915: 1902: 1901: 1899: 1898: 1863: 1854: 1853: 1845: 1839: 1832: 1826: 1825: 1823: 1822: 1807: 1792: 1791: 1775: 1765: 1759: 1758: 1750: 1669:PFV.PST.SG.MASC 1650: 1618: 1597: 1591: 1585: 1579: 1566: 1560: 1554: 1542: 1472: 1461: 1457: 1453: 1449: 1445: 1426: 1365:12,822 (15,560) 1351:40,260 (46,420) 1337:97,158 (81,218) 1323:36,691 (10,814) 1288: 1218: 1214: 1210: 1199: 1195: 1166: 1164: 1158: 1143: 1139: 1136:sulat su~sulat s 1131: 1129: 1121: 1112: 1102: 1019: 1010: 974: 970: 954: 938: 901: 872: 842: 814: 808: 798: 792: 770: 764: 727: 721: 680: 674: 668: 575: 570: 565: 533:transliteration, 532: 525:1. the standard 519:Taiwanese Minnan 498:transliteration, 415: 411: 401: 391: 387: 383: 379: 375: 270: 93:Spanish-language 90: 51:interlinear text 2251: 2250: 2244: 2243: 2242: 2240: 2239: 2238: 2214: 2213: 2147: 2142: 2141: 2127: 2126: 2122: 2090: 2089: 2085: 2043: 2042: 2038: 2006: 2005: 2001: 1992: 1990: 1963: 1962: 1955: 1946: 1944: 1917: 1916: 1905: 1896: 1894: 1865: 1864: 1857: 1847: 1846: 1842: 1833: 1829: 1820: 1818: 1809: 1808: 1795: 1788: 1767: 1766: 1762: 1752: 1751: 1747: 1742: 1700: 1692:PFV.PST.SG.MASC 1684:PFV.PST.SG.MASC 1661:PFV.PST.SG.FEM 1642:PFV.PST.SG.MASC 1636:with slots for 1630: 1620: 1607: 1599: 1568: 1544: 1528: 1484: 1474: 1463: 1436: 1428: 1413: 1405: 1262: 1254: 1246: 1230:unser-n Väter-n 1228: 1220: 1201: 1171: 1165:trigger⟩ 1162: 1160: 1152: 1141: 1137: 1134: 1127: 1125: 1119: 1115: 1106: 1100: 1096: 1085: 1077: 1058: 1048: 1040: 1032: 1017: 1008: 1003: 990: 979: 959: 943: 927: 919: 914: 904: 884: 874: 854: 844: 782: 772: 753: 745: 737: 729: 710: 689: 678: 672: 666: 664: 650: 636: 622: 608: 594: 580: 573: 568: 563: 458: 443: 434: 426: 418: 403: 393: 370:ni-c-chihui-lia 360: 350: 342: 334: 326: 318: 310: 302: 294: 286: 262: 247: 233: 219: 205: 191: 177: 163: 149: 135: 79: 23: 17: 12: 11: 5: 2249: 2248: 2245: 2237: 2236: 2231: 2226: 2216: 2215: 2212: 2211: 2210:, Autumn 2008. 2198: 2192: 2187: 2182: 2176: 2171: 2165: 2159: 2153: 2146: 2145:External links 2143: 2140: 2139: 2120: 2083: 2036: 1999: 1953: 1903: 1878:(2): 321–349. 1855: 1840: 1827: 1793: 1786: 1760: 1744: 1743: 1741: 1738: 1737: 1736: 1731: 1728:James Hamilton 1725: 1719: 1713: 1707: 1699: 1696: 1680:PFV.PST.SG.FEM 1674: 1673: 1670: 1666: 1665: 1662: 1658: 1657: 1654: 1638:PFV.PST.SG.FEM 1608: 1600: 1569: 1545: 1534: 1533: 1527: 1524: 1523: 1522: 1511: 1464: 1437: 1429: 1418: 1417: 1412: 1409: 1404: 1401: 1398: 1397: 1394: 1391: 1390:1,326 (1,493) 1388: 1384: 1383: 1380: 1379:2,313 (3,012) 1377: 1374: 1370: 1369: 1366: 1363: 1360: 1356: 1355: 1354:21.27 (29.55) 1352: 1349: 1346: 1342: 1341: 1340:51.34 (51.69) 1338: 1335: 1332: 1328: 1327: 1324: 1321: 1318: 1314: 1313: 1307: 1301: 1295: 1279: 1278: 1274: 1271: 1261: 1258: 1253: 1250: 1221: 1202: 1187: 1186: 1116: 1097: 1086: 1078: 1075: 1074: 1041: 1033: 1030: 1029: 980: 960: 944: 928: 925: 924: 918: 915: 892: 891: 862: 861: 832: 831: 754: 746: 738: 730: 711: 703: 702: 651: 637: 623: 609: 595: 581: 561: 560: 559: 558: 555: 548: 541: 534: 515: 514: 507: 506: 499: 496:morphophonemic 492: 491:transcription, 485: 482: 471:(typically in 457: 454: 427: 419: 404: 394: 368: 367: 343: 335: 327: 319: 311: 303: 295: 287: 279: 278: 234: 220: 206: 192: 178: 164: 150: 136: 122: 121: 99:speakers, 1910 78: 75: 15: 13: 10: 9: 6: 4: 3: 2: 2247: 2246: 2235: 2232: 2230: 2227: 2225: 2222: 2221: 2219: 2209: 2208: 2203: 2200:Ernest Blum, 2199: 2196: 2193: 2191: 2188: 2186: 2183: 2180: 2177: 2175: 2172: 2169: 2166: 2163: 2160: 2157: 2154: 2152: 2149: 2148: 2144: 2135: 2131: 2124: 2121: 2116: 2112: 2107: 2102: 2098: 2094: 2087: 2084: 2079: 2075: 2070: 2065: 2060: 2055: 2051: 2047: 2040: 2037: 2032: 2028: 2023: 2018: 2014: 2010: 2003: 2000: 1989: 1985: 1980: 1975: 1971: 1967: 1960: 1958: 1954: 1943: 1939: 1934: 1929: 1925: 1921: 1914: 1912: 1910: 1908: 1904: 1893: 1889: 1885: 1881: 1877: 1873: 1869: 1862: 1860: 1856: 1851: 1844: 1841: 1837: 1834:Example from 1831: 1828: 1817: 1813: 1806: 1804: 1802: 1800: 1798: 1794: 1789: 1783: 1779: 1774: 1773: 1764: 1761: 1756: 1749: 1746: 1739: 1735: 1732: 1729: 1726: 1723: 1720: 1717: 1714: 1711: 1708: 1705: 1702: 1701: 1697: 1695: 1693: 1689: 1685: 1681: 1678:The slot for 1671: 1668: 1667: 1663: 1660: 1659: 1655: 1652: 1651: 1645: 1643: 1639: 1635: 1629: 1626: 1623: 1619: 1617: 1611: 1606: 1603: 1598: 1596: 1590: 1584: 1578: 1572: 1567: 1565: 1559: 1553: 1548: 1543: 1537: 1532: 1525: 1520: 1516: 1512: 1509: 1505: 1504: 1503: 1501: 1497: 1493: 1489: 1483: 1480: 1477: 1473: 1467: 1462: 1440: 1435: 1432: 1427: 1421: 1416: 1410: 1408: 1402: 1395: 1392: 1389: 1386: 1385: 1381: 1378: 1375: 1372: 1371: 1367: 1364: 1361: 1358: 1357: 1353: 1350: 1347: 1344: 1343: 1339: 1336: 1333: 1330: 1329: 1326:19.39 (6.88) 1325: 1322: 1319: 1316: 1315: 1312: 1308: 1306: 1302: 1300: 1296: 1294: 1290: 1289: 1283: 1275: 1272: 1268: 1267: 1266: 1259: 1257: 1251: 1249: 1245: 1243: 1240:'fathers' is 1239: 1234: 1231: 1227: 1224: 1219: 1205: 1200: 1190: 1185: 1183: 1178: 1176: 1170: 1168: 1161:⟨agent 1157:.past⟩ 1156: 1155:agent trigger 1150: 1145: 1133: 1123: 1114: 1110: 1109:agent trigger 1104: 1095: 1093: 1089: 1084: 1081: 1073: 1071: 1070:reduplication 1067: 1063: 1057: 1056:'I love you.' 1054: 1051: 1047: 1044: 1039: 1036: 1028: 1026: 1021: 1015: 1011: 1002: 999: 996: 993: 989: 986: 983: 978: 975: 969: 963: 958: 955: 953: 947: 942: 939: 937: 931: 923: 916: 913: 910: 907: 903: 900: 895: 890: 888: 887:Reduplication 883: 880: 877: 873: 871: 865: 860: 858: 853: 852:'to come out' 850: 847: 843: 841: 835: 830: 828: 824: 822: 818: 813: 807: 802: 797: 791: 786: 781: 778: 775: 771: 769: 763: 757: 752: 749: 744: 741: 736: 733: 728: 726: 720: 714: 709: 706: 701: 699: 695: 693: 688: 685: 682: 676: 670: 663: 660: 657: 654: 649: 646: 643: 640: 635: 632: 629: 626: 621: 618: 615: 612: 607: 604: 601: 598: 593: 590: 587: 584: 579: 576: 571: 566: 556: 553: 549: 546: 542: 539: 535: 531: 530: 524: 523: 522: 520: 512: 511: 510: 509:and finally 504: 500: 497: 493: 490: 486: 483: 480: 479: 474: 470: 467:The original 466: 465: 464: 461: 455: 453: 451: 446: 442: 440: 437: 433: 430: 425: 422: 417: 407: 402: 397: 392: 371: 366: 363: 359: 356: 353: 349: 346: 341: 338: 333: 330: 325: 322: 317: 314: 309: 306: 301: 298: 293: 290: 285: 282: 277: 273: 269: 261: 259: 256: 253: 250: 246: 243: 240: 237: 232: 229: 226: 223: 218: 215: 212: 209: 204: 201: 198: 195: 190: 187: 184: 181: 176: 173: 170: 167: 162: 159: 156: 153: 148: 145: 142: 139: 134: 131: 128: 125: 120: 118: 114: 109: 107: 98: 95:textbook for 94: 89: 83: 76: 74: 72: 68: 64: 60: 56: 52: 48: 45:into another 44: 40: 36: 32: 28: 22: 2205: 2133: 2123: 2096: 2086: 2049: 2039: 2012: 2002: 1991:. Retrieved 1969: 1945:. Retrieved 1923: 1895:. Retrieved 1875: 1871: 1849: 1843: 1835: 1830: 1819:. Retrieved 1815: 1771: 1763: 1754: 1748: 1691: 1687: 1683: 1679: 1677: 1641: 1637: 1633: 1631: 1627: 1624: 1621: 1612: 1604: 1573: 1549: 1538: 1529: 1518: 1514: 1507: 1499: 1495: 1491: 1487: 1485: 1481: 1478: 1475: 1468: 1441: 1433: 1422: 1414: 1406: 1382:1.22 (1.92) 1368:6.78 (9.96) 1310: 1304: 1298: 1292: 1280: 1263: 1255: 1247: 1241: 1237: 1235: 1232: 1229: 1225: 1222: 1206: 1203: 1191: 1188: 1179: 1172: 1169: 1146: 1135: 1124: 1105: 1090: 1082: 1059: 1055: 1052: 1049: 1045: 1042: 1022: 1013: 1006: 1004: 1000: 997: 994: 991: 987: 984: 981: 976: 956: 940: 920: 911: 908: 905: 896: 886: 885: 881: 878: 875: 866: 856: 855: 851: 849:come.out-INF 848: 845: 836: 826: 825: 820: 816: 800: 793:, the stem ( 784: 783: 779: 776: 773: 758: 750: 742: 734: 715: 707: 697: 696: 691: 690: 686: 683: 677: 671: 665: 661: 647: 633: 619: 605: 591: 577: 538:tone numbers 516: 508: 477: 476: 472: 462: 459: 447: 444: 441: 438: 435: 364: 361: 357: 354: 351: 347: 339: 331: 323: 315: 307: 299: 291: 283: 274: 263: 260: 257: 254: 251: 248: 241: 235: 227: 221: 213: 207: 199: 193: 185: 179: 171: 165: 157: 151: 143: 137: 129: 123: 110: 102: 62: 58: 54: 50: 34: 24: 2229:Linguistics 2224:Translation 1656:inflection 1317:>10,000 1309:Percent of 1066:circumfixes 917:Punctuation 912:'is buying' 545:tone sandhi 478:bold italic 469:orthography 106:source text 71:source text 67:source text 63:interlinear 43:translation 27:linguistics 2218:Categories 2059:2005.10213 1993:2021-12-15 1947:2021-12-15 1897:2021-12-15 1821:2010-06-30 1740:References 1734:Metaphrase 1664:pobeja-la 1396:100 (100) 1376:838 (862) 1362:326 (460) 1348:122 (139) 1331:1000-9999 1303:Number of 1299:languages 1297:Number of 1293:instances 1270:singular). 1053:I⹀you⹀love 1050:je⹀te⹀aime 1038:I⹀you⹀love 1035:je⹀te⹀aime 1031:Je t'aime. 1018:FEM.PL.DAT 957:speed-with 790:amuqʼ-da-č 756:amuqʼ-da-č 529:pe̍h-ōe-jī 406:no-piltzin 19:See also: 2078:218718982 1988:226262296 1942:227231816 1722:Treebanks 1710:Ruby text 1571:pobeja-la 1536:Vecher-om 941:room-from 837:come.out- 740:hamišaluǧ 617:koat-teng 614:koat-teng 611:koat-tēng 456:Structure 2170:(E-MELD) 2164:(E-MELD) 2158:(E-MELD) 2115:14929030 2031:53082616 1698:See also 1539:evening- 1500:ERG.1.PL 1431:ħumukuli 1345:100-999 1334:37 (31) 1223:(German) 1153:⟨ 1130:trigger⟩ 1088:su~sulat 1043:(French) 962:çık-tı-m 909:IPFV~buy 713:abur-u-n 659:tng-khi. 656:tng-khi. 503:morpheme 489:phonetic 268:mache es 47:language 31:pedagogy 1892:2674996 1610:magazin 1458:-steal- 1207:father\ 1204:Väter-n 1189:unser-n 1151:~write 1144:u~sulat 1122:u~sulat 1062:infixes 1025:clitics 1014:oikíais 1007:go_out- 982:Turkish 965:go.out- 930:oda-dan 906:bi~bili 894:bi~bili 879:boy-NOM 846:çık-mak 834:çık-mak 809:) and ( 743:forever 662:return. 653:tńg-khì 631:tang-si 628:tang-si 625:tang-sî 606:not-yet 603:iau-boe 600:iau-boe 597:iáu-boē 552:English 329:piltzin 211:piltzin 77:History 2113:  2076:  2029:  1986:  1940:  1924:COLING 1890:  1784:  1704:Kanbun 1688:pobeja 1634:pobeja 1613:store. 1519:be.NEG 1387:Total 1359:10-99 1320:3 (1) 1226:  1182:umlaut 1163:  1159:write 1147:write 1140:ulat s 1128:  1126:⟨agent 1111:.past⟩ 1094:~write 1046:  1016:house. 988:  985:  949:speed- 946:hız-lı 876:puer-ø 864:puer-ø 751:behind 748:güǧüna 620:decide 473:italic 388:-mach- 316:to-the 297:chihui 155:chihui 97:German 2111:S2CID 2074:S2CID 2054:arXiv 2027:S2CID 1984:S2CID 1970:EMNLP 1938:S2CID 1888:S2CID 1653:Slot 1508:camel 1434:camel 1242:Vater 1238:Väter 1175:affix 1173:(See 1113:write 1083:write 1080:sulat 933:room- 882:'boy' 787:. In 759:stay- 732:ferma 716:they- 554:, and 429:calli 416:-Sohn 348:house 345:calli 239:calli 147:mache 39:gloss 37:is a 33:, an 1782:ISBN 1640:and 1574:run- 1423:you- 1420:mi-s 1373:1-9 1192:our- 1103:ulat 902:~buy 899:IPFV 867:boy- 819:and 801:stay 796:amuq 735:farm 705:Gila 679:(3.) 673:(2.) 667:(1.) 648:want 645:boeh 642:boeh 639:boeh 634:when 578:(4.) 574:(3.) 569:(2.) 564:(1.) 432:Haus 414:POSS 390:APPL 378:SUBJ 305:-lia 300:make 245:Haus 217:Sohn 203:mein 169:-lia 91:, a 29:and 2101:doi 2064:doi 2017:doi 1974:doi 1928:doi 1880:doi 1778:715 1616:ACC 1595:FEM 1583:PST 1577:PFV 1564:NOM 1541:INS 1515:anu 1492:you 1471:NEG 1469:be. 1466:anu 1460:PRT 1448:ERG 1444:OBL 1442:we. 1425:GEN 1213:DAT 1194:DAT 1009:PFV 973:1sg 968:PFV 952:COM 936:ABL 870:NOM 840:INF 821:NEG 817:FUT 768:NEG 762:FUT 725:GEN 719:OBL 708:now 589:goa 586:goa 583:goá 475:or 424:ein 410:1SG 400:DET 386:OBJ 382:3SG 374:1SG 332:son 321:no- 308:for 281:ni- 231:ein 197:no- 189:der 175:für 133:ich 127:ni- 59:IGT 53:or 25:In 2220:: 2204:, 2132:. 2109:. 2095:. 2072:. 2062:. 2048:. 2025:. 2011:. 1982:. 1968:. 1956:^ 1936:. 1922:. 1906:^ 1886:. 1876:50 1874:. 1870:. 1858:^ 1814:. 1796:^ 1780:. 1672:? 1644:: 1605:in 1589:SG 1558:SG 1547:ya 1521:). 1488:mi 1456:PL 1217:PL 1209:PL 1198:PL 1064:, 806:da 547:), 494:a 487:a 481:), 421:ce 396:in 337:ce 324:my 313:in 292:it 289:c- 225:ce 183:in 161:es 141:c- 119:: 73:. 2117:. 2103:: 2080:. 2066:: 2056:: 2033:. 2019:: 1996:. 1976:: 1950:. 1930:: 1900:. 1882:: 1824:. 1790:. 1602:v 1592:. 1586:. 1580:. 1561:. 1555:. 1552:1 1517:: 1498:: 1496:a 1490:: 1454:. 1452:1 1450:. 1446:- 1244:) 1215:. 1211:- 1196:. 1118:s 1107:⟨ 1099:s 971:- 812:č 765:- 722:- 592:I 412:. 384:. 380:- 376:. 340:a 284:I 242:9 236:9 228:8 222:8 214:7 208:7 200:6 194:6 186:5 180:5 172:4 166:4 158:2 152:3 144:3 138:2 130:1 124:1 57:(

Index

List of glossing abbreviations
linguistics
pedagogy
gloss
translation
language
source text
source text

Spanish-language
German
source text
Wilhelm von Humboldt
Classical Nahuatl
Specials Unicode block
orthography
phonetic
morphophonemic
morpheme
Taiwanese Minnan
pe̍h-ōe-jī
tone numbers
tone sandhi
English
OBL
GEN
FUT
NEG
INF
NOM

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.