Knowledge

Lemma (morphology)

Source 📝

1339: 498:
forms "go", "goes", "going", "went", and "gone". The relationship between an inflected form and its lemma is usually denoted by an angle bracket, e.g., "went" < "go". Of course, the disadvantage of such simplifications is the inability to look up a declined or conjugated form of the word, but some
594:
is the part of the word that never changes even when morphologically inflected; a lemma is the least marked form of the word. In linguistic analysis, the stem is defined more generally as a form without any of its possible inflectional morphemes (but including derivational morphemes and may contain
715:
entries appears. The headword is used to locate the entry, and dictates its alphabetical position. Depending on the size and nature of the dictionary or encyclopedia, the entry may include alternative meanings of the word, its
823:, has around 330,000 headwords. These values are cited by the dictionary makers and may not use exactly the same definition of a headword. In addition, headwords may not accurately reflect a dictionary's physical size. The 378:
the third-person singular masculine of the past/perfect tense is the least-marked form and is used for entries in modern dictionaries. In older dictionaries, which are still commonly used, the
599:
is taken into account, the definition of the unchangeable part of the word is not useful, as can be seen in the phonological forms of the words in the preceding example: "produced"
809: 1235: 1110: 972: 526:
for determining word frequency. In that usage, the specific definition of "lemma" is flexible depending on the task it is being used for.
1316: 1250: 797: 315:
is traditionally used, but some modern dictionaries use the infinitive instead (except for Bulgarian, which lacks infinitives; for
1371: 1175: 547: 433:, words are highly inflected by case (genitive, nominative, dative and vocative) and by their place within a sentence because of 924:
A minor... problem can arise when the canonical form of the headword, i.e. the form in which it is to be cited, is to be chosen.
106:
refers to the particular form that is chosen by convention to represent the lexeme. Lemmas have special significance in highly
917: 838:
The term 'lemma' comes from the practice in Greco-Roman antiquity of using the word to refer to the headwords of marginal
1326: 1240: 102:, in this context, refers to the set of all the inflected or alternating forms in the paradigm of a single word, and 1103: 956: 1004: 803: 1070: 192:, the citation form of regular adjectives and nouns is usually the masculine singular. If the language also has 868: 732:
or phrases that contain the headword, and encyclopedic information about the concepts represented by the word.
31: 1280: 1215: 1210: 1190: 815: 500: 434: 319:
in Ancient Greek, an uncontracted first person singular present tense is used to reveal the contract vowel:
234: 1361: 1321: 1285: 1255: 1220: 578:
when unstressed). Dictionaries usually give the pronunciation used when the word is pronounced alone (its
575: 571: 567: 563: 418: 1225: 1205: 1195: 1096: 423: 146:
form, but there are several exceptions such as the use of the infinitive for verbs in some languages.
1295: 1185: 543: 474: 166: 1200: 316: 230: 1338: 1366: 1342: 1270: 1265: 1260: 1245: 1180: 883: 839: 523: 421:, the verb stem (which is also the imperative form - the least marked one) is often cited, e.g., 308: 189: 174: 154: 240: 1300: 968: 913: 395: 371: 170: 1230: 986: 960: 888: 637: 602: 356: 245: 193: 119: 115: 1275: 1074: 978: 878: 832: 820: 403: 383: 359:
dictionaries list verbs not under their root, but under the first infinitive, marked with
219: 208: 131: 111: 255:. English verbs usually have an infinitive, which in its bare form (without the particle 1128: 1046: 579: 479: 430: 414: 284: 1355: 1024: 873: 847: 729: 721: 712: 551: 535: 504: 312: 300: 127: 503:, list "went". Multilingual dictionaries vary in how they deal with this issue: the 1119: 304: 35: 17: 1151: 399: 295:
has no infinitive, and both lemmas are their lexemes' present tense forms). For
831:, for instance, include exhaustive historical reviews and exact citations from 1167: 1136: 725: 708: 680: 582:) and with stress, but they may also note common weak forms of pronunciation. 495: 379: 204: 143: 107: 386:, which also uses the third-person singular masculine perfect form, e.g. ברא 224: 213: 142:
The form of a word that is chosen to serve as the lemma is usually the least
1156: 990: 982: 717: 684: 596: 591: 519: 964: 807:(OED) has around 273,000 headwords along with 220,000 other lemmas, while 1290: 1141: 539: 676: 250: 1146: 1009: 843: 675:
Some lexemes have several stems but one lemma. For instance the verb "
1067: 863: 375: 91: 478:(literally, "Carthage must be destroyed") is a common way of citing 441:, the lemma for the noun meaning "speaker", has a variety of forms: 764: 760: 756: 737: 382:
of the word, either a verb or a noun, is used. This is similar to
296: 200: 196:, the citation form is often the masculine singular nominative. 150: 67: 1092: 1088: 664: 620: 347: 338: 329: 320: 646: 611: 287:
with no infinitive the present tense is used (for example,
661: 617: 542:
environment (the neighbouring sounds) or on the degree of
655: 908:
Zgusta, Ladislav (2006). Dolezal, Fredric F. M. (ed.).
374:, the non-past (present and future) tense is used. For 134:, although lemmatisation is at least partly arbitrary. 938:
Frequency Analysis of English Usage: Lexicon and Usage
683:: the past tense was co-opted from a different verb, " 658: 623: 741:
may contain the following (simplified) definitions:
652: 643: 640: 626: 608: 605: 1309: 1165: 1126: 649: 614: 486:("I hold Carthage to be in need of destruction"). 546:in a sentence. An example of the latter is the 494:In a dictionary, the lemma "go" represents the 791:to know how to act in your own best interests. 130:. The lemma can be viewed as the chief of the 1104: 8: 835:not usually found in standard dictionaries. 810:Webster's Third New International Dictionary 755:A common food made from the combination of 472:Some phrases are cited in a sort of lemma: 199:For many languages, the citation form of a 1111: 1097: 1089: 789:to know which side your bread is buttered 98:as the lemma by which they are indexed. 900: 801:contains around 500,000 headwords. The 679:" has the stems "go" and "went" due to 173:, the citation form uses a form of the 1025:"Glossary - Oxford English Dictionary" 850:plural form is sometimes used, namely 165:. For multiword lexemes that contain 7: 1077:at the BBAW, retrieved 22-June-2012. 259:) is its least marked (for example, 149:For English, the citation form of a 1317:International scientific vocabulary 507:dictionary of German does not list 43: 936:Francis, W. N.; Kučera, H (1982). 819:(DWB), the largest lexicon of the 25: 798:Academic Dictionary of Lithuanian 586:Difference between stem and lemma 482:, but what he said was nearer to 122:. The process of determining the 1337: 636: 601: 484:censeo Carthaginem esse delendam 70:forms. In English, for example, 27:Root word of a set of word forms 1236:Language-for-specific-purposes 1: 707:under which a set of related 188:. In European languages with 126:for a given lexeme is called 854:(Greek λῆμμα, pl. λήμματα). 311:, the first person singular 1327:List of online dictionaries 940:. Boston: Houghton Mifflin. 157:(and non-possessive) form: 1388: 957:Cambridge University Press 735:For example, the headword 534:A word may have different 348: 339: 330: 321: 1335: 1049:. www.merriam-webster.com 1005:Oxford English Dictionary 910:Lexicography then and now 804:Oxford English Dictionary 728:, related lemmas such as 515:), but the Cassell does. 410:is attached to the stem. 239: 233: 1068:The Deutsches Wörterbuch 951:Rochelle Lieber (2022). 869:Lexical Markup Framework 291:has only one form while 1372:Linguistics terminology 846:; for this reason, the 813:has about 470,000. The 1322:List of lexicographers 1008:, 3rd. edition, 2018, 953:Introducing morphology 782:To coat in breadcrumbs 595:multiple roots). When 419:agglutinative language 249: 223: 212: 90:are forms of the same 1251:Monolingual learner's 965:10.1017/9781108957960 548:weak and strong forms 167:possessive adjectives 816:Deutsches Wörterbuch 501:Webster's Dictionary 475:Carthago delenda est 1291:Spelling dictionary 1201:Defining vocabulary 550:of certain English 538:, depending on its 499:dictionaries, like 108:inflected languages 18:Lemma (linguistics) 1343:Linguistics portal 1176:Advanced learner's 1073:2016-08-12 at the 884:Root (linguistics) 570:when stressed but 524:corpus linguistics 522:are used often in 190:grammatical gender 175:indefinite pronoun 171:reflexive pronouns 1349: 1348: 974:978-1-108-95796-0 634:vs. "production" 435:initial mutations 16:(Redirected from 1379: 1341: 1241:Machine-readable 1113: 1106: 1099: 1090: 1078: 1065: 1059: 1058: 1056: 1054: 1043: 1037: 1036: 1034: 1032: 1027:. public.oed.com 1021: 1015: 1001: 995: 994: 955:(3rd ed.). 948: 942: 941: 933: 927: 926: 905: 889:Uninflected word 833:source documents 671: 670: 667: 666: 663: 660: 657: 654: 651: 648: 645: 642: 633: 632: 629: 628: 625: 622: 619: 616: 613: 610: 607: 577: 573: 569: 565: 351: 350: 342: 341: 333: 332: 324: 323: 317:contracted verbs 243: 237: 45: 21: 1387: 1386: 1382: 1381: 1380: 1378: 1377: 1376: 1352: 1351: 1350: 1345: 1331: 1305: 1161: 1129:reference works 1122: 1117: 1087: 1082: 1081: 1075:Wayback Machine 1066: 1062: 1052: 1050: 1045: 1044: 1040: 1030: 1028: 1023: 1022: 1018: 1002: 998: 975: 950: 949: 945: 935: 934: 930: 920: 912:. p. 202. 907: 906: 902: 897: 879:Principal parts 860: 821:German language 693: 639: 635: 604: 600: 588: 532: 492: 285:defective verbs 263:is chosen over 186:perjure oneself 140: 132:principal parts 60:dictionary form 28: 23: 22: 15: 12: 11: 5: 1385: 1383: 1375: 1374: 1369: 1364: 1354: 1353: 1347: 1346: 1336: 1333: 1332: 1330: 1329: 1324: 1319: 1313: 1311: 1307: 1306: 1304: 1303: 1298: 1293: 1288: 1283: 1278: 1273: 1268: 1263: 1258: 1253: 1248: 1243: 1238: 1233: 1228: 1223: 1218: 1213: 1208: 1203: 1198: 1193: 1188: 1183: 1178: 1172: 1170: 1163: 1162: 1160: 1159: 1154: 1149: 1144: 1139: 1133: 1131: 1124: 1123: 1118: 1116: 1115: 1108: 1101: 1093: 1086: 1085:External links 1083: 1080: 1079: 1060: 1047:"Mwunabridged" 1038: 1016: 1014:, definition 5 996: 973: 943: 928: 918: 899: 898: 896: 893: 892: 891: 886: 881: 876: 871: 866: 859: 856: 793: 792: 785: 784: 783: 775: 774: 773: 767: 748: 730:compound words 692: 689: 587: 584: 580:isolation form 552:function words 536:pronunciations 531: 528: 491: 488: 139: 136: 56:canonical form 26: 24: 14: 13: 10: 9: 6: 4: 3: 2: 1384: 1373: 1370: 1368: 1365: 1363: 1362:Lexical units 1360: 1359: 1357: 1344: 1340: 1334: 1328: 1325: 1323: 1320: 1318: 1315: 1314: 1312: 1308: 1302: 1299: 1297: 1294: 1292: 1289: 1287: 1284: 1282: 1279: 1277: 1274: 1272: 1269: 1267: 1264: 1262: 1259: 1257: 1254: 1252: 1249: 1247: 1244: 1242: 1239: 1237: 1234: 1232: 1229: 1227: 1224: 1222: 1219: 1217: 1214: 1212: 1209: 1207: 1204: 1202: 1199: 1197: 1194: 1192: 1189: 1187: 1184: 1182: 1179: 1177: 1174: 1173: 1171: 1169: 1164: 1158: 1155: 1153: 1150: 1148: 1145: 1143: 1140: 1138: 1135: 1134: 1132: 1130: 1125: 1121: 1114: 1109: 1107: 1102: 1100: 1095: 1094: 1091: 1084: 1076: 1072: 1069: 1064: 1061: 1048: 1042: 1039: 1026: 1020: 1017: 1013: 1012: 1007: 1006: 1000: 997: 992: 988: 984: 980: 976: 970: 966: 962: 958: 954: 947: 944: 939: 932: 929: 925: 921: 915: 911: 904: 901: 894: 890: 887: 885: 882: 880: 877: 875: 874:Null morpheme 872: 870: 867: 865: 862: 861: 857: 855: 853: 849: 848:Ancient Greek 845: 841: 836: 834: 830: 826: 822: 818: 817: 812: 811: 806: 805: 800: 799: 790: 786: 781: 780: 779: 776: 772: 768: 766: 762: 758: 754: 753: 752: 749: 747: 744: 743: 742: 740: 739: 733: 731: 727: 723: 722:pronunciation 719: 714: 713:encyclopaedia 710: 706: 702: 698: 690: 688: 686: 682: 678: 673: 669: 631: 598: 593: 585: 583: 581: 561: 557: 553: 549: 545: 541: 537: 530:Pronunciation 529: 527: 525: 521: 516: 514: 510: 506: 505:Langenscheidt 502: 497: 489: 487: 485: 481: 477: 476: 470: 468: 464: 460: 456: 452: 448: 444: 440: 436: 432: 427: 426: 425: 420: 416: 411: 409: 405: 401: 397: 393: 389: 385: 381: 377: 373: 368: 366: 362: 358: 354: 345: 336: 327: 318: 314: 313:present tense 310: 306: 302: 301:Ancient Greek 298: 294: 290: 286: 282: 278: 274: 270: 266: 262: 258: 254: 253: 252: 247: 242: 236: 232: 228: 227: 226: 221: 217: 216: 215: 210: 206: 202: 197: 195: 191: 187: 183: 182:do one's best 179: 176: 172: 168: 164: 160: 156: 152: 147: 145: 137: 135: 133: 129: 128:lemmatisation 125: 121: 117: 113: 109: 105: 101: 97: 93: 89: 85: 81: 77: 73: 69: 65: 64:citation form 61: 57: 53: 49: 41: 37: 33: 19: 1281:Single-field 1216:Etymological 1211:Encyclopedic 1191:Biographical 1168:dictionaries 1120:Lexicography 1063: 1051:. Retrieved 1041: 1029:. Retrieved 1019: 1010: 1003: 999: 952: 946: 937: 931: 923: 909: 903: 851: 837: 828: 824: 814: 808: 802: 796: 794: 788: 777: 770: 750: 745: 736: 734: 704: 700: 696: 694: 674: 589: 562:(pronounced 559: 555: 533: 517: 512: 508: 493: 490:Lexicography 483: 473: 471: 466: 462: 458: 454: 450: 446: 442: 438: 428: 422: 412: 407: 391: 390:create, כפר 387: 369: 364: 360: 355:"I love" ). 352: 343: 334: 325: 305:Modern Greek 292: 288: 280: 276: 272: 268: 264: 260: 256: 248: 222: 211: 198: 185: 181: 177: 162: 161:rather than 158: 148: 141: 123: 103: 99: 95: 87: 83: 79: 75: 71: 66:of a set of 63: 59: 55: 51: 47: 39: 36:lexicography 29: 1286:Specialized 1256:Multi-field 1221:Explanatory 1152:Phrase book 726:inflections 467:gcainteoirí 463:chainteoirí 437:. The noun 400:verbal noun 337:"I love" , 1356:Categories 1226:Historical 1206:Electronic 1196:Conceptual 1137:Dictionary 991:Q125778052 919:3484391294 895:References 709:dictionary 681:suppletion 520:word stems 518:Lemmas or 459:cainteoirí 455:chainteora 447:gcainteoir 443:chainteoir 380:triliteral 231:Hindustani 205:infinitive 138:Morphology 32:morphology 1367:Morphemes 1296:Sub-field 1186:Bilingual 1166:Types of 1157:Thesaurus 1127:Types of 1053:3 October 1031:3 October 983:35578155M 718:etymology 701:catchword 597:phonology 496:inflected 451:cainteora 439:cainteoir 398:uses the 309:Bulgarian 54:) is the 1142:Glossary 1071:Archived 987:Wikidata 858:See also 827:and the 787:— 697:headword 691:Headword 540:phonetic 396:Georgian 372:Japanese 277:breaking 265:to break 155:singular 110:such as 88:breaking 1271:Rhyming 1266:Reverse 1261:Picture 1246:Medical 1181:Anagram 1147:Lexicon 852:lemmata 844:scholia 840:glosses 771:(slang) 703:is the 685:to wend 572:/s(ə)m/ 357:Finnish 283:); for 246:Spanish 203:is the 153:is the 120:Russian 116:Turkish 94:, with 52:lemmata 1301:Visual 989:  981:  971:  916:  864:Lexeme 778:(verb) 769:Money 751:(noun) 544:stress 511:(< 404:Korean 402:. For 394:deny. 392:kaphar 388:bara' 384:Hebrew 376:Arabic 344:agapáō 340:ἀγαπάω 326:philéō 307:, and 281:broken 279:, and 269:breaks 220:German 209:French 144:marked 118:, and 112:Arabic 100:Lexeme 92:lexeme 84:broken 76:breaks 48:lemmas 1310:Other 1231:Idiom 765:yeast 761:water 757:flour 746:Bread 738:bread 705:lemma 677:to go 576:/bət/ 568:/bʌt/ 564:/sʌm/ 554:like 513:gehen 431:Irish 417:, an 415:Tamil 365:-(t)ä 361:-(t)a 353:agapō 349:ἀγαπῶ 335:philō 322:φιλέω 297:Latin 293:shall 273:broke 261:break 225:gehen 214:aller 194:cases 159:mouse 124:lemma 104:lemma 96:break 80:broke 72:break 62:, or 40:lemma 1276:Rime 1055:2016 1033:2016 1011:s.v. 969:ISBN 914:ISBN 795:The 763:and 724:and 592:stem 590:The 558:and 556:some 509:ging 480:Cato 465:and 370:For 346:for 331:φιλῶ 328:for 289:must 241:جانا 235:जाना 201:verb 163:mice 151:noun 86:and 68:word 38:, a 34:and 961:doi 842:in 829:DWB 825:OED 711:or 699:or 687:". 560:but 429:In 424:இரு 413:In 408:-da 178:one 169:or 50:or 44:pl. 30:In 1358:: 985:. 979:OL 977:. 967:. 959:. 922:. 759:, 720:, 695:A 672:. 665:ən 621:uː 618:dj 574:, 566:, 469:. 461:, 457:, 453:, 449:, 445:, 406:, 367:. 363:, 303:, 299:, 275:, 271:, 267:, 257:to 251:ir 244:, 229:, 218:, 207:: 184:, 180:: 114:, 82:, 78:, 74:, 58:, 46:: 1112:e 1105:t 1098:v 1057:. 1035:. 993:. 963:: 668:/ 662:ʃ 659:k 656:ʌ 653:d 650:ˈ 647:ə 644:r 641:p 638:/ 630:/ 627:t 624:s 615:ˈ 612:ə 609:r 606:p 603:/ 238:/ 42:( 20:)

Index

Lemma (linguistics)
morphology
lexicography
word
lexeme
inflected languages
Arabic
Turkish
Russian
lemmatisation
principal parts
marked
noun
singular
possessive adjectives
reflexive pronouns
indefinite pronoun
grammatical gender
cases
verb
infinitive
French
aller
German
gehen
Hindustani
जाना
جانا
Spanish
ir

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.