3245:साङ्ख्य-योग-न्याय-वैशेषिक-पूर्व-मीमांसा-वेदान्त-नारद-शाण्डिल्य-भक्ति-सूत्र-गीता-वाल्मीकीय-रामायण-भागवतादि-सिद्धान्त-बोध-पुरः-सर-समधिकृताशेष-तुलसी-दास-साहित्य-सौहित्य-स्वाध्याय-प्रवचन-व्याख्यान-परम-प्रवीणाः (IAST sāṅkhya-yoga-nyāya-vaiśeṣika-pūrva-mīmāṃsā-vedānta-nārada-śāṇḍilya-bhakti-sūtra-gītā-vālmīkīya-rāmāyaṇa-bhāgavatādi-siddhānta-bodha-puraḥ-sara-samadhikṛtāśeṣa-tulasī-dāsa-sāhitya-sauhitya-svādhyāya-pravacana-vyākhyāna-parama-pravīṇāḥ): Literally
2210:
3064:" would be a "ball" in the "foot" context. In itself, this does not alter the meaning of the main word. The added context only makes it more precise. As such, a "football" must be understood as a "ball". However, as is the case with "football", a well established compound word may have gained a special meaning in the language's
3101:
the
Germanic family of languages, English is unusual in that compounds are normally written in separate parts. This would be an error in other Germanic languages such as Norwegian, Swedish, Danish, German and Dutch. However, this is merely an orthographic convention: As in other Germanic languages, arbitrary
3100:
Because a compound is understood as a word in its own right, it may in turn be used in new compounds, so forming an arbitrarily long word is trivial. This contrasts to
Romance languages, where prepositions are more used to specify word relationships instead of concatenating the words. As a member of
724:
All natural languages have compound nouns. The positioning of the words (i.e. the most common order of constituents in phrases where nouns are modified by adjectives, by possessors, by other nouns, etc.) varies according to the language. While
Germanic languages, for example, are left-branching when
3088:
and inflection rules of the main word. That is to say, since "fish" and "shape" are nouns, "starfish" and "star shape" must also be nouns, and they must take plural forms as "starfish" and "star shapes", definite singular forms as "the starfish" and "the star shape", and so on. This principle also
470:
Compounds can be rather long when translating technical documents from
English to some other language, since the lengths of the words are theoretically unlimited, especially in chemical terminology. For example, when translating an English technical document to Swedish, the term "Motion estimation
70:
that creates compound lexemes. Compounding occurs when two or more words or signs are joined to make a longer word or sign. Consequently, a compound is a unit composed of more than one stem, forming words or signs. If the joining of the words or signs is orthographically represented with a hyphen,
3187:
Sanskrit is very rich in compound formation with seven major compound types and as many as 55 sub-types. The compound formation process is productive, so it is not possible to list all
Sanskrit compounds in a dictionary. Compounds of two or three words are more frequent, but longer compounds with
2309:
967:'wardrobe' (lit. 'store clothes'). These compounds are formally invariable in the plural (but in many cases they have been reanalyzed as plural forms, and a singular form has appeared). French and Italian have these same compounds with the noun in the singular form: Italian
3378:, in recent decades written English has displayed a noticeable trend towards increased use of compounds. Recently, many words have been made by taking syllables of words and compounding them, such as pixel (picture element) and bit (binary digit). This is called a
3401:(the "English disease"). Because the Dutch language and the Scandinavian languages rely heavily on the distinction between the compound word and the sequence of the separate words it consists of, this has serious implications. For example, the Danish adjective
576:
of some unexpressed semantic category (such as a person, plant, or animal): none (neither) of its components can be perceived as a formal head, and its meaning often cannot be transparently guessed from its constituent parts. For example, the
English compound
585:
is not a verb but a noun. The meaning of this type of compound can be glossed as "(one) whose B is A", where B is the second element of the compound and A the first. A bahuvrihi compound is one whose nature is expressed by neither of the words: thus a
3144:'academic village'). In agglutinative compound nouns, an agglutinating infix is typically used: пароход 'steamship': пар + о + ход. Compound nouns may be created as noun+noun, adjective + noun, noun + adjective (rare), noun + verb (or, rather, noun +
454:
3300:. On including word boundaries with sandhi (vedānta=veda-anta, rāmāyaṇa=rāma-ayana, bhāgavatādi=bhāgavata-ādi, siddhānta=siddha-anta, samadhikṛtāśeṣa=samadhikṛta-aśeṣa, svādhyāya=sva-adhyāya), this is a compound of 35 words and 86 syllables.
418:, extremely extendable compound words can be found in the language of chemical compounds, where, in the cases of biochemistry and polymers, they can be practically unlimited in length, mostly because the German rule suggests combining all
3365:
there is another process easily compared to compounding. Blending is the blending of two morphemes to create a new word called a portmanteau. This is different from compounding in that it breaks the strict linear order of compounding.
3151:
Compound adjectives may be formed either per se (бело-розовый 'white-pink') or as a result of compounding during the derivation of an adjective from a multi-word term: Каменноостровский проспект () 'Stone Island Avenue', a street in
3222:कमला-कुच-कुङ्कुम-पिञ्जरीकृत-वक्षः-स्थल-विराजित-महा-कौस्तुभ-मणि-मरीचि-माला-निराकृत-त्रि-भुवन-तिमिर (IAST kamalā-kuca-kuṅkuma-piñjarīkṛta-vakṣaḥ-sthala-virājita-mahā-kaustubha-maṇi-marīci-mālā-nirākṛta-tri-bhuvana-timira): Literally
1217:), one of the verbs is the primary, and determines the primary semantics and also the argument structure. The secondary verb, often called a vector verb or explicator, provides fine distinctions, usually in temporality or
931:
marker li-/la ‘for/of’ appears or is absent when the first element is definite. The second criterion deals with the appearance/absence of the possessive marker li-/la ‘for/of’ when the first element is preceded by a
3971:
Sanskrit
Computational Linguistics: 4th International Symposium, New Delhi, India, December 10–12, 2010: Proceedings (Volume 6465 of Lecture Notes in Computer Science / Lecture Notes in Artificial Intelligence
222:
Some languages easily form compounds from what in other languages would be a multi-word expression. This can result in unusually long words, a phenomenon known in German (which is one such language) as
3309:
Also in sign languages, compounding is a productive word formation process. Both endocentric and exocentric compounds have been described for a variety of sign languages. Copulative compounds or
1301:. With a few exceptions, all compound verbs alternate with their simple counterparts. That is, removing the vector does not affect grammaticality at all nor the meaning very much: निकला
461:
In
Finnish, although there is theoretically no limit to the length of compound words, words consisting of more than three components are rare. Internet folklore sometimes suggests that
458:("Association for Subordinate Officials of the Main Electric Maintenance Building of the Danube Steam Shipping"), but there is no evidence that this association ever actually existed.
4346:
3432:
compound nouns when it enhances comprehensibility and readability. This is done mostly with very long compound words by separating them into two or more smaller compounds, like
467:(airplane jet turbine engine auxiliary mechanic non-commissioned officer student) is the longest word in Finnish, but evidence of its actual use is scant and anecdotal at best.
4506:
581:
is neither a kind of collar nor a white thing. In an exocentric compound, the word class is determined lexically, disregarding the class of the constituents. For example, a
4292:
3192:
are not rare in
Sanskrit literature. Some examples are below (hyphens below show individual word boundaries for ease of reading but are not required in original Sanskrit).
4310:
4298:
4268:
4262:
4304:
4286:
857:, whereby compound nouns are virtually always required to be solid or at least hyphenated; even the hyphenated styling is used less now than it was in centuries past.
155:
is unusual in that even simple compounds made since the 18th century tend to be written in separate parts. This would be an error in other
Germanic languages such as
3124:
Compound nouns may be agglutinative compounds, hyphenated compounds (стол-книга 'folding table', lit. 'table-book', "book-like table"), or abbreviated compounds (
1707:, etc.). Hindi has a small number of simple (i.e., one-word) postpositions and a large number of compound postpositions, mostly consisting of simple postposition
2940:(Non-standard Tamil), parts of words from other languages may be morphed into Tamil. Common examples include 'ratta-azhuttam' (blood pressure) from the Sanskrit
536:, i.e. the categorical part that contains the basic meaning of the whole compound, and modifiers, which restrict this meaning. For example, the English compound
106:
The meaning of the compound may be similar to or different from the meaning of its components in isolation. The component stems of a compound may be of the same
4484:
3559:
3313:, which are composed of two or more nouns from the same semantic category to denote that semantic category, also occur regularly in many sign languages. The
448:("beginning of the discussion of a regulation on tendering of Danube steamboat shipping company captain hats") etc. According to several editions of the
3341:
are signed together to create a new word meaning breakfast. This is an example of a sequential compound; in sign languages, it is also possible to form
4895:
4339:
4256:
955:, for example, such compounds consist of a verb conjugated for the second person singular imperative followed by a noun (singular or plural): e.g.,
192:
would be written in English as "Danube steamship transport company captain" and not as "Danubesteamshiptransportcompanycaptain".
5064:
3105:, for example "girl scout troop", "city council member", and "cellar door", can be made up on the spot and used as compound nouns in English too.
183:, for example "girl scout troop", "city council member", and "cellar door", can be made up on the spot and used as compound nouns in English too.
3593:
Donau - This refers to the Danube River, the second-longest river in Europe, which flows through several countries including Germany and Austria.
1201:; this is however not quite a true compound since they are connected by a conjunction and the second missing arguments may be taken as a case of
3444:(Bokmål and Nynorsk alike), and is even encouraged between parts of the word that have very different pronunciation, such as when one part is a
195:
The meaning of compounds may not always be transparent from their components, necessitating familiarity with usage and context. The addition of
446:
Donaudampfschifffahrtsgesellschaftskapitänsmützenreinigungsausschreibungsverordnungsdiskussionsanfang
3979:
3771:
3729:
3671:
3643:
3761:
1285:"reading" usually remain the same. An exception to this is the passive voice, in which both English and Japanese modify the main verb, i.e.
1190:, a Dravidian language, van̪t̪u paːr, lit. "come see". In each case, the two verbs together determine the semantics and argument structure.
4805:
4496:
4332:
3719:
3611:
So, "Donaudampfschiffahrtsgesellschaftskapitänsmütze" can be understood as the "Danube steam navigation company captain's cap" in English.
3409:("smoke freely"). In Dutch, compounds written with spaces may also be confused, but can also be interpreted as a sequence of a noun and a
5059:
3084:
in modern biology. Also syntactically, the compound word behaves like the main word – the whole compound word (or phrase) inherits the
4666:
1044:, it is claimed that all verbs are V+N compounds, such as "do a sleep", or "run a dive", and the language has only three basic verbs:
411:
tend to create very long words with derivational morphemes. Compounds may or may not require the use of derivational morphemes also.
4316:
3345:
compounds, where one hand represents one lexeme while the other simultaneously represents another lexeme. An example is the sign for
4820:
4651:
4280:
4274:
4250:
4095:
3952:
3910:
441:
3121:, and several types of compounds exist, both in terms of compounded parts of speech and of the way of the formation of a compound.
4591:
3350:
3211:प्रवर-मुकुट-मणि-मरीचि-मञ्जरी-चय-चर्चित-चरण-युगल (IAST pravara-mukuṭa-maṇi-marīci-mañjarī-caya-carcita-caraṇa-yugala): Literally,
5008:
4661:
3633:
5105:
4656:
4401:
3590:
The word illustrates where compounds can be formed by combining multiple words together to create new, often complex, terms.
3170:
2310:
Rinderkennzeichnungs- und Rindfleischetikettierungsüberwachungsaufgabenübertragungsgesetz
449:
4071:
1085:
Verb–verb compounds are sequences of more than one verb acting together to determine clause structure. They have two types:
4143:"Deaf Culture: Exploring Deaf Communities in the United States by Irene W. Leigh, Jean F. Andrews, and Raychelle L. Harris"
3417:("commission meeting") being read as "commission of the meeting" rather than "meeting of the commission" (normally spelled
4925:
4646:
731:
can be spaced, hyphenated, or solid, and they sometimes change orthographically in that direction over time, reflecting a
4618:
864:, compound nouns are often formed by left-hand heads with prepositional components inserted before the modifier, as in
4963:
4948:
4920:
4785:
4780:
4355:
4001:
3901:
Chiara Melloni, Antonietta Bisetto, "Parasynthetic compounds: data and theory", in Sergio Scalies, Irene Vogel, eds.,
3513:
3076:, at that), and also the game involving such a ball. Another example of special and altered meaning is "starfish" – a
1060:
1010:
359:, though in some it is combined with an explicit genitive case, so that both parts of the compound are marked, e.g.
4700:
4671:
4449:
4189:
1041:
725:
it comes to noun phrases (the modifiers come before the head), the Romance languages are usually right-branching.
4543:
4396:
3883:
1009:
Also common in English is another type of verb–noun (or noun–verb) compound, in which an argument of the verb is
5069:
4993:
4725:
4681:
4566:
4464:
3165:
Quite a few Russian words are borrowed from other languages in an already-compounded form, including numerous "
3159:
3094:
1648:
1091:
216:
3969:
Kumar, Anil; Mittal, Vipul; Kulkarni, Amba (2010). "Sanskrit Compound Processor". In Jha, Girish Nath (ed.).
1611:
If you are not giving away free information on your web site then a huge proportion of your business is just
978:
This construction exists in English, generally with the verb and noun both in uninflected form: examples are
614:
tradition) are compounds with two semantic heads, for example in a gradual scale (such as a mix of colours).
455:
Donaudampfschiffahrtselektrizitätenhauptbetriebswerkbauunterbeamtengesellschaft
4973:
4943:
4610:
3538:
1344:
1202:
945:
850:
1638:
the term 'compound verb' refers to noun-plus-verb compounds, not to the verb–verb compounds discussed here.
1331:, where as many as 20% of verb forms in running text are compound. They exist but are less common in other
743:
guides under the aphorism that "compound nouns tend to solidify as they age"; thus a compound noun such as
548:
is the modifier, is understood as a house intended for a dog. Endocentric compounds tend to be of the same
4830:
4523:
4501:
4491:
4459:
4434:
3425:
3386:
3362:
3330:
408:
5100:
4690:
3554:
3441:
3379:
3322:
3287:; and the most skilled in comprehensive self-study, discoursing and expounding of the complete works of
3228:
jewel hanging on the chest, which has been made reddish-yellow by the saffron from the bosom of Kamalā (
3213:
O the one whose dual feet are covered by the cluster of brilliant rays from the gems of the best crowns
3405:(literally "smokefree", meaning no smoking allowed) if separated into its composite parts, would mean
1225:(tense and/or agreement markers). The main verb usually appears in conjunctive participial (sometimes
5043:
4719:
4695:
4548:
3523:
1525:
1332:
339:(school), it is the head that is modified: the compound literally means "house-of book", with בַּיִת
204:
141:
5023:
4953:
4910:
4866:
4638:
4628:
4623:
4511:
3503:
3272:
1699:
formed by prepositions and nouns are common in English and the Romance languages (consider English
1324:
1316:
240:
3374:
Although there is no universally agreed-upon guideline regarding the use of compound words in the
5033:
4905:
4770:
4533:
4516:
4374:
4170:
3549:
3508:
3498:
3476:
3461:
3182:
3166:
3133:
3073:
3049:
1218:
948:
is formed of a verb and its object, and in effect transforms a simple verbal clause into a noun.
854:
532:
266:
251:
156:
148:
1025:, etc. The noun is often an instrumental complement. From these gerunds new verbs can be made:
5038:
4750:
4558:
4469:
4162:
4091:
3975:
3948:
3906:
3852:
3767:
3725:
3667:
3639:
3493:
2373:
1635:
1372:
1305:'(He) went out.' In a few languages both components of the compound verb can be finite forms:
924:
611:
527:
356:
3787:
2956:
in Tamil order to remove the consonant-cluster. This also happens with English, for examples
4915:
4800:
4775:
4576:
4479:
4154:
4083:
3844:
3471:
3466:
3375:
3284:
3114:
3053:
1631:
1360:
1356:
1336:
1320:
1315:
Compound verbs are very common in some languages, such as the northern Indo-Aryan languages
1037:
952:
876:
728:
620:
are lexemes that have two (contrary or simultaneous) attributes that classify the compound.
160:
152:
3440:(car manual). Such practice is also permitted in other Germanic languages, e.g. Danish and
3413:(which is unmarked in Dutch) in formal abbreviated writing. This may lead to, for example,
2209:
5095:
5027:
4988:
4983:
4851:
4581:
4454:
4429:
4411:
3792:
3481:
3314:
3293:
2976:
2748:
2657:
1368:
1364:
1340:
1306:
933:
861:
415:
346:
330:
274:
244:
168:
164:
3832:
269:, the relationship between the elements of a compound may be marked with a case or other
2921:' (exterior). Sometimes phonemes may be inserted during the blending process such as in
2643:'outside' (As two nouns compound the consonant sound 'b' fortifies into 'p' becoming 안팎
4735:
4715:
4439:
3736:
The compound can be a closed compound, where the two words are written together (e.g.,
3596:
Dampf - Means "steam" in German, often used in the context of steamboats or steamships.
3565:
3132:'). Some compounds look like acronym, while in fact they are an agglutinations of type
3118:
1352:
1348:
1328:
1187:
549:
172:
107:
67:
522:
5089:
4998:
4810:
4790:
4571:
4174:
4045:
3276:
3153:
3137:
1496:
1380:
1376:
1080:
300:
248:
233:
125:—or they may belong to different parts of speech, as in the case of the English word
47:
4226:, eds. Lieber, Rochelle & Pavol Štekauer, 2009. Oxford: Oxford University Press.
4023:
3389:
there is an unofficial trend toward splitting compound words, known in Norwegian as
4978:
4070:
Quer, Josep; Cecchetto, Carlo; Donati, Caterina; Geraci, Carlo, eds. (2017-11-20).
3570:
3533:
3353:, which is produced by simultaneously signing a one-handed version of the sign for
3216:
3102:
3090:
1652:
1100:
419:
4117:
3325:, for instance, is a combination of the nouns 'father' and 'mother'. The sign for
1245:, lit. "exit fell", means 'departed' or 'was blurted out'. In these examples निकल
444:
captain hat), which can of course be made even longer and even more absurd, e.g.
4935:
4815:
4528:
4444:
4421:
4369:
3923:
3805:
a compound whose word components are separated by a space in printing or writing
3543:
3260:
3145:
2898:
2573:
1696:
1277:"start" change according to tense, negation, and the like, while the main verbs
751:
745:
736:
515:
180:
176:
31:
17:
3068:. Only this defines "football" as a particular type of ball (unambiguously the
2314:: literally cattle-marking- and beef-labeling-supervision-duties-delegation law
1095:, two actions, often sequential, are expressed in a single clause. For example:
739:
to something stronger in its solidification. This theme has been summarized in
464:
lentokonesuihkuturbiinimoottoriapumekaanikkoaliupseerioppilas
236:
as well, where compounds are also created by combining two or more sign stems.
4538:
4324:
4087:
3650:
English compounds cannot be defined as an uninterrupted sequence of characters
3085:
3065:
1222:
928:
763:
757:
561:
4166:
3856:
3666:(17th ed.). Chicago: The University of Chicago Press. pp. 443–444.
4406:
3996:
3833:"Identifying N+N compounding in Modern Standard Arabic and Jordanian Arabic"
3528:
3445:
3225:
3057:
1391:
906:
Occasionally, two synonymous nouns can form a compound noun, resulting in a
769:
732:
568:
314:
130:
51:
4142:
2902:
2897:(Classical Tamil), rules for compounding are laid down in grammars such as
4158:
912:
661:
A+B denotes a special kind of an unexpressed different semantic meaning C
4881:
4861:
4846:
4825:
4795:
4740:
4705:
4586:
3410:
3288:
3239:
3201:
3077:
3061:
1795:
907:
270:
3888:
2760:
sounds like a kind of fictional science rather than scientific fiction.)
2752:. In English, the head of a compound word is the last morpheme: science
5018:
4876:
4856:
4730:
4474:
4389:
3848:
3518:
3449:
3310:
3280:
3268:
3264:
3248:
3229:
3129:
3125:
2912:
1647:
Parasynthetic compounds are formed by a combination of compounding and
606:
591:
573:
215:) should not be confused with nominal composition, as this is actually
3678:
Compounds defined. An open compound is spelled as two or more words (
3599:
Schiffahrt - This translates to "navigation" or "shipping" in English.
1507:
4384:
4379:
3429:
3297:
2743:
1791:
1387:
1014:
920:
814:
749:
begins as spaced in most attestations and then becomes hyphenated as
200:
39:
3224:
O the one who dispels the darkness of three worlds by the shine of
2213:
Sewage-treatment-facility – The German language has many compounds.
923:, there are two distinct criteria unique to Arabic, or potentially
477:, though in reality, the word would most likely be divided in two:
5074:
4710:
3817:
3256:
2918:
2208:
1596:
Compound verb equivalents in English (examples from the internet):
1230:
1132:
740:
437:
Donaudampfschifffahrtsgesellschaftskapitänsmütze
304:
196:
1163:
496:
A common semantic classification of compounds yields four types:
378:
4596:
3296:. The hyphens show only those word boundaries where there is no
3252:
3235:
3197:
3081:
3069:
2930:
2922:
1063:, of which noun incorporation into a verbal root (as in English
474:
rörelseuppskattningssökintervallsinställningar
115:
43:
4328:
4257:
Compounds and multi-word expressions in the languages of Europe
3686:). A hyphenated compound is spelled with one or more hyphens (
590:
person is neither white nor a collar (the collar's colour is a
392:
4871:
1511:
480:
sökintervallsinställningar för rörelseuppskattning
3698:). A closed (or solid) compound is spelled as a single word (
2356:
2341:
2322:
1687:(verbal suffix), with the meaning 'to put into an envelope'.
1176:
1145:
699:
A and B provide different descriptions for the same referent
3945:
Student Dictionary of Compound Words of the Russian Language
3740:), an open compound, where they are written separate (e.g.,
3056:), compounds are formed by prepending what is effectively a
1149:
927:
in general. The initial criterion involves whether the
834:
When written, there is no space or intervening punctuation.
262:
Compound formation rules vary widely across language types.
189:
Donaudampfschifffahrtsgesellschaftskapitän
83:. If they are joined without an intervening space, it is a
4118:"Word formation: compounding and blending in sign language"
3924:"Diccionario De La Lengua Española : limpiaparabrisas"
3247:
the acclaimed forerunner in understanding of the canons of
2756:. Conversely, the Spanish head is located at the front, so
1790:
Chinese (traditional/simplified Chinese; Standard Chinese
594:
for socioeconomic status). Other English examples include
452:, the longest published German word has 79 letters and is
434:(radio remote control), and the often quoted jocular word
388:
3060:(disambiguation context) to the main word. For example, "
1671:'having', with the meaning 'having a black eye'; Italian
355:(house-of). This latter pattern is common throughout the
27:
Lexeme (word or sign) that consists of more than one stem
422:
with the noun as the last stem. German examples include
140:. With very few exceptions, English compound words are
872:'windmill', lit. 'mill (that works)-by-means-of wind'.
232:
Compounding extends beyond spoken languages to include
3602:
Gesellschaft - Means "society" or "company" in German.
2614:
2604:
2551:
2205:'electricity meter': 'three-phase kilowatt hour meter'
1241:, lit. "exit went", means 'went out', while निकल पड़ा
179:
convention: as in other Germanic languages, arbitrary
4065:
4063:
3608:
Mütze - This translates to "cap" or "hat" in English.
2925:
from 'kō' (king) + 'il' (home). Other types are like
471:
search range settings" can be directly translated to
99:), then the result – at least in English – may be an
2648:
5052:
5007:
4962:
4934:
4894:
4839:
4761:
4749:
4680:
4637:
4609:
4557:
4420:
4362:
1013:into the verb, which is then usually turned into a
879:, one way of forming compound nouns is as follows:
3744:), or hyphenated, with a hyphen in between (e.g.,
959:(modelled on "skyscraper", lit. 'scratch skies'),
97:school bus, high school, lowest common denominator
4293:Compounds and multi-word expressions in Hungarian
3760:McArthur, Thomas Burns; McArthur, Roshan (2005).
3200:Himālaya, decomposed as hima-ālaya): Name of the
1257:are the vector verbs. Similarly, in both English
483:– "search range settings for motion estimation".
4236:Scalise Sergio & Irene Vogel (eds.) (2010),
3763:Concise Oxford Companion to the English Language
3397:(literally "separate writing"), and in Dutch as
944:A type of compound that is fairly common in the
796:The words are not visibly connected in writing.
4317:Compounds and multi-word expressions in Spanish
4311:Compounds and multi-word expressions in Russian
4299:Compounds and multi-word expressions in Italian
4269:Compounds and multi-word expressions in Finnish
4263:Compounds and multi-word expressions in English
4259:by Rita Finkbeiner and Barbara Schlücker, 2019
3721:The Routledge Companion to Linguistics in India
2539:pomodoro: pomo d'oro = apple of Gold = tomatoes
2458:
2450:
2444:
2438:
2432:
2426:
2420:
2404:
2396:
2390:
2384:
1655:and a derivational affix. For example, English
1193:Serial verb expressions in English may include
4305:Compounds and multi-word expressions in Polish
4281:Compounds and multi-word expressions in German
4275:Compounds and multi-word expressions in French
3831:Altakhaineh, Abdel Rahman Mitib (2018-02-27).
552:(word class) as their head, as in the case of
4340:
4287:Compounds and multi-word expressions in Greek
4233:, Cambridge University Press, Cambridge 2003.
3632:Sanchez-Stockhammer, Christina (2018-05-03).
3208:. A compound of two words and four syllables.
2880:
2874:
2865:
2859:
2853:
2845:
2839:
2823:
2815:
2809:
2803:
2795:
2789:
2783:
2775:
2769:
2763:
2737:
2731:
2725:
2531:
2525:
2519:
2511:
2505:
2499:
2491:
2485:
2479:
2200:
2192:
2186:
2180:
2174:
2166:
2160:
2154:
2146:
2140:
2134:
2126:
2120:
2114:
2106:
2100:
2094:
1943:arbeidsongeschiktheidsverzekering
680:A+B denotes 'the sum' of what A and B denote
478:
472:
462:
8:
3560:Tweebuffelsmeteenskootmorsdoodgeskietfontein
3219:. A compound of nine words and 25 syllables.
2307:
2299:
2293:
2287:
2279:
2273:
2267:
2261:
2253:
2247:
2241:
2233:
2227:
2221:
1778:
1769:
1760:
1749:
1740:
1731:
1390:substrate speakers living in the Ecuadorian
1375:, and in northeast Caucasian languages like
435:
429:
423:
366:
350:
340:
334:
290:
284:
278:
224:
187:
4190:"The Rise and Fall of the English Sentence"
2910:
2202:kolmivaihekilowattituntimittari
2081:
2075:
2069:
2063:
2055:
2049:
2043:
2035:
2029:
2023:
2017:
2009:
2003:
1997:
1989:
1983:
1973:
1969:rioolwaterzuiveringsinstallatie
1967:
1959:
1953:
1947:
1941:
1711:followed by a specific postposition (e.g.,
1233:
324:
318:
308:
4758:
4554:
4347:
4333:
4325:
3638:. Cambridge University Press. p. 26.
3605:Kapitän - Refers to the captain of a ship.
3242:. A compound of 16 words and 44 syllables.
1269:-start" "start reading", the vector verbs
95:). If they are joined with a space (e.g.
3662:University of Chicago press, ed. (2017).
3357:and a one-handed version of the sign for
2858:'windshield wiper' is a nested compound:
1394:have innovated compound verbs in Spanish:
4238:Cross-Disciplinary Issues in Compounding
4046:"Virudavali – Jagadguru Rambhadracharya"
3903:Cross-Disciplinary Issues in Compounding
3766:. Oxford University Press. p. 237.
2929:(butter) from 'veḷḷai' (white) + 'nei' (
775:
622:
404:"servant of-the-god: the servant of God"
3624:
3583:
1312:lit. "died-3pl went-3pl" '(They) died.'
1071:, etc.) is most prevalent (see below).
3964:
3962:
3960:
3292:. An adjective used in a panegyric of
2828:'keeping the head low in a bad mood':
1036:In the Australian Aboriginal language
3871:Jingulu Grammar, Dictionary and Texts
7:
4806:Simple Knowledge Organization System
3635:English Compounds and their Spelling
3333:follows the same concept. The words
2542:portacenere = porta cenere = ashtray
1475:'In anger (he/she) smashed the pot.'
868:'railway', lit. 'road of iron', and
110:—as in the case of the English word
3718:Nagarajan, Hemalatha (2022-10-20).
2933:); note how 'veḷḷai' becomes 'ven'.
2905:, in various forms, under the name
2022:'customer service representative':
1779:
1770:
1761:
1750:
1741:
1732:
367:
4319:by Jesús Fernández-Domínguez, 2019
4224:The Oxford Handbook of Compounding
1864:building (with more than 1 storey)
1524:Unknown glossing abbreviation(s) (
910:. One example is the English word
735:identity that evolves from a mere
25:
4821:Thesaurus (information retrieval)
3428:of 1996 introduced the option of
3089:holds for languages that express
2068:'possibilities for advancement':
1723:Examples from different languages
1059:A special kind of compounding is
963:'corkscrew' (lit. 'pull corks'),
442:Danube-Steamboat-Shipping Company
50:) that consists of more than one
4082:. De Gruyter. pp. 163–270.
3873:, PhD thesis (MIT, 1992), 138–9.
3438:Kraftfahrzeugs-Betriebsanleitung
3351:Sign Language of the Netherlands
3117:compounding is a common type of
2948:'azhuttam' (pressure); note how
2909:. Examples of compounds include
2873:, which is itself a compound of
1486:'We will kill you.' (Cf. Quichua
1195:What did you go and do that for?
1027:(a mother) breastfeeds (a child)
975:'toaster' (lit. 'toast bread').
855:a somewhat different orthography
777:Types of English compound nouns
642:A+B denotes a special kind of B
572:in the Sanskrit tradition) is a
4218:English Linguistics: Essentials
4022:Kumar, Animesh (May 23, 2007).
3162:is also a source of compounds.
1929:謝謝/谢谢 'thanks': Repeating of 謝
333:compound, the word בֵּית סֵפֶר
144:on their first component stem.
4402:Natural language understanding
2960:(coffee shop) is from English
2045:universiteitsbibliotheek
2019:klantenservicemedewerker
761:, or the spaced compound noun
450:Guinness Book of World Records
1:
4926:Optical character recognition
2609:'national diet building': 国会
1894:百科全書/百科全书 'encyclopaedia': 百
1624:herself a house out of brush.
1249:is the primary verb, and गया
329:(family). Conversely, in the
243:" are compounds derived from
175:. However, this is merely an
4619:Multi-document summarization
4277:by Kristel Van Goethem, 2018
4188:Sedivy, Julie (2017-11-16).
4048:. Shri Tulsi Peeth Seva Nyas
3974:. Springer. pp. 57–69.
3370:Recent trends in orthography
2746:from the English expression
2742:, 'fiction' (This word is a
2564:'awakening (someone)' (+ 時計
2065:doorgroeimogelijkheden
1630:Caution: In descriptions of
1029:and from them new compounds
303:suffix); and similarly, the
4949:Latent Dirichlet allocation
4921:Natural language generation
4786:Machine-readable dictionary
4781:Linguistic Linked Open Data
4356:Natural language processing
4313:by Ingeborg Ohnheiser, 2019
4307:by Bozena Cetnarowska, 2019
4002:Online Etymology Dictionary
3887:, Third Edition, June 2005
3664:The Chicago manual of style
3514:Incorporation (linguistics)
3173:: автомобиль 'automobile'.
2911:
2649:
2615:
2605:
2552:
1158:
1140:
817:is used to join the words.
351:
345:(house) having entered the
341:
335:
5122:
4701:Explicit semantic analysis
4450:Deep linguistic processing
4289:by Maria Koliopoulou, 2019
4283:by Barbara Schlücker, 2019
3684:lowest common denominoator
3204:mountain range. Literally
3180:
2463:'telephone conversation':
2357:
2342:
2323:
1972:'sewage treatment plant':
1768:'the anaesthesia needle':
1078:
1031:mother-child breastfeeding
773:without being hyphenated.
767:directly becomes solid as
4544:Word-sense disambiguation
4397:Computational linguistics
4301:by Francesca Masini, 2019
4231:Word-formation in English
4220:, Cornelsen, Berlin 2005.
4088:10.1515/9781501511806-009
3884:Oxford English Dictionary
3573:: a phrase becomes a word
3562:, South African placename
3294:Jagadguru Rambhadracharya
3275:, Śāṇḍilya Bhakti Sūtra,
3215:, from the Sanskrit work
2844:'seesaw' (contraction of
2709:'white person/American':
2536:'crystal (pane of) glass'
2437:'study' or 'knowledge' +
1999:verjaardagskalender
1386:Under the influence of a
1234:
530:tradition) consists of a
440:(originally only two Fs,
5070:Natural Language Toolkit
4994:Pronunciation assessment
4896:Automatic identification
4726:Latent semantic analysis
4682:Distributional semantics
4567:Compound-term processing
4465:Named-entity recognition
4141:Hill, Joseph C. (2017).
3926:. Real Academia Española
3724:. Taylor & Francis.
3160:Reduplication in Russian
2968:in Tamil, and the Tamil
2635:'inside and outside': 안
2353:'people' + κράτος 'rule'
1946:'disability insurance':
1840:摩天樓/摩天楼 'skyscraper': 摩
1739:'the physics teachers':
755:and eventually solid as
715:Syntactic classification
428:(color television set),
283:consists of the lexemes
217:morphological derivation
4974:Automated essay scoring
4944:Document classification
4611:Automatic summarization
4271:by Irma Hyvärinen, 2019
4240:, Amsterdam, Benjamins.
4024:"Sruti Krta Rama Stuti"
3539:Phono-semantic matching
3456:Compounding by language
3436:(railway underpass) or
3136:+ word: Академгородок '
2364:'rose-fingered': ῥόδον
1643:Parasynthetic compounds
1517:huañuchi-shpa shitashun
1345:Tibeto-Burman languages
1221:, and also carries the
946:Indo-European languages
492:Semantic classification
431:Funkfernbedienung
409:Agglutinative languages
401:servant-NOM DEF-god-GEN
295:(license) joined by an
4831:Universal Dependencies
4524:Terminology extraction
4507:Semantic decomposition
4502:Semantic role labeling
4492:Part-of-speech tagging
4460:Information extraction
4445:Coreference resolution
4435:Collocation extraction
4295:by Ferenc Kiefer, 2019
3818:"Seattle FinnFest '09"
3788:"open compound (noun)"
3434:Eisenbahn-Unterführung
3426:German spelling reform
3387:Scandinavian languages
3363:American Sign Language
3331:American Sign Language
2881:
2875:
2866:
2860:
2854:
2846:
2840:
2824:
2816:
2810:
2804:
2796:
2790:
2784:
2776:
2770:
2764:
2738:
2732:
2726:
2532:
2526:
2520:
2512:
2506:
2500:
2492:
2486:
2480:
2459:
2451:
2445:
2439:
2433:
2427:
2421:
2405:
2397:
2391:
2385:
2308:
2300:
2294:
2288:
2280:
2274:
2268:
2262:
2254:
2248:
2242:
2234:
2228:
2222:
2214:
2201:
2193:
2187:
2181:
2175:
2167:
2161:
2155:
2147:
2141:
2135:
2127:
2121:
2115:
2107:
2101:
2095:
2082:
2076:
2070:
2064:
2056:
2050:
2048:'university library':
2044:
2036:
2030:
2024:
2018:
2010:
2004:
1998:
1990:
1984:
1974:
1968:
1960:
1954:
1948:
1942:
1910:(branch of) study + 全
1737:muʕallim-uu l-fiizyaaʔ
1589:we will kill-throw you
1229:) form. For examples,
1199:He just upped and left
1155:
1137:
1113:
1105:
851:West Germanic language
729:English compound nouns
618:Appositional compounds
479:
473:
463:
436:
430:
425:Farbfernsehgerät
424:
384:
373:
325:
319:
309:
291:
285:
279:
258:Formation of compounds
225:
188:
114:, composed of the two
5106:Linguistic morphology
4592:Sentence segmentation
4265:by Laurie Bauer, 2019
4159:10.1353/sls.2017.0025
4147:Sign Language Studies
3555:Syllabic abbreviation
3415:commissie vergadering
3380:syllabic abbreviation
3323:Italian Sign Language
3238:in the Kakabhushundi
3190:running through pages
3142:akademichesky gorodok
3093:by inflection (as in
2613:'national diet' + 議事
2330:'philosopher': φίλος
2212:
2002:'birthday calendar':
1867:打印機/打印机 'printer': 打
1520:kill-CP throw.1PL.FUT
1295:PASSIVE-(CONJUNCTIVE)
1042:Pama–Nyungan language
971:'skyscraper', French
5044:Voice user interface
4755:datasets and corpora
4696:Document-term matrix
4549:Word-sense induction
4072:"Part 4: Morphology"
3696:non-English-speaking
3524:Multiword expression
3419:commissievergadering
2917:from 'kō' (king) + '
2524:'windscreen wiper':
2376:applied to the Dawn)
1918:entire/complete + 書
1691:Compound adpositions
1333:Indo-Aryan languages
903:: road-possessive).
891:: mill-possessive);
602:Copulative compounds
229:("tapeworm words").
186:For example, German
5024:Interactive fiction
4954:Pachinko allocation
4911:Speech segmentation
4867:Google Ngram Viewer
4639:Machine translation
4629:Text simplification
4624:Sentence extraction
4512:Semantic similarity
3504:Etymological calque
3462:Classical compounds
3273:Nārada Bhakti Sūtra
3167:classical compounds
2850:'goes up and down')
2730:'science fiction':
2660:/Anishinaabemowin:
2349:'democracy': δῆμος
2179:'railway station':
2125:'knowledge data' +
1802:學生/学生 'student': 學
1586:tere ko mār DāleNge
1583:तेरे को मार डालेंगे
1325:Dravidian languages
1261:and Japanese 読み始める
1075:Verb–verb compounds
940:Verb–noun compounds
778:
720:Noun–noun compounds
273:. For example, the
241:classical compounds
147:As a member of the
73:hyphenated compound
64:nominal composition
42:(less precisely, a
5034:Question answering
4906:Speech recognition
4771:Corpus linguistics
4751:Language resources
4534:Textual entailment
4517:Sentiment analysis
4253:, encyclopedia.com
3913:, 2010, p. 199-218
3849:10.1111/stul.12087
3837:Studia Linguistica
3550:Status constructus
3509:Genitive connector
3499:Bracketing paradox
3477:Sanskrit compounds
3289:Gosvāmī Tulasīdāsa
3279:, the Ramayana of
3234:, an adjective of
3183:Sanskrit compounds
3050:Germanic languages
3044:Germanic languages
2619:'proceedings' + 堂
2600:'returning (home)'
2581:'preference' + 焼き
2562:samashi (-zamashi)
2368:'rose' + δάκτυλος
2334:'beloved' + σοφία
2215:
1613:upping and leaving
1534:Likewise in Hindi:
831:Solid (or closed)
776:
289:(sea captain) and
267:synthetic language
199:to words (such as
129:, composed of the
66:is the process of
5083:
5082:
5039:Virtual assistant
4964:Computer-assisted
4890:
4889:
4647:Computer-assisted
4605:
4604:
4597:Word segmentation
4559:Text segmentation
4497:Semantic analysis
4485:Syntactic parsing
4470:Ontology learning
4216:Kortmann, Bernd:
4122:www.handspeak.com
4026:. Stutimandal.com
3995:Harper, Douglas.
3981:978-3-642-17527-5
3773:978-0-19-280637-6
3731:978-1-000-77574-7
3673:978-0-226-28705-8
3645:978-1-108-18727-5
3494:Compound modifier
3467:English compounds
3385:In Dutch and the
3206:the abode of snow
3177:Sanskrit language
3171:internationalisms
3080:is in fact not a
2986:'peanut butter':
2556:'alarm clock': 目
1821:太空/太空 'space': 太
1766:ʔibrat l-muxaddir
1719:, "underneath").
1636:Iranian languages
1482:Botaremos matándo
1296:
1268:
1215:complex predicate
925:Semitic languages
849:German, a fellow
847:
846:
793:Spaced (or open)
712:
711:
357:Semitic languages
16:(Redirected from
5113:
5060:Formal semantics
5009:Natural language
4916:Speech synthesis
4898:and data capture
4801:Semantic network
4776:Lexical resource
4759:
4577:Lexical analysis
4555:
4480:Semantic parsing
4349:
4342:
4335:
4326:
4204:
4203:
4201:
4200:
4185:
4179:
4178:
4138:
4132:
4131:
4129:
4128:
4114:
4108:
4107:
4105:
4104:
4067:
4058:
4057:
4055:
4053:
4042:
4036:
4035:
4033:
4031:
4019:
4013:
4012:
4010:
4009:
3992:
3986:
3985:
3966:
3955:
3942:
3936:
3935:
3933:
3931:
3920:
3914:
3899:
3893:
3880:
3874:
3867:
3861:
3860:
3828:
3822:
3821:
3814:
3808:
3807:
3802:
3800:
3784:
3778:
3777:
3757:
3751:
3750:
3715:
3709:
3708:
3659:
3653:
3652:
3629:
3612:
3588:
3472:German compounds
3393:, in Swedish as
3376:English language
3115:Russian language
3109:Russian language
2964:, which becomes
2916:
2884:
2878:
2869:
2863:
2857:
2855:limpiaparabrisas
2849:
2843:
2827:
2819:
2813:
2807:
2799:
2793:
2787:
2779:
2773:
2767:
2741:
2735:
2729:
2652:
2618:
2608:
2555:
2553:mezamashi(dokei)
2535:
2529:
2523:
2515:
2509:
2503:
2495:
2489:
2483:
2462:
2454:
2448:
2442:
2436:
2430:
2425:'encyclopedia':
2424:
2408:
2400:
2394:
2388:
2360:
2359:
2345:
2344:
2326:
2325:
2313:
2303:
2298:'barb/barbed' +
2297:
2291:
2283:
2277:
2271:
2265:
2257:
2251:
2245:
2237:
2231:
2225:
2204:
2196:
2190:
2184:
2178:
2170:
2164:
2158:
2150:
2144:
2138:
2130:
2124:
2118:
2110:
2104:
2098:
2086:'possibilities'.
2085:
2079:
2073:
2067:
2059:
2053:
2047:
2039:
2033:
2027:
2021:
2013:
2007:
2001:
1993:
1987:
1977:
1971:
1963:
1957:
1951:
1945:
1883:stamp/print + 機
1782:
1781:
1773:
1772:
1764:
1763:
1753:
1752:
1744:
1743:
1735:
1734:
1651:, with multiple
1529:
1513:
1509:
1498:
1357:Turkic languages
1299:start to be read
1294:
1287:start to be read
1266:
1237:
1236:
1178:
1165:
1151:
1147:
1127:"turn and leave"
779:
623:
544:is the head and
482:
476:
466:
439:
433:
427:
394:
390:
380:
370:
369:
354:
349:to become בֵּית
344:
338:
328:
322:
312:
294:
288:
282:
228:
191:
81:hunter-gatherer)
71:the result is a
21:
18:Word-compounding
5121:
5120:
5116:
5115:
5114:
5112:
5111:
5110:
5086:
5085:
5084:
5079:
5048:
5028:Syntax guessing
5010:
5003:
4989:Predictive text
4984:Grammar checker
4965:
4958:
4930:
4897:
4886:
4852:Bank of English
4835:
4763:
4754:
4745:
4676:
4633:
4601:
4553:
4455:Distant reading
4430:Argument mining
4416:
4412:Text processing
4358:
4353:
4247:
4213:
4208:
4207:
4198:
4196:
4187:
4186:
4182:
4140:
4139:
4135:
4126:
4124:
4116:
4115:
4111:
4102:
4100:
4098:
4069:
4068:
4061:
4051:
4049:
4044:
4043:
4039:
4029:
4027:
4021:
4020:
4016:
4007:
4005:
3994:
3993:
3989:
3982:
3968:
3967:
3958:
3943:
3939:
3929:
3927:
3922:
3921:
3917:
3900:
3896:
3881:
3877:
3869:R. Pensalfini,
3868:
3864:
3830:
3829:
3825:
3816:
3815:
3811:
3798:
3796:
3793:Merriam-Webster
3786:
3785:
3781:
3774:
3759:
3758:
3754:
3732:
3717:
3716:
3712:
3674:
3661:
3660:
3656:
3646:
3631:
3630:
3626:
3621:
3616:
3615:
3589:
3585:
3580:
3490:
3482:Arabic compound
3458:
3372:
3307:
3285:Śrīmadbhāgavata
3185:
3179:
3111:
3046:
2758:ciencia ficción
2749:science fiction
2736:, 'science', +
2727:ciencia-ficción
2707:gichi-mookomaan
2647:rather than 안밖
2431:'everything' +
2401:'path' or 'way'
2374:Homeric epithet
2318:Ancient Greek:
2292:'barbed wire':
2054:'university' +
1994:'installation'.
1958:'inaptitude' +
1725:
1693:
1675:is composed of
1659:is composed of
1645:
1593:
1581:
1570:
1559:
1548:
1531:
1523:
1515:
1500:
1477:
1453:
1445:
1437:
1425:
1413:
1405:
1184:
1167:
1153:
1135:
1129:
1119:
1111:
1103:
1083:
1077:
1023:finger-pointing
942:
934:cardinal number
825:hunter-gatherer
722:
717:
703:hunter-gatherer
494:
489:
406:
396:
382:
371:
364:
347:construct state
331:Hebrew language
260:
245:classical Latin
197:affix morphemes
149:Germanic family
85:closed compound
28:
23:
22:
15:
12:
11:
5:
5119:
5117:
5109:
5108:
5103:
5098:
5088:
5087:
5081:
5080:
5078:
5077:
5072:
5067:
5062:
5056:
5054:
5050:
5049:
5047:
5046:
5041:
5036:
5031:
5021:
5015:
5013:
5011:user interface
5005:
5004:
5002:
5001:
4996:
4991:
4986:
4981:
4976:
4970:
4968:
4960:
4959:
4957:
4956:
4951:
4946:
4940:
4938:
4932:
4931:
4929:
4928:
4923:
4918:
4913:
4908:
4902:
4900:
4892:
4891:
4888:
4887:
4885:
4884:
4879:
4874:
4869:
4864:
4859:
4854:
4849:
4843:
4841:
4837:
4836:
4834:
4833:
4828:
4823:
4818:
4813:
4808:
4803:
4798:
4793:
4788:
4783:
4778:
4773:
4767:
4765:
4756:
4747:
4746:
4744:
4743:
4738:
4736:Word embedding
4733:
4728:
4723:
4716:Language model
4713:
4708:
4703:
4698:
4693:
4687:
4685:
4678:
4677:
4675:
4674:
4669:
4667:Transfer-based
4664:
4659:
4654:
4649:
4643:
4641:
4635:
4634:
4632:
4631:
4626:
4621:
4615:
4613:
4607:
4606:
4603:
4602:
4600:
4599:
4594:
4589:
4584:
4579:
4574:
4569:
4563:
4561:
4552:
4551:
4546:
4541:
4536:
4531:
4526:
4520:
4519:
4514:
4509:
4504:
4499:
4494:
4489:
4488:
4487:
4482:
4472:
4467:
4462:
4457:
4452:
4447:
4442:
4440:Concept mining
4437:
4432:
4426:
4424:
4418:
4417:
4415:
4414:
4409:
4404:
4399:
4394:
4393:
4392:
4387:
4377:
4372:
4366:
4364:
4360:
4359:
4354:
4352:
4351:
4344:
4337:
4329:
4323:
4322:
4321:
4320:
4314:
4308:
4302:
4296:
4290:
4284:
4278:
4272:
4266:
4254:
4246:
4245:External links
4243:
4242:
4241:
4234:
4227:
4221:
4212:
4209:
4206:
4205:
4180:
4153:(1): 162–165.
4133:
4109:
4096:
4059:
4037:
4014:
3987:
3980:
3956:
3937:
3915:
3894:
3875:
3862:
3823:
3809:
3779:
3772:
3752:
3730:
3710:
3672:
3654:
3644:
3623:
3622:
3620:
3617:
3614:
3613:
3610:
3609:
3606:
3603:
3600:
3597:
3594:
3582:
3581:
3579:
3576:
3575:
3574:
3568:
3566:Word formation
3563:
3557:
3552:
3547:
3541:
3536:
3531:
3526:
3521:
3516:
3511:
3506:
3501:
3496:
3489:
3486:
3485:
3484:
3479:
3474:
3469:
3464:
3457:
3454:
3399:Engelse ziekte
3371:
3368:
3306:
3305:Sign languages
3303:
3302:
3301:
3243:
3220:
3209:
3181:Main article:
3178:
3175:
3119:word formation
3110:
3107:
3095:North Germanic
3045:
3042:
3041:
3040:
3026:
3012:
2995:
2974:
2973:
2934:
2887:
2886:
2851:
2837:
2821:
2801:
2781:
2761:
2719:
2718:
2704:
2690:
2676:
2665:mashkikiwaaboo
2655:
2654:
2625:
2624:
2601:
2598:kaeri (-gaeri)
2592:'day trip': 日
2586:
2569:
2566:tokei (-dokei)
2544:
2543:
2540:
2537:
2521:tergicristallo
2517:
2497:
2473:
2472:
2467:'telephone' +
2456:
2443:'dictionary' (
2422:alfræðiorðabók
2418:
2402:
2378:
2377:
2354:
2339:
2316:
2315:
2305:
2285:
2266:'automobile':
2259:
2239:
2226:'skyscraper':
2207:
2206:
2198:
2172:
2152:
2132:
2112:
2099:'dictionary':
2088:
2087:
2061:
2041:
2028:'customers' +
2015:
1995:
1965:
1955:ongeschiktheid
1935:
1934:
1927:
1892:
1865:
1838:
1819:
1788:
1787:
1758:
1733:معلمو الفيزياء
1724:
1721:
1692:
1689:
1644:
1641:
1640:
1639:
1627:
1626:
1622:took and built
1617:
1608:
1598:
1597:
1571:
1560:
1549:
1538:
1537:
1536:
1535:
1501:
1490:
1489:
1488:
1487:
1446:
1438:
1426:
1414:
1406:
1398:
1397:
1396:
1395:
1384:
1313:
1310:kecc-ar ker-ar
1291:yomarehajimeru
1154:
1136:
1131:
1130:
1112:
1104:
1099:
1098:
1097:
1096:
1079:Main article:
1076:
1073:
941:
938:
845:
844:
835:
832:
828:
827:
818:
811:
807:
806:
797:
794:
790:
789:
786:
783:
721:
718:
716:
713:
710:
709:
700:
697:
691:
690:
681:
678:
672:
671:
662:
659:
653:
652:
643:
640:
634:
633:
630:
627:
550:part of speech
511:
510:
507:
504:
501:
493:
490:
488:
485:
398:ʕabd-u l-lāh-i
383:
372:
365:
362:
361:
323:of the lexeme
317:genitive form
299:(originally a
280:Kapitänspatent
259:
256:
234:Sign languages
226:Bandwurmwörter
151:of languages,
108:part of speech
68:word formation
26:
24:
14:
13:
10:
9:
6:
4:
3:
2:
5118:
5107:
5104:
5102:
5099:
5097:
5094:
5093:
5091:
5076:
5073:
5071:
5068:
5066:
5065:Hallucination
5063:
5061:
5058:
5057:
5055:
5051:
5045:
5042:
5040:
5037:
5035:
5032:
5029:
5025:
5022:
5020:
5017:
5016:
5014:
5012:
5006:
5000:
4999:Spell checker
4997:
4995:
4992:
4990:
4987:
4985:
4982:
4980:
4977:
4975:
4972:
4971:
4969:
4967:
4961:
4955:
4952:
4950:
4947:
4945:
4942:
4941:
4939:
4937:
4933:
4927:
4924:
4922:
4919:
4917:
4914:
4912:
4909:
4907:
4904:
4903:
4901:
4899:
4893:
4883:
4880:
4878:
4875:
4873:
4870:
4868:
4865:
4863:
4860:
4858:
4855:
4853:
4850:
4848:
4845:
4844:
4842:
4838:
4832:
4829:
4827:
4824:
4822:
4819:
4817:
4814:
4812:
4811:Speech corpus
4809:
4807:
4804:
4802:
4799:
4797:
4794:
4792:
4791:Parallel text
4789:
4787:
4784:
4782:
4779:
4777:
4774:
4772:
4769:
4768:
4766:
4760:
4757:
4752:
4748:
4742:
4739:
4737:
4734:
4732:
4729:
4727:
4724:
4721:
4717:
4714:
4712:
4709:
4707:
4704:
4702:
4699:
4697:
4694:
4692:
4689:
4688:
4686:
4683:
4679:
4673:
4670:
4668:
4665:
4663:
4660:
4658:
4655:
4653:
4652:Example-based
4650:
4648:
4645:
4644:
4642:
4640:
4636:
4630:
4627:
4625:
4622:
4620:
4617:
4616:
4614:
4612:
4608:
4598:
4595:
4593:
4590:
4588:
4585:
4583:
4582:Text chunking
4580:
4578:
4575:
4573:
4572:Lemmatisation
4570:
4568:
4565:
4564:
4562:
4560:
4556:
4550:
4547:
4545:
4542:
4540:
4537:
4535:
4532:
4530:
4527:
4525:
4522:
4521:
4518:
4515:
4513:
4510:
4508:
4505:
4503:
4500:
4498:
4495:
4493:
4490:
4486:
4483:
4481:
4478:
4477:
4476:
4473:
4471:
4468:
4466:
4463:
4461:
4458:
4456:
4453:
4451:
4448:
4446:
4443:
4441:
4438:
4436:
4433:
4431:
4428:
4427:
4425:
4423:
4422:Text analysis
4419:
4413:
4410:
4408:
4405:
4403:
4400:
4398:
4395:
4391:
4388:
4386:
4383:
4382:
4381:
4378:
4376:
4373:
4371:
4368:
4367:
4365:
4363:General terms
4361:
4357:
4350:
4345:
4343:
4338:
4336:
4331:
4330:
4327:
4318:
4315:
4312:
4309:
4306:
4303:
4300:
4297:
4294:
4291:
4288:
4285:
4282:
4279:
4276:
4273:
4270:
4267:
4264:
4261:
4260:
4258:
4255:
4252:
4251:Compound word
4249:
4248:
4244:
4239:
4235:
4232:
4228:
4225:
4222:
4219:
4215:
4214:
4210:
4195:
4191:
4184:
4181:
4176:
4172:
4168:
4164:
4160:
4156:
4152:
4148:
4144:
4137:
4134:
4123:
4119:
4113:
4110:
4099:
4097:9781501511806
4093:
4089:
4085:
4081:
4077:
4073:
4066:
4064:
4060:
4047:
4041:
4038:
4025:
4018:
4015:
4004:
4003:
3998:
3991:
3988:
3983:
3977:
3973:
3965:
3963:
3961:
3957:
3954:
3953:0-8285-5190-1
3950:
3946:
3941:
3938:
3925:
3919:
3916:
3912:
3911:9789027248275
3908:
3904:
3898:
3895:
3892:
3891:
3886:
3885:
3879:
3876:
3872:
3866:
3863:
3858:
3854:
3850:
3846:
3842:
3838:
3834:
3827:
3824:
3819:
3813:
3810:
3806:
3795:
3794:
3789:
3783:
3780:
3775:
3769:
3765:
3764:
3756:
3753:
3749:
3747:
3743:
3739:
3733:
3727:
3723:
3722:
3714:
3711:
3707:
3705:
3701:
3697:
3693:
3692:kilowatt-hour
3689:
3688:mass-produced
3685:
3681:
3675:
3669:
3665:
3658:
3655:
3651:
3647:
3641:
3637:
3636:
3628:
3625:
3618:
3607:
3604:
3601:
3598:
3595:
3592:
3591:
3587:
3584:
3577:
3572:
3569:
3567:
3564:
3561:
3558:
3556:
3553:
3551:
3548:
3545:
3542:
3540:
3537:
3535:
3532:
3530:
3527:
3525:
3522:
3520:
3517:
3515:
3512:
3510:
3507:
3505:
3502:
3500:
3497:
3495:
3492:
3491:
3487:
3483:
3480:
3478:
3475:
3473:
3470:
3468:
3465:
3463:
3460:
3459:
3455:
3453:
3451:
3447:
3443:
3439:
3435:
3431:
3427:
3422:
3420:
3416:
3412:
3408:
3404:
3400:
3396:
3392:
3388:
3383:
3381:
3377:
3369:
3367:
3364:
3360:
3356:
3352:
3348:
3344:
3340:
3336:
3332:
3328:
3324:
3320:
3316:
3312:
3304:
3299:
3295:
3291:
3290:
3286:
3282:
3278:
3277:Bhagavad Gītā
3274:
3270:
3266:
3265:Pūrva Mīmāṃsā
3262:
3258:
3254:
3250:
3244:
3241:
3237:
3233:
3231:
3227:
3221:
3218:
3214:
3210:
3207:
3203:
3199:
3195:
3194:
3193:
3191:
3184:
3176:
3174:
3172:
3168:
3163:
3161:
3157:
3155:
3154:St.Petersburg
3149:
3147:
3143:
3139:
3138:Akademgorodok
3135:
3131:
3127:
3122:
3120:
3116:
3108:
3106:
3104:
3098:
3096:
3092:
3087:
3083:
3079:
3075:
3071:
3067:
3063:
3059:
3055:
3051:
3043:
3038:
3034:
3030:
3027:
3024:
3020:
3016:
3013:
3011:
3007:
3003:
2999:
2996:
2993:
2990:'squirrel' +
2989:
2985:
2982:
2981:
2980:
2978:
2972:meaning shop.
2971:
2967:
2963:
2959:
2955:
2951:
2947:
2943:
2939:
2935:
2932:
2928:
2924:
2920:
2915:
2914:
2908:
2904:
2900:
2896:
2892:
2891:
2890:
2883:
2877:
2872:
2868:
2862:
2856:
2852:
2848:
2842:
2838:
2835:
2831:
2826:
2822:
2820:'(the) water'
2818:
2812:
2806:
2802:
2798:
2792:
2786:
2782:
2778:
2772:
2768:'centipede':
2766:
2762:
2759:
2755:
2751:
2750:
2745:
2740:
2734:
2728:
2724:
2723:
2722:
2716:
2712:
2708:
2705:
2702:
2698:
2694:
2691:
2688:
2684:
2681:'raspberry':
2680:
2677:
2674:
2671:'medicine' +
2670:
2666:
2663:
2662:
2661:
2659:
2651:
2646:
2642:
2639:'inside' + 밖
2638:
2634:
2630:
2629:
2628:
2622:
2617:
2612:
2607:
2602:
2599:
2595:
2591:
2587:
2584:
2580:
2576:
2575:
2570:
2567:
2563:
2559:
2554:
2549:
2548:
2547:
2541:
2538:
2534:
2528:
2522:
2518:
2514:
2508:
2502:
2498:
2494:
2490:'thousand' +
2488:
2484:'millipede':
2482:
2478:
2477:
2476:
2470:
2466:
2461:
2457:
2453:
2447:
2441:
2435:
2429:
2423:
2419:
2416:
2412:
2407:
2403:
2399:
2393:
2387:
2383:
2382:
2381:
2375:
2371:
2367:
2363:
2362:rhododáktylos
2355:
2352:
2348:
2340:
2337:
2333:
2329:
2321:
2320:
2319:
2312:
2311:
2306:
2302:
2296:
2290:
2286:
2282:
2276:
2270:
2264:
2263:Kraftfahrzeug
2260:
2256:
2250:
2244:
2240:
2236:
2230:
2224:
2223:Wolkenkratzer
2220:
2219:
2218:
2211:
2203:
2199:
2195:
2189:
2183:
2177:
2176:rautatieasema
2173:
2169:
2163:
2157:
2153:
2149:
2143:
2139:'Wednesday':
2137:
2133:
2129:
2123:
2117:
2113:
2109:
2103:
2097:
2093:
2092:
2091:
2084:
2083:mogelijkheden
2078:
2072:
2066:
2062:
2058:
2052:
2046:
2042:
2038:
2032:
2026:
2020:
2016:
2012:
2008:'birthday' +
2006:
2000:
1996:
1992:
1988:'cleaning' +
1986:
1981:
1976:
1970:
1966:
1962:
1956:
1950:
1944:
1940:
1939:
1938:
1932:
1928:
1925:
1921:
1917:
1913:
1909:
1905:
1901:
1897:
1893:
1890:
1886:
1882:
1878:
1874:
1870:
1866:
1863:
1859:
1855:
1851:
1847:
1843:
1839:
1836:
1832:
1828:
1824:
1820:
1817:
1813:
1809:
1805:
1801:
1800:
1799:
1797:
1793:
1786:'anaesthesia'
1785:
1776:
1767:
1759:
1756:
1748:'teachers' +
1747:
1738:
1730:
1729:
1728:
1722:
1720:
1718:
1714:
1710:
1706:
1702:
1698:
1690:
1688:
1686:
1682:
1678:
1674:
1670:
1666:
1662:
1658:
1654:
1653:lexical stems
1650:
1642:
1637:
1633:
1629:
1628:
1625:
1623:
1620:Big Pig, she
1618:
1616:
1614:
1609:
1607:
1605:
1602:What did you
1600:
1599:
1595:
1594:
1592:
1590:
1587:
1584:
1580:
1577:
1574:
1569:
1566:
1563:
1558:
1555:
1552:
1547:
1544:
1541:
1533:
1532:
1530:
1527:
1521:
1518:
1514:
1504:
1499:
1493:
1492:huañuchi-shpa
1485:
1483:
1479:
1478:
1476:
1473:
1471:
1468:
1463:
1461:
1458:
1452:
1449:
1444:
1441:
1436:
1435:
1431:
1430:
1424:
1423:
1419:
1418:
1412:
1409:
1404:
1401:
1393:
1389:
1385:
1382:
1378:
1374:
1370:
1366:
1362:
1358:
1354:
1350:
1346:
1342:
1338:
1334:
1330:
1326:
1322:
1318:
1314:
1311:
1308:
1304:
1300:
1292:
1288:
1284:
1280:
1276:
1272:
1264:
1260:
1259:start reading
1256:
1252:
1248:
1244:
1240:
1232:
1228:
1224:
1220:
1216:
1212:
1211:compound verb
1208:
1207:
1206:
1204:
1200:
1196:
1191:
1189:
1183:
1180:
1179:.PTCP see-IMP
1173:
1172:jā-kar dekh-o
1170:
1166:
1160:
1157:
1152:
1142:
1139:
1134:
1128:
1125:
1122:
1118:
1115:
1110:
1107:
1102:
1094:
1093:
1088:
1087:
1086:
1082:
1081:Compound verb
1074:
1072:
1070:
1066:
1062:
1061:incorporation
1057:
1055:
1051:
1047:
1043:
1039:
1034:
1032:
1028:
1024:
1020:
1019:breastfeeding
1016:
1012:
1007:
1005:
1001:
997:
993:
989:
985:
981:
976:
974:
970:
966:
962:
958:
954:
949:
947:
939:
937:
935:
930:
926:
922:
917:
915:
914:
909:
904:
902:
898:
894:
890:
886:
882:
878:
873:
871:
870:moulin à vent
867:
866:chemin-de-fer
863:
858:
856:
852:
843:
839:
836:
833:
830:
829:
826:
822:
819:
816:
812:
809:
808:
805:
801:
798:
795:
792:
791:
787:
784:
781:
780:
774:
772:
771:
766:
765:
760:
759:
754:
753:
748:
747:
742:
738:
734:
730:
726:
719:
714:
708:
704:
701:
698:
696:
693:
692:
689:
685:
682:
679:
677:
674:
673:
670:
666:
663:
660:
658:
655:
654:
651:
647:
644:
641:
639:
636:
635:
631:
628:
625:
624:
621:
619:
615:
613:
609:
608:
603:
599:
597:
593:
589:
584:
580:
575:
571:
570:
565:
563:
557:
555:
551:
547:
543:
539:
535:
534:
529:
525:
524:
519:
517:
508:
505:
502:
499:
498:
497:
491:
486:
484:
481:
475:
468:
465:
459:
457:
456:
451:
447:
443:
438:
432:
426:
421:
420:noun adjuncts
417:
412:
410:
405:
402:
399:
395:
386:
381:
375:
360:
358:
353:
348:
343:
337:
332:
327:
321:
316:
313:contains the
311:
310:paterfamilias
306:
302:
301:genitive case
298:
293:
287:
281:
276:
272:
268:
263:
257:
255:
253:
250:
249:ancient Greek
246:
242:
237:
235:
230:
227:
220:
218:
214:
210:
206:
202:
198:
193:
190:
184:
182:
178:
174:
170:
166:
162:
158:
154:
150:
145:
143:
139:
136:and the noun
135:
132:
128:
124:
120:
117:
113:
109:
104:
102:
101:open compound
98:
94:
90:
86:
82:
78:
74:
69:
65:
61:
57:
53:
49:
45:
41:
37:
33:
19:
5101:Word coinage
4979:Concordancer
4375:Bag-of-words
4237:
4230:
4229:Plag, Ingo:
4223:
4217:
4197:. Retrieved
4193:
4183:
4150:
4146:
4136:
4125:. Retrieved
4121:
4112:
4101:. Retrieved
4079:
4075:
4050:. Retrieved
4040:
4028:. Retrieved
4017:
4006:. Retrieved
4000:
3990:
3970:
3944:
3940:
3928:. Retrieved
3918:
3902:
3897:
3889:
3882:
3878:
3870:
3865:
3840:
3836:
3826:
3812:
3804:
3797:. Retrieved
3791:
3782:
3762:
3755:
3745:
3741:
3737:
3735:
3720:
3713:
3703:
3699:
3695:
3691:
3687:
3683:
3679:
3677:
3663:
3657:
3649:
3634:
3627:
3586:
3571:Univerbation
3534:Noun adjunct
3437:
3433:
3423:
3418:
3414:
3406:
3402:
3398:
3395:särskrivning
3394:
3390:
3384:
3373:
3358:
3354:
3346:
3343:simultaneous
3342:
3338:
3334:
3326:
3318:
3308:
3246:
3223:
3217:Panchatantra
3212:
3205:
3189:
3186:
3164:
3158:
3150:
3141:
3123:
3112:
3103:noun phrases
3099:
3091:definiteness
3070:round object
3047:
3036:
3035:' people' +
3032:
3028:
3022:
3018:
3017:'kerosene':
3014:
3009:
3005:
3001:
2997:
2991:
2987:
2983:
2977:Tłįchǫ Yatiì
2975:
2969:
2965:
2961:
2957:
2953:
2949:
2945:
2944:(blood) and
2941:
2937:
2926:
2906:
2894:
2888:
2870:
2833:
2829:
2808:'umbrella':
2774:'hundred' +
2757:
2753:
2747:
2720:
2714:
2710:
2706:
2700:
2696:
2693:dibik-giizis
2692:
2686:
2682:
2678:
2672:
2668:
2664:
2656:
2644:
2640:
2636:
2632:
2626:
2620:
2610:
2606:kokkaigijidō
2597:
2593:
2589:
2582:
2578:
2572:
2565:
2561:
2560:'eye' + 覚まし
2557:
2545:
2530:'to wash' +
2474:
2468:
2464:
2414:
2413:'journey' +
2410:
2379:
2372:'finger' (a
2369:
2365:
2361:
2358:ῥοδοδάκτυλος
2350:
2346:
2335:
2331:
2327:
2317:
2289:Stacheldraht
2216:
2119:'computer':
2089:
2074:'through' +
2051:universiteit
2034:'service' +
1979:
1964:'insurance'.
1936:
1930:
1923:
1919:
1915:
1911:
1907:
1903:
1902:hundred + 科
1899:
1895:
1888:
1884:
1880:
1876:
1872:
1868:
1861:
1857:
1853:
1849:
1845:
1841:
1834:
1830:
1826:
1822:
1818:living being
1815:
1811:
1807:
1803:
1789:
1783:
1774:
1765:
1754:
1745:
1736:
1726:
1716:
1712:
1708:
1704:
1700:
1697:prepositions
1694:
1684:
1683:'envelope',
1680:
1676:
1672:
1668:
1664:
1660:
1656:
1646:
1621:
1619:
1612:
1610:
1603:
1601:
1591:
1588:
1585:
1582:
1578:
1575:
1572:
1567:
1564:
1561:
1556:
1553:
1550:
1545:
1542:
1539:
1522:
1519:
1516:
1505:
1502:
1494:
1491:
1481:
1480:
1474:
1469:
1466:
1464:
1459:
1456:
1454:
1450:
1447:
1442:
1439:
1433:
1432:
1428:
1427:
1421:
1420:
1416:
1415:
1410:
1407:
1402:
1399:
1309:
1302:
1298:
1290:
1286:
1282:
1278:
1274:
1270:
1263:yomihajimeru
1262:
1258:
1254:
1250:
1246:
1242:
1238:
1226:
1214:
1210:
1198:
1194:
1192:
1185:
1182:"go and see"
1181:
1174:
1171:
1168:
1161:
1143:
1126:
1123:
1120:
1116:
1108:
1090:
1084:
1068:
1065:backstabbing
1064:
1058:
1053:
1049:
1045:
1035:
1030:
1026:
1022:
1018:
1011:incorporated
1008:
1004:know-nothing
1003:
999:
995:
991:
987:
983:
979:
977:
972:
968:
964:
960:
956:
950:
943:
918:
911:
905:
900:
896:
892:
888:
884:
883:'windmill' (
881:yeldeğirmeni
880:
874:
869:
865:
859:
848:
841:
837:
824:
820:
803:
799:
785:Description
768:
762:
756:
750:
744:
727:
723:
706:
702:
695:appositional
694:
687:
683:
675:
668:
664:
656:
649:
645:
637:
629:Description
617:
616:
605:
601:
600:
595:
588:white-collar
587:
582:
579:white-collar
578:
567:
560:
558:
553:
545:
541:
537:
531:
521:
513:
512:
509:appositional
495:
469:
460:
453:
445:
413:
407:
403:
400:
397:
387:
376:
296:
264:
261:
238:
231:
221:
212:
208:
194:
185:
181:noun phrases
177:orthographic
146:
137:
133:
126:
122:
118:
111:
105:
100:
96:
92:
88:
84:
80:
76:
72:
63:
59:
55:
35:
29:
4936:Topic model
4816:Text corpus
4662:Statistical
4529:Text mining
4370:AI-complete
4052:October 25,
3930:16 February
3843:(1): 1–36.
3680:high school
3544:Portmanteau
3430:hyphenating
3391:særskriving
3146:verbal noun
3074:dance party
3052:(including
3031:'bannock':
2938:koṭuntamizh
2899:Tolkappiyam
2847:sube y baja
2788:'railway':
2785:ferrocarril
2596:'day' + 帰り
2574:okonomiyaki
2504:'railway':
2417:'apparatus'
2409:'vehicle':
2389:'railway':
2380:Icelandic:
2328:philósophos
2284:'machinery'
2275:fahren/fahr
2246:'railway':
2232:'clouds' +
2145:'middle' +
2136:keskiviikko
2057:bibliotheek
2014:'calendar'.
1991:installatie
1961:verzekering
1952:'labour' +
1875:strike + 印
1794:/Cantonese
1777:'needle' +
1762:إبرة المخدر
1746:muʕallim-uu
1465:from anger
1293:lit. "read-
1289:and 読まれ始める
1267:CONJUNCTIVE
1092:serial verb
1000:dreadnought
973:grille-pain
969:grattacielo
965:guardarropa
961:sacacorchos
957:rascacielos
895:'railway' (
810:Hyphenated
737:collocation
707:maidservant
684:bittersweet
638:endocentric
516:endocentric
500:endocentric
239:So-called "
60:composition
56:Compounding
32:linguistics
5090:Categories
4657:Rule-based
4539:Truecasing
4407:Stop words
4199:2020-08-02
4127:2022-10-25
4103:2019-02-19
4008:2011-07-17
3997:"Himalaya"
3799:29 January
3746:short-term
3738:blackboard
3704:smartphone
3619:References
3407:røyk fritt
3128:: колхоз '
3086:word class
3072:, not the
3066:vocabulary
2984:dlòotsǫ̀ǫ̀
2958:kāpi-kaṭai
2885:'breezes'.
2871:windshield
2867:parabrisas
2864:'clean' +
2814:'stops' +
2699:'night' +
2546:Japanese:
2481:millepiedi
2471:'dialogue'
2449:'words' +
2347:dēmokratíā
2343:δημοκρατία
2278:'drive' +
2272:'power' +
2060:'library'.
2037:medewerker
2005:verjaardag
1982:'water' +
1978:'sewer' +
1848:touch + 天
1829:great + 空
1810:learn + 生
1715:, "near";
1703:, Spanish
1657:black-eyed
1649:derivation
1634:and other
1568:kill-throw
1317:Hindustani
1243:nikal paRā
1239:nikal gayā
1223:inflection
1124:turn leave
1069:breastfeed
1017:, such as
996:pickpocket
980:spoilsport
929:possessive
889:değirmen-i
821:place-name
800:place name
752:place-name
746:place name
676:copulative
657:exocentric
562:exocentric
506:copulative
503:exocentric
487:Subclasses
213:employment
4966:reviewing
4764:standards
4762:Types and
4175:148714617
4167:1533-6263
4080:Blueprint
3857:0039-3193
3742:ice cream
3700:birthrate
3546:compounds
3529:Neologism
3446:loan word
3442:Norwegian
3403:røykfritt
3327:breakfast
3261:Vaiśeṣika
3226:Kaustubha
3058:namespace
3025:'its fat'
3021:'land' +
3015:dè gotłeè
3006:'sick' +
3000:'hospital
2979:/Dogrib:
2879:'stop' +
2832:'head' +
2825:cabizbajo
2794:'iron' +
2721:Spanish:
2715:mookomaan
2667:'tonic':
2585:'cooking'
2550:目覚まし(時計)
2533:cristallo
2510:'iron' +
2475:Italian:
2406:farartæki
2395:'iron' +
2386:járnbraut
2324:φιλόσοφος
2252:'iron' +
2243:Eisenbahn
2238:'scraper'
2197:'station'
2191:'road' +
2185:'iron' +
2165:'land' +
2159:'world':
2131:'machine'
2116:tietokone
2105:'word' +
2096:sanakirja
2090:Finnish:
2080:'grow' +
2040:'worker'.
1985:zuivering
1837:emptiness
1757:'physics'
1705:encima de
1701:on top of
1695:Compound
1673:imbustare
1606:that for?
1604:go and do
1503:shitashun
1460:rompiendo
1455:De rabia
1429:rompiendo
1392:altiplano
1253:and पड़ा
1169:जाकर देखो
992:cutthroat
988:breakfast
893:demiryolu
842:scarecrow
838:placename
804:ice cream
788:Examples
764:file name
758:placename
688:sleepwalk
669:scarecrow
650:smalltalk
632:Examples
583:must-have
569:bahuvrihi
523:tatpuruṣa
336:bet sefer
277:compound
157:Norwegian
131:adjective
127:blackbird
93:blackbird
77:must-have
4882:Wikidata
4862:FrameNet
4847:BabelNet
4826:Treebank
4796:PropBank
4741:Word2vec
4706:fastText
4587:Stemming
4194:Nautilus
3488:See also
3411:genitive
3355:Saturday
3240:Rāmāyaṇa
3202:Himalaya
3196:हिमालय (
3140:' (from
3126:acronyms
3078:starfish
3062:football
3029:dǫ łèt'è
2952:becomes
2946:Cemmozhi
2907:punarcci
2895:Cemmozhi
2841:subibaja
2805:paraguas
2765:ciempiés
2713:'big' +
2695:'moon':
2685:'red' +
2679:miskomin
2675:'liquid'
2669:mashkiki
2627:Korean:
2501:ferrovia
2370:dáktylos
2338:'wisdom'
2217:German:
2011:kalender
1856:sky + 樓
1796:Jyutping
1784:muxaddir
1755:fiizyaaʔ
1751:الفيزياء
1727:Arabic:
1717:ke nīche
1470:breaking
1434:breaking
1373:Japanese
1297:-start"
1275:hajimeru
1273:and 始める
1235:निकल गया
1203:ellipsis
908:pleonasm
899:: iron,
887:: wind,
770:filename
733:semantic
646:darkroom
612:Sanskrit
596:barefoot
564:compound
554:doghouse
540:, where
538:doghouse
528:Sanskrit
518:compound
377:servant-
368:عبد الله
320:familias
271:morpheme
207:, as in
205:prefixes
201:suffixes
142:stressed
112:footpath
89:footpath
36:compound
5053:Related
5019:Chatbot
4877:WordNet
4857:DBpedia
4731:Seq2seq
4475:Parsing
4390:Trigram
4211:Sources
4030:July 1,
3972:Series)
3947:(1978)
3519:Kenning
3450:acronym
3359:Sunday.
3347:weekend
3339:morning
3319:parents
3311:dvandva
3281:Vālmīki
3269:Vedānta
3249:Sāṅkhya
3230:Lakshmi
3130:kolkhoz
3113:In the
3054:English
3039:'bread'
3010:'house'
2913:kopuram
2889:Tamil:
2754:fiction
2739:ficción
2733:ciencia
2717:'knife'
2689:'berry'
2590:higaeri
2579:okonomi
2527:tergere
2455:'book')
2440:orðabók
2295:Stachel
2258:'track'
2235:Kratzer
2156:maailma
2031:service
2025:klanten
1937:Dutch:
1891:machine
1632:Persian
1576:DāleNge
1573:डालेंगे
1472:the pot
1462:la olla
1388:Quichua
1361:Turkish
1337:Marathi
1321:Punjabi
1281:and 読み
1279:reading
1121:trɔ dzo
1038:Jingulu
1033:, etc.
984:killjoy
953:Spanish
913:pathway
877:Turkish
665:redhead
610:in the
607:dvandva
592:metonym
574:hyponym
526:in the
385:l-lāh-i
326:familia
315:archaic
307:lexeme
286:Kapitän
161:Swedish
153:English
87:(e.g.,
75:(e.g.,
5096:Syntax
5026:(c.f.
4684:models
4672:Neural
4385:Bigram
4380:n-gram
4173:
4165:
4094:
3978:
3951:
3909:
3855:
3770:
3728:
3670:
3642:
3448:or an
3298:sandhi
3023:gotłeè
2998:eyakǫ̀
2994:'dung'
2992:tsǫ̀ǫ̀
2962:coffee
2927:vennai
2903:Nannūl
2882:brisas
2861:limpia
2836:'down'
2830:cabeza
2800:'lane'
2797:carril
2780:'feet'
2744:calque
2701:giizis
2673:waaboo
2658:Ojibwe
2623:'hall'
2611:kokkai
2603:国会議事堂
2577:: お好み
2571:お好み焼き
2568:clock)
2496:'feet'
2460:símtal
2366:rhódon
2336:sophíā
2332:phílos
2304:'wire'
2229:Wolken
2151:'week'
2148:viikko
2111:'book'
1949:arbeid
1792:Pinyin
1775:ʔibrat
1713:ke pas
1679:'in',
1667:, and
1506:throw.
1369:Korean
1365:Kyrgyz
1353:Newari
1341:Nepali
1323:, and
1307:Kurukh
1303:nikalā
1265:"read-
1219:aspect
1159:dekh-o
1141:jā-kar
1052:, and
1015:gerund
1002:, and
921:Arabic
862:French
853:, has
815:hyphen
416:German
374:ʕabd-u
363:Arabic
292:Patent
275:German
209:employ
171:, and
169:German
165:Danish
103:.
40:lexeme
5075:spaCy
4720:large
4711:GloVe
4171:S2CID
4076:Sign
3578:Notes
3257:Nyāya
3188:some
3169:" or
3037:łèt'è
2970:kaṭai
2954:ratta
2950:rakta
2942:rakta
2923:kovil
2919:puram
2817:aguas
2791:ferro
2711:gichi
2703:'sun'
2697:dibik
2683:misko
2650:anbak
2645:anpak
2633:anpak
2516:'way'
2507:ferro
2493:piedi
2487:mille
2434:fræði
2411:farar
2398:braut
2351:dêmos
2301:Draht
2269:Kraft
2249:Eisen
2194:asema
2182:rauta
2171:'air'
2142:keski
2122:tieto
2108:kirja
2077:groei
1980:water
1975:riool
1933:thank
1916:cyun4
1900:baak3
1835:hung1
1827:taai3
1816:sang1
1812:shēng
1742:معلمو
1681:busta
1661:black
1495:kill-
1411:anger
1408:rabia
1367:, in
1359:like
1355:, in
1349:Limbu
1347:like
1343:, in
1335:like
1329:Tamil
1327:like
1271:start
1247:nikal
1231:Hindi
1209:In a
1197:, or
1188:Tamil
1133:Hindi
1117:leave
1089:In a
901:yol-u
897:demir
782:Type
741:usage
626:Type
542:house
391:-god-
342:bayit
305:Latin
265:In a
252:roots
173:Dutch
134:black
116:nouns
38:is a
4840:Data
4691:BERT
4163:ISSN
4092:ISBN
4078:Gram
4054:2011
4032:2011
3976:ISBN
3949:ISBN
3932:2013
3907:ISBN
3890:s.v.
3853:ISSN
3801:2024
3768:ISBN
3726:ISBN
3668:ISBN
3640:ISBN
3424:The
3337:and
3317:for
3315:sign
3253:Yoga
3236:Rama
3198:IAST
3134:stem
3082:fish
2988:dlòo
2966:kāpi
2931:ghee
2901:and
2876:para
2834:bajo
2811:para
2777:pies
2771:cien
2687:miin
2616:giji
2588:日帰り
2583:yaki
2446:orða
2415:tæki
2392:járn
2281:Zeug
2255:Bahn
2168:ilma
2128:kone
2102:sana
2071:door
1926:book
1924:syu1
1912:quán
1889:gei1
1881:yan3
1873:daa2
1862:lau2
1854:tin1
1850:tiān
1831:kōng
1808:hok6
1780:مخدر
1771:إبرة
1685:-are
1557:will
1543:tere
1540:तेरे
1526:help
1457:puso
1448:olla
1417:puso
1403:from
1381:Avar
1379:and
1377:Tsez
1371:and
1363:and
1351:and
1339:and
1319:and
1283:yomi
1255:paRā
1251:gayā
1227:zero
1213:(or
1177:CONJ
1162:see-
1156:देखो
1150:PTCP
1146:CONJ
1138:जाकर
1109:turn
1050:make
1040:, a
533:head
138:bird
123:path
121:and
119:foot
52:stem
48:sign
44:word
34:, a
4872:UBY
4155:doi
4084:doi
3845:doi
3421:).
3361:In
3349:in
3335:eat
3329:in
3321:in
3148:).
3097:).
3048:In
3008:kǫ̀
3004:eya
2936:In
2893:In
2641:bak
2631:안팎
2513:via
2469:tal
2465:sím
2452:bók
2188:tie
2162:maa
1931:xiè
1920:shū
1908:fo1
1896:bǎi
1877:yìn
1858:lóu
1846:mo1
1823:tài
1804:xué
1798:):
1677:in-
1669:-ed
1665:eye
1579:you
1565:mār
1562:मार
1512:FUT
1508:1PL
1467:put
1451:pot
1443:the
1422:put
1186:In
1175:go-
1164:IMP
1144:go-
1114:dzo
1106:trɔ
1101:Ewe
1054:run
951:In
919:In
885:yel
875:In
860:In
559:An
546:dog
514:An
414:In
393:GEN
389:DEF
379:NOM
352:bet
297:-s-
247:or
203:or
62:or
46:or
30:In
5092::
4192:.
4169:.
4161:.
4151:18
4149:.
4145:.
4120:.
4090:.
4074:.
4062:^
3999:.
3959:^
3905:,
3851:.
3841:73
3839:.
3835:.
3803:.
3790:.
3748:).
3734:.
3706:).
3702:,
3694:,
3690:,
3682:,
3676:.
3648:.
3452:.
3382:.
3283:,
3271:,
3267:,
3263:,
3259:,
3255:,
3251:,
3156:.
3033:dǫ
3019:dè
3002::
2637:an
2621:dō
2594:hi
2558:me
2428:al
1904:kē
1885:jī
1869:dǎ
1842:mó
1709:ke
1663:,
1554:ko
1551:को
1546:we
1528:);
1497:CP
1484:te
1440:la
1400:De
1205:.
1067:,
1056:.
1048:,
1046:do
1021:,
1006:.
998:,
994:,
990:,
986:,
982:,
936:.
916:.
840:,
823:,
813:A
802:,
705:,
686:,
667:,
648:,
598:.
556:.
254:.
219:.
211:→
167:,
163:,
159:,
91:,
79:,
58:,
54:.
5030:)
4753:,
4722:)
4718:(
4348:e
4341:t
4334:v
4202:.
4177:.
4157::
4130:.
4106:.
4086::
4056:.
4034:.
4011:.
3984:.
3934:.
3859:.
3847::
3820:.
3776:.
3232:)
2653:)
1922:/
1914:/
1906:/
1898:/
1887:/
1879:/
1871:/
1860:/
1852:/
1844:/
1833:/
1825:/
1814:/
1806:/
1615:.
1510:.
1383:.
1148:.
604:(
566:(
520:(
20:)
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.