25:
1135:
159:, etc.) can take contextual information into account and thus compose chunks in such a way that they better reflect the semantic relations between the basic constituents. That is, these more advanced methods get around the problem that combinations of elementary constituents can have different higher level meanings depending on the context of the sentence.
386:
546:
139:
which first identifies constituent parts of sentences (nouns, verbs, adjectives, etc.) and then links them to higher order units that have discrete grammatical meanings (
249:
147:, verb groups, etc.). While the most elementary chunking algorithms simply link constituent parts on the basis of elementary search patterns (e.g., as specified by
1200:
524:
935:
379:
319:
271:
1205:
1176:
1104:
845:
536:
372:
333:
1099:
706:
1195:
860:
691:
108:
631:
42:
1048:
701:
89:
696:
441:
46:
61:
965:
686:
1169:
658:
68:
1003:
988:
960:
825:
820:
395:
163:
740:
711:
489:
35:
1142:
583:
436:
170:
for computer languages. Under the name "shallow structure hypothesis", it is also used as an explanation for why
75:
1109:
1033:
765:
721:
606:
504:
359:
1162:
1013:
983:
650:
57:
267:
870:
563:
541:
531:
499:
474:
354:
136:
730:
243:
1083:
759:
735:
588:
1063:
993:
950:
906:
678:
668:
663:
551:
1073:
945:
810:
573:
556:
414:
231:
148:
1078:
790:
598:
509:
1146:
337:
82:
955:
840:
815:
616:
519:
223:
167:
152:
1067:
1028:
1023:
891:
494:
469:
451:
196:
171:
214:
Clahsen, Felser, Harald, Claudia (2006). "Grammatical
Processing in Language Learners".
775:
755:
479:
156:
1189:
1038:
850:
830:
611:
235:
1018:
975:
855:
568:
484:
461:
409:
144:
24:
578:
364:
227:
446:
16:
Analysis of a sentence which first identifies constituent parts of sentences
1134:
921:
901:
886:
865:
835:
780:
745:
626:
307:
1058:
916:
896:
770:
514:
429:
310:
131:
286:
424:
419:
349:
328:
1114:
750:
636:
325:
140:
368:
911:
18:
174:
learners often fail to parse complex sentences correctly.
1150:
203:. Singapore: Pearson Education Inc. pp. 577–586.
316:
1092:
1047:
1002:
974:
934:
879:
801:
789:
720:
677:
649:
597:
460:
402:
49:. Unsourced material may be challenged and removed.
317:GATE General Architecture for Text Engineering
1170:
380:
287:"Parsing By Chunks | Principle-Based Parsing"
8:
248:: CS1 maint: multiple names: authors list (
1177:
1163:
798:
594:
387:
373:
365:
272:Association for Computational Linguistics
109:Learn how and when to remove this message
188:
241:
7:
1201:Tasks of natural language processing
1131:
1129:
846:Simple Knowledge Organization System
47:adding citations to reliable sources
1149:. You can help Knowledge (XXG) by
166:. It is similar to the concept of
14:
861:Thesaurus (information retrieval)
162:It is a technique widely used in
1133:
268:"NP Chunking (State of the art)"
23:
1206:Computational linguistics stubs
34:needs additional citations for
442:Natural language understanding
201:Speech and Language Processing
1:
966:Optical character recognition
659:Multi-document summarization
989:Latent Dirichlet allocation
961:Natural language generation
826:Machine-readable dictionary
821:Linguistic Linked Open Data
396:Natural language processing
199:; Martin, James H. (2000).
164:natural language processing
153:machine learning techniques
1222:
1128:
741:Explicit semantic analysis
490:Deep linguistic processing
1143:computational linguistics
584:Word-sense disambiguation
437:Computational linguistics
228:10.1017/S0142716406060024
216:Applied Psycholinguistics
1196:Natural language parsing
1110:Natural Language Toolkit
1034:Pronunciation assessment
936:Automatic identification
766:Latent semantic analysis
722:Distributional semantics
607:Compound-term processing
505:Named-entity recognition
360:Named entity recognition
1014:Automated essay scoring
984:Document classification
651:Automatic summarization
334:Illinois Shallow Parser
151:), approaches that use
1145:-related article is a
871:Universal Dependencies
564:Terminology extraction
547:Semantic decomposition
542:Semantic role labeling
532:Part-of-speech tagging
500:Information extraction
485:Coreference resolution
475:Collocation extraction
355:Semantic role labeling
285:Abney, Steven (1991).
135:) is an analysis of a
632:Sentence segmentation
1084:Voice user interface
795:datasets and corpora
736:Document-term matrix
589:Word-sense induction
43:improve this article
1064:Interactive fiction
994:Pachinko allocation
951:Speech segmentation
907:Google Ngram Viewer
679:Machine translation
669:Text simplification
664:Sentence extraction
552:Semantic similarity
322:includes a chunker.
313:includes a chunker.
296:. pp. 257–278.
149:regular expressions
1074:Question answering
946:Speech recognition
811:Corpus linguistics
791:Language resources
574:Textual entailment
557:Sentiment analysis
1158:
1157:
1123:
1122:
1079:Virtual assistant
1004:Computer-assisted
930:
929:
687:Computer-assisted
645:
644:
637:Word segmentation
599:Text segmentation
537:Semantic analysis
525:Syntactic parsing
510:Ontology learning
119:
118:
111:
93:
58:"Shallow parsing"
1213:
1179:
1172:
1165:
1137:
1130:
1100:Formal semantics
1049:Natural language
956:Speech synthesis
938:and data capture
841:Semantic network
816:Lexical resource
799:
617:Lexical analysis
595:
520:Semantic parsing
389:
382:
375:
366:
297:
294:www.vinartus.net
291:
281:
279:
278:
254:
253:
247:
239:
211:
205:
204:
197:Jurafsky, Daniel
193:
168:lexical analysis
114:
107:
103:
100:
94:
92:
51:
27:
19:
1221:
1220:
1216:
1215:
1214:
1212:
1211:
1210:
1186:
1185:
1184:
1183:
1126:
1124:
1119:
1088:
1068:Syntax guessing
1050:
1043:
1029:Predictive text
1024:Grammar checker
1005:
998:
970:
937:
926:
892:Bank of English
875:
803:
794:
785:
716:
673:
641:
593:
495:Distant reading
470:Argument mining
456:
452:Text processing
398:
393:
346:
336:Shallow Parser
304:
289:
284:
276:
274:
266:
263:
258:
257:
240:
213:
212:
208:
195:
194:
190:
185:
180:
172:second language
122:Shallow parsing
115:
104:
98:
95:
52:
50:
40:
28:
17:
12:
11:
5:
1219:
1217:
1209:
1208:
1203:
1198:
1188:
1187:
1182:
1181:
1174:
1167:
1159:
1156:
1155:
1138:
1121:
1120:
1118:
1117:
1112:
1107:
1102:
1096:
1094:
1090:
1089:
1087:
1086:
1081:
1076:
1071:
1061:
1055:
1053:
1051:user interface
1045:
1044:
1042:
1041:
1036:
1031:
1026:
1021:
1016:
1010:
1008:
1000:
999:
997:
996:
991:
986:
980:
978:
972:
971:
969:
968:
963:
958:
953:
948:
942:
940:
932:
931:
928:
927:
925:
924:
919:
914:
909:
904:
899:
894:
889:
883:
881:
877:
876:
874:
873:
868:
863:
858:
853:
848:
843:
838:
833:
828:
823:
818:
813:
807:
805:
796:
787:
786:
784:
783:
778:
776:Word embedding
773:
768:
763:
756:Language model
753:
748:
743:
738:
733:
727:
725:
718:
717:
715:
714:
709:
707:Transfer-based
704:
699:
694:
689:
683:
681:
675:
674:
672:
671:
666:
661:
655:
653:
647:
646:
643:
642:
640:
639:
634:
629:
624:
619:
614:
609:
603:
601:
592:
591:
586:
581:
576:
571:
566:
560:
559:
554:
549:
544:
539:
534:
529:
528:
527:
522:
512:
507:
502:
497:
492:
487:
482:
480:Concept mining
477:
472:
466:
464:
458:
457:
455:
454:
449:
444:
439:
434:
433:
432:
427:
417:
412:
406:
404:
400:
399:
394:
392:
391:
384:
377:
369:
363:
362:
357:
352:
345:
342:
341:
340:
331:
323:
314:
308:Apache OpenNLP
303:
302:External links
300:
299:
298:
282:
262:
259:
256:
255:
206:
187:
186:
184:
181:
179:
176:
157:topic modeling
155:(classifiers,
117:
116:
31:
29:
22:
15:
13:
10:
9:
6:
4:
3:
2:
1218:
1207:
1204:
1202:
1199:
1197:
1194:
1193:
1191:
1180:
1175:
1173:
1168:
1166:
1161:
1160:
1154:
1152:
1148:
1144:
1139:
1136:
1132:
1127:
1116:
1113:
1111:
1108:
1106:
1105:Hallucination
1103:
1101:
1098:
1097:
1095:
1091:
1085:
1082:
1080:
1077:
1075:
1072:
1069:
1065:
1062:
1060:
1057:
1056:
1054:
1052:
1046:
1040:
1039:Spell checker
1037:
1035:
1032:
1030:
1027:
1025:
1022:
1020:
1017:
1015:
1012:
1011:
1009:
1007:
1001:
995:
992:
990:
987:
985:
982:
981:
979:
977:
973:
967:
964:
962:
959:
957:
954:
952:
949:
947:
944:
943:
941:
939:
933:
923:
920:
918:
915:
913:
910:
908:
905:
903:
900:
898:
895:
893:
890:
888:
885:
884:
882:
878:
872:
869:
867:
864:
862:
859:
857:
854:
852:
851:Speech corpus
849:
847:
844:
842:
839:
837:
834:
832:
831:Parallel text
829:
827:
824:
822:
819:
817:
814:
812:
809:
808:
806:
800:
797:
792:
788:
782:
779:
777:
774:
772:
769:
767:
764:
761:
757:
754:
752:
749:
747:
744:
742:
739:
737:
734:
732:
729:
728:
726:
723:
719:
713:
710:
708:
705:
703:
700:
698:
695:
693:
692:Example-based
690:
688:
685:
684:
682:
680:
676:
670:
667:
665:
662:
660:
657:
656:
654:
652:
648:
638:
635:
633:
630:
628:
625:
623:
622:Text chunking
620:
618:
615:
613:
612:Lemmatisation
610:
608:
605:
604:
602:
600:
596:
590:
587:
585:
582:
580:
577:
575:
572:
570:
567:
565:
562:
561:
558:
555:
553:
550:
548:
545:
543:
540:
538:
535:
533:
530:
526:
523:
521:
518:
517:
516:
513:
511:
508:
506:
503:
501:
498:
496:
493:
491:
488:
486:
483:
481:
478:
476:
473:
471:
468:
467:
465:
463:
462:Text analysis
459:
453:
450:
448:
445:
443:
440:
438:
435:
431:
428:
426:
423:
422:
421:
418:
416:
413:
411:
408:
407:
405:
403:General terms
401:
397:
390:
385:
383:
378:
376:
371:
370:
367:
361:
358:
356:
353:
351:
348:
347:
343:
339:
335:
332:
330:
327:
324:
321:
318:
315:
312:
309:
306:
305:
301:
295:
288:
283:
273:
269:
265:
264:
260:
251:
245:
237:
233:
229:
225:
221:
217:
210:
207:
202:
198:
192:
189:
182:
177:
175:
173:
169:
165:
160:
158:
154:
150:
146:
142:
138:
134:
133:
127:
123:
113:
110:
102:
99:February 2016
91:
88:
84:
81:
77:
74:
70:
67:
63:
60: –
59:
55:
54:Find sources:
48:
44:
38:
37:
32:This article
30:
26:
21:
20:
1151:expanding it
1140:
1125:
1019:Concordancer
621:
415:Bag-of-words
293:
275:. Retrieved
244:cite journal
219:
215:
209:
200:
191:
161:
129:
125:
121:
120:
105:
96:
86:
79:
72:
65:
53:
41:Please help
36:verification
33:
976:Topic model
856:Text corpus
702:Statistical
569:Text mining
410:AI-complete
1190:Categories
697:Rule-based
579:Truecasing
447:Stop words
277:2016-01-30
178:References
143:groups or
69:newspapers
1006:reviewing
804:standards
802:Types and
183:Citations
922:Wikidata
902:FrameNet
887:BabelNet
866:Treebank
836:PropBank
781:Word2vec
746:fastText
627:Stemming
344:See also
329:chunking
236:15990215
222:: 3–42.
137:sentence
126:chunking
1093:Related
1059:Chatbot
917:WordNet
897:DBpedia
771:Seq2seq
515:Parsing
430:Trigram
311:OpenNLP
261:Sources
145:phrases
132:parsing
83:scholar
1066:(c.f.
724:models
712:Neural
425:Bigram
420:n-gram
350:Parser
234:
130:light
124:(also
85:
78:
71:
64:
56:
1141:This
1115:spaCy
760:large
751:GloVe
290:(PDF)
232:S2CID
90:JSTOR
76:books
1147:stub
880:Data
731:BERT
338:Demo
326:NLTK
320:GATE
250:link
141:noun
62:news
912:UBY
224:doi
128:or
45:by
1192::
292:.
270:.
246:}}
242:{{
230:.
220:27
218:.
1178:e
1171:t
1164:v
1153:.
1070:)
793:,
762:)
758:(
388:e
381:t
374:v
280:.
252:)
238:.
226::
112:)
106:(
101:)
97:(
87:·
80:·
73:·
66:·
39:.
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.