Shallow parsing - Knowledge (XXG)

25: 1135: 159:, etc.) can take contextual information into account and thus compose chunks in such a way that they better reflect the semantic relations between the basic constituents. That is, these more advanced methods get around the problem that combinations of elementary constituents can have different higher level meanings depending on the context of the sentence. 386: 546: 139:

which first identifies constituent parts of sentences (nouns, verbs, adjectives, etc.) and then links them to higher order units that have discrete grammatical meanings (

249: 147:, verb groups, etc.). While the most elementary chunking algorithms simply link constituent parts on the basis of elementary search patterns (e.g., as specified by 1200: 524: 935: 379: 319: 271: 1205: 1176: 1104: 845: 536: 372: 333: 1099: 706: 1195: 860: 691: 108: 631: 42: 1048: 701: 89: 696: 441: 46: 61: 965: 686: 1169: 658: 68: 1003: 988: 960: 825: 820: 395: 163: 740: 711: 489: 35: 1142: 583: 436: 170:

for computer languages. Under the name "shallow structure hypothesis", it is also used as an explanation for why

75: 1109: 1033: 765: 721: 606: 504: 359: 1162: 1013: 983: 650: 57: 267: 870: 563: 541: 531: 499: 474: 354: 136: 730: 243: 1083: 759: 735: 588: 1063: 993: 950: 906: 678: 668: 663: 551: 1073: 945: 810: 573: 556: 414: 231: 148: 1078: 790: 598: 509: 1146: 337: 82: 955: 840: 815: 616: 519: 223: 167: 152: 1067: 1028: 1023: 891: 494: 469: 451: 196: 171: 214:

Clahsen, Felser, Harald, Claudia (2006). "Grammatical Processing in Language Learners".

775: 755: 479: 156: 1189: 1038: 850: 830: 611: 235: 1018: 975: 855: 568: 484: 461: 409: 144: 24: 578: 364: 227: 446: 16:

Analysis of a sentence which first identifies constituent parts of sentences

1134: 921: 901: 886: 865: 835: 780: 745: 626: 307: 1058: 916: 896: 770: 514: 429: 310: 131: 286: 424: 419: 349: 328: 1114: 750: 636: 325: 140: 368: 911: 18: 174:

learners often fail to parse complex sentences correctly.

1150: 203:. Singapore: Pearson Education Inc. pp. 577–586. 316: 1092: 1047: 1002: 974: 934: 879: 801: 789: 720: 677: 649: 597: 460: 402: 49:. Unsourced material may be challenged and removed. 317:GATE General Architecture for Text Engineering 1170: 380: 287:"Parsing By Chunks | Principle-Based Parsing" 8: 248:: CS1 maint: multiple names: authors list ( 1177: 1163: 798: 594: 387: 373: 365: 272:Association for Computational Linguistics 109:Learn how and when to remove this message 188: 241: 7: 1201:Tasks of natural language processing 1131: 1129: 846:Simple Knowledge Organization System 47:adding citations to reliable sources 1149:. You can help Knowledge (XXG) by 166:. It is similar to the concept of 14: 861:Thesaurus (information retrieval) 162:It is a technique widely used in 1133: 268:"NP Chunking (State of the art)" 23: 1206:Computational linguistics stubs 34:needs additional citations for 442:Natural language understanding 201:Speech and Language Processing 1: 966:Optical character recognition 659:Multi-document summarization 989:Latent Dirichlet allocation 961:Natural language generation 826:Machine-readable dictionary 821:Linguistic Linked Open Data 396:Natural language processing 199:; Martin, James H. (2000). 164:natural language processing 153:machine learning techniques 1222: 1128: 741:Explicit semantic analysis 490:Deep linguistic processing 1143:computational linguistics 584:Word-sense disambiguation 437:Computational linguistics 228:10.1017/S0142716406060024 216:Applied Psycholinguistics 1196:Natural language parsing 1110:Natural Language Toolkit 1034:Pronunciation assessment 936:Automatic identification 766:Latent semantic analysis 722:Distributional semantics 607:Compound-term processing 505:Named-entity recognition 360:Named entity recognition 1014:Automated essay scoring 984:Document classification 651:Automatic summarization 334:Illinois Shallow Parser 151:), approaches that use 1145:-related article is a 871:Universal Dependencies 564:Terminology extraction 547:Semantic decomposition 542:Semantic role labeling 532:Part-of-speech tagging 500:Information extraction 485:Coreference resolution 475:Collocation extraction 355:Semantic role labeling 285:Abney, Steven (1991). 135:) is an analysis of a 632:Sentence segmentation 1084:Voice user interface 795:datasets and corpora 736:Document-term matrix 589:Word-sense induction 43:improve this article 1064:Interactive fiction 994:Pachinko allocation 951:Speech segmentation 907:Google Ngram Viewer 679:Machine translation 669:Text simplification 664:Sentence extraction 552:Semantic similarity 322:includes a chunker. 313:includes a chunker. 296:. pp. 257–278. 149:regular expressions 1074:Question answering 946:Speech recognition 811:Corpus linguistics 791:Language resources 574:Textual entailment 557:Sentiment analysis 1158: 1157: 1123: 1122: 1079:Virtual assistant 1004:Computer-assisted 930: 929: 687:Computer-assisted 645: 644: 637:Word segmentation 599:Text segmentation 537:Semantic analysis 525:Syntactic parsing 510:Ontology learning 119: 118: 111: 93: 58:"Shallow parsing" 1213: 1179: 1172: 1165: 1137: 1130: 1100:Formal semantics 1049:Natural language 956:Speech synthesis 938:and data capture 841:Semantic network 816:Lexical resource 799: 617:Lexical analysis 595: 520:Semantic parsing 389: 382: 375: 366: 297: 294:www.vinartus.net 291: 281: 279: 278: 254: 253: 247: 239: 211: 205: 204: 197:Jurafsky, Daniel 193: 168:lexical analysis 114: 107: 103: 100: 94: 92: 51: 27: 19: 1221: 1220: 1216: 1215: 1214: 1212: 1211: 1210: 1186: 1185: 1184: 1183: 1126: 1124: 1119: 1088: 1068:Syntax guessing 1050: 1043: 1029:Predictive text 1024:Grammar checker 1005: 998: 970: 937: 926: 892:Bank of English 875: 803: 794: 785: 716: 673: 641: 593: 495:Distant reading 470:Argument mining 456: 452:Text processing 398: 393: 346: 336:Shallow Parser 304: 289: 284: 276: 274: 266: 263: 258: 257: 240: 213: 212: 208: 195: 194: 190: 185: 180: 172:second language 122:Shallow parsing 115: 104: 98: 95: 52: 50: 40: 28: 17: 12: 11: 5: 1219: 1217: 1209: 1208: 1203: 1198: 1188: 1187: 1182: 1181: 1174: 1167: 1159: 1156: 1155: 1138: 1121: 1120: 1118: 1117: 1112: 1107: 1102: 1096: 1094: 1090: 1089: 1087: 1086: 1081: 1076: 1071: 1061: 1055: 1053: 1051:user interface 1045: 1044: 1042: 1041: 1036: 1031: 1026: 1021: 1016: 1010: 1008: 1000: 999: 997: 996: 991: 986: 980: 978: 972: 971: 969: 968: 963: 958: 953: 948: 942: 940: 932: 931: 928: 927: 925: 924: 919: 914: 909: 904: 899: 894: 889: 883: 881: 877: 876: 874: 873: 868: 863: 858: 853: 848: 843: 838: 833: 828: 823: 818: 813: 807: 805: 796: 787: 786: 784: 783: 778: 776:Word embedding 773: 768: 763: 756:Language model 753: 748: 743: 738: 733: 727: 725: 718: 717: 715: 714: 709: 707:Transfer-based 704: 699: 694: 689: 683: 681: 675: 674: 672: 671: 666: 661: 655: 653: 647: 646: 643: 642: 640: 639: 634: 629: 624: 619: 614: 609: 603: 601: 592: 591: 586: 581: 576: 571: 566: 560: 559: 554: 549: 544: 539: 534: 529: 528: 527: 522: 512: 507: 502: 497: 492: 487: 482: 480:Concept mining 477: 472: 466: 464: 458: 457: 455: 454: 449: 444: 439: 434: 433: 432: 427: 417: 412: 406: 404: 400: 399: 394: 392: 391: 384: 377: 369: 363: 362: 357: 352: 345: 342: 341: 340: 331: 323: 314: 308:Apache OpenNLP 303: 302:External links 300: 299: 298: 282: 262: 259: 256: 255: 206: 187: 186: 184: 181: 179: 176: 157:topic modeling 155:(classifiers, 117: 116: 31: 29: 22: 15: 13: 10: 9: 6: 4: 3: 2: 1218: 1207: 1204: 1202: 1199: 1197: 1194: 1193: 1191: 1180: 1175: 1173: 1168: 1166: 1161: 1160: 1154: 1152: 1148: 1144: 1139: 1136: 1132: 1127: 1116: 1113: 1111: 1108: 1106: 1105:Hallucination 1103: 1101: 1098: 1097: 1095: 1091: 1085: 1082: 1080: 1077: 1075: 1072: 1069: 1065: 1062: 1060: 1057: 1056: 1054: 1052: 1046: 1040: 1039:Spell checker 1037: 1035: 1032: 1030: 1027: 1025: 1022: 1020: 1017: 1015: 1012: 1011: 1009: 1007: 1001: 995: 992: 990: 987: 985: 982: 981: 979: 977: 973: 967: 964: 962: 959: 957: 954: 952: 949: 947: 944: 943: 941: 939: 933: 923: 920: 918: 915: 913: 910: 908: 905: 903: 900: 898: 895: 893: 890: 888: 885: 884: 882: 878: 872: 869: 867: 864: 862: 859: 857: 854: 852: 851:Speech corpus 849: 847: 844: 842: 839: 837: 834: 832: 831:Parallel text 829: 827: 824: 822: 819: 817: 814: 812: 809: 808: 806: 800: 797: 792: 788: 782: 779: 777: 774: 772: 769: 767: 764: 761: 757: 754: 752: 749: 747: 744: 742: 739: 737: 734: 732: 729: 728: 726: 723: 719: 713: 710: 708: 705: 703: 700: 698: 695: 693: 692:Example-based 690: 688: 685: 684: 682: 680: 676: 670: 667: 665: 662: 660: 657: 656: 654: 652: 648: 638: 635: 633: 630: 628: 625: 623: 622:Text chunking 620: 618: 615: 613: 612:Lemmatisation 610: 608: 605: 604: 602: 600: 596: 590: 587: 585: 582: 580: 577: 575: 572: 570: 567: 565: 562: 561: 558: 555: 553: 550: 548: 545: 543: 540: 538: 535: 533: 530: 526: 523: 521: 518: 517: 516: 513: 511: 508: 506: 503: 501: 498: 496: 493: 491: 488: 486: 483: 481: 478: 476: 473: 471: 468: 467: 465: 463: 462:Text analysis 459: 453: 450: 448: 445: 443: 440: 438: 435: 431: 428: 426: 423: 422: 421: 418: 416: 413: 411: 408: 407: 405: 403:General terms 401: 397: 390: 385: 383: 378: 376: 371: 370: 367: 361: 358: 356: 353: 351: 348: 347: 343: 339: 335: 332: 330: 327: 324: 321: 318: 315: 312: 309: 306: 305: 301: 295: 288: 283: 273: 269: 265: 264: 260: 251: 245: 237: 233: 229: 225: 221: 217: 210: 207: 202: 198: 192: 189: 182: 177: 175: 173: 169: 165: 160: 158: 154: 150: 146: 142: 138: 134: 133: 127: 123: 113: 110: 102: 99:February 2016 91: 88: 84: 81: 77: 74: 70: 67: 63: 60: – 59: 55: 54:Find sources: 48: 44: 38: 37: 32:This article 30: 26: 21: 20: 1151:expanding it 1140: 1125: 1019:Concordancer 621: 415:Bag-of-words 293: 275:. Retrieved 244:cite journal 219: 215: 209: 200: 191: 161: 129: 125: 121: 120: 105: 96: 86: 79: 72: 65: 53: 41:Please help 36:verification 33: 976:Topic model 856:Text corpus 702:Statistical 569:Text mining 410:AI-complete 1190:Categories 697:Rule-based 579:Truecasing 447:Stop words 277:2016-01-30 178:References 143:groups or 69:newspapers 1006:reviewing 804:standards 802:Types and 183:Citations 922:Wikidata 902:FrameNet 887:BabelNet 866:Treebank 836:PropBank 781:Word2vec 746:fastText 627:Stemming 344:See also 329:chunking 236:15990215 222:: 3–42. 137:sentence 126:chunking 1093:Related 1059:Chatbot 917:WordNet 897:DBpedia 771:Seq2seq 515:Parsing 430:Trigram 311:OpenNLP 261:Sources 145:phrases 132:parsing 83:scholar 1066:(c.f. 724:models 712:Neural 425:Bigram 420:n-gram 350:Parser 234: 130:light 124:(also 85: 78: 71: 64: 56: 1141:This 1115:spaCy 760:large 751:GloVe 290:(PDF) 232:S2CID 90:JSTOR 76:books 1147:stub 880:Data 731:BERT 338:Demo 326:NLTK 320:GATE 250:link 141:noun 62:news 912:UBY 224:doi 128:or 45:by 1192:: 292:. 270:. 246:}} 242:{{ 230:. 220:27 218:. 1178:e 1171:t 1164:v 1153:. 1070:) 793:, 762:) 758:( 388:e 381:t 374:v 280:. 252:) 238:. 226:: 112:) 106:( 101:) 97:( 87:· 80:· 73:· 66:· 39:.

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.

Index