Knowledge (XXG)

Parse tree

Source 📝

31: 160: 485:
see all nodes as terminal, which means they do not acknowledge the distinction between terminal and non-terminal categories. They are simpler on average than constituency-based parse trees because they contain fewer nodes. The dependency-based parse tree for the example sentence above is as follows:
439:
node. A root node is a node that does not have any branches on top of it. Within a sentence, there is only ever one root node. A branch node is a parent node that connects to two or more child nodes. A leaf node, however, is a terminal node that does not dominate other nodes in the tree. S is the
460:
of the sentence. A parent node is one that has at least one other node linked by a branch under it. In the example, S is a parent of both N and VP. A child node is one that has at least one node directly above it to which it is linked by a branch of a tree. From the example,
556:), but are often given instead in the form of "bracketed expressions", which occupy less space in the memory. For example, a bracketed expression corresponding to the constituency-based tree given above may be something like: 167:
A parse tree is made up of nodes and branches. In the picture the parse tree is the entire structure, starting from S and ending in each of the leaf nodes (John, ball, the, hit). In a parse tree, each node is either a
521:
The constituency vs. dependency distinction is far-reaching. Whether the additional syntactic structure associated with constituency-based parse trees is necessary or beneficial is a matter of debate.
143:. A phrase marker is a linguistic expression marked as to its phrase structure. This may be presented in the form of a tree, or as a bracketed expression. Phrase markers are generated by applying 734:
As with trees, the precise construction of such expressions and the amount of detail shown can depend on the theory being applied and on the points that the query author wishes to illustrate.
729: 237: 909: 191:
node is one which has at least one node directly above it to which it is linked by a branch of the tree. Again from our example, hit is a child node of V.
280: 510:
structure is acknowledged. Any complete sub-tree of the tree is a constituent. Thus this dependency-based parse tree acknowledges the subject noun
496: 506:
This parse tree lacks the phrasal categories (S, VP, and NP) seen in the constituency-based counterpart above. Like the constituency-based tree,
820: 866:
See Carnie (2013:118ff.) for an introduction to the basic concepts of syntax trees (e.g. root node, terminal node, non-terminal node, etc.).
530: 140: 1005: 978: 1264: 1223: 780: 205:
For binary trees (where each parent node has two immediate child nodes), the number of possible parse trees for a sentence with
1228: 187:
node is one which has at least one other node linked by a branch under it. In the example, S is a parent of both NP and VP. A
180:
node. In the above example, S is a root node, NP and VP are branch nodes, while John, ball, the, and hit are all leaf nodes.
1233: 966: 117: 1259: 1171: 1073: 849: 759: 749: 507: 94: 75: 63: 267:
categories. The image below represents a constituency-based parse tree; it shows the syntactic structure of the
1068: 1040: 770: 248: 101: 1176: 1119: 1078: 148: 810: 310: 144: 109: 1201: 1181: 1045: 998: 549: 542: 100:
Parse trees are usually constructed based on either the constituency relation of constituency grammars (
55: 1213: 891:, Ludwig Eichinger, Hans-Werner Eroms, Peter Hellwig, Hans Heringer, and Hennig Lobin (eds.) 2003/6. 744: 125: 86: 67: 147:, and themselves are subject to further transformational rules. A set of possible parse trees for a 1218: 1186: 1124: 1102: 356: 1150: 888: 754: 482: 396: 279: 105: 93:
used for teaching grammar, parse trees do not use distinct symbol shapes for different types of
562: 1196: 1140: 1057: 816: 400: 331: 285:
The parse tree is the entire structure, starting from S and ending in each of the leaf nodes (
85:
Concrete syntax trees reflect the syntax of the input language, making them distinct from the
1092: 1022: 991: 899: 775: 457: 335: 268: 113: 90: 905:
Chiswell, Ian and Wilfrid Hodges 2007. Mathematical logic. Oxford: Oxford University Press.
215: 1254: 376: 264: 495: 538: 210: 1248: 1114: 1030: 252: 1145: 961: 534: 892: 1191: 1097: 951: 352: 327: 256: 836: 198:
is a function (node) which is either a root or a branch in that tree whereas a
17: 1107: 972: 948:
Enhanced version of phpSyntaxTree in Ruby with Unicode and Vectorized graphics
1087: 1035: 260: 933: 893:
Dependency and valency: An international handbook of contemporary research
942:– Online parse tree drawing site (improved version that supports Unicode) 121: 1014: 764: 30: 923: 159: 983: 939: 59: 945: 955: 518:
as constituents just like the constituency-based parse tree does.
158: 29: 183:
Nodes can also be referred to as parent nodes and child nodes. A
417: 372: 987: 545:. Then, this application may undergo further transformations. 330:. The first (leftmost) NP, a single noun "John", serves as the 928: 247:
The constituency-based parse trees of constituency grammars (
671: 656: 653: 614: 611: 585: 251:) distinguish between terminal and non-terminal nodes. The 529:
Phrase markers, or P-markers, were introduced in early
202:
is a function (node) in a parse tree which is a leaf.
565: 301:). The following abbreviations are used in the tree: 218: 837:
The structure of shared forests in ambiguous parsing
1164: 1133: 1056: 1021: 975:
Dependency Parse Introduction (Christopher Manning)
553: 850:"The parsetree Package for Drawing Trees in LaTeX" 723: 231: 908:Aho, A. V., Sethi, R., and Ullman, J. D. 1986. 548:Phrase markers may be presented in the form of 473:are also sometimes used for this relationship. 910:Compilers: Principles, techniques, & tools 999: 537:and others. A phrase marker representing the 8: 902:, 3rd edition. Malden, MA: Wiley-Blackwell. 456:(N) are all leaf nodes. The leaves are the 440:root node, NP and VP are branch nodes, and 1006: 992: 984: 704: 695: 680: 670: 669: 652: 651: 636: 627: 610: 609: 594: 584: 583: 570: 564: 313:, the top-level structure in this example 223: 217: 792: 541:of a sentence is generated by applying 334:of the sentence. The second one is the 875:See for example Ágel et al. 2003/2006. 962:TreeForm Syntax Tree Drawing Software 259:categories of the grammar, while the 151:sentence is called a "parse forest." 89:used in computer programming. Unlike 7: 481:The dependency-based parse trees of 835:Billot, Sylvie, and Bernard Lang. " 531:transformational generative grammar 141:transformational generative grammar 108:. Parse trees may be generated for 967:Visual Introduction to Parse Trees 427:Each node in the tree is either a 78:; in theoretical syntax, the term 25: 979:Penn Treebank II Constituent Tags 900:Syntax: A generative introduction 809:Noam Chomsky (26 December 2014). 799:See Chiswell and Hodges 2007: 34. 1224:History of compiler construction 936:– Online parse tree drawing site 781:Terminal and nonterminal symbols 494: 465:is a child node of V. The terms 278: 104:) or the dependency relation of 1229:Comparison of parser generators 969:Introduction and Transformation 958:package for drawing parse trees 812:Aspects of the Theory of Syntax 124:of computer languages, such as 912:. Reading, MA: Addison-Wesley. 718: 715: 712: 709: 692: 685: 666: 648: 641: 624: 606: 599: 580: 567: 554:constituency-based parse trees 243:Constituency-based parse trees 91:Reed-Kellogg sentence diagrams 27:Tree in formal language theory 1: 131:A related concept is that of 895:. Berlin: Walter de Gruyter. 552:(as in the above section on 477:Dependency-based parse trees 74:itself is used primarily in 1234:Operator-precedence grammar 929:Linguistic Tree Constructor 514:and the object noun phrase 118:natural language processing 1281: 760:Computational linguistics 750:Constituent (linguistics) 724:{\displaystyle \ \ \ ]]]} 249:phrase structure grammars 102:phrase structure grammars 76:computational linguistics 771:Phrase structure grammar 54:) is an ordered, rooted 1265:Trees (data structures) 1177:Definite clause grammar 940:phpSyntaxTree (Unicode) 399:, in this instance the 375:. In this case, it's a 149:syntactically ambiguous 725: 543:phrase structure rules 355:, which serves as the 233: 209:words is given by the 164: 145:phrase structure rules 35: 1182:Deterministic parsing 726: 234: 232:{\displaystyle C_{n}} 162: 126:programming languages 120:), as well as during 87:abstract syntax trees 33: 745:Abstract syntax tree 563: 216: 196:nonterminal function 68:context-free grammar 58:that represents the 52:concrete syntax tree 1219:Scannerless parsing 1187:Dynamic programming 483:dependency grammars 163:A simple parse tree 106:dependency grammars 1015:Parsing algorithms 924:Syntax Tree Editor 755:Dependency grammar 721: 533:, as developed by 229: 165: 66:according to some 36: 34:Parse tree to SAAB 1260:Generative syntax 1242: 1241: 1041:Recursive descent 898:Carnie, A. 2013. 822:978-0-262-52740-8 767:(syntax analysis) 707: 703: 690: 683: 679: 664: 646: 639: 635: 622: 604: 597: 593: 578: 273:John hit the ball 200:terminal function 114:natural languages 46:(also known as a 16:(Redirected from 1272: 1197:Parser generator 1120:Recursive ascent 1008: 1001: 994: 985: 973:OpenCourseOnline 876: 873: 867: 864: 858: 857: 854:www1.essex.ac.uk 846: 840: 833: 827: 826: 806: 800: 797: 776:Sentence diagram 730: 728: 727: 722: 708: 705: 701: 700: 699: 688: 684: 681: 677: 676: 675: 674: 662: 661: 660: 659: 644: 640: 637: 633: 632: 631: 620: 619: 618: 617: 602: 598: 595: 591: 590: 589: 588: 576: 575: 574: 498: 401:definite article 338:of the sentence. 282: 238: 236: 235: 230: 228: 227: 82:is more common. 21: 1280: 1279: 1275: 1274: 1273: 1271: 1270: 1269: 1245: 1244: 1243: 1238: 1160: 1129: 1052: 1017: 1012: 920: 915: 884: 879: 874: 870: 865: 861: 848: 847: 843: 834: 830: 823: 808: 807: 803: 798: 794: 790: 785: 740: 691: 665: 647: 623: 605: 579: 566: 561: 560: 527: 479: 377:transitive verb 263:are labeled by 255:are labeled by 245: 219: 214: 213: 157: 62:structure of a 48:derivation tree 28: 23: 22: 18:Derivation tree 15: 12: 11: 5: 1278: 1276: 1268: 1267: 1262: 1257: 1247: 1246: 1240: 1239: 1237: 1236: 1231: 1226: 1221: 1216: 1211: 1206: 1205: 1204: 1194: 1189: 1184: 1179: 1174: 1168: 1166: 1165:Related topics 1162: 1161: 1159: 1158: 1155: 1154: 1153: 1143: 1137: 1135: 1131: 1130: 1128: 1127: 1122: 1117: 1112: 1111: 1110: 1105: 1100: 1095: 1085: 1084: 1083: 1082: 1081: 1071: 1062: 1060: 1054: 1053: 1051: 1050: 1049: 1048: 1046:Tail recursive 1038: 1033: 1027: 1025: 1019: 1018: 1013: 1011: 1010: 1003: 996: 988: 982: 981: 976: 970: 964: 959: 949: 943: 937: 931: 926: 919: 918:External links 916: 914: 913: 906: 903: 896: 885: 883: 880: 878: 877: 868: 859: 841: 828: 821: 801: 791: 789: 786: 784: 783: 778: 773: 768: 762: 757: 752: 747: 741: 739: 736: 732: 731: 720: 717: 714: 711: 698: 694: 687: 673: 668: 658: 655: 650: 643: 630: 626: 616: 613: 608: 601: 587: 582: 573: 569: 539:deep structure 526: 525:Phrase markers 523: 504: 503: 502: 501: 500: 499: 478: 475: 458:lexical tokens 425: 424: 423: 422: 421: 420: 409: 408: 407: 406: 405: 404: 388: 387: 386: 385: 384: 383: 364: 363: 362: 361: 360: 359: 344: 343: 342: 341: 340: 339: 319: 318: 317: 316: 315: 314: 253:interior nodes 244: 241: 226: 222: 211:Catalan number 156: 153: 26: 24: 14: 13: 10: 9: 6: 4: 3: 2: 1277: 1266: 1263: 1261: 1258: 1256: 1253: 1252: 1250: 1235: 1232: 1230: 1227: 1225: 1222: 1220: 1217: 1215: 1212: 1210: 1207: 1203: 1200: 1199: 1198: 1195: 1193: 1190: 1188: 1185: 1183: 1180: 1178: 1175: 1173: 1170: 1169: 1167: 1163: 1156: 1152: 1149: 1148: 1147: 1144: 1142: 1139: 1138: 1136: 1132: 1126: 1123: 1121: 1118: 1116: 1113: 1109: 1106: 1104: 1101: 1099: 1096: 1094: 1091: 1090: 1089: 1086: 1080: 1079:Shunting-yard 1077: 1076: 1075: 1072: 1070: 1067: 1066: 1064: 1063: 1061: 1059: 1055: 1047: 1044: 1043: 1042: 1039: 1037: 1034: 1032: 1029: 1028: 1026: 1024: 1020: 1016: 1009: 1004: 1002: 997: 995: 990: 989: 986: 980: 977: 974: 971: 968: 965: 963: 960: 957: 953: 950: 947: 944: 941: 938: 935: 934:phpSyntaxTree 932: 930: 927: 925: 922: 921: 917: 911: 907: 904: 901: 897: 894: 890: 887: 886: 881: 872: 869: 863: 860: 855: 851: 845: 842: 838: 832: 829: 824: 818: 815:. MIT Press. 814: 813: 805: 802: 796: 793: 787: 782: 779: 777: 774: 772: 769: 766: 763: 761: 758: 756: 753: 751: 748: 746: 743: 742: 737: 735: 696: 628: 571: 559: 558: 557: 555: 551: 546: 544: 540: 536: 532: 524: 522: 519: 517: 513: 509: 497: 493: 492: 491: 490: 489: 488: 487: 484: 476: 474: 472: 468: 464: 459: 455: 451: 447: 443: 438: 434: 430: 419: 415: 414: 413: 412: 411: 410: 402: 398: 394: 393: 392: 391: 390: 389: 381: 378: 374: 370: 369: 368: 367: 366: 365: 358: 354: 350: 349: 348: 347: 346: 345: 337: 333: 329: 325: 324: 323: 322: 321: 320: 312: 308: 307: 306: 305: 304: 303: 302: 300: 296: 292: 288: 283: 281: 276: 274: 270: 266: 262: 258: 254: 250: 242: 240: 224: 220: 212: 208: 203: 201: 197: 192: 190: 186: 181: 179: 175: 171: 161: 154: 152: 150: 146: 142: 139:, as used in 138: 134: 133:phrase marker 129: 127: 123: 119: 115: 111: 107: 103: 98: 96: 92: 88: 83: 81: 77: 73: 69: 65: 61: 57: 53: 49: 45: 41: 32: 19: 1208: 1134:Mixed, other 1125:Shift-reduce 871: 862: 853: 844: 831: 811: 804: 795: 733: 547: 535:Noam Chomsky 528: 520: 515: 511: 505: 480: 470: 466: 462: 453: 449: 445: 441: 436: 432: 428: 426: 379: 298: 294: 290: 286: 284: 277: 272: 257:non-terminal 246: 206: 204: 199: 195: 193: 188: 184: 182: 177: 173: 169: 166: 155:Nomenclature 136: 132: 130: 99: 95:constituents 84: 79: 71: 51: 47: 44:parsing tree 43: 39: 37: 1192:Memoization 1157:Statistical 1151:Left corner 1108:Generalized 1065:Precedence 946:rSyntaxTree 508:constituent 435:node, or a 353:verb phrase 328:noun phrase 176:node, or a 80:syntax tree 70:. The term 1249:Categories 1209:Parse tree 1141:Combinator 1098:Look-ahead 882:References 397:determiner 261:leaf nodes 122:processing 72:parse tree 40:parse tree 1103:Canonical 1058:Bottom-up 452:(D), and 357:predicate 271:sentence 110:sentences 60:syntactic 1074:Operator 1023:Top-down 889:Ágel, V. 738:See also 516:the ball 471:daughter 431:node, a 311:sentence 265:terminal 172:node, a 137:P-marker 765:Parsing 351:VP for 332:subject 326:NP for 269:English 1255:Syntax 1093:Simple 1069:Simple 1031:Earley 819:  702:  689:  678:  663:  645:  634:  621:  603:  592:  577:  467:mother 433:branch 416:N for 395:D for 371:V for 336:object 309:S for 185:parent 174:branch 64:string 1146:Chart 956:LaTeX 952:Qtree 788:Notes 550:trees 448:(V), 444:(N), 403:"the" 189:child 116:(see 1202:LALR 817:ISBN 706:ball 596:John 512:John 469:and 454:ball 442:John 437:leaf 429:root 418:noun 373:verb 299:ball 287:John 178:leaf 170:root 56:tree 1214:AST 1172:PEG 1115:CYK 682:the 638:hit 463:hit 450:the 446:hit 380:hit 295:the 291:hit 135:or 112:in 50:or 42:or 1251:: 1088:LR 1036:LL 954:– 852:. 839:." 297:, 293:, 289:, 275:: 239:. 194:A 128:. 97:. 38:A 1007:e 1000:t 993:v 856:. 825:. 719:] 716:] 713:] 710:] 697:N 693:[ 686:] 672:D 667:[ 657:P 654:N 649:[ 642:] 629:V 625:[ 615:P 612:V 607:[ 600:] 586:N 581:[ 572:S 568:[ 382:. 225:n 221:C 207:n 20:)

Index

Derivation tree

tree
syntactic
string
context-free grammar
computational linguistics
abstract syntax trees
Reed-Kellogg sentence diagrams
constituents
phrase structure grammars
dependency grammars
sentences
natural languages
natural language processing
processing
programming languages
transformational generative grammar
phrase structure rules
syntactically ambiguous

Catalan number
phrase structure grammars
interior nodes
non-terminal
leaf nodes
terminal
English
Parse tree PSG
sentence

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.