Knowledge (XXG)

Lancaster-Oslo-Bergen Corpus

Source 📝

36: 838: 781: 147:
Its composition was designed to match the original Brown corpus in terms of its size and genres as closely as possible using documents published in the UK in 1961 by British authors. Both corpora consist of 500 samples each comprising about 2000 words in the following genres:
898: 65: 611: 948: 822: 496: 879: 531: 903: 556: 699: 596: 815: 87: 125: 679: 489: 953: 872: 571: 913: 808: 933: 928: 48: 918: 482: 58: 52: 44: 865: 734: 719: 704: 674: 69: 923: 649: 644: 551: 521: 750: 694: 664: 536: 117: 943: 407: 724: 689: 684: 654: 591: 581: 908: 729: 566: 505: 121: 141: 849: 792: 938: 845: 669: 629: 788: 659: 526: 443: 113: 137: 546: 411: 892: 760: 561: 541: 133: 464: 430: 837: 780: 17: 709: 639: 586: 755: 714: 634: 606: 474: 129: 116:
texts which was compiled in the 1970s in collaboration between the
601: 469: 478: 29: 853: 796: 743: 620: 512: 57:but its sources remain unclear because it lacks 612:Wellington Corpus of Spoken New Zealand English 444:"CoRD | The Lancaster-Oslo/Bergen Corpus (LOB)" 640:CorCenCC National Corpus of Contemporary Welsh 873: 816: 490: 414:categories have been assigned to every word. 126:Norwegian Computing Centre for the Humanities 8: 899:1970s establishments in the United Kingdom 880: 866: 823: 809: 497: 483: 475: 132:, to provide a British counterpart to the 88:Learn how and when to remove this message 27:1970s collection of British English texts 532:Bergen Corpus of London Teenage Language 281:Miscellaneous (documents, reports, etc.) 150: 557:Corpus of Contemporary American English 470:LOB Corpus from the Oxford Text Archive 423: 949:Library and information science stubs 7: 834: 832: 777: 775: 112:is a one-million-word collection of 700:Scottish Corpus of Texts and Speech 597:Switchboard Telephone Speech Corpus 144:for American English in the 1960s. 852:. You can help Knowledge (XXG) by 795:. You can help Knowledge (XXG) by 25: 263:Belles lettres, biography, essays 836: 779: 680:Neo-Assyrian Text Corpus Project 34: 572:International Corpus of English 295:Learned and scientific writings 904:1970s establishments in Norway 1: 351:Adventure and western fiction 323:Mystery and detective fiction 577:Lancaster-Oslo-Bergen Corpus 970: 831: 774: 227:Skills, trades and hobbies 735:Thesaurus Linguae Graecae 720:Tehran Monolingual Corpus 705:Slovenian National Corpus 675:National Corpus of Polish 406:The corpus has been also 650:Croatian National Corpus 645:Croatian Language Corpus 552:Cambridge English Corpus 522:American National Corpus 43:This article includes a 844:This article about the 695:Russian National Corpus 665:German Reference Corpus 537:British National Corpus 118:University of Lancaster 72:more precise citations. 954:English language stubs 365:Romance and love story 787:This article about a 725:Tekstaro de Esperanto 690:Quranic Arabic Corpus 685:Persian Speech Corpus 655:Czech National Corpus 592:Spoken English Corpus 582:Oxford English Corpus 102:Lancaster-Oslo/Bergen 914:Lancaster University 730:TenTen Corpus Family 934:Applied linguistics 929:Linguistic research 448:varieng.helsinki.fi 919:University of Oslo 506:Corpus linguistics 122:University of Oslo 45:list of references 861: 860: 804: 803: 769: 768: 465:LOB Corpus Manual 442:Johansson, Stig. 431:LOB Corpus Manual 404: 403: 142:W. Nelson Francis 98: 97: 90: 16:(Redirected from 961: 882: 875: 868: 846:English language 840: 833: 825: 818: 811: 783: 776: 670:Hamshahri Corpus 630:Bijankhan Corpus 499: 492: 485: 476: 452: 451: 439: 433: 428: 185:Press: editorial 171:Press: reportage 151: 93: 86: 82: 79: 73: 68:this article by 59:inline citations 38: 37: 30: 21: 969: 968: 964: 963: 962: 960: 959: 958: 924:English corpora 889: 888: 887: 886: 830: 829: 789:digital library 772: 770: 765: 739: 660:Europarl Corpus 622: 616: 527:Bank of English 514: 508: 503: 461: 456: 455: 441: 440: 436: 429: 425: 420: 337:Science fiction 309:General fiction 114:British English 94: 83: 77: 74: 63: 49:related reading 39: 35: 28: 23: 22: 15: 12: 11: 5: 967: 965: 957: 956: 951: 946: 941: 936: 931: 926: 921: 916: 911: 906: 901: 891: 890: 885: 884: 877: 870: 862: 859: 858: 841: 828: 827: 820: 813: 805: 802: 801: 784: 767: 766: 764: 763: 758: 753: 751:BNC consortium 747: 745: 741: 740: 738: 737: 732: 727: 722: 717: 712: 707: 702: 697: 692: 687: 682: 677: 672: 667: 662: 657: 652: 647: 642: 637: 632: 626: 624: 618: 617: 615: 614: 609: 604: 599: 594: 589: 584: 579: 574: 569: 564: 559: 554: 549: 547:Buckeye Corpus 544: 539: 534: 529: 524: 518: 516: 510: 509: 504: 502: 501: 494: 487: 479: 473: 472: 467: 460: 459:External links 457: 454: 453: 434: 422: 421: 419: 416: 412:part-of-speech 402: 401: 398: 395: 390: 387: 386: 383: 380: 377: 373: 372: 369: 366: 363: 359: 358: 355: 352: 349: 345: 344: 341: 338: 335: 331: 330: 327: 324: 321: 317: 316: 313: 310: 307: 303: 302: 299: 296: 293: 289: 288: 285: 282: 279: 275: 274: 269: 264: 261: 257: 256: 251: 246: 243: 239: 238: 233: 228: 225: 221: 220: 217: 214: 211: 207: 206: 203: 200: 199:Press: reviews 197: 193: 192: 189: 186: 183: 179: 178: 175: 172: 169: 165: 164: 161: 158: 157:Text category 155: 96: 95: 53:external links 42: 40: 33: 26: 24: 14: 13: 10: 9: 6: 4: 3: 2: 966: 955: 952: 950: 947: 945: 944:Website stubs 942: 940: 937: 935: 932: 930: 927: 925: 922: 920: 917: 915: 912: 910: 907: 905: 902: 900: 897: 896: 894: 883: 878: 876: 871: 869: 864: 863: 857: 855: 851: 847: 842: 839: 835: 826: 821: 819: 814: 812: 807: 806: 800: 798: 794: 790: 785: 782: 778: 773: 762: 761:Sketch Engine 759: 757: 754: 752: 749: 748: 746: 744:Organizations 742: 736: 733: 731: 728: 726: 723: 721: 718: 716: 713: 711: 708: 706: 703: 701: 698: 696: 693: 691: 688: 686: 683: 681: 678: 676: 673: 671: 668: 666: 663: 661: 658: 656: 653: 651: 648: 646: 643: 641: 638: 636: 633: 631: 628: 627: 625: 621:Text corpora, 619: 613: 610: 608: 605: 603: 600: 598: 595: 593: 590: 588: 585: 583: 580: 578: 575: 573: 570: 568: 565: 563: 560: 558: 555: 553: 550: 548: 545: 543: 540: 538: 535: 533: 530: 528: 525: 523: 520: 519: 517: 513:Text corpora, 511: 507: 500: 495: 493: 488: 486: 481: 480: 477: 471: 468: 466: 463: 462: 458: 449: 445: 438: 435: 432: 427: 424: 417: 415: 413: 409: 399: 396: 394: 391: 389: 388: 384: 381: 378: 375: 374: 370: 367: 364: 361: 360: 356: 353: 350: 347: 346: 342: 339: 336: 333: 332: 328: 325: 322: 319: 318: 314: 311: 308: 305: 304: 300: 297: 294: 291: 290: 286: 283: 280: 277: 276: 273: 270: 268: 265: 262: 259: 258: 255: 252: 250: 247: 244: 241: 240: 237: 234: 232: 229: 226: 223: 222: 218: 215: 212: 209: 208: 204: 201: 198: 195: 194: 190: 187: 184: 181: 180: 176: 173: 170: 167: 166: 162: 160:Brown Corpus 159: 156: 153: 152: 149: 145: 143: 139: 135: 131: 127: 123: 119: 115: 111: 107: 103: 92: 89: 81: 78:December 2022 71: 67: 61: 60: 54: 50: 46: 41: 32: 31: 19: 854:expanding it 843: 797:expanding it 786: 771: 576: 562:Enron Corpus 542:Brown Corpus 447: 437: 426: 405: 392: 271: 266: 253: 248: 245:Popular lore 235: 230: 146: 138:Henry Kučera 136:compiled by 134:Brown Corpus 109: 105: 101: 99: 84: 75: 64:Please help 56: 909:1970s works 623:non-English 163:LOB Corpus 70:introducing 893:Categories 418:References 124:, and the 18:LOB Corpus 710:TalkBank 587:PropBank 567:EnTenTen 213:Religion 939:Corpora 756:COBUILD 715:Tatoeba 635:CHILDES 607:VerbNet 515:English 410:, i.e. 66:improve 408:tagged 379:Humour 154:Label 130:Bergen 120:, the 110:Corpus 848:is a 791:is a 602:TIMIT 393:Total 51:, or 850:stub 793:stub 400:500 140:and 100:The 397:500 371:29 357:29 329:24 315:29 301:80 287:30 219:17 205:17 191:27 177:44 106:LOB 895:: 446:. 385:9 368:29 354:29 343:6 326:24 312:29 298:80 284:30 272:77 267:75 254:44 249:48 236:38 231:36 216:17 202:17 188:27 174:44 128:, 108:) 55:, 47:, 881:e 874:t 867:v 856:. 824:e 817:t 810:v 799:. 498:e 491:t 484:v 450:. 382:9 376:R 362:P 348:N 340:6 334:M 320:L 306:K 292:J 278:H 260:G 242:F 224:E 210:D 196:C 182:B 168:A 104:( 91:) 85:( 80:) 76:( 62:. 20:)

Index

LOB Corpus
list of references
related reading
external links
inline citations
improve
introducing
Learn how and when to remove this message
British English
University of Lancaster
University of Oslo
Norwegian Computing Centre for the Humanities
Bergen
Brown Corpus
Henry Kučera
W. Nelson Francis
tagged
part-of-speech
LOB Corpus Manual
"CoRD | The Lancaster-Oslo/Bergen Corpus (LOB)"
LOB Corpus Manual
LOB Corpus from the Oxford Text Archive
v
t
e
Corpus linguistics
American National Corpus
Bank of English
Bergen Corpus of London Teenage Language
British National Corpus

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.