36:
838:
781:
147:
Its composition was designed to match the original Brown corpus in terms of its size and genres as closely as possible using documents published in the UK in 1961 by
British authors. Both corpora consist of 500 samples each comprising about 2000 words in the following genres:
898:
65:
611:
948:
822:
496:
879:
531:
903:
556:
699:
596:
815:
87:
125:
679:
489:
953:
872:
571:
913:
808:
933:
928:
48:
918:
482:
58:
52:
44:
865:
734:
719:
704:
674:
69:
923:
649:
644:
551:
521:
750:
694:
664:
536:
117:
943:
407:
724:
689:
684:
654:
591:
581:
908:
729:
566:
505:
121:
141:
849:
792:
938:
845:
669:
629:
788:
659:
526:
443:
113:
137:
546:
411:
892:
760:
561:
541:
133:
464:
430:
837:
780:
17:
709:
639:
586:
755:
714:
634:
606:
474:
129:
116:
texts which was compiled in the 1970s in collaboration between the
601:
469:
478:
29:
853:
796:
743:
620:
512:
57:but its sources remain unclear because it lacks
612:Wellington Corpus of Spoken New Zealand English
444:"CoRD | The Lancaster-Oslo/Bergen Corpus (LOB)"
640:CorCenCC National Corpus of Contemporary Welsh
873:
816:
490:
414:categories have been assigned to every word.
126:Norwegian Computing Centre for the Humanities
8:
899:1970s establishments in the United Kingdom
880:
866:
823:
809:
497:
483:
475:
132:, to provide a British counterpart to the
88:Learn how and when to remove this message
27:1970s collection of British English texts
532:Bergen Corpus of London Teenage Language
281:Miscellaneous (documents, reports, etc.)
150:
557:Corpus of Contemporary American English
470:LOB Corpus from the Oxford Text Archive
423:
949:Library and information science stubs
7:
834:
832:
777:
775:
112:is a one-million-word collection of
700:Scottish Corpus of Texts and Speech
597:Switchboard Telephone Speech Corpus
144:for American English in the 1960s.
852:. You can help Knowledge (XXG) by
795:. You can help Knowledge (XXG) by
25:
263:Belles lettres, biography, essays
836:
779:
680:Neo-Assyrian Text Corpus Project
34:
572:International Corpus of English
295:Learned and scientific writings
904:1970s establishments in Norway
1:
351:Adventure and western fiction
323:Mystery and detective fiction
577:Lancaster-Oslo-Bergen Corpus
970:
831:
774:
227:Skills, trades and hobbies
735:Thesaurus Linguae Graecae
720:Tehran Monolingual Corpus
705:Slovenian National Corpus
675:National Corpus of Polish
406:The corpus has been also
650:Croatian National Corpus
645:Croatian Language Corpus
552:Cambridge English Corpus
522:American National Corpus
43:This article includes a
844:This article about the
695:Russian National Corpus
665:German Reference Corpus
537:British National Corpus
118:University of Lancaster
72:more precise citations.
954:English language stubs
365:Romance and love story
787:This article about a
725:Tekstaro de Esperanto
690:Quranic Arabic Corpus
685:Persian Speech Corpus
655:Czech National Corpus
592:Spoken English Corpus
582:Oxford English Corpus
102:Lancaster-Oslo/Bergen
914:Lancaster University
730:TenTen Corpus Family
934:Applied linguistics
929:Linguistic research
448:varieng.helsinki.fi
919:University of Oslo
506:Corpus linguistics
122:University of Oslo
45:list of references
861:
860:
804:
803:
769:
768:
465:LOB Corpus Manual
442:Johansson, Stig.
431:LOB Corpus Manual
404:
403:
142:W. Nelson Francis
98:
97:
90:
16:(Redirected from
961:
882:
875:
868:
846:English language
840:
833:
825:
818:
811:
783:
776:
670:Hamshahri Corpus
630:Bijankhan Corpus
499:
492:
485:
476:
452:
451:
439:
433:
428:
185:Press: editorial
171:Press: reportage
151:
93:
86:
82:
79:
73:
68:this article by
59:inline citations
38:
37:
30:
21:
969:
968:
964:
963:
962:
960:
959:
958:
924:English corpora
889:
888:
887:
886:
830:
829:
789:digital library
772:
770:
765:
739:
660:Europarl Corpus
622:
616:
527:Bank of English
514:
508:
503:
461:
456:
455:
441:
440:
436:
429:
425:
420:
337:Science fiction
309:General fiction
114:British English
94:
83:
77:
74:
63:
49:related reading
39:
35:
28:
23:
22:
15:
12:
11:
5:
967:
965:
957:
956:
951:
946:
941:
936:
931:
926:
921:
916:
911:
906:
901:
891:
890:
885:
884:
877:
870:
862:
859:
858:
841:
828:
827:
820:
813:
805:
802:
801:
784:
767:
766:
764:
763:
758:
753:
751:BNC consortium
747:
745:
741:
740:
738:
737:
732:
727:
722:
717:
712:
707:
702:
697:
692:
687:
682:
677:
672:
667:
662:
657:
652:
647:
642:
637:
632:
626:
624:
618:
617:
615:
614:
609:
604:
599:
594:
589:
584:
579:
574:
569:
564:
559:
554:
549:
547:Buckeye Corpus
544:
539:
534:
529:
524:
518:
516:
510:
509:
504:
502:
501:
494:
487:
479:
473:
472:
467:
460:
459:External links
457:
454:
453:
434:
422:
421:
419:
416:
412:part-of-speech
402:
401:
398:
395:
390:
387:
386:
383:
380:
377:
373:
372:
369:
366:
363:
359:
358:
355:
352:
349:
345:
344:
341:
338:
335:
331:
330:
327:
324:
321:
317:
316:
313:
310:
307:
303:
302:
299:
296:
293:
289:
288:
285:
282:
279:
275:
274:
269:
264:
261:
257:
256:
251:
246:
243:
239:
238:
233:
228:
225:
221:
220:
217:
214:
211:
207:
206:
203:
200:
199:Press: reviews
197:
193:
192:
189:
186:
183:
179:
178:
175:
172:
169:
165:
164:
161:
158:
157:Text category
155:
96:
95:
53:external links
42:
40:
33:
26:
24:
14:
13:
10:
9:
6:
4:
3:
2:
966:
955:
952:
950:
947:
945:
944:Website stubs
942:
940:
937:
935:
932:
930:
927:
925:
922:
920:
917:
915:
912:
910:
907:
905:
902:
900:
897:
896:
894:
883:
878:
876:
871:
869:
864:
863:
857:
855:
851:
847:
842:
839:
835:
826:
821:
819:
814:
812:
807:
806:
800:
798:
794:
790:
785:
782:
778:
773:
762:
761:Sketch Engine
759:
757:
754:
752:
749:
748:
746:
744:Organizations
742:
736:
733:
731:
728:
726:
723:
721:
718:
716:
713:
711:
708:
706:
703:
701:
698:
696:
693:
691:
688:
686:
683:
681:
678:
676:
673:
671:
668:
666:
663:
661:
658:
656:
653:
651:
648:
646:
643:
641:
638:
636:
633:
631:
628:
627:
625:
621:Text corpora,
619:
613:
610:
608:
605:
603:
600:
598:
595:
593:
590:
588:
585:
583:
580:
578:
575:
573:
570:
568:
565:
563:
560:
558:
555:
553:
550:
548:
545:
543:
540:
538:
535:
533:
530:
528:
525:
523:
520:
519:
517:
513:Text corpora,
511:
507:
500:
495:
493:
488:
486:
481:
480:
477:
471:
468:
466:
463:
462:
458:
449:
445:
438:
435:
432:
427:
424:
417:
415:
413:
409:
399:
396:
394:
391:
389:
388:
384:
381:
378:
375:
374:
370:
367:
364:
361:
360:
356:
353:
350:
347:
346:
342:
339:
336:
333:
332:
328:
325:
322:
319:
318:
314:
311:
308:
305:
304:
300:
297:
294:
291:
290:
286:
283:
280:
277:
276:
273:
270:
268:
265:
262:
259:
258:
255:
252:
250:
247:
244:
241:
240:
237:
234:
232:
229:
226:
223:
222:
218:
215:
212:
209:
208:
204:
201:
198:
195:
194:
190:
187:
184:
181:
180:
176:
173:
170:
167:
166:
162:
160:Brown Corpus
159:
156:
153:
152:
149:
145:
143:
139:
135:
131:
127:
123:
119:
115:
111:
107:
103:
92:
89:
81:
78:December 2022
71:
67:
61:
60:
54:
50:
46:
41:
32:
31:
19:
854:expanding it
843:
797:expanding it
786:
771:
576:
562:Enron Corpus
542:Brown Corpus
447:
437:
426:
405:
392:
271:
266:
253:
248:
245:Popular lore
235:
230:
146:
138:Henry Kučera
136:compiled by
134:Brown Corpus
109:
105:
101:
99:
84:
75:
64:Please help
56:
909:1970s works
623:non-English
163:LOB Corpus
70:introducing
893:Categories
418:References
124:, and the
18:LOB Corpus
710:TalkBank
587:PropBank
567:EnTenTen
213:Religion
939:Corpora
756:COBUILD
715:Tatoeba
635:CHILDES
607:VerbNet
515:English
410:, i.e.
66:improve
408:tagged
379:Humour
154:Label
130:Bergen
120:, the
110:Corpus
848:is a
791:is a
602:TIMIT
393:Total
51:, or
850:stub
793:stub
400:500
140:and
100:The
397:500
371:29
357:29
329:24
315:29
301:80
287:30
219:17
205:17
191:27
177:44
106:LOB
895::
446:.
385:9
368:29
354:29
343:6
326:24
312:29
298:80
284:30
272:77
267:75
254:44
249:48
236:38
231:36
216:17
202:17
188:27
174:44
128:,
108:)
55:,
47:,
881:e
874:t
867:v
856:.
824:e
817:t
810:v
799:.
498:e
491:t
484:v
450:.
382:9
376:R
362:P
348:N
340:6
334:M
320:L
306:K
292:J
278:H
260:G
242:F
224:E
210:D
196:C
182:B
168:A
104:(
91:)
85:(
80:)
76:(
62:.
20:)
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.