Knowledge (XXG)

PetaBox

Source 📝

516: 122: 25: 228:
The first 100 terabyte rack became operational in Amsterdam at the Internet Archive's European arm, the Stichting Internet Archive (SIA), in June 2004. The second 80 terabyte rack became operational in their main San Francisco location that same year. The Internet Archive then spun off its Petabox
267:
contains 57 petabytes of information; book, music and video collections contain an extra 42 petabytes of information, and "unique data" account for an extra 99 petabytes of information, for a total of 212 petabytes of storage.
255:
In 2010, the fourth version of the Petabox began operation. Each Petabox allowed for 480 TB of raw storage (240 disks of 2 TB each, set up with 24 disks per 4U high rack units and with 10 units per rack) running on
252:
sites, and other enterprises. Their largest product uses 750 gigabyte disks. In 2007, the Internet Archive data center housed approximately three petabytes of Petabox storage technology.
440: 620: 381: 615: 574: 433: 263:
As of December 2021, the Internet Archive's Petabox storage system consists of four data centers, 745 nodes, and 28,000 spinning disks. The
779: 810: 108: 426: 610: 605: 547: 542: 46: 820: 552: 787: 815: 89: 61: 557: 237: 389: 35: 505: 68: 42: 630: 532: 625: 75: 490: 121: 515: 232:
Between 2004 and 2007, Capricorn replicated the Internet Archive's deployment of the Petabox for major
233: 204: 363: 57: 681: 537: 140:. It was designed by the staff of the Internet Archive and C. R. Saikley to store and process one 653: 198: 771: 635: 587: 191: 709: 704: 676: 668: 645: 597: 582: 495: 485: 449: 367: 341: 137: 470: 264: 249: 245: 241: 729: 724: 686: 211: 804: 691: 82: 734: 480: 176: 739: 24: 285: 305: 754: 658: 141: 172:
Low power: 6 kW per rack, 60 kW for the entire storage cluster
159:
No air conditioning, instead uses excess heat to help heat the building
413: 336: 418: 500: 257: 197:
Shipping container friendly: able to be run in a 20' by 8' by 8'
185: 120: 290: 422: 229:
production to the newly-formed company Capricorn Technologies.
18: 136:, is a storage unit from Capricorn Technologies and the 181:
Local computing to process the data (800 low-end PCs)
763: 747: 718: 667: 644: 596: 573: 566: 523: 463: 49:. Unsourced material may be challenged and removed. 236:, digital preservationists, government agencies, 382:"eWEEK Labs Walk-Through: the Internet Archive" 434: 8: 570: 441: 427: 419: 109:Learn how and when to remove this message 414:Petabox overview on the Internet Archive 277: 168:Design goals of the Petabox included: 144:(a million gigabytes) of information. 156:Power consumption: 3 kW/petabyte 7: 636:Collected texts of Simon Schwartzman 331: 329: 327: 325: 47:adding citations to reliable sources 16:High-volume digital storage hardware 780:Recorder: The Marion Stokes Project 14: 458:Universal access to all knowledge 621:RECAP US Federal Court Documents 514: 240:(HPC) and major research sites, 23: 364:"The Fourth Generation Petabox" 34:needs additional citations for 219:Inexpensive design and storage 1: 553:Biodiversity Heritage Library 788:Hachette v. Internet Archive 362:Jeff Kaplan (27 July 2010). 337:"Internet Archive: Petabox" 153:Density: 1.4 petabytes/rack 837: 710:Open Educational Resources 286:"Big storage on the cheap" 246:digital image repositories 238:high-performance computing 210:Software to automate full 811:Internet Archive projects 512: 456: 506:Internet Archive Scholar 306:"PetaBox Product Family" 125:Internet Archive Petabox 631:US Government Documents 533:Bibliotheca Alexandrina 310:Capricorn Technologies 203:Easy maintenance: one 175:High density: 100+ TB/ 126: 491:Open Content Alliance 234:academic institutions 124: 821:Data storage servers 205:system administrator 43:improve this article 538:Library of Congress 250:storage outsourcing 184:Multi-OS possible, 816:Computer enclosure 654:Live Music Archive 616:Children's Library 611:Canadian Libraries 606:American Libraries 548:Canadian Libraries 543:American Libraries 199:shipping container 127: 798: 797: 772:Panorama Ephemera 700: 699: 588:Libre Map Project 119: 118: 111: 93: 828: 571: 558:Sloan Foundation 518: 450:Internet Archive 443: 436: 429: 420: 401: 400: 398: 397: 388:. Archived from 378: 372: 371: 368:Internet Archive 359: 353: 352: 350: 349: 342:Internet Archive 333: 320: 319: 317: 316: 302: 296: 295: 282: 138:Internet Archive 132:, also stylized 114: 107: 103: 100: 94: 92: 51: 27: 19: 836: 835: 831: 830: 829: 827: 826: 825: 801: 800: 799: 794: 759: 743: 714: 696: 663: 640: 592: 562: 525: 519: 510: 471:Wayback Machine 459: 452: 447: 410: 405: 404: 395: 393: 380: 379: 375: 361: 360: 356: 347: 345: 335: 334: 323: 314: 312: 304: 303: 299: 284: 283: 279: 274: 265:Wayback Machine 242:medical imaging 226: 166: 150: 115: 104: 98: 95: 52: 50: 40: 28: 17: 12: 11: 5: 834: 832: 824: 823: 818: 813: 803: 802: 796: 795: 793: 792: 784: 776: 767: 765: 761: 760: 758: 757: 751: 749: 745: 744: 742: 737: 732: 730:Rick Prelinger 727: 725:Brewster Kahle 722: 720: 716: 715: 713: 712: 707: 701: 698: 697: 695: 694: 689: 687:Democracy Now! 684: 679: 673: 671: 665: 664: 662: 661: 656: 650: 648: 642: 641: 639: 638: 633: 628: 623: 618: 613: 608: 602: 600: 594: 593: 591: 590: 585: 579: 577: 568: 564: 563: 561: 560: 555: 550: 545: 540: 535: 529: 527: 521: 520: 513: 511: 509: 508: 503: 498: 493: 488: 483: 478: 473: 467: 465: 461: 460: 457: 454: 453: 448: 446: 445: 438: 431: 423: 417: 416: 409: 408:External links 406: 403: 402: 373: 354: 321: 297: 276: 275: 273: 270: 225: 222: 221: 220: 217: 214: 208: 201: 195: 189: 182: 179: 173: 165: 162: 161: 160: 157: 154: 149: 148:Specifications 146: 117: 116: 31: 29: 22: 15: 13: 10: 9: 6: 4: 3: 2: 833: 822: 819: 817: 814: 812: 809: 808: 806: 790: 789: 785: 782: 781: 777: 774: 773: 769: 768: 766: 762: 756: 753: 752: 750: 746: 741: 738: 736: 733: 731: 728: 726: 723: 721: 717: 711: 708: 706: 703: 702: 693: 692:Marion Stokes 690: 688: 685: 683: 680: 678: 675: 674: 672: 670: 666: 660: 657: 655: 652: 651: 649: 647: 643: 637: 634: 632: 629: 627: 624: 622: 619: 617: 614: 612: 609: 607: 604: 603: 601: 599: 595: 589: 586: 584: 581: 580: 578: 576: 572: 569: 565: 559: 556: 554: 551: 549: 546: 544: 541: 539: 536: 534: 531: 530: 528: 526:Collaborators 522: 517: 507: 504: 502: 499: 497: 494: 492: 489: 487: 484: 482: 479: 477: 474: 472: 469: 468: 466: 462: 455: 451: 444: 439: 437: 432: 430: 425: 424: 421: 415: 412: 411: 407: 392:on 2022-04-27 391: 387: 383: 377: 374: 369: 365: 358: 355: 344: 343: 338: 332: 330: 328: 326: 322: 311: 307: 301: 298: 293: 292: 287: 281: 278: 271: 269: 266: 261: 259: 253: 251: 247: 243: 239: 235: 230: 223: 218: 216:Easy to scale 215: 213: 209: 206: 202: 200: 196: 193: 190: 187: 183: 180: 178: 174: 171: 170: 169: 163: 158: 155: 152: 151: 147: 145: 143: 139: 135: 131: 123: 113: 110: 102: 99:December 2012 91: 88: 84: 81: 77: 74: 70: 67: 63: 60: –  59: 55: 54:Find sources: 48: 44: 38: 37: 32:This article 30: 26: 21: 20: 786: 778: 770: 735:David Rumsey 524:Partners and 481:Open Library 475: 394:. Retrieved 390:the original 385: 376: 357: 346:. Retrieved 340: 313:. Retrieved 309: 300: 289: 280: 262: 254: 231: 227: 207:per petabyte 167: 133: 129: 128: 105: 96: 86: 79: 72: 65: 53: 41:Please help 36:verification 33: 740:Jason Scott 677:NASA Images 583:NASA Images 567:Collections 486:NASA Images 244:providers, 805:Categories 496:Archive-It 396:2021-11-09 348:2023-07-10 315:2023-07-10 272:References 192:Colocation 69:newspapers 626:Microfilm 212:mirroring 58:"PetaBox" 755:Heritrix 748:Software 705:Software 659:LibriVox 464:Projects 386:PCMag UK 194:friendly 188:standard 142:petabyte 764:Related 682:FedFlix 476:PetaBox 224:History 134:Petabox 130:PetaBox 83:scholar 791:(2023) 783:(2019) 775:(2004) 719:People 164:Design 85:  78:  71:  64:  56:  669:Video 646:Audio 598:Texts 575:Image 501:SFlan 258:Linux 186:Linux 90:JSTOR 76:books 291:CNET 177:rack 62:news 45:by 807:: 384:. 366:. 339:. 324:^ 308:. 288:. 260:. 248:, 442:e 435:t 428:v 399:. 370:. 351:. 318:. 294:. 112:) 106:( 101:) 97:( 87:· 80:· 73:· 66:· 39:.

Index


verification
improve this article
adding citations to reliable sources
"PetaBox"
news
newspapers
books
scholar
JSTOR
Learn how and when to remove this message

Internet Archive
petabyte
rack
Linux
Colocation
shipping container
system administrator
mirroring
academic institutions
high-performance computing
medical imaging
digital image repositories
storage outsourcing
Linux
Wayback Machine
"Big storage on the cheap"
CNET
"PetaBox Product Family"

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.