516:
122:
25:
228:
The first 100 terabyte rack became operational in
Amsterdam at the Internet Archive's European arm, the Stichting Internet Archive (SIA), in June 2004. The second 80 terabyte rack became operational in their main San Francisco location that same year. The Internet Archive then spun off its Petabox
267:
contains 57 petabytes of information; book, music and video collections contain an extra 42 petabytes of information, and "unique data" account for an extra 99 petabytes of information, for a total of 212 petabytes of storage.
255:
In 2010, the fourth version of the
Petabox began operation. Each Petabox allowed for 480 TB of raw storage (240 disks of 2 TB each, set up with 24 disks per 4U high rack units and with 10 units per rack) running on
252:
sites, and other enterprises. Their largest product uses 750 gigabyte disks. In 2007, the
Internet Archive data center housed approximately three petabytes of Petabox storage technology.
440:
620:
381:
615:
574:
433:
263:
As of
December 2021, the Internet Archive's Petabox storage system consists of four data centers, 745 nodes, and 28,000 spinning disks. The
779:
810:
108:
426:
610:
605:
547:
542:
46:
820:
552:
787:
815:
89:
61:
557:
237:
389:
35:
505:
68:
42:
630:
532:
625:
75:
490:
121:
515:
232:
Between 2004 and 2007, Capricorn replicated the
Internet Archive's deployment of the Petabox for major
233:
204:
363:
57:
681:
537:
140:. It was designed by the staff of the Internet Archive and C. R. Saikley to store and process one
653:
198:
771:
635:
587:
191:
709:
704:
676:
668:
645:
597:
582:
495:
485:
449:
367:
341:
137:
470:
264:
249:
245:
241:
729:
724:
686:
211:
804:
691:
82:
734:
480:
176:
739:
24:
285:
305:
754:
658:
141:
172:
Low power: 6 kW per rack, 60 kW for the entire storage cluster
159:
No air conditioning, instead uses excess heat to help heat the building
413:
336:
418:
500:
257:
197:
Shipping container friendly: able to be run in a 20' by 8' by 8'
185:
120:
290:
422:
229:
production to the newly-formed company
Capricorn Technologies.
18:
136:, is a storage unit from Capricorn Technologies and the
181:
Local computing to process the data (800 low-end PCs)
763:
747:
718:
667:
644:
596:
573:
566:
523:
463:
49:. Unsourced material may be challenged and removed.
236:, digital preservationists, government agencies,
382:"eWEEK Labs Walk-Through: the Internet Archive"
434:
8:
570:
441:
427:
419:
109:Learn how and when to remove this message
414:Petabox overview on the Internet Archive
277:
168:Design goals of the Petabox included:
144:(a million gigabytes) of information.
156:Power consumption: 3 kW/petabyte
7:
636:Collected texts of Simon Schwartzman
331:
329:
327:
325:
47:adding citations to reliable sources
16:High-volume digital storage hardware
780:Recorder: The Marion Stokes Project
14:
458:Universal access to all knowledge
621:RECAP US Federal Court Documents
514:
240:(HPC) and major research sites,
23:
364:"The Fourth Generation Petabox"
34:needs additional citations for
219:Inexpensive design and storage
1:
553:Biodiversity Heritage Library
788:Hachette v. Internet Archive
362:Jeff Kaplan (27 July 2010).
337:"Internet Archive: Petabox"
153:Density: 1.4 petabytes/rack
837:
710:Open Educational Resources
286:"Big storage on the cheap"
246:digital image repositories
238:high-performance computing
210:Software to automate full
811:Internet Archive projects
512:
456:
506:Internet Archive Scholar
306:"PetaBox Product Family"
125:Internet Archive Petabox
631:US Government Documents
533:Bibliotheca Alexandrina
310:Capricorn Technologies
203:Easy maintenance: one
175:High density: 100+ TB/
126:
491:Open Content Alliance
234:academic institutions
124:
821:Data storage servers
205:system administrator
43:improve this article
538:Library of Congress
250:storage outsourcing
184:Multi-OS possible,
816:Computer enclosure
654:Live Music Archive
616:Children's Library
611:Canadian Libraries
606:American Libraries
548:Canadian Libraries
543:American Libraries
199:shipping container
127:
798:
797:
772:Panorama Ephemera
700:
699:
588:Libre Map Project
119:
118:
111:
93:
828:
571:
558:Sloan Foundation
518:
450:Internet Archive
443:
436:
429:
420:
401:
400:
398:
397:
388:. Archived from
378:
372:
371:
368:Internet Archive
359:
353:
352:
350:
349:
342:Internet Archive
333:
320:
319:
317:
316:
302:
296:
295:
282:
138:Internet Archive
132:, also stylized
114:
107:
103:
100:
94:
92:
51:
27:
19:
836:
835:
831:
830:
829:
827:
826:
825:
801:
800:
799:
794:
759:
743:
714:
696:
663:
640:
592:
562:
525:
519:
510:
471:Wayback Machine
459:
452:
447:
410:
405:
404:
395:
393:
380:
379:
375:
361:
360:
356:
347:
345:
335:
334:
323:
314:
312:
304:
303:
299:
284:
283:
279:
274:
265:Wayback Machine
242:medical imaging
226:
166:
150:
115:
104:
98:
95:
52:
50:
40:
28:
17:
12:
11:
5:
834:
832:
824:
823:
818:
813:
803:
802:
796:
795:
793:
792:
784:
776:
767:
765:
761:
760:
758:
757:
751:
749:
745:
744:
742:
737:
732:
730:Rick Prelinger
727:
725:Brewster Kahle
722:
720:
716:
715:
713:
712:
707:
701:
698:
697:
695:
694:
689:
687:Democracy Now!
684:
679:
673:
671:
665:
664:
662:
661:
656:
650:
648:
642:
641:
639:
638:
633:
628:
623:
618:
613:
608:
602:
600:
594:
593:
591:
590:
585:
579:
577:
568:
564:
563:
561:
560:
555:
550:
545:
540:
535:
529:
527:
521:
520:
513:
511:
509:
508:
503:
498:
493:
488:
483:
478:
473:
467:
465:
461:
460:
457:
454:
453:
448:
446:
445:
438:
431:
423:
417:
416:
409:
408:External links
406:
403:
402:
373:
354:
321:
297:
276:
275:
273:
270:
225:
222:
221:
220:
217:
214:
208:
201:
195:
189:
182:
179:
173:
165:
162:
161:
160:
157:
154:
149:
148:Specifications
146:
117:
116:
31:
29:
22:
15:
13:
10:
9:
6:
4:
3:
2:
833:
822:
819:
817:
814:
812:
809:
808:
806:
790:
789:
785:
782:
781:
777:
774:
773:
769:
768:
766:
762:
756:
753:
752:
750:
746:
741:
738:
736:
733:
731:
728:
726:
723:
721:
717:
711:
708:
706:
703:
702:
693:
692:Marion Stokes
690:
688:
685:
683:
680:
678:
675:
674:
672:
670:
666:
660:
657:
655:
652:
651:
649:
647:
643:
637:
634:
632:
629:
627:
624:
622:
619:
617:
614:
612:
609:
607:
604:
603:
601:
599:
595:
589:
586:
584:
581:
580:
578:
576:
572:
569:
565:
559:
556:
554:
551:
549:
546:
544:
541:
539:
536:
534:
531:
530:
528:
526:Collaborators
522:
517:
507:
504:
502:
499:
497:
494:
492:
489:
487:
484:
482:
479:
477:
474:
472:
469:
468:
466:
462:
455:
451:
444:
439:
437:
432:
430:
425:
424:
421:
415:
412:
411:
407:
392:on 2022-04-27
391:
387:
383:
377:
374:
369:
365:
358:
355:
344:
343:
338:
332:
330:
328:
326:
322:
311:
307:
301:
298:
293:
292:
287:
281:
278:
271:
269:
266:
261:
259:
253:
251:
247:
243:
239:
235:
230:
223:
218:
216:Easy to scale
215:
213:
209:
206:
202:
200:
196:
193:
190:
187:
183:
180:
178:
174:
171:
170:
169:
163:
158:
155:
152:
151:
147:
145:
143:
139:
135:
131:
123:
113:
110:
102:
99:December 2012
91:
88:
84:
81:
77:
74:
70:
67:
63:
60: –
59:
55:
54:Find sources:
48:
44:
38:
37:
32:This article
30:
26:
21:
20:
786:
778:
770:
735:David Rumsey
524:Partners and
481:Open Library
475:
394:. Retrieved
390:the original
385:
376:
357:
346:. Retrieved
340:
313:. Retrieved
309:
300:
289:
280:
262:
254:
231:
227:
207:per petabyte
167:
133:
129:
128:
105:
96:
86:
79:
72:
65:
53:
41:Please help
36:verification
33:
740:Jason Scott
677:NASA Images
583:NASA Images
567:Collections
486:NASA Images
244:providers,
805:Categories
496:Archive-It
396:2021-11-09
348:2023-07-10
315:2023-07-10
272:References
192:Colocation
69:newspapers
626:Microfilm
212:mirroring
58:"PetaBox"
755:Heritrix
748:Software
705:Software
659:LibriVox
464:Projects
386:PCMag UK
194:friendly
188:standard
142:petabyte
764:Related
682:FedFlix
476:PetaBox
224:History
134:Petabox
130:PetaBox
83:scholar
791:(2023)
783:(2019)
775:(2004)
719:People
164:Design
85:
78:
71:
64:
56:
669:Video
646:Audio
598:Texts
575:Image
501:SFlan
258:Linux
186:Linux
90:JSTOR
76:books
291:CNET
177:rack
62:news
45:by
807::
384:.
366:.
339:.
324:^
308:.
288:.
260:.
248:,
442:e
435:t
428:v
399:.
370:.
351:.
318:.
294:.
112:)
106:(
101:)
97:(
87:·
80:·
73:·
66:·
39:.
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.