Knowledge (XXG)

Help:Using archive.today

Source 📝

185:
display of archives from the original site, even though the old site never had a robots.txt. Nevertheless, some archive providers agreed to use robots.txt as a method for end-users to signal when they didn't want their pages publicly archived and/or displayed (if already archived). archive.today does not abide by the robots exclusion standard. Wayback Machine formerly used it to avoid archiving material that site owners do not want archived.
696:
Also, the content may be unreadable by the archive.today archiver (too complex JavaScript based pages can crash its browser or be executed too long time, or ones involving browser checks sometimes cause our archive engine to fail). … Pages which violate our hoster's rules (cracks, porn, etc) may be deleted. Also, completely empty pages (or pages which have nothing but text like "502 Server Timeout") may be deleted.
32: 164:. Similar to archive.today, the Wayback Machine takes snapshots of webpages at certain times, as well as user-initiated on-demand archiving called "Save Page Now" (SPN). Wayback and archive.today operate differently, and certain pages can be archived by one but not the other. Wayback is used in over 80% of instances. 147:
The owner of the service has requested Knowledge (XXG) to always use the "archive.today" domain – it is a gateway that redirects to one of the final destinations (.is, .li, .fo, .ph, .vn and .md) based on load and availability. It provides archive.today flexibility to dynamically redirect traffic to
695:
files, audio and video. The page may be too big (there is 50mb limit for a single page). The content may be inaccessible from the archive.today network (this is particularly likely if you are attempting to access subscription based content which your institution subscribes to on its users' behalf).
324:
is a web browser bookmark that performs a certain function. The archive.today bookmarklet, when clicked, takes the URL of the page you are currently looking at and submits it to archive.today for archiving. This method is straightforward to set up, and is convenient. It is recommended that you have
184:
was never designed for use by archive providers. The use of robots.txt for this purpose is essentially a hack that led to unintended consequences, for example domains that are hijacked or change ownership with the new domain owner adding a robots.txt which triggers archive providers to block the
369:
Firefox smart keywords are commonly used to perform searches through the Firefox address bar or to open a bookmark by typing a keyword into the Firefox address bar. Here we are going to use a smart keyword to submit a URL to archive.today for archiving. The key steps are:
172:
archive.today removes archived pages by request of copyright holders per the U.S. DMCA; requests can be made with the "Report abuse" link on archive.today archived pages. Re-hosting U.S. copyrighted material without permission may be a violation of the U.S.
325:
your Bookmarks/Favorites bar visible or at least have your bookmarks accessible within a click or two. This method only allows you to archive the page you are currently viewing. To archive a different web page you will have to use another method.
177:(DMCA) – for this reason, to avoid implicating Knowledge (XXG) in violations of copyright laws and incurring DMCA take-down requests, archive.today should be used with some caution regarding U.S.-copyrighted content. 356:
the bookmarklet, simply click on it when you are on a web page you wish to archive. It initiates the archiving process. When the process is complete (it usually takes 5–15 seconds) you will be sent to the archived
204:
There are several ways to submit a web page to archive.today for archiving. For new users, the website form is suggested. The other methods are better suited to those who use archive.today regularly.
637: 457:") in front of the URL of the web page you would like to archive in the Firefox address bar. (e.g. if you are using "a" as your keyword, the text in the address bar would be 524:") in front of the URL of the web page you would like to archive in the Chrome address bar (e.g. If you are using "a" as your keyword, the text in the address bar would be 271:
Web browser (not necessarily the same Web browser). If the page is already archived, the archived copy will open; otherwise, a new archive of the page will be initiated.
161: 476:
Although this is created through Chrome's search engine feature, this functions just like a smart keyword in Firefox. This method is moderately simple to set up.
741: 484:
the "search engine", right click the address bar and select "Edit search engines...". At the bottom of the list that comes up, you can add a "search engine".
773: 796: 188:
Note that it can sometimes be a good idea to add multiple archive providers for key material. Multiple links can be added to Knowledge (XXG) using
709: 494:
Enter a keyword for the "search engine" in the second field. You should choose something short and this keyword must not already be used (e.g.
654: 157: 544: 818: 609:
Web pages previously archived through archive.today are accessible through a searchable database. Users may search by URL, domain or their
48: 464:
Hit Enter. This starts the archiving process. When it completes (this usually takes 15–30 seconds) you will be sent to the archived page.
757:: Put a URL into the form, press the button, and we save the page. You will instantly have a permanent URL for your page. Please note, 222:, enter the URL of the web page you wish to archive into the "My url is alive and I want to archive its content" field (the red one). 240: 291: 109:
that can be accessed if the original page is moved, changes, or disappears. Not all web pages can be archived using archive.today.
296: 225:
Click the "Submit" button. When archiving process completes (it usually takes 5–15 seconds) you will be sent to the archived page.
174: 571: 407:
Browse to a location you would like to save the smart keyword bookmark in (it should not be visible in the bookmarks toolbar).
731: 40: 599:{{cite web |last= |first= |title= |work= |publisher= |date= |url= |archive-url= |archive-date= |url-status= }}</ref: --> 629:
ended in June 2016 with consensus to remove archive.is from the blacklist. The previous consensus, established earlier at
117: 630: 626: 877: 850: 832: 237: 660: 531:
Hit Enter. You will be sent to a page containing a link to the archive URL of the web page you wished to archive.
181: 781: 180:
The history of robots.txt and archive providers is longer and more complex than this essay's focus. Briefly,
804: 333:
the bookmarklet, first create a bookmark for any page. Then follow the next two steps to change it to work.
52: 648: 575: 62: 534:
It is recommended that you view the archived page to check if the archive process has been successful.
228:
It is recommended that you view the archived page to check if the archive process has been successful.
191: 17: 212:
This method is easy to use. It requires going to the archive.today website to archive a web page.
610: 360:
It is recommended that you view the archived page to check if the archive process was successful.
345:
javascript:void(open('https://archive.today/submit/?url='+encodeURIComponent(document.location)))
384:
Set the bookmark keyword to something short you'll type in the address bar before the URL, e.g.
717: 301: 851:"Some sites are not available because of robots.txt or other exclusions. What does that mean?" 467:
It is recommended that you view the archived page to check if the process has been successful.
691:
A page may not be archived for a number of reasons. archive.today does not support archiving
736: 633:, was to blacklist links to archive.today, as soon as all the existing links were removed. 622: 47:
It explains concepts or processes used by the Knowledge (XXG) community. It is not one of
106: 545:
Knowledge (XXG) talk:Using archive.today § RfC: Should we use short or long format URLs?
260: 871: 666: 256: 125: 94: 90: 284: 244: 321: 252: 121: 410:
In the menu at the top of the window, click "Organize", then "Add Bookmark".
404:
in the top right of the window, then "Bookmarks", then "Manage bookmarks").
431:
Enter a keyword for the bookmark. You should choose something short (e.g.
279:
Browser extensions that can archive and search archive.today, by means of
248: 102: 858: 836: 280: 264: 819:"Robots.txt meant for search engines don't work well for web archives" 543:
Links archived with archive.today should appear in long format. (See
267:), Share it to Share2Archive, and the page archive will open in the 101:. A web archiving service allows Knowledge (XXG) editors to reduce 638:
MediaWiki talk:Spam-blacklist/archives/December 2013 § archive.is
435:) and this keyword must not already be used for another bookmark. 113: 692: 487:
Enter a name for the "search engine" in the first field (e.g.
26: 686: 554:
https://archive.today/YYYYMMDDhhmmss/http://www.example.com
140:, the site is accessible through other domains, including 400:
Hit Ctrl+Shift+O to open the Bookmarks library (or click
218: 137: 98: 77: 70: 141: 438:
Click the "Add" button. Close the Bookmarks Library.
313:
Note: Bookmarklets have been deprecated in favor of
132:.today compared to .is, .li, .fo, .ph, .vn and .md 526:a http://www.example.com/pageyouwantoarchive.html 517:the "search engine", add the keyword you chose (" 459:a http://www.example.com/pageyouwantoarchive.html 336:Change or enter the name for the bookmark (e.g. 586:. If the original URL is still accessible, the 428:stands for the string that follows the keyword. 521:" in the above example) followed by a space (" 454:" in the above example) followed by a space (" 833:"Removing Documents From the Wayback Machine" 8: 710:"Wikitech-l – format of Recent Changes feed" 667:Talk:Perma.cc § Perma.cc and Knowledge (XXG) 503:https://archive.today/?run=1&url=%s& 162:20 other providers in use on Knowledge (XXG) 605:Searching for previously archived web pages 708:Harihareswara, Sumana (3 September 2013). 558:This archive URL can be inserted into the 655:Knowledge (XXG):Using the Wayback Machine 651:, how-to guide for prevention of link rot 49:Knowledge (XXG)'s policies or guidelines 678: 422:https://archive.today/?run=1&url=%s 379:https://archive.today/?run=1&url=%s 314: 243:can access archive.today by means of a 508:Hit Enter to save the "search engine". 7: 795:Dascalescu, Dan (18 February 2013). 759:this method only saves a single page 413:Enter a name for the bookmark (e.g. 51:, and may reflect varying levels of 774:"How can I delete an archived page" 732:"Save Pages in the Wayback Machine" 18:Knowledge (XXG):Using archive.today 105:by preserving a copy of an online 25: 780:. 24 January 2013. Archived from 744:from the original on 14 July 2020 714:Wikimedia.org technical mail list 590:parameter value should be set to 582:parameter value should be set to 392:Below are the detailed steps. To 156:Other web archiving services are 803:. Dan Dascalescu. Archived from 631:Knowledge (XXG):Archive.is RFC 3 627:Knowledge (XXG):Archive.is RFC 4 175:Digital Millennium Copyright Act 152:Differences from other archivers 30: 1: 661:Knowledge (XXG):Using WebCite 144:, .li, .fo, .ph, .vn and .md 450:Add the keyword you chose (" 894: 635: 539:Use within Knowledge (XXG) 112:archive.today can archive 60: 574:. If the original URL is 570:parameters in any of the 424:into the Location field. 182:robots exclusion standard 693:Portable Document Format 649:Knowledge (XXG):Link rot 550:An example long format: 377:Set the bookmark URL to 347:into the Location field. 168:Copyright and robots.txt 148:other domains/servers. 878:Knowledge (XXG) how-to 669:, about using Perma.cc 219:https://archive.today/ 807:on 22 September 2013. 784:on 26 September 2013. 761:, not the whole site. 505:into the third field. 365:Firefox smart keyword 287:, are available for: 138:https://archive.today 99:https://archive.today 797:"Web page archiving" 576:no longer accessible 472:Chrome search engine 39:This help page is a 839:on 15 October 2002. 720:on 26 October 2013. 623:request for comment 562:and its supporting 446:the smart keyword, 396:the smart keyword: 861:on 4 October 2002. 572:citation templates 315:Browser extensions 275:Browser extensions 142:https://archive.is 374:Create a bookmark 247:. When viewing a 88: 87: 16:(Redirected from 885: 863: 862: 857:. Archived from 847: 841: 840: 835:. Archived from 829: 823: 822: 815: 809: 808: 792: 786: 785: 770: 764: 763: 751: 749: 737:Internet Archive 728: 722: 721: 716:. Archived from 705: 699: 698: 683: 600: 593: 589: 585: 581: 569: 565: 561: 555: 527: 523: 520: 504: 497: 490: 460: 456: 453: 434: 427: 423: 416: 403: 387: 380: 346: 343:Change or enter 339: 221: 195: 93:is an on-demand 80: 73: 34: 33: 27: 21: 893: 892: 888: 887: 886: 884: 883: 882: 868: 867: 866: 849: 848: 844: 831: 830: 826: 817: 816: 812: 794: 793: 789: 772: 771: 767: 747: 745: 730: 729: 725: 707: 706: 702: 685: 684: 680: 676: 645: 640: 619: 607: 597: 591: 587: 583: 579: 567: 563: 559: 556: 553: 541: 525: 522: 518: 502: 495: 488: 474: 458: 455: 451: 432: 425: 421: 414: 401: 385: 378: 367: 344: 337: 310: 277: 235: 217: 210: 202: 189: 170: 158:Wayback Machine 154: 134: 84: 83: 78:WP:ARCHIVETODAY 76: 69: 65: 57: 56: 31: 23: 22: 15: 12: 11: 5: 891: 889: 881: 880: 870: 869: 865: 864: 842: 824: 810: 787: 765: 723: 700: 677: 675: 672: 671: 670: 664: 663:, how-to guide 658: 657:, how-to guide 652: 644: 641: 625:(RfC) held at 618: 615: 606: 603: 552: 540: 537: 536: 535: 532: 529: 510: 509: 506: 499: 492: 485: 473: 470: 469: 468: 465: 462: 440: 439: 436: 429: 418: 411: 408: 405: 390: 389: 382: 375: 366: 363: 362: 361: 358: 349: 348: 341: 334: 319: 309: 306: 305: 304: 299: 294: 276: 273: 251:in an Android 234: 231: 230: 229: 226: 223: 209: 206: 201: 200:How to archive 198: 169: 166: 153: 150: 133: 130: 126:digital images 86: 85: 82: 81: 74: 66: 61: 58: 46: 45: 37: 35: 24: 14: 13: 10: 9: 6: 4: 3: 2: 890: 879: 876: 875: 873: 860: 856: 852: 846: 843: 838: 834: 828: 825: 820: 814: 811: 806: 802: 798: 791: 788: 783: 779: 775: 769: 766: 762: 760: 756: 755:Save Page Now 743: 739: 738: 733: 727: 724: 719: 715: 711: 704: 701: 697: 694: 688: 682: 679: 673: 668: 665: 662: 659: 656: 653: 650: 647: 646: 642: 639: 634: 632: 628: 624: 616: 614: 612: 604: 602: 595: 577: 573: 551: 548: 546: 538: 533: 530: 516: 512: 511: 507: 500: 493: 489:archive.today 486: 483: 479: 478: 477: 471: 466: 463: 449: 448: 447: 445: 437: 430: 419: 415:archive.today 412: 409: 406: 399: 398: 397: 395: 383: 376: 373: 372: 371: 364: 359: 355: 351: 350: 342: 338:archive.today 335: 332: 328: 327: 326: 323: 318: 316: 307: 303: 300: 298: 295: 293: 290: 289: 288: 286: 282: 274: 272: 270: 266: 262: 258: 254: 250: 246: 242: 241:Share2Archive 239: 232: 227: 224: 220: 215: 214: 213: 207: 205: 199: 197: 193: 186: 183: 178: 176: 167: 165: 163: 160:and at least 159: 151: 149: 145: 143: 139: 131: 129: 127: 123: 119: 115: 110: 108: 104: 100: 96: 95:web archiving 92: 91:archive.today 79: 75: 72: 68: 67: 64: 59: 54: 50: 44: 42: 36: 29: 28: 19: 859:the original 854: 845: 837:the original 827: 813: 805:the original 800: 790: 782:the original 777: 768: 758: 754: 753: 746:. Retrieved 735: 726: 718:the original 713: 703: 690: 681: 620: 608: 598:<ref: --> 596: 564:archivedate= 557: 549: 542: 514: 481: 475: 443: 441: 393: 391: 368: 353: 330: 312: 311: 285:Context menu 278: 268: 245:Share action 236: 211: 208:Website form 203: 187: 179: 171: 155: 146: 135: 118:style sheets 111: 89: 71:WP:ARCHIVEIS 41:how-to guide 38: 568:url-status= 560:archiveurl= 322:bookmarklet 308:Bookmarklet 253:Web browser 238:Android app 233:Android app 116:web pages, 97:service at 674:References 636:See also: 588:url-status 580:url-status 192:webarchive 122:JavaScript 617:Consensus 611:wildcards 283:icon and 63:Shortcuts 53:consensus 872:Category 742:Archived 740:. 2018. 643:See also 249:Web page 136:Besides 103:link rot 302:Firefox 281:Toolbar 269:default 265:Firefox 255:(e.g., 748:19 May 578:, the 501:Enter 482:set up 420:Enter 394:set up 331:set up 292:Chrome 257:Chrome 124:, and 107:source 687:"FAQ" 357:page. 801:Wiki 778:Blog 750:2021 621:The 592:live 584:dead 566:and 297:Edge 261:Edge 114:HTML 855:FAQ 515:use 513:To 480:To 444:use 442:To 354:use 352:To 329:To 216:At 874:: 853:. 799:. 776:. 752:. 734:. 712:. 689:. 613:. 601:. 594:. 547:) 528:). 519:wc 498:). 496:wc 491:). 461:). 426:%s 417:). 340:). 320:A 263:, 259:, 196:. 194:}} 190:{{ 128:. 120:, 821:. 452:a 433:a 402:☰ 388:. 386:a 381:. 317:. 55:. 43:. 20:)

Index

Knowledge (XXG):Using archive.today
how-to guide
Knowledge (XXG)'s policies or guidelines
consensus
Shortcuts
WP:ARCHIVEIS
WP:ARCHIVETODAY
archive.today
web archiving
https://archive.today
link rot
source
HTML
style sheets
JavaScript
digital images
https://archive.today
https://archive.is
Wayback Machine
20 other providers in use on Knowledge (XXG)
Digital Millennium Copyright Act
robots exclusion standard
webarchive
https://archive.today/
Android app
Share2Archive
Share action
Web page
Web browser
Chrome

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.