Knowledge (XXG)

User:H3llBot/ADL

Source 📝

117: 131: 147: 428:. Other error codes or failed connections are ignored. The 404 check is carried out twice within 3 days (used to be 1 day) to make sure the link is really dead and not just down for maintenance. GET (as opposed to HEAD) requests are used and redirects followed as some servers redirect to both 404 and 200 pages. 489:
A: Make sure it is not a temporary problem, often individual Wayback's servers are down. Otherwise, the page was available when it was added. Internet Archive respects robots.txt and request for content removal. So any copyright holder can contact them and ask the pages to be removed. This doesn't
461:
A: Wayback is not always reliable (in fact, it's quite unreliable most of the time with common timeouts). Often the retrieval fails at one time and succeeds for the same link at other time. Even the implemented retries and delays do not always work. Hopefully, return visits will mean fixing more
453:
A: Usually, the available copies are out of the date range the bot is comfortable using. Secondly, Wayback is not always reliable. The bot uses secondary attempts if Internet Archive returns connection errors, but even that sometimes fails. I use multiple retries and delays.
469:
A: Some web-sites don't like bots and use various ways of determining automatic processes, simplest being a check for user-agent and referrer. The bot fakes these, but even then some sites may wrongly return a 404 not found page instead of 403 forbidden as they should.
477:
A: Sometimes web-sites are temporarily down and wrongly return 404s instead of 503. Even though bot retries every dead link, it may visit within this maintenance frame. Also, preserving archive copies for live links is actually not wrong, if misleading without
497:
A: This is probably because the bot had seen that link with that access date before in another article, but has not yet checked all the links in this one. It should get back to this article eventually and mark the rest. This happens rarely.
93: 641: 709: 685: 650: 697: 691: 604: 703: 669: 584: 580: 719: 576: 654: 623: 679: 632: 389:
up to 3 month range or (2) the first archived copy after the access date up to 1 month range (used to be ±6 months). The date format is derived either from
216: 53: 644: 638: 635: 629: 620: 617: 614: 611: 599: 298: 79: 458:
Q: How many times will your bot keep coming back to the same page and making changes, can't you do them at once?
506:
This task covers several "sub-tasks", marked with the code (in edit summary or page link redirect) as follows:
46: 673: 64: 403: 393: 348: 278: 595: 212: 385:
The retrieved Wayback archive's date is either (1) the closest archived copy before the citation's
366:
comment, so it is possible to track bot added archvies. Failing that, it will mark dead links with
338: 328: 308: 72: 39: 541: 521: 440: 369: 318: 288: 258: 227: 151: 146: 268: 248: 135: 130: 89: 200: 190: 180: 170: 494:
Q: The bot only marked 1 or 2 links, but there are more dead, even from the same domain.
486:
Q: The linked Wayback page says there is no archive available! Why did bot add bad urls?
82:, used to make repetitive automated edits that would be extremely tedious to do manually. 68: 362:
to the citation (the bot will respect whitespace formatting). The bot will also add
121: 116: 17: 563:
Parse revision history to find url insertion dates when accessdate is missing
354:. The bot will attempt to retrieve the archived copy from Wayback and add 425: 421: 191:
Converting {{Wayback}} template to preceding citation's archive fields
100:
Administrators: if this bot is malfunctioning or causing harm, please
171:
Converting citation urls pointing to archived copy to archive fields
474:
Q: The link isn't dead! You added archive parameters to it anyway.
223:
of now dead links in references and citations or marking them with
547:
from citation(s) with successfully retrieved/added archived copy
236:
The bot currently only processes citation templates that have
534:
for citation(s) now dead, but with preemptively archived copy
490:
happen often and is very time consuming to verify reliably.
201:
Move links from author/editor fields to author/editorlink=
514:
to citation(s) with successfully retrieved archived copy
101: 605:
Knowledge (XXG):WikiProject_External_links/Webcitebot2
420:
Dead links are URLs whose HTTP status responses are
181:
Remove incorrect Wayback usage from citation fields
378:if it was a preemptively archived citation with 466:Q: The link isn't dead! You marked it as dead. 607:- task force of WP:EL dedicated to link repair 527:to citation(s) unable to get archived copy for 47: 8: 233:if a suitable archived copy is unavailable. 688:for replacing and archiving certain domains 575:The bot request for approval available at 217:using the Internet Archive Wayback Machine 54: 40: 161:Archiving dead links via Internet Archive 531: 511: 479: 414: 410: 386: 379: 375: 359: 355: 241: 237: 566:Use WebCite as alternative to Wayback 7: 706:for preemptively archiving new links 537:RDT – remove deadlink tag: removes 244:set. The recognized citations are: 84:The bot is approved, but currently 530:MDY – mark citation expired: set 24: 560:Check manually written references 145: 129: 115: 517:MCD – mark citation dead: adds 510:ADL – archive dead links: adds 364:<!-- Added by H3llBot --> 141:(Issues, problems, questions) 80:legitimate alternative account 1: 700:for replacing certain domains 694:for replacing certain domains 372:|bot=H3llBot}} 409:templates or the citation's 741: 711:Blevintron's BlevintronBot 672:for the same purpose bot, 198: 188: 178: 168: 158: 557:Check bare external links 144: 128: 114: 110: 29: 698:Merlissimo's MerlLinkBot 436:Q: You marked a link as 692:ThaddeusB's DeadLinkBOT 662:Other similar bot BRFAs 63:This user account is a 704:ThaddeusB's WebCiteBOT 682:for finding dead links 649:Access dates by bots: 360:|archivedate= 686:Anomie's AnomieBOT 60 532:|deadurl=yes 512:|archiveurl= 411:|accessdate= 387:|accessdate= 376:|deadurl=yes 356:|archiveurl= 242:|accessdate= 164:(ADL, MCD, MDY, RDT) 90:requests for approval 670:Tim1357's DASHBot 11 628:WebCiteBOT related: 480:|deadurl=no 380:|deadurl=no 610:Dead link related: 450:a copy on Wayback! 721:Emijrp's BOTijo 10 211:This task combats 577:WP:BRFA/H3llBot 2 299:Cite mailing list 208: 207: 106: 732: 726: 722: 716: 712: 546: 540: 533: 526: 520: 513: 481: 445: 439: 416: 412: 408: 402: 398: 392: 388: 381: 377: 373: 365: 361: 357: 353: 347: 343: 337: 333: 327: 323: 317: 313: 307: 303: 297: 293: 287: 283: 277: 273: 267: 263: 257: 253: 247: 243: 239: 232: 226: 174:(U2A, A2U, MAD) 149: 133: 119: 98: 56: 49: 42: 27: 26: 740: 739: 735: 734: 733: 731: 730: 729: 724: 720: 714: 710: 680:Ocolon's Ocobot 592: 573: 554: 544: 538: 524: 518: 504: 492: 484: 472: 464: 456: 443: 437: 434: 415:|date= 406: 400: 396: 390: 367: 363: 351: 345: 341: 335: 331: 325: 321: 315: 311: 305: 301: 295: 291: 285: 281: 275: 271: 265: 261: 255: 251: 245: 230: 224: 209: 140: 97: 88:; the relevant 83: 77: 61: 60: 35: 22: 21: 20: 12: 11: 5: 738: 736: 728: 727: 717: 707: 701: 695: 689: 683: 677: 666: 665: 663: 659: 658: 657:, no consensus 647: 626: 608: 602: 591: 590:Relevant links 588: 572: 569: 568: 567: 564: 561: 558: 553: 550: 549: 548: 535: 528: 515: 503: 500: 433: 430: 238:|url= 221:archive copies 206: 205: 196: 195: 186: 185: 176: 175: 166: 165: 157: 143: 127: 112: 111: 108: 107: 59: 58: 51: 44: 36: 31: 30: 25: 23: 15: 14: 13: 10: 9: 6: 4: 3: 2: 737: 723: 718: 713: 708: 705: 702: 699: 696: 693: 690: 687: 684: 681: 678: 675: 671: 668: 667: 664: 661: 660: 656: 652: 648: 646: 643: 640: 637: 634: 631: 627: 625: 622: 619: 616: 613: 609: 606: 603: 601: 597: 594: 593: 589: 587: 586: 582: 579:. Addendums: 578: 570: 565: 562: 559: 556: 555: 551: 543: 536: 529: 523: 516: 509: 508: 507: 501: 499: 495: 491: 487: 483: 475: 471: 467: 463: 459: 455: 451: 449: 442: 431: 429: 427: 423: 418: 405: 404:Use mdy dates 395: 394:Use dmy dates 383: 371: 350: 349:Vcite journal 340: 330: 320: 310: 300: 290: 280: 270: 260: 250: 234: 229: 222: 218: 214: 203: 202: 197: 193: 192: 187: 183: 182: 177: 173: 172: 167: 163: 162: 156: 155: 154: 148: 142: 139: 138: 132: 126: 125: 124: 118: 113: 109: 105: 103: 95: 91: 87: 81: 76: 74: 70: 66: 57: 52: 50: 45: 43: 38: 37: 34: 28: 19: 715:-- withdrawn 574: 545:}} 539:{{ 525:}} 519:{{ 505: 496: 493: 488: 485: 476: 473: 468: 465: 460: 457: 452: 447: 446:, but there 444:}} 438:{{ 435: 419: 407:}} 401:{{ 397:}} 391:{{ 384: 368:{{ 352:}} 346:{{ 342:}} 336:{{ 332:}} 326:{{ 322:}} 316:{{ 312:}} 306:{{ 302:}} 296:{{ 292:}} 286:{{ 282:}} 279:Cite journal 276:{{ 272:}} 266:{{ 262:}} 256:{{ 252:}} 246:{{ 235: 231:}} 225:{{ 220: 210: 199: 189: 179: 169: 160: 159: 153:Active tasks 152: 150: 136: 134: 122: 120: 99: 92:can be seen 85: 67:operated by 62: 32: 18:User:H3llBot 674:description 596:WP:DEADLINK 219:to provide 725:-- expired 600:WP:DEADREF 585:H3llBot 2c 581:H3llBot 2b 339:Vcite news 329:Vcite book 309:Cite video 542:dead link 522:dead link 441:Dead link 370:dead link 319:Vcite web 289:Cite book 259:Cite news 228:dead link 137:Talk page 69:Hellknowz 269:Cite web 249:Citation 213:link rot 102:block it 86:inactive 78:It is a 462:links. 417:field. 374:or set 33:H3llBot 344:, and 204:(ALA) 194:(W2A) 184:(RWF) 502:Codes 16:< 655:VP 2 651:VP 1 571:BRFA 552:TODO 358:and 240:and 123:Home 94:here 73:talk 432:FAQ 426:301 424:or 422:404 413:or 399:or 215:by 65:bot 653:, 598:, 583:, 482:. 448:is 382:. 334:, 324:, 314:, 304:, 294:, 284:, 274:, 264:, 254:, 75:). 676:. 645:6 642:5 639:4 636:3 633:2 630:1 624:5 621:4 618:3 615:2 612:1 104:. 96:. 71:( 55:e 48:t 41:v

Index

User:H3llBot
v
t
e
bot
Hellknowz
talk
legitimate alternative account
requests for approval
here
block it

Home

Talk page

Active tasks
Archiving dead links via Internet Archive
Converting citation urls pointing to archived copy to archive fields
Remove incorrect Wayback usage from citation fields
Converting {{Wayback}} template to preceding citation's archive fields
Move links from author/editor fields to author/editorlink=
link rot
using the Internet Archive Wayback Machine
dead link
Citation
Cite news
Cite web
Cite journal
Cite book

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.