Knowledge (XXG)

:Bots/Requests for approval/WildBot 3 - Knowledge (XXG)

Source 📝

570: 528: 510: 842: 489: 48: 239: 548:
Thus far 25 edits have been made, and I've discovered a number of things. Turns out that pywikipedia has code to detect valid section references - but it doesn't work correctly when there's markup; the common case being an article link in a section header. People put all kinds of crazy stuff into
303:
Bandwidth; I just don't have it. Running the bot as is consumes a solid 20% of a bandwidth I have available. Unless I get a Toolserver account, recent changes or a database scan is off the cards. Additionally, I've got plans to make the bot smarter and more helpful, so I don't want to bomb every
220:
At the same time as checking new page's wiki markup for links to dab pages, the bot will also check for links containing a #section anchor to ensure the anchor appears on the target page. Normally this is a section heading, but there are techniques available (templates like
549:
section headers. I won't bore you with the stories. I seem to have bitten off quite a large, chewy part of the world. The internal re-coding has been shaken-out, so I'll soon be tidying up the code and running that in production.
516:
adding this functionality has demanded a substantial internal redesign for WildBot, as it's no longer making one edit to a talk page (at least, not internally). The hard figures above were produced by a very rough draft.
829:
If I might add, this has been terribly buggy. I'm going to be keeping a very close eye on it in its early life, the multitude of problem that turned up during the trial haven't endeared the code to me.
384:
Damn, one other thing. It is generaly frowned upon for bots to download the html markup. If I may suggest a more server-friendly version: use http://en.wikipedia.org/search/?title=<title: -->
289:
or something? If there is it would certianly be more helpfull. Would you mind telling me why you think this should only be limited to new pages? You could do a dump scan for the whole project.
344:
I've had a look, there's nothing for inline work. It may be inappropriate to inline too, because the link still kind-of works, it just goes to the target page rather than a part thereof.
888: 371:
I was thinking along these lines but couldn't think of a reason not to check the other namespaces the bot currently patrols. What difficulties do you foresee outside of mainspace?
313:
You could get a Toolserver account if you'd like; it would probably help with the running of your bot and it's not very difficult to get one if you can demonstrate need.
231:
and raw HTML tags) which create an anchor without a ==section==; to detect these cases, the HTML of the target page will be downloaded and searched for these anchors.
538:
with nine hits. The rest of the results are going to be spread out though the normal run of WildBot. There's code to limit it to 50 #section edits per run.
436:
My reasoning is that there realy is no need for notifications outside of the mainspace. Plus there is no "talk" pages for talk pages, if you know what I mean.
186:
I'd guess less than 5% of new pages have #section links, and perhaps 20% of those would be wrong. At 1000/new pages a day, this would be about 10 edits.
439:
Ok, if you really need to download the html, thats fine. I just thought the templates=expand bit would be helpful; I myself just found out about it.
188:
Hard figures show: 4% of new pages have #section links, and 32.5% of these are wrong; At 1000/new pages a day, this would be about 13 edits/day.
21: 455:
Yes, I prefer your method over my API call. WildBot task 1 doesn't do talk pages, so no probs there. Toolserver account is in process.
88: 331:
on Sunday. Today's Sunday in Germany I believe. Or has it just finished? Anyway, WildBot's approval may help things along there.
409:
That's pretty much what I've done; I called the API version (which I'm not sure, having seen your suggestion, is the best idea).
83: 493:
Approved for trial (50 edits). Please provide a link to the relevant contributions and/or diffs when the trial is complete.
118: 103: 466:
Nice, this gets the thumbs up from me as long as this acts in the manor that the Disambiguation Wildbot does.
433:
Thats ok, if you only want to do new pages, thats fine. You could look into the toolserver idea if you want.
857: 834: 580: 562: 553: 542: 521: 503: 475: 459: 448: 413: 404: 375: 366: 348: 335: 322: 308: 298: 273: 870: 247:
found one or more links in this article with broken #section; for more information on #section links see
78: 40: 831: 577: 559: 550: 539: 518: 456: 410: 372: 345: 332: 305: 98: 93: 141: 248: 171: 853: 471: 444: 400: 362: 294: 283: 17: 129: 495: 389: 314: 265: 225: 357:
On another note, I would appreciate it if the bot only works only in the main-namespace.
882: 192: 849: 467: 440: 396: 358: 290: 244: 59: 150: 328: 47: 499: 318: 269: 534:
The trial has commenced, with some preliminary results are available in this
253:
The broken #section links found were: Broadway#Golden years, New York#Histery
155: 385:&action=raw&templates=expand. That solves the problem of the 863:
The above discussion is preserved as an archive of the debate.
164:
Add checking of #section anchors for existence to existing bot
869:
To request review of this BRFA, please start a new section at
233: 39:
To request review of this BRFA, please start a new section at
819:
http://en.wikipedia.org/search/?diff=prev&oldid=340694909
814:
http://en.wikipedia.org/search/?diff=prev&oldid=340690850
809:
http://en.wikipedia.org/search/?diff=prev&oldid=340645876
804:
http://en.wikipedia.org/search/?diff=prev&oldid=340642165
799:
http://en.wikipedia.org/search/?diff=prev&oldid=340642113
794:
http://en.wikipedia.org/search/?diff=prev&oldid=340641235
789:
http://en.wikipedia.org/search/?diff=prev&oldid=340640770
784:
http://en.wikipedia.org/search/?diff=prev&oldid=340639103
779:
http://en.wikipedia.org/search/?diff=prev&oldid=340638907
774:
http://en.wikipedia.org/search/?diff=prev&oldid=340638848
769:
http://en.wikipedia.org/search/?diff=prev&oldid=340638024
764:
http://en.wikipedia.org/search/?diff=prev&oldid=340536163
759:
http://en.wikipedia.org/search/?diff=prev&oldid=340535773
754:
http://en.wikipedia.org/search/?diff=prev&oldid=340535508
749:
http://en.wikipedia.org/search/?diff=prev&oldid=340534637
744:
http://en.wikipedia.org/search/?diff=prev&oldid=340534529
739:
http://en.wikipedia.org/search/?diff=prev&oldid=340530881
734:
http://en.wikipedia.org/search/?diff=prev&oldid=340523826
729:
http://en.wikipedia.org/search/?diff=prev&oldid=340496044
724:
http://en.wikipedia.org/search/?diff=prev&oldid=340486642
719:
http://en.wikipedia.org/search/?diff=prev&oldid=340483807
714:
http://en.wikipedia.org/search/?diff=prev&oldid=340483435
709:
http://en.wikipedia.org/search/?diff=prev&oldid=340465459
704:
http://en.wikipedia.org/search/?diff=prev&oldid=340465392
699:
http://en.wikipedia.org/search/?diff=prev&oldid=340463673
694:
http://en.wikipedia.org/search/?diff=prev&oldid=340462264
689:
http://en.wikipedia.org/search/?diff=prev&oldid=340462031
684:
http://en.wikipedia.org/search/?diff=prev&oldid=340461444
679:
http://en.wikipedia.org/search/?diff=prev&oldid=340461285
674:
http://en.wikipedia.org/search/?diff=prev&oldid=340459065
669:
http://en.wikipedia.org/search/?diff=prev&oldid=340306956
664:
http://en.wikipedia.org/search/?diff=prev&oldid=340306925
659:
http://en.wikipedia.org/search/?diff=prev&oldid=340306801
654:
http://en.wikipedia.org/search/?diff=prev&oldid=340066727
649:
http://en.wikipedia.org/search/?diff=prev&oldid=340066511
644:
http://en.wikipedia.org/search/?diff=prev&oldid=340054153
639:
http://en.wikipedia.org/search/?diff=prev&oldid=340052590
634:
http://en.wikipedia.org/search/?diff=prev&oldid=340038285
629:
http://en.wikipedia.org/search/?diff=prev&oldid=340038259
624:
http://en.wikipedia.org/search/?diff=prev&oldid=340037816
619:
http://en.wikipedia.org/search/?diff=prev&oldid=340037710
614:
http://en.wikipedia.org/search/?diff=prev&oldid=340037642
609:
http://en.wikipedia.org/search/?diff=prev&oldid=340037392
604:
http://en.wikipedia.org/search/?diff=prev&oldid=340037371
599:
http://en.wikipedia.org/search/?diff=prev&oldid=340037212
249:
Knowledge (XXG):Linking#Piped links to sections of articles
172:
Knowledge (XXG):Linking#Checking links as they are created
818: 813: 808: 803: 798: 793: 788: 783: 778: 773: 768: 763: 758: 753: 748: 743: 738: 733: 728: 723: 718: 713: 708: 703: 698: 693: 688: 683: 678: 673: 668: 663: 658: 653: 648: 643: 638: 633: 628: 623: 618: 613: 608: 603: 598: 535: 113: 108: 73: 304:
broken page link in the 'pedia with a mere advisory.
279:Isnt there an inline template for this, similar to 889:Approved Knowledge (XXG) bot requests for approval 576:I'll be posting links to the edits in a few hours 168:Links to relevant discussions (where appropriate): 33:The following discussion is an archived debate. 8: 585: 156:https://svn.toolserver.org/svnroot/josh/ 7: 536:a seeded group of #section checking 183:Estimated number of pages affected: 28: 45:The result of the discussion was 840: 568: 526: 508: 487: 237: 46: 264:This is certainly a good idea. 135:Automatic or Manually assisted: 1: 858:00:54, 30 January 2010 (UTC) 835:11:48, 29 January 2010 (UTC) 581:02:45, 29 January 2010 (UTC) 563:04:33, 28 January 2010 (UTC) 554:13:57, 27 January 2010 (UTC) 543:01:01, 26 January 2010 (UTC) 522:22:20, 24 January 2010 (UTC) 504:03:30, 23 January 2010 (UTC) 476:01:10, 18 January 2010 (UTC) 460:01:07, 18 January 2010 (UTC) 449:00:55, 18 January 2010 (UTC) 414:22:20, 17 January 2010 (UTC) 405:17:06, 17 January 2010 (UTC) 376:22:20, 17 January 2010 (UTC) 367:16:57, 17 January 2010 (UTC) 349:00:58, 18 January 2010 (UTC) 336:00:53, 18 January 2010 (UTC) 329:my application from 29th Dec 323:00:34, 18 January 2010 (UTC) 309:22:20, 17 January 2010 (UTC) 299:16:57, 17 January 2010 (UTC) 274:16:22, 17 January 2010 (UTC) 202:Y, standard in pywikipedia 905: 418:So in order of questions: 866:Please do not modify it. 36:Please do not modify it. 327:DaB said he'll look at 206:Already has a bot flag 22:Requests for approval 146:Python, pywikipedia 142:Programming language 18:Knowledge (XXG):Bots 193:Exclusion compliant 848:Seems good to me. 162:Function overview: 826: 825: 257: 256: 218:Function details: 896: 868: 844: 843: 586: 572: 571: 530: 529: 512: 511: 491: 490: 394: 388: 288: 282: 241: 240: 234: 230: 224: 50: 38: 904: 903: 899: 898: 897: 895: 894: 893: 879: 878: 877: 864: 841: 827: 591: 574:Trial complete. 569: 527: 509: 488: 392: 386: 286: 280: 262: 238: 228: 222: 177:Edit period(s): 124: 63: 34: 26: 25: 24: 12: 11: 5: 902: 900: 892: 891: 881: 880: 876: 875: 860: 824: 823: 822: 821: 816: 811: 806: 801: 796: 791: 786: 781: 776: 771: 766: 761: 756: 751: 746: 741: 736: 731: 726: 721: 716: 711: 706: 701: 696: 691: 686: 681: 676: 671: 666: 661: 656: 651: 646: 641: 636: 631: 626: 621: 616: 611: 606: 601: 593: 592: 589: 584: 566: 565: 556: 485: 484: 483: 482: 481: 480: 479: 478: 463: 462: 452: 451: 437: 434: 424: 423: 422: 421: 420: 419: 381: 380: 379: 378: 354: 353: 352: 351: 342: 341: 340: 339: 338: 261: 258: 255: 254: 252: 242: 123: 122: 116: 111: 106: 101: 96: 91: 86: 81: 76: 74:Approved BRFAs 71: 64: 62: 57: 56: 55: 29: 27: 15: 14: 13: 10: 9: 6: 4: 3: 2: 901: 890: 887: 886: 884: 874: 872: 867: 861: 859: 855: 851: 847: 839: 838: 837: 836: 833: 820: 817: 815: 812: 810: 807: 805: 802: 800: 797: 795: 792: 790: 787: 785: 782: 780: 777: 775: 772: 770: 767: 765: 762: 760: 757: 755: 752: 750: 747: 745: 742: 740: 737: 735: 732: 730: 727: 725: 722: 720: 717: 715: 712: 710: 707: 705: 702: 700: 697: 695: 692: 690: 687: 685: 682: 680: 677: 675: 672: 670: 667: 665: 662: 660: 657: 655: 652: 650: 647: 645: 642: 640: 637: 635: 632: 630: 627: 625: 622: 620: 617: 615: 612: 610: 607: 605: 602: 600: 597: 596: 595: 594: 588: 587: 583: 582: 579: 575: 564: 561: 557: 555: 552: 547: 546: 545: 544: 541: 537: 533: 524: 523: 520: 515: 506: 505: 502: 501: 497: 494: 477: 473: 469: 465: 464: 461: 458: 454: 453: 450: 446: 442: 438: 435: 432: 431: 430: 429: 428: 427: 426: 425: 417: 416: 415: 412: 408: 407: 406: 402: 398: 391: 383: 382: 377: 374: 370: 369: 368: 364: 360: 356: 355: 350: 347: 343: 337: 334: 330: 326: 325: 324: 321: 320: 316: 312: 311: 310: 307: 302: 301: 300: 296: 292: 285: 278: 277: 276: 275: 272: 271: 267: 259: 250: 246: 243: 236: 235: 232: 227: 219: 215: 213: 210: 207: 203: 201: 198: 195: 194: 189: 187: 184: 180: 178: 174: 173: 169: 165: 163: 159: 158:(revision 6) 157: 154: 152: 147: 145: 143: 138: 136: 132: 131: 128: 120: 117: 115: 112: 110: 107: 105: 102: 100: 97: 95: 92: 90: 87: 85: 82: 80: 77: 75: 72: 70: 66: 65: 61: 58: 53: 49: 44: 42: 37: 31: 30: 23: 19: 865: 862: 845: 828: 573: 567: 531: 525: 513: 507: 498: 492: 486: 317: 268: 263: 217: 216: 211: 208: 205: 204: 199: 196: 191: 190: 185: 182: 181: 176: 175: 167: 166: 161: 160: 149: 148: 140: 139: 134: 133: 126: 125: 68: 51: 35: 32: 832:Josh Parris 590:Trial Edits 578:Josh Parris 560:Josh Parris 551:Josh Parris 540:Josh Parris 519:Josh Parris 457:Josh Parris 411:Josh Parris 373:Josh Parris 346:Josh Parris 333:Josh Parris 306:Josh Parris 179:Continuous 170:Guideline: 151:Source code 130:Josh Parris 395:template. 260:Discussion 153:available: 137:Automatic 114:rights log 104:page moves 846:Approved. 558:33 edits 514:Coding... 127:Operator: 109:block log 60:WildBot 3 883:Category 532:Doing... 284:deadlink 84:contribs 52:Approved 20:‎ | 871:WT:BRFA 850:Tim1357 468:Tim1357 441:Tim1357 397:Tim1357 359:Tim1357 291:Tim1357 245:WildBot 41:WT:BRFA 390:Anchor 226:Anchor 500:harej 319:harej 270:harej 209:(Y/N) 197:(Y/N) 89:count 16:< 854:talk 472:talk 445:talk 401:talk 363:talk 295:talk 144:(s): 119:flag 99:logs 79:talk 69:BRFA 94:SUL 885:: 856:) 474:) 447:) 403:) 393:}} 387:{{ 365:) 297:) 287:}} 281:{{ 229:}} 223:{{ 214:Y 873:. 852:( 496:@ 470:( 443:( 399:( 361:( 315:@ 293:( 266:@ 251:. 212:: 200:: 121:) 67:( 54:. 43:.

Index

Knowledge (XXG):Bots
Requests for approval
WT:BRFA

WildBot 3
BRFA
Approved BRFAs
talk
contribs
count
SUL
logs
page moves
block log
rights log
flag
Josh Parris
Programming language
Source code
https://svn.toolserver.org/svnroot/josh/
Knowledge (XXG):Linking#Checking links as they are created
Exclusion compliant
Anchor
WildBot
Knowledge (XXG):Linking#Piped links to sections of articles
@
harej
16:22, 17 January 2010 (UTC)
deadlink
Tim1357

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.