570:
528:
510:
842:
489:
48:
239:
548:
Thus far 25 edits have been made, and I've discovered a number of things. Turns out that pywikipedia has code to detect valid section references - but it doesn't work correctly when there's markup; the common case being an article link in a section header. People put all kinds of crazy stuff into
303:
Bandwidth; I just don't have it. Running the bot as is consumes a solid 20% of a bandwidth I have available. Unless I get a
Toolserver account, recent changes or a database scan is off the cards. Additionally, I've got plans to make the bot smarter and more helpful, so I don't want to bomb every
220:
At the same time as checking new page's wiki markup for links to dab pages, the bot will also check for links containing a #section anchor to ensure the anchor appears on the target page. Normally this is a section heading, but there are techniques available (templates like
549:
section headers. I won't bore you with the stories. I seem to have bitten off quite a large, chewy part of the world. The internal re-coding has been shaken-out, so I'll soon be tidying up the code and running that in production.
516:
adding this functionality has demanded a substantial internal redesign for WildBot, as it's no longer making one edit to a talk page (at least, not internally). The hard figures above were produced by a very rough draft.
829:
If I might add, this has been terribly buggy. I'm going to be keeping a very close eye on it in its early life, the multitude of problem that turned up during the trial haven't endeared the code to me.
384:
Damn, one other thing. It is generaly frowned upon for bots to download the html markup. If I may suggest a more server-friendly version: use http://en.wikipedia.org/search/?title=<title: -->
289:
or something? If there is it would certianly be more helpfull. Would you mind telling me why you think this should only be limited to new pages? You could do a dump scan for the whole project.
344:
I've had a look, there's nothing for inline work. It may be inappropriate to inline too, because the link still kind-of works, it just goes to the target page rather than a part thereof.
888:
371:
I was thinking along these lines but couldn't think of a reason not to check the other namespaces the bot currently patrols. What difficulties do you foresee outside of mainspace?
313:
You could get a
Toolserver account if you'd like; it would probably help with the running of your bot and it's not very difficult to get one if you can demonstrate need.
231:
and raw HTML tags) which create an anchor without a ==section==; to detect these cases, the HTML of the target page will be downloaded and searched for these anchors.
538:
with nine hits. The rest of the results are going to be spread out though the normal run of WildBot. There's code to limit it to 50 #section edits per run.
436:
My reasoning is that there realy is no need for notifications outside of the mainspace. Plus there is no "talk" pages for talk pages, if you know what I mean.
186:
I'd guess less than 5% of new pages have #section links, and perhaps 20% of those would be wrong. At 1000/new pages a day, this would be about 10 edits.
439:
Ok, if you really need to download the html, thats fine. I just thought the templates=expand bit would be helpful; I myself just found out about it.
188:
Hard figures show: 4% of new pages have #section links, and 32.5% of these are wrong; At 1000/new pages a day, this would be about 13 edits/day.
21:
455:
Yes, I prefer your method over my API call. WildBot task 1 doesn't do talk pages, so no probs there. Toolserver account is in process.
88:
331:
on Sunday. Today's Sunday in
Germany I believe. Or has it just finished? Anyway, WildBot's approval may help things along there.
409:
That's pretty much what I've done; I called the API version (which I'm not sure, having seen your suggestion, is the best idea).
83:
493:
Approved for trial (50 edits). Please provide a link to the relevant contributions and/or diffs when the trial is complete.
118:
103:
466:
Nice, this gets the thumbs up from me as long as this acts in the manor that the
Disambiguation Wildbot does.
433:
Thats ok, if you only want to do new pages, thats fine. You could look into the toolserver idea if you want.
857:
834:
580:
562:
553:
542:
521:
503:
475:
459:
448:
413:
404:
375:
366:
348:
335:
322:
308:
298:
273:
870:
247:
found one or more links in this article with broken #section; for more information on #section links see
78:
40:
831:
577:
559:
550:
539:
518:
456:
410:
372:
345:
332:
305:
98:
93:
141:
248:
171:
853:
471:
444:
400:
362:
294:
283:
17:
129:
495:
389:
314:
265:
225:
357:
On another note, I would appreciate it if the bot only works only in the main-namespace.
882:
192:
849:
467:
440:
396:
358:
290:
244:
59:
150:
328:
47:
499:
318:
269:
534:
The trial has commenced, with some preliminary results are available in this
253:
The broken #section links found were: Broadway#Golden years, New York#Histery
155:
385:&action=raw&templates=expand. That solves the problem of the
863:
The above discussion is preserved as an archive of the debate.
164:
Add checking of #section anchors for existence to existing bot
869:
To request review of this BRFA, please start a new section at
233:
39:
To request review of this BRFA, please start a new section at
819:
http://en.wikipedia.org/search/?diff=prev&oldid=340694909
814:
http://en.wikipedia.org/search/?diff=prev&oldid=340690850
809:
http://en.wikipedia.org/search/?diff=prev&oldid=340645876
804:
http://en.wikipedia.org/search/?diff=prev&oldid=340642165
799:
http://en.wikipedia.org/search/?diff=prev&oldid=340642113
794:
http://en.wikipedia.org/search/?diff=prev&oldid=340641235
789:
http://en.wikipedia.org/search/?diff=prev&oldid=340640770
784:
http://en.wikipedia.org/search/?diff=prev&oldid=340639103
779:
http://en.wikipedia.org/search/?diff=prev&oldid=340638907
774:
http://en.wikipedia.org/search/?diff=prev&oldid=340638848
769:
http://en.wikipedia.org/search/?diff=prev&oldid=340638024
764:
http://en.wikipedia.org/search/?diff=prev&oldid=340536163
759:
http://en.wikipedia.org/search/?diff=prev&oldid=340535773
754:
http://en.wikipedia.org/search/?diff=prev&oldid=340535508
749:
http://en.wikipedia.org/search/?diff=prev&oldid=340534637
744:
http://en.wikipedia.org/search/?diff=prev&oldid=340534529
739:
http://en.wikipedia.org/search/?diff=prev&oldid=340530881
734:
http://en.wikipedia.org/search/?diff=prev&oldid=340523826
729:
http://en.wikipedia.org/search/?diff=prev&oldid=340496044
724:
http://en.wikipedia.org/search/?diff=prev&oldid=340486642
719:
http://en.wikipedia.org/search/?diff=prev&oldid=340483807
714:
http://en.wikipedia.org/search/?diff=prev&oldid=340483435
709:
http://en.wikipedia.org/search/?diff=prev&oldid=340465459
704:
http://en.wikipedia.org/search/?diff=prev&oldid=340465392
699:
http://en.wikipedia.org/search/?diff=prev&oldid=340463673
694:
http://en.wikipedia.org/search/?diff=prev&oldid=340462264
689:
http://en.wikipedia.org/search/?diff=prev&oldid=340462031
684:
http://en.wikipedia.org/search/?diff=prev&oldid=340461444
679:
http://en.wikipedia.org/search/?diff=prev&oldid=340461285
674:
http://en.wikipedia.org/search/?diff=prev&oldid=340459065
669:
http://en.wikipedia.org/search/?diff=prev&oldid=340306956
664:
http://en.wikipedia.org/search/?diff=prev&oldid=340306925
659:
http://en.wikipedia.org/search/?diff=prev&oldid=340306801
654:
http://en.wikipedia.org/search/?diff=prev&oldid=340066727
649:
http://en.wikipedia.org/search/?diff=prev&oldid=340066511
644:
http://en.wikipedia.org/search/?diff=prev&oldid=340054153
639:
http://en.wikipedia.org/search/?diff=prev&oldid=340052590
634:
http://en.wikipedia.org/search/?diff=prev&oldid=340038285
629:
http://en.wikipedia.org/search/?diff=prev&oldid=340038259
624:
http://en.wikipedia.org/search/?diff=prev&oldid=340037816
619:
http://en.wikipedia.org/search/?diff=prev&oldid=340037710
614:
http://en.wikipedia.org/search/?diff=prev&oldid=340037642
609:
http://en.wikipedia.org/search/?diff=prev&oldid=340037392
604:
http://en.wikipedia.org/search/?diff=prev&oldid=340037371
599:
http://en.wikipedia.org/search/?diff=prev&oldid=340037212
249:
Knowledge (XXG):Linking#Piped links to sections of articles
172:
Knowledge (XXG):Linking#Checking links as they are created
818:
813:
808:
803:
798:
793:
788:
783:
778:
773:
768:
763:
758:
753:
748:
743:
738:
733:
728:
723:
718:
713:
708:
703:
698:
693:
688:
683:
678:
673:
668:
663:
658:
653:
648:
643:
638:
633:
628:
623:
618:
613:
608:
603:
598:
535:
113:
108:
73:
304:
broken page link in the 'pedia with a mere advisory.
279:Isnt there an inline template for this, similar to
889:Approved Knowledge (XXG) bot requests for approval
576:I'll be posting links to the edits in a few hours
168:Links to relevant discussions (where appropriate):
33:The following discussion is an archived debate.
8:
585:
156:https://svn.toolserver.org/svnroot/josh/
7:
536:a seeded group of #section checking
183:Estimated number of pages affected:
28:
45:The result of the discussion was
840:
568:
526:
508:
487:
237:
46:
264:This is certainly a good idea.
135:Automatic or Manually assisted:
1:
858:00:54, 30 January 2010 (UTC)
835:11:48, 29 January 2010 (UTC)
581:02:45, 29 January 2010 (UTC)
563:04:33, 28 January 2010 (UTC)
554:13:57, 27 January 2010 (UTC)
543:01:01, 26 January 2010 (UTC)
522:22:20, 24 January 2010 (UTC)
504:03:30, 23 January 2010 (UTC)
476:01:10, 18 January 2010 (UTC)
460:01:07, 18 January 2010 (UTC)
449:00:55, 18 January 2010 (UTC)
414:22:20, 17 January 2010 (UTC)
405:17:06, 17 January 2010 (UTC)
376:22:20, 17 January 2010 (UTC)
367:16:57, 17 January 2010 (UTC)
349:00:58, 18 January 2010 (UTC)
336:00:53, 18 January 2010 (UTC)
329:my application from 29th Dec
323:00:34, 18 January 2010 (UTC)
309:22:20, 17 January 2010 (UTC)
299:16:57, 17 January 2010 (UTC)
274:16:22, 17 January 2010 (UTC)
202:Y, standard in pywikipedia
905:
418:So in order of questions:
866:Please do not modify it.
36:Please do not modify it.
327:DaB said he'll look at
206:Already has a bot flag
22:Requests for approval
146:Python, pywikipedia
142:Programming language
18:Knowledge (XXG):Bots
193:Exclusion compliant
848:Seems good to me.
162:Function overview:
826:
825:
257:
256:
218:Function details:
896:
868:
844:
843:
586:
572:
571:
530:
529:
512:
511:
491:
490:
394:
388:
288:
282:
241:
240:
234:
230:
224:
50:
38:
904:
903:
899:
898:
897:
895:
894:
893:
879:
878:
877:
864:
841:
827:
591:
574:Trial complete.
569:
527:
509:
488:
392:
386:
286:
280:
262:
238:
228:
222:
177:Edit period(s):
124:
63:
34:
26:
25:
24:
12:
11:
5:
902:
900:
892:
891:
881:
880:
876:
875:
860:
824:
823:
822:
821:
816:
811:
806:
801:
796:
791:
786:
781:
776:
771:
766:
761:
756:
751:
746:
741:
736:
731:
726:
721:
716:
711:
706:
701:
696:
691:
686:
681:
676:
671:
666:
661:
656:
651:
646:
641:
636:
631:
626:
621:
616:
611:
606:
601:
593:
592:
589:
584:
566:
565:
556:
485:
484:
483:
482:
481:
480:
479:
478:
463:
462:
452:
451:
437:
434:
424:
423:
422:
421:
420:
419:
381:
380:
379:
378:
354:
353:
352:
351:
342:
341:
340:
339:
338:
261:
258:
255:
254:
252:
242:
123:
122:
116:
111:
106:
101:
96:
91:
86:
81:
76:
74:Approved BRFAs
71:
64:
62:
57:
56:
55:
29:
27:
15:
14:
13:
10:
9:
6:
4:
3:
2:
901:
890:
887:
886:
884:
874:
872:
867:
861:
859:
855:
851:
847:
839:
838:
837:
836:
833:
820:
817:
815:
812:
810:
807:
805:
802:
800:
797:
795:
792:
790:
787:
785:
782:
780:
777:
775:
772:
770:
767:
765:
762:
760:
757:
755:
752:
750:
747:
745:
742:
740:
737:
735:
732:
730:
727:
725:
722:
720:
717:
715:
712:
710:
707:
705:
702:
700:
697:
695:
692:
690:
687:
685:
682:
680:
677:
675:
672:
670:
667:
665:
662:
660:
657:
655:
652:
650:
647:
645:
642:
640:
637:
635:
632:
630:
627:
625:
622:
620:
617:
615:
612:
610:
607:
605:
602:
600:
597:
596:
595:
594:
588:
587:
583:
582:
579:
575:
564:
561:
557:
555:
552:
547:
546:
545:
544:
541:
537:
533:
524:
523:
520:
515:
506:
505:
502:
501:
497:
494:
477:
473:
469:
465:
464:
461:
458:
454:
453:
450:
446:
442:
438:
435:
432:
431:
430:
429:
428:
427:
426:
425:
417:
416:
415:
412:
408:
407:
406:
402:
398:
391:
383:
382:
377:
374:
370:
369:
368:
364:
360:
356:
355:
350:
347:
343:
337:
334:
330:
326:
325:
324:
321:
320:
316:
312:
311:
310:
307:
302:
301:
300:
296:
292:
285:
278:
277:
276:
275:
272:
271:
267:
259:
250:
246:
243:
236:
235:
232:
227:
219:
215:
213:
210:
207:
203:
201:
198:
195:
194:
189:
187:
184:
180:
178:
174:
173:
169:
165:
163:
159:
158:(revision 6)
157:
154:
152:
147:
145:
143:
138:
136:
132:
131:
128:
120:
117:
115:
112:
110:
107:
105:
102:
100:
97:
95:
92:
90:
87:
85:
82:
80:
77:
75:
72:
70:
66:
65:
61:
58:
53:
49:
44:
42:
37:
31:
30:
23:
19:
865:
862:
845:
828:
573:
567:
531:
525:
513:
507:
498:
492:
486:
317:
268:
263:
217:
216:
211:
208:
205:
204:
199:
196:
191:
190:
185:
182:
181:
176:
175:
167:
166:
161:
160:
149:
148:
140:
139:
134:
133:
126:
125:
68:
51:
35:
32:
832:Josh Parris
590:Trial Edits
578:Josh Parris
560:Josh Parris
551:Josh Parris
540:Josh Parris
519:Josh Parris
457:Josh Parris
411:Josh Parris
373:Josh Parris
346:Josh Parris
333:Josh Parris
306:Josh Parris
179:Continuous
170:Guideline:
151:Source code
130:Josh Parris
395:template.
260:Discussion
153:available:
137:Automatic
114:rights log
104:page moves
846:Approved.
558:33 edits
514:Coding...
127:Operator:
109:block log
60:WildBot 3
883:Category
532:Doing...
284:deadlink
84:contribs
52:Approved
20: |
871:WT:BRFA
850:Tim1357
468:Tim1357
441:Tim1357
397:Tim1357
359:Tim1357
291:Tim1357
245:WildBot
41:WT:BRFA
390:Anchor
226:Anchor
500:harej
319:harej
270:harej
209:(Y/N)
197:(Y/N)
89:count
16:<
854:talk
472:talk
445:talk
401:talk
363:talk
295:talk
144:(s):
119:flag
99:logs
79:talk
69:BRFA
94:SUL
885::
856:)
474:)
447:)
403:)
393:}}
387:{{
365:)
297:)
287:}}
281:{{
229:}}
223:{{
214:Y
873:.
852:(
496:@
470:(
443:(
399:(
361:(
315:@
293:(
266:@
251:.
212::
200::
121:)
67:(
54:.
43:.
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.