Knowledge (XXG)

:Bots/Requests for approval/The Anomebot2 - Knowledge (XXG)

Source 📝

55:, implemented in Python, which was one of the first Knowledge (XXG) bots. I now want to register a new bot account for running a simple bot, based on the Wikipediafs Debian package and a simple Python editing script, that will add geotags to allready-existing geographical articles without geotags. As part of this work, I have compiled a list of just under 10,000 new geotags, which has already undergone spot checks for validity. The bot will check for existing tags during its run, and will not attempt to replace any existing tags. 144:'s feedback, I've now made several more improvements, with a number of other filters being used to suppress edits in cases where geotags are already present either directly or through transclusion, and better placement of the tag within the article. I think I'm ready to run. I'd like to perform another limited test run of 100, just to make sure everything works OK, if you can approve that. If manual review of those 100 edits shows no problems, I think I'll be ready to run on the full dataset, subject to approval. -- 242:
Thanks! No, I'm fine without a botflag for now, as I'm only making one edit every 30 seconds. There's lots more geodata goodness coming: I've been working on on modularizing the data preparation, in readiness for being able to process mountains, hills, railway stations, mine workings, woodlands, etc,
218:
Update: I've regenerated the input dataset using the latest dump data, and performed a few more bot edits to validate the new list, staying within my existing test allowance. I now have geodata available for more than 15,000 towns, cities and villages alone. I can easily do the same for other classes
66:
The results are sorted by country, then place, and binned into four files. They have also been compared to the data in Koordinaten_en_CSV.txt, and labelled by whether they are new coordinates (NEW), or duplicate coordinates already in articles (dup), and, if so, whether they are exact duplicates, or
100:
The bot is intended to run at a limited edit rate, and is intended to self-check its edits independently of Wikipediafs, and to stop if its edits are not saved, or differ in any way from the intended content: it should thus automatically stop if blocked, or if the Wikipediafs or the Knowledge (XXG)
96:
Where this data differs from the existing Knowledge (XXG) data, the new data has been found to be correct in almost every case: where the data was in error because of bugs in the list compilation, I fixed the bugs, and regenerated the output data to remove any similar errors.
219:
of geographic features later, using the same bot, and the same data-generation code, just by changing the GNS and category filtering parameters. Please let me know if/when I can proceed with adding the city data. --
62:
data, and rubbing vigorously. With quite cautious checks applied to both datasets, this gives an unambigious location for 12660 out of a possible 28628 (44%) articles about non-US cities, towns and villages.
274: 128:
Thank you. I've now completed testing this bot: I've checked a number of possible error scenarios, including restarts and auto-shutoff, and it seems to be working OK. -- please see
185:(as a side effect, the category and interwiki links are also sorted into the standard order after all other page content, and are sorted in alphabetical order) 205:
New feature: as requested by Eugene, the geodata tags now have region and feature codes, for example {{coor title dm|34|47|S|150|42|E|region:AU_type:city}}
67:
if not, roughly how many km out they are. (The distance calculation uses several approximations, so treat it only as an order-of-magnitude figure).
21: 177:(with a slight tweak to the edit comment string in the middle for a couple of entries). The bot seems to be working OK in this run: 111:
One week trial period approved, please limit speed to no more than 2 edit/min; and limit the test run to 100 pages or less. —
181:
Articles with categories, interwikis and Unicode characters are handled OK, and the tag is added in the correct place: see
189: 141: 247: 236: 223: 213: 163: 148: 135: 121: 105: 89: 83: 77: 71: 174: 188:
Articles marked with a variety of pre-existing geodata template styles seem to be caught now: see
58:
The following data is the result of taking Knowledge (XXG)'s category links and the public domain
45: 52: 17: 209:
Please let me know what you think, and whether you would like me to do any more testing. --
129: 169:
I've now done another few small test runs, the most recent of which started at 23:44 with
244: 220: 210: 145: 132: 102: 268: 154: 112: 233: 59: 193: 170: 202:
Blocking the bot shuts it down as soon as it detects the write error
182: 253:
The above discussion is preserved as an archive of the debate.
153:
Additional 100 article run is OK, please post results here. —
259:
Subsequent comments should be made in a new section.
39:
Subsequent comments should be made in a new section.
275:Approved Knowledge (XXG) bot requests for approval 33:The following discussion is an archived debate. 8: 7: 28: 101:servers are malfunctioning. -- 1: 232:- do you want a botflag? -- 248:15:27, 19 August 2006 (UTC) 237:15:30, 18 August 2006 (UTC) 224:13:14, 18 August 2006 (UTC) 214:23:46, 13 August 2006 (UTC) 164:15:59, 13 August 2006 (UTC) 149:15:29, 13 August 2006 (UTC) 136:11:35, 13 August 2006 (UTC) 122:16:05, 12 August 2006 (UTC) 106:02:16, 11 August 2006 (UTC) 291: 228:Looks very solid to me -- 190:Augusta, Western Australia 173:, and ended at 00:36 with 243:etc. at a later date. -- 256:Please do not modify it. 90:User:The Anome/Geodata 4 84:User:The Anome/Geodata 3 78:User:The Anome/Geodata 2 72:User:The Anome/Geodata 1 36:Please do not modify it. 51:A long time ago, I ran 230:permissions is granted 175:Berry, New South Wales 199:Restart is working OK 22:Requests for approval 142:Eugène van der Pijll 131:for the results. -- 18:Knowledge (XXG):Bots 46:User:The Anomebot2 53:User:The Anomebot 282: 258: 160: 118: 38: 290: 289: 285: 284: 283: 281: 280: 279: 265: 264: 263: 254: 159: 156: 117: 114: 92:: countries N-Z 86:: countries J-M 80:: countries G-I 74:: countries A-F 49: 34: 26: 25: 24: 12: 11: 5: 288: 286: 278: 277: 267: 266: 262: 261: 250: 207: 206: 203: 200: 197: 186: 167: 166: 157: 151: 138: 125: 124: 115: 94: 93: 87: 81: 75: 48: 43: 42: 41: 29: 27: 15: 14: 13: 10: 9: 6: 4: 3: 2: 287: 276: 273: 272: 270: 260: 257: 251: 249: 246: 241: 240: 239: 238: 235: 231: 226: 225: 222: 216: 215: 212: 204: 201: 198: 196:for examples. 195: 191: 187: 184: 180: 179: 178: 176: 172: 165: 162: 161: 152: 150: 147: 143: 139: 137: 134: 130: 127: 126: 123: 120: 119: 110: 109: 108: 107: 104: 98: 91: 88: 85: 82: 79: 76: 73: 70: 69: 68: 64: 61: 56: 54: 47: 44: 40: 37: 31: 30: 23: 19: 255: 252: 229: 227: 217: 208: 168: 155: 113: 99: 95: 65: 57: 50: 35: 32: 140:Following 245:The Anome 221:The Anome 211:The Anome 146:The Anome 133:The Anome 103:The Anome 269:Category 158:xaosflux 116:xaosflux 60:NIMA GNS 20:‎ | 194:Yerevan 234:Tawker 192:, and 171:Aparan 183:Goris 16:< 271::

Index

Knowledge (XXG):Bots
Requests for approval
User:The Anomebot2
User:The Anomebot
NIMA GNS
User:The Anome/Geodata 1
User:The Anome/Geodata 2
User:The Anome/Geodata 3
User:The Anome/Geodata 4
The Anome
02:16, 11 August 2006 (UTC)
xaosflux
16:05, 12 August 2006 (UTC)

The Anome
11:35, 13 August 2006 (UTC)
Eugène van der Pijll
The Anome
15:29, 13 August 2006 (UTC)
xaosflux
15:59, 13 August 2006 (UTC)
Aparan
Berry, New South Wales
Goris
Augusta, Western Australia
Yerevan
The Anome
23:46, 13 August 2006 (UTC)
The Anome
13:14, 18 August 2006 (UTC)

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.