55:, implemented in Python, which was one of the first Knowledge (XXG) bots. I now want to register a new bot account for running a simple bot, based on the Wikipediafs Debian package and a simple Python editing script, that will add geotags to allready-existing geographical articles without geotags. As part of this work, I have compiled a list of just under 10,000 new geotags, which has already undergone spot checks for validity. The bot will check for existing tags during its run, and will not attempt to replace any existing tags.
144:'s feedback, I've now made several more improvements, with a number of other filters being used to suppress edits in cases where geotags are already present either directly or through transclusion, and better placement of the tag within the article. I think I'm ready to run. I'd like to perform another limited test run of 100, just to make sure everything works OK, if you can approve that. If manual review of those 100 edits shows no problems, I think I'll be ready to run on the full dataset, subject to approval. --
242:
Thanks! No, I'm fine without a botflag for now, as I'm only making one edit every 30 seconds. There's lots more geodata goodness coming: I've been working on on modularizing the data preparation, in readiness for being able to process mountains, hills, railway stations, mine workings, woodlands, etc,
218:
Update: I've regenerated the input dataset using the latest dump data, and performed a few more bot edits to validate the new list, staying within my existing test allowance. I now have geodata available for more than 15,000 towns, cities and villages alone. I can easily do the same for other classes
66:
The results are sorted by country, then place, and binned into four files. They have also been compared to the data in
Koordinaten_en_CSV.txt, and labelled by whether they are new coordinates (NEW), or duplicate coordinates already in articles (dup), and, if so, whether they are exact duplicates, or
100:
The bot is intended to run at a limited edit rate, and is intended to self-check its edits independently of
Wikipediafs, and to stop if its edits are not saved, or differ in any way from the intended content: it should thus automatically stop if blocked, or if the Wikipediafs or the Knowledge (XXG)
96:
Where this data differs from the existing
Knowledge (XXG) data, the new data has been found to be correct in almost every case: where the data was in error because of bugs in the list compilation, I fixed the bugs, and regenerated the output data to remove any similar errors.
219:
of geographic features later, using the same bot, and the same data-generation code, just by changing the GNS and category filtering parameters. Please let me know if/when I can proceed with adding the city data. --
62:
data, and rubbing vigorously. With quite cautious checks applied to both datasets, this gives an unambigious location for 12660 out of a possible 28628 (44%) articles about non-US cities, towns and villages.
274:
128:
Thank you. I've now completed testing this bot: I've checked a number of possible error scenarios, including restarts and auto-shutoff, and it seems to be working OK. -- please see
185:(as a side effect, the category and interwiki links are also sorted into the standard order after all other page content, and are sorted in alphabetical order)
205:
New feature: as requested by Eugene, the geodata tags now have region and feature codes, for example {{coor title dm|34|47|S|150|42|E|region:AU_type:city}}
67:
if not, roughly how many km out they are. (The distance calculation uses several approximations, so treat it only as an order-of-magnitude figure).
21:
177:(with a slight tweak to the edit comment string in the middle for a couple of entries). The bot seems to be working OK in this run:
111:
One week trial period approved, please limit speed to no more than 2 edit/min; and limit the test run to 100 pages or less. —
181:
Articles with categories, interwikis and
Unicode characters are handled OK, and the tag is added in the correct place: see
189:
141:
247:
236:
223:
213:
163:
148:
135:
121:
105:
89:
83:
77:
71:
174:
188:
Articles marked with a variety of pre-existing geodata template styles seem to be caught now: see
58:
The following data is the result of taking
Knowledge (XXG)'s category links and the public domain
45:
52:
17:
209:
Please let me know what you think, and whether you would like me to do any more testing. --
129:
169:
I've now done another few small test runs, the most recent of which started at 23:44 with
244:
220:
210:
145:
132:
102:
268:
154:
112:
233:
59:
193:
170:
202:
Blocking the bot shuts it down as soon as it detects the write error
182:
253:
The above discussion is preserved as an archive of the debate.
153:
Additional 100 article run is OK, please post results here. —
259:
Subsequent comments should be made in a new section.
39:
Subsequent comments should be made in a new section.
275:Approved Knowledge (XXG) bot requests for approval
33:The following discussion is an archived debate.
8:
7:
28:
101:servers are malfunctioning. --
1:
232:- do you want a botflag? --
248:15:27, 19 August 2006 (UTC)
237:15:30, 18 August 2006 (UTC)
224:13:14, 18 August 2006 (UTC)
214:23:46, 13 August 2006 (UTC)
164:15:59, 13 August 2006 (UTC)
149:15:29, 13 August 2006 (UTC)
136:11:35, 13 August 2006 (UTC)
122:16:05, 12 August 2006 (UTC)
106:02:16, 11 August 2006 (UTC)
291:
228:Looks very solid to me --
190:Augusta, Western Australia
173:, and ended at 00:36 with
243:etc. at a later date. --
256:Please do not modify it.
90:User:The Anome/Geodata 4
84:User:The Anome/Geodata 3
78:User:The Anome/Geodata 2
72:User:The Anome/Geodata 1
36:Please do not modify it.
51:A long time ago, I ran
230:permissions is granted
175:Berry, New South Wales
199:Restart is working OK
22:Requests for approval
142:Eugène van der Pijll
131:for the results. --
18:Knowledge (XXG):Bots
46:User:The Anomebot2
53:User:The Anomebot
282:
258:
160:
118:
38:
290:
289:
285:
284:
283:
281:
280:
279:
265:
264:
263:
254:
159:
156:
117:
114:
92:: countries N-Z
86:: countries J-M
80:: countries G-I
74:: countries A-F
49:
34:
26:
25:
24:
12:
11:
5:
288:
286:
278:
277:
267:
266:
262:
261:
250:
207:
206:
203:
200:
197:
186:
167:
166:
157:
151:
138:
125:
124:
115:
94:
93:
87:
81:
75:
48:
43:
42:
41:
29:
27:
15:
14:
13:
10:
9:
6:
4:
3:
2:
287:
276:
273:
272:
270:
260:
257:
251:
249:
246:
241:
240:
239:
238:
235:
231:
226:
225:
222:
216:
215:
212:
204:
201:
198:
196:for examples.
195:
191:
187:
184:
180:
179:
178:
176:
172:
165:
162:
161:
152:
150:
147:
143:
139:
137:
134:
130:
127:
126:
123:
120:
119:
110:
109:
108:
107:
104:
98:
91:
88:
85:
82:
79:
76:
73:
70:
69:
68:
64:
61:
56:
54:
47:
44:
40:
37:
31:
30:
23:
19:
255:
252:
229:
227:
217:
208:
168:
155:
113:
99:
95:
65:
57:
50:
35:
32:
140:Following
245:The Anome
221:The Anome
211:The Anome
146:The Anome
133:The Anome
103:The Anome
269:Category
158:xaosflux
116:xaosflux
60:NIMA GNS
20: |
194:Yerevan
234:Tawker
192:, and
171:Aparan
183:Goris
16:<
271::
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.