1 Dec 2005 11:02
[wishlist] [patch] Bad wikipages should be skipped
Peter Gervai <grin <at> grin.hu>
2005-12-01 10:02:30 GMT
2005-12-01 10:02:30 GMT
Hello,
I write here because SF is seemingly very down, so no bugtrack and
patchtrack available for me. You may copy it there at will.
So, there are buggy pages all around, like:
WARNING: No character set found
WARNING: No text area found on ia.wikiquote.org/w/index.php?title=Dominio%20public&action=edit.
Maybe the server is down. Retrying in 1 minutes...
I understand that it would be nice to fix 'em all, but I would
prefer the bot not blocking on them forever. I mean:
--- wikipedia.py.pregrin 2005-12-01 10:00:05.437118797 +0100
+++ wikipedia.py.postgrin 2005-12-01 10:04:56.396430481 +0100
<at> <at> -434,6 +434,9 <at> <at>
retry_idle_time *= 2
if retry_idle_time > 30:
retry_idle_time = 30
+ # faking orgasm, I mean, get outta here --g
+ output(u"DBG> enough is enough! get outta here!")
+ return "", isWatched, 'sysop'
continue
i2 = re.search('</textarea>', text).start()
if i2-i1 < 2:
And I don't even think there is a point in waiting _any time_ for a page
with no text area. I would skip it on 3rd retry, especially when the page
is retrieved fine and fast.
(Continue reading)
RSS Feed