Re: Broken articles in the database of cs: Wikipedia
Jens Frank <JeLuF <at> gmx.de>
2004-11-01 05:42:53 GMT
Done.
JeLuF
On Fri, Oct 29, 2004 at 09:15:02PM +0200, Petr Kadlec wrote:
> Hi!
>
> After downloading the cs: database dump and importing it to my local
> MediaWiki installation, I played a little with the database and I have
> found some problems. We have already known about some of them, some
> others were quite surprising. We would like to clean the database, but
> I don't know if this is the right place to ask.
>
> The problems are:
>
> Article named "kniha_nahr??vek" in Wikipedia namespace -- note the
> first lowercase letter, which makes the article inaccessible. This was
> a badly named entry in LanguageCs.php corresponding to the "Upload
> log". Because of that, there are many duplicate copies of the article,
> cur_ids: 237, 238, 365, 367, 457, 459, 461, 463, 487, 528, 529, 538,
> 643, 654, 656, 689, 691, 693, 698, 700, 702, 704, 708, 710, 712, 714.
> We would like to get them all deleted.
>
> Article named "kniha_nahr??vek_" in Wikipedia namespace -- in addition
> to the lowercase letter, there is a space at the end. cur_id = 836,
> the article should also be deleted.
>
> " Psan??_dat" in Wikipedia namespace -- beginning with a space, which
> makes it also inaccessible (automatic redirection to "Psan?? dat"),
> cur_id = 1265. The article should be also deleted.
(Continue reading)