sfkeller | 1 Apr 2006 22:34
Picon
Favicon
Gravatar

Re: Troubles with reading Articles

Dear admins,

Let me insist and ask again about the status und plans of the toolserver: Starting on Monday our team will
start over for an interim last programming round on Wikipoint database in our spare time. 

So, I'd really like to avoid redundant programming on outdated articles (residing in dumps). 

Now, we will use WikiProxy in the hope everything works as well as it gets, right?

-- Stefan 

P.S. There is yet another potential use of our Wikipoint service coming up: There is a proposal for a 'wiki
safary across germany' which has the goal to capture pictures for georeferenced articles which have not
yet one... c.f. http://de.wikipedia.org/wiki/Wikipedia_Diskussion:Bilderw%C3%BCnsche. If they
would plan on the basis of a dump, which is always some weeks old, then they probably would gather photos
where there has been already someone...

-----Original Message-----
From: toolserver-l-bounces <at> Wikipedia.org [mailto:toolserver-l-bounces <at> Wikipedia.org] On Behalf
Of Leo Büttiker
Sent: Mittwoch, 22. März 2006 22:04
To: toolserver-l <at> wikipedia.org
Subject: [Toolserver-l] Troubles with reading Articles

Hi all,
For a toolserver-project I will read all Wikipedia (pwiki_de) articles and 
parse them for geoinformation. After some troubles I've fixed now nearly all 
bugs, but I have still some troubles with opening the articles. 

I open the article with the help of the mediawiki functions in the following 
(Continue reading)

Daniel Baur | 2 Apr 2006 14:47
Favicon

Re: Troubles with reading Articles

Hello,
Am Samstag 01 April 2006 22:34 schrieb sfkeller <at> hsr.ch:
> Dear admins,
>
> Let me insist and ask again about the status und plans of the toolserver:
> Starting on Monday our team will start over for an interim last programming
> round on Wikipoint database in our spare time.
>
> So, I'd really like to avoid redundant programming on outdated articles
> (residing in dumps).
>
> Now, we will use WikiProxy in the hope everything works as well as it gets,
> right?

you can do in that way, when you don't generate to much load neither on the 
tool- nor on the master-db-server. You shouldn't not read more then a few 
articles per minute (max 10 or so).

>
> -- Stefan
>

Sincertly 
DaB.
Daniel Kinzler | 3 Apr 2006 13:01
Picon
Favicon
Gravatar

WikiProxy caches (was: Troubles with reading Articles)

Hi all

I just want to let you know that I have an extra database for the
caching now (thanks Kate!) and can now add wikis to the cache on
request. Currently, the following wikis are cached:

commons, en, de, fr, pl, nl, it, sv, pt, es, ru.

If you need more, please tell me. Note that *all* wikis can be accessed
through WikiProxy, data from "uncached" wikis is simply passed through.

-- Daniel

--

-- 
Homepage: http://brightbyte.de

FlaBot | 3 Apr 2006 20:45

Re: WikiProxy caches (was: Troubles with reading Articles)

Hi !

Great Tool !

because of the big replag at the moment i use a mod-way to access the data :

$replag=file_get_contents("http://tools.wikimedia.de/~interiot/cgi-bin/replag?raw ");
if ($replag<7200){
$text1=file_get_contents("http://tools.wikimedia.de/~daniel/WikiSense/WikiProxy.php?wiki=".$fields[1]."&title=".$fields[4]."&rev=&go=Fetch");
$text2=file_get_contents("http://tools.wikimedia.de/~daniel/WikiSense/WikiProxy.php?wiki=".$fields[2]."&title=".$fields[4]."&rev=&go=Fetch");
}else{
$text1=file_get_contents("http://".$fields[1].".wikipedia.org/w/index.php?title=".$fields[4]."&action=raw");
$text2=file_get_contents("http://".$fields[2].".wikipedia.org/w/index.php?title=".$fields[4]."&action=raw");
}

I dont know if its a good idea that i use this hack.

Perhaps daniel should itegrate it in HIS software.

flacus

Marco Schuster | 4 Apr 2006 18:19
Picon

Re: WikiProxy caches

FlaBot schrieb:
> I dont know if its a good idea that i use this hack.
Not for the people who need to parse header files(running bots or 
similar stuff with Login-Feature; these need to capture the 
SET-COOKIE-field in HTTP reply, which file_get_contents doesn't support, 
I think)

Greets,

Marco

Daniel Kinzler | 4 Apr 2006 18:39
Picon
Favicon
Gravatar

Re: WikiProxy caches

Hi FlaBot, hi all

> Great Tool !
> 
> because of the big replag at the moment i use a mod-way to access the data :
> 
[...]
> I dont know if its a good idea that i use this hack.
> 
> Perhaps daniel should itegrate it in HIS software.

No, I don't think so. The cache has to be consistent with the databases
on the toolserver, otherwise I would not be able to match up revisions
correctly. You should only use that hack if you use the article text
only, without any data from the toolserver db...

Regards,
Daniel

--

-- 
Homepage: http://brightbyte.de

interiot | 5 Apr 2006 02:36

Re: WikiProxy caches

> No, I don't think so. The cache has to be consistent with the databases
> on the toolserver, otherwise I would not be able to match up revisions
> correctly. You should only use that hack if you use the article text
> only, without any data from the toolserver db...
> 
> Regards,
> Daniel

It's not a hack.  Caches can store multiple rev_id's of wikitext, and if you use Special:Export when
fetching the latest wikitext version, then you know what rev_id to cache the wikitext under.

I have a number of tools that use wikipedia's latest rev_id, rather than the replaged one, and I
find it to be generally useful.  As you said, you have to keep in mind that any metadata you get
from the local database may be inconsistent with the latest version of the wikitext, but I still
find that there are some tools that are better off using the latest wikitext.

-Interiot

Marco Schuster | 5 Apr 2006 21:45
Picon

Toolserver replication lag

Hi!

As you all may have noticed, the replication lag has got higher and 
higher during the last days. "Replication lag is currently 1 day, 22 
hours.".

Why is the replag so high?

Greets,
Marco

Kelly Martin | 5 Apr 2006 23:53
Picon

Re: Toolserver replication lag

Lack of karma.

Kelly

On 4/5/06, Marco Schuster <CDL-Klever <at> gmx.net> wrote:
> Hi!
>
> As you all may have noticed, the replication lag has got higher and
> higher during the last days. "Replication lag is currently 1 day, 22
> hours.".
>
> Why is the replag so high?
>
> Greets,
> Marco
> _______________________________________________
> Toolserver-l mailing list
> Toolserver-l <at> Wikipedia.org
> http://mail.wikipedia.org/mailman/listinfo/toolserver-l
>

Tim Starling | 9 Apr 2006 21:12
Picon

Interiot and pgk deputised

Interiot and pgk were hanging around on IRC complaining about toolserver
lag, so I gave them both PROCESS access in MySQL. This allows them to
see what queries are running at any given time. I'm hoping that they
will terrorise anyone responsible for running inefficient lag-inducing
read queries, and thereby improve the situation.

-- Tim Starling


Gmane