Re: acceptable usage policy?
Forwarded to Wikitech-l, that is this Wikimedia server discussion list,
please reply all when replying to this.
Scott Wheeler wrote:
> Hi folks --
>
> I'm building a proof-of-concept application that does some work on the
> Wikipedia data set. I was excited to see the announcement of the new
> API since it would much simplify things for me.
>
> However, as one recent poster pointed out, what is and isn't acceptable
> usage isn't particularly clear. I'd expect once I put up announce the
> demo that things might hit (complete guesstimation) in the ballpark of
> 10k hits per day for a couple days and then probably dropping off to a
> few hundred a day. Given that Wikipedia averages 30-50k requests per
> second, it seems that such usage would probably be rounding error
> compared to Wikipedia's load. I'd cache requests that had already come
> across on my server for speed / load reasons.
>
> But what I'd like to avoid is building this nifty demo, announcing it a
> few places and then getting the plug pulled on it. In the case of you
> know it accidentally becoming The Next Big Thing, I'd naturally move
> over to a DB dump hosted elsewhere. For clarity, my project doesn't
> have the goal of being a Wikipedia mirror, the demo is just to show how
> the software works on a big data set.
>
> What would even be fine from my side would be just a heads up from
> somebody at WP if we're pissing them off, so that we could rework things
> within a couple days to use a dump.
>
(Continue reading)