Tim Starling | 1 Oct 2006 01:13
Picon
Picon
Favicon

Re: dwdiff - word by word diffs

David Gerard wrote:
> http://www.linux.com/article.pl?sid=06/09/21/1913234
> 
> Interesting. Any use for wikitext diffs, you think?

We've already got one. It's quite cool and fast. Maybe we should hawk 
ours to them rather than the other way around.

-- Tim Starling
Mark Clements | 1 Oct 2006 03:18
Picon

Re: Parsing database dumps

"Platonides" <Platonides@...> wrote in
message news:efk2vt$1gk$1 <at> sea.gmane.org...
> "Mark Clements" wrote:
> > As a separate, but related question, why is the namespace not given as
> > part
> > of the page information?
> >
> > e.g.
> >    <title>Help:Contents</title>
> >    <namespace>12</namespace>
> >    <pagetitle>Contents</pagetitle>
> >
> > Surely this would be more useful when it comes to wider application?
> >
> > - Mark Clements (HappyDog)
>
> I'd add it there as <title ns="12">Help:Contents</title>  (undefined
> parameter meaning 'old xml version', not main namespace)
> Giving title, namespace and pagetitle is redundant and should be avoided.
It
> can be several Mb for uncompressed dumps.

That's a pretty good solution, although one of the issues is that the title
includes the namespace, which needs to be removed to get the actual page
title. I feel that the <page> section should be complete in and of itself,
without requiring the header section mapping namespace names to ids. Without
knowing the mappings (ns to ns-title) that are present in the header, you
cannot interpret the title unambiguosly, for example <title ns="0">Star
Trek: The Next Generation</title> relies on the parser knowing that ns-0 is
not called 'Star Trek' in order to be interpreted properly.
(Continue reading)

Minh Nguyen | 1 Oct 2006 04:04
Gravatar

Re: Ogg Theora is too hard to play. We needYouTube-style Flash streaming video.

David Gerard wrote:
> On 30/09/06, Gregory Maxwell <gmaxwell@...> wrote:
>> On 9/30/06, Walter Vermeir <walter@...> wrote:
> 
>>> For Ogg Vorbis there is;
>>> http://www.jcraft.com/jorbis/
>>> And with that you can play a Ogg Vorbis file without a player at the
>>> client side.
> 
> 
>> I set it up weeks ago and have been able to gather fairly little
>> interest in it... thus my frustration with people insisting that we
>> must use proprietary formats for ease of use.
> 
> 
> I, and I assume most of those who saw it, thought it was an early demo
> - not something ready to roll. I also wasn't aware it ran under GCJ as
> well as proprietary Java.
> 
> As such, WHAT ARE WE WAITING FOR? ROLL IT OUT! SOMETHING USABLE! IT
> SHOULD BE ON EVERY OGG IMAGE PAGE!
> 
> (The "about" will have to mention it works fine under GCJ and is
> Free-As-In-Stallman Software all the way down.)
> 
> cc: to wikitech-l in case there's some technical hiccup to doing so
> 
> 
> - d.

(Continue reading)

Chad Perrin | 1 Oct 2006 04:18
Gravatar

Re: Ogg Theora is too hard to play. We needYouTube-style Flash streaming video.

On Sat, Sep 30, 2006 at 07:04:36PM -0700, Minh Nguyen wrote:
> 
> The Vietnamese-language projects currently link to JOrbisPlayer on all 
> its OGG description pages. [1] We use a hack in order to make this work: 
> we have [[MediaWiki:Fileinfo]] call [[MediaWiki:Playfile]], passing the 
> MIME type as a parameter. If anyone creates another OGG player on the 
> Wikimedia Toolserver, we'd be happy to link to that as well.

I'm a fan of Helix Player, myself.  I don't think there's a Windows
version of it, though.

--

-- 
CCD CopyWrite Chad Perrin [ http://ccd.apotheon.org ]
"Real ugliness is not harsh-looking syntax, but having to
build programs out of the wrong concepts." - Paul Graham
Aerik Sylvan | 1 Oct 2006 04:36

Re: Category Intersections - some early testing

Timwi  wrote:

> ...
> I thought I had mentioned before (maybe not to you) that this query is
> unnecessarily inefficient (because it scans both categories completely).
> Why do you use this adventurous approach when there is a much simpler
> and more obvious one, which also happens to be faster?
>
> SELECT c.cl_from FROM categorylinks c WHERE c.cl_from IN (
>         SELECT c2.cl_from FROM categorylinks c2
>         WHERE c2.cl_to='Fantasy_films'
> ) AND c.cl_to='Disney_films'

Showing rows 0 - 8 (9 total, Query took 0.9231 sec) <-- took longer than the
count/group by query
SQL query: SELECT c.cl_from
FROM categorylinks c
WHERE c.cl_from
IN (

SELECT c2.cl_from
FROM categorylinks c2
WHERE c2.cl_to = 'Fantasy_films'
)
AND c.cl_to = 'Disney_films'
LIMIT 0 , 30

I tried your way before, an a smaller data set, and got basically identical
results for your way and mine.  Trying your way on the larger dataset (copy
of en), it appears it may be slower than mine.  I think I recall that when
(Continue reading)

Minh Nguyen | 1 Oct 2006 07:57
Gravatar

Re: Ogg Theora is too hard to play. We needYouTube-style Flash streaming video.

Chad Perrin wrote:
> On Sat, Sep 30, 2006 at 07:04:36PM -0700, Minh Nguyen wrote:
>> The Vietnamese-language projects currently link to JOrbisPlayer on all 
>> its OGG description pages. [1] We use a hack in order to make this work: 
>> we have [[MediaWiki:Fileinfo]] call [[MediaWiki:Playfile]], passing the 
>> MIME type as a parameter. If anyone creates another OGG player on the 
>> Wikimedia Toolserver, we'd be happy to link to that as well.
> 
> I'm a fan of Helix Player, myself.  I don't think there's a Windows
> version of it, though.

Actually, the point of the links is that you don't have to download 
anything (server-side players). For other clients, we link to our audio 
help page. I suppose such a player would have to be on the Toolserver, 
since otherwise it would be remote loading.

--

-- 
Minh Nguyen <mxn@...>
[[en:User:Mxn]] [[vi:User:Mxn]] [[m:User:Mxn]]
AIM: trycom2000; Jabber: mxn@...; Blog: http://mxn.f2o.org/
brion | 1 Oct 2006 09:16
Picon
Favicon
Gravatar

MediaWiki automated test run failure 2006-10-01

An automated run of parserTests.php showed the following failures:

Running test TODO: Table security: embedded pipes
(http://mail.wikipedia.org/pipermail/wikitech-l/2006-April/034637.html)... FAILED!
Running test TODO: Link containing double-single-quotes '' (bug 4598)... FAILED!
Running test TODO: Template with thumb image (with link in description)... FAILED!
Running test Template infinite loop... FAILED!
Running test TODO: message transform: <noinclude> in transcluded template (bug 4926)... FAILED!
Running test TODO: message transform: <onlyinclude> in transcluded template (bug 4926)... FAILED!
Running test BUG 1887, part 2: A <math> with a thumbnail- math enabled... FAILED!
Running test TODO: HTML bullet list, unclosed tags (bug 5497)... FAILED!
Running test TODO: HTML ordered list, unclosed tags (bug 5497)... FAILED!
Running test TODO: HTML nested bullet list, open tags (bug 5497)... FAILED!
Running test TODO: HTML nested ordered list, open tags (bug 5497)... FAILED!
Running test TODO: Parsing optional HTML elements (Bug 6171)... FAILED!
Running test TODO: Inline HTML vs wiki block nesting... FAILED!
Running test TODO: Mixing markup for italics and bold... FAILED!
Running test TODO: 5 quotes, code coverage +1 line... FAILED!
Running test TODO: HTML Hex character encoding.... FAILED!
Running test TODO: dt/dd/dl test... FAILED!

Passed 412 of 429 tests (96.04%) FAILED!
Picon

Re: New Wikipedias created

> http://ru-sib.wikipedia.org/            Siberian/Nort Russian

Article about Siberian language was deleted from English Wikipedia as
original research (nosource, the language was created in 2005).

So it's nonsense!

--
Aleksander Sigachov
http://meta.wikimedia.orf/wiki/user:ajvol
Andre Engels | 1 Oct 2006 11:52
Picon

Re: New Wikipedias created

2006/9/30, Jens Frank <jf@...>:
> http://wuu.wikipedia.org/               Wu
> http://mzn.wikipedia.org/               Mazandarani
> http://new.wikipedia.org/               Newar / Nepal Bhasa
> http://lbe.wikipedia.org/               Lak
> http://eml.wikipedia.org/               Emilian-Romagnol / Sammarinese
> http://bxr.wikipedia.org/               Buryat (Russia)
> http://hsb.wikipedia.org/               Upper Sorbian
> http://nov.wikipedia.org/               Novial
> http://pag.wikipedia.org/               Pangasinan
> http://bar.wikipedia.org/               Bavarian
> http://bpy.wikipedia.org/               Bishnupriya Manipuri
> http://diq.wikipedia.org/               Zazaki
> http://zea.wikipedia.org/               Zealandic
> http://roa-tara.wikipedia.org/          Tarantino
> http://cbk-zam.wikipedia.org/           Zamboanga Chavacano
> http://zh-classical.wikipedia.org/      Classical Chinese
> http://cu.wikipedia.org/                Old Church Slavonic
> http://ru-sib.wikipedia.org/            Siberian/Nort Russian
> http://glk.wikipedia.org/               Gilaki
> http://cdo.wikipedia.org/               Min Dong

For my bots I would like to know:

What is the name of those languages in the languages themselves,
transliterated into Latin script.

--

-- 
Andre Engels, andreengels@...
ICQ: 6260644  --  Skype: a_engels
(Continue reading)

Filip Maljkovic | 1 Oct 2006 12:06
Picon

Re: New Wikipedias created

Andre Engels wrote:
> 2006/9/30, Jens Frank <jf@...>:
>   
>> http://wuu.wikipedia.org/               Wu
>> http://mzn.wikipedia.org/               Mazandarani
>> http://new.wikipedia.org/               Newar / Nepal Bhasa
>> http://lbe.wikipedia.org/               Lak
>> http://eml.wikipedia.org/               Emilian-Romagnol / Sammarinese
>> http://bxr.wikipedia.org/               Buryat (Russia)
>> http://hsb.wikipedia.org/               Upper Sorbian
>> http://nov.wikipedia.org/               Novial
>> http://pag.wikipedia.org/               Pangasinan
>> http://bar.wikipedia.org/               Bavarian
>> http://bpy.wikipedia.org/               Bishnupriya Manipuri
>> http://diq.wikipedia.org/               Zazaki
>> http://zea.wikipedia.org/               Zealandic
>> http://roa-tara.wikipedia.org/          Tarantino
>> http://cbk-zam.wikipedia.org/           Zamboanga Chavacano
>> http://zh-classical.wikipedia.org/      Classical Chinese
>> http://cu.wikipedia.org/                Old Church Slavonic
>> http://ru-sib.wikipedia.org/            Siberian/Nort Russian
>> http://glk.wikipedia.org/               Gilaki
>> http://cdo.wikipedia.org/               Min Dong
>>     
>
> For my bots I would like to know:
>
> What is the name of those languages in the languages themselves,
> transliterated into Latin script.
You can find most of them here: 
(Continue reading)


Gmane