Platonides | 1 Feb 23:30
Picon

Re: Commons has reached 6 million files

Daniel Schwen wrote:
> Wow, fantastic :-(
> So this had nothing to do with the timing of the mass-upload?
> 
>> Commons has just reached 6 million files! At 10:17, January 31,
>> 2010, Sailing_on_Ullswater_-_geograph.org.uk_-_173422.jpg  became our 6
>> millionth file on Commons!

Well, the mass uploaded increased our numbers a lot. I don't think we
would have passed the 6m mark yet without that bump.
And with thousands of geograph images being uploaded, other files have
it hard to be uploaded precisely at the milestone.

5800000 File:Fukushima Pref Route Sign 0043.svg
^This was uploaded 25 January.

Geograph images uploads began 29 January:
5820000 File:Christian Jacques - Aviator.png
5830000 File:Church Street, Shirley, Southampton - geograph.org.uk -
26621.jpg
5840000 File:Marine Lake, Southport - geograph.org.uk - 10649.jpg
5850000 File:West Quay, Southampton - geograph.org.uk - 26122.jpg
5860000 File:Boswens Menhir - geograph.org.uk - 75084.jpg
5870000 File:Brie de Provins.jpg
5880000 File:Main Street, Humberstone, Leicester - geograph.org.uk -
76115.jpg
5880000 File:Old Aylesbury Fire Station - Cambridge Street -
geograph.org.uk - 41616.jpg
5890000 File:Shillahill Bridge - geograph.org.uk - 62241.jpg
5900000 File:Tumulus on Withypool Hill - geograph.org.uk - 53965.jpg
(Continue reading)

Rama Neko | 2 Feb 02:08
Picon

Re: Commons has reached 6 million files

And two of these three were copyvios.
-- Rama

On 01/02/2010, Platonides <Platonides <at> gmail.com> wrote:
> Daniel Schwen wrote:
>> Wow, fantastic :-(
>> So this had nothing to do with the timing of the mass-upload?
>>
>>> Commons has just reached 6 million files! At 10:17, January 31,
>>> 2010, Sailing_on_Ullswater_-_geograph.org.uk_-_173422.jpg  became our 6
>>> millionth file on Commons!
>
> Well, the mass uploaded increased our numbers a lot. I don't think we
> would have passed the 6m mark yet without that bump.
> And with thousands of geograph images being uploaded, other files have
> it hard to be uploaded precisely at the milestone.
>
> 5800000 File:Fukushima Pref Route Sign 0043.svg
> ^This was uploaded 25 January.
>
> Geograph images uploads began 29 January:
> 5820000 File:Christian Jacques - Aviator.png
> 5830000 File:Church Street, Shirley, Southampton - geograph.org.uk -
> 26621.jpg
> 5840000 File:Marine Lake, Southport - geograph.org.uk - 10649.jpg
> 5850000 File:West Quay, Southampton - geograph.org.uk - 26122.jpg
> 5860000 File:Boswens Menhir - geograph.org.uk - 75084.jpg
> 5870000 File:Brie de Provins.jpg
> 5880000 File:Main Street, Humberstone, Leicester - geograph.org.uk -
> 76115.jpg
(Continue reading)

Guillaume Paumier | 3 Feb 21:46
Picon
Gravatar

Typical sample of our content

Dear all,

As part of the Multimedia usability project [1], we're going to set up a 
prototype environment, similar to the "sandboxes" used by the Wikipedia 
Usability initiative [2].

Importing all of Commons into the prototype would be overkill. As a 
consequence, we're looking for a small subset of content from Commons to 
use as a sample.

We would like to have a reasonable range of:
* filetypes
* file sizes
* licenses
* metadata

We don't necessarily need a wide range of topics.

Would you have any suggestion of category (or set of categories) to use 
for this purpose?

[1] http://usability.wikimedia.org/wiki/Multimedia:About
[2] http://usability.wikimedia.org/wiki/Sandbox

--

-- 
Guillaume Paumier
Product Manager, Multimedia Usability
Wikimedia Foundation
Support Free Knowledge: http://wikimediafoundation.org/wiki/Donate
(Continue reading)

geni | 3 Feb 22:45
Picon

Re: Typical sample of our content

On 3 February 2010 20:46, Guillaume Paumier <gpaumier@...> wrote:
> Dear all,
>
> As part of the Multimedia usability project [1], we're going to set up a
> prototype environment, similar to the "sandboxes" used by the Wikipedia
> Usability initiative [2].
>
> Importing all of Commons into the prototype would be overkill. As a
> consequence, we're looking for a small subset of content from Commons to
> use as a sample.
>
> We would like to have a reasonable range of:
> * filetypes
> * file sizes
> * licenses
> * metadata
>
> We don't necessarily need a wide range of topics.
>
> Would you have any suggestion of category (or set of categories) to use
> for this purpose?
>
> [1] http://usability.wikimedia.org/wiki/Multimedia:About
> [2] http://usability.wikimedia.org/wiki/Sandbox
>
> --
> Guillaume Paumier
> Product Manager, Multimedia Usability
> Wikimedia Foundation
> Support Free Knowledge: http://wikimediafoundation.org/wiki/Donate
(Continue reading)

Guillaume Paumier | 3 Feb 22:47
Picon
Gravatar

Re: Typical sample of our content

geni a écrit :
> 
> What are you looking for that hitting [[special:random/file]] as many
> times as images are needed won't provide?

Saving time?

--

-- 
Guillaume Paumier
Daniel Kinzler | 3 Feb 22:54
Picon
Favicon
Gravatar

Re: Typical sample of our content

geni schrieb:
> 
> What are you looking for that hitting [[special:random/file]] as many
> times as images are needed won't provide?

That would give a fair sample, but not necessarily a good sample. the usability
folks need "a few of each kind" for testing. getting numbers proportional to the
actual "population" of images is unnecessary and probably rather counterproductive.

So, I see two tasks:
* identifying relevant "group" of media (along the parameters guillom specified)
* sampling from  each group

-- daniel
Platonides | 3 Feb 22:49
Picon

Re: Typical sample of our content

Guillaume Paumier wrote:
> Dear all,
> 
> As part of the Multimedia usability project [1], we're going to set up a 
> prototype environment, similar to the "sandboxes" used by the Wikipedia 
> Usability initiative [2].
> 
> Importing all of Commons into the prototype would be overkill. As a 
> consequence, we're looking for a small subset of content from Commons to 
> use as a sample.
> 
> We would like to have a reasonable range of:
> * filetypes
> * file sizes
> * licenses
> * metadata
> 
> We don't necessarily need a wide range of topics.
> 
> Would you have any suggestion of category (or set of categories) to use 
> for this purpose?
> 
> [1] http://usability.wikimedia.org/wiki/Multimedia:About
> [2] http://usability.wikimedia.org/wiki/Sandbox

What are you going to test there?
For many usages you could just use commons as repository, so you
wouldn't need to "import" anything.
geni | 3 Feb 22:58
Picon

Re: Typical sample of our content

On 3 February 2010 21:54, Daniel Kinzler <daniel@...> wrote:
> geni schrieb:
>>
>> What are you looking for that hitting [[special:random/file]] as many
>> times as images are needed won't provide?
>
> That would give a fair sample, but not necessarily a good sample. the usability
> folks need "a few of each kind" for testing. getting numbers proportional to the
> actual "population" of images is unnecessary and probably rather counterproductive.
>
> So, I see two tasks:
> * identifying relevant "group" of media (along the parameters guillom specified)
> * sampling from  each group
>
> -- daniel
>

At a guess I'd say the museum categories are the best bet. Stuff like:

http://commons.wikimedia.org/wiki/Category:Science_Museum_%28London%29
http://commons.wikimedia.org/wiki/Category:British_Museum

--

-- 
geni
Guillaume Paumier | 3 Feb 23:00
Picon
Gravatar

Re: Typical sample of our content

Hi,

Platonides a écrit :
> 
> What are you going to test there?
> For many usages you could just use commons as repository, so you
> wouldn't need to "import" anything.

At some point we will work on the file description page, for example. Or 
basic editing tools such as crop / rotate etc. For these cases, we need 
"local" files.

--

-- 
Guillaume Paumier
Product Manager, Multimedia Usability
Wikimedia Foundation
Support Free Knowledge: http://wikimediafoundation.org/wiki/Donate
Daniel Kinzler | 3 Feb 23:01
Picon
Favicon
Gravatar

Re: Typical sample of our content

geni schrieb:
> On 3 February 2010 21:54, Daniel Kinzler <daniel@...> wrote:
>> geni schrieb:
>>> What are you looking for that hitting [[special:random/file]] as many
>>> times as images are needed won't provide?
>> That would give a fair sample, but not necessarily a good sample. the usability
>> folks need "a few of each kind" for testing. getting numbers proportional to the
>> actual "population" of images is unnecessary and probably rather counterproductive.
>>
>> So, I see two tasks:
>> * identifying relevant "group" of media (along the parameters guillom specified)
>> * sampling from  each group
>>
>> -- daniel
>>
> 
> At a guess I'd say the museum categories are the best bet. Stuff like:
> 
> http://commons.wikimedia.org/wiki/Category:Science_Museum_%28London%29
> http://commons.wikimedia.org/wiki/Category:British_Museum
> 

I think the museum categories are *one* interesting group of pretty uniform content.

-- daniel

Gmane