Ingo Frost | 9 Jul 2005 01:56
Picon

Re: WSJ on Wikipedia

Dear Andrew Lih,
dear scientific community,

I am a bit disappointed about the available material
that tries to measure the quality of Wikipedia articles.

The quoted newspaper article of the Wall street journal
for example just analyses technical topics but it would
be a dangerous claim to assume that quality is equally
distributed over the different fields and topics.
But you need that claim as condition for the method
of randomly picking articles and conclude for the rest.

There was another attempt to compare the Quality of
Wikipedia with other Encyclopedias in the German Computer
Newspaper C't with the same random approach.

(1) But there is a problem since it is a random way of
choosing articles to compare or to analyse. I see
some problems in non technical fields such as soft
sciences (in social science for example every theory
on society redefines all concepts of society on it's
own: how can an encyclopedia claim to have a definition?).

(2) Political terms are sometimes very complex topics
where the NPOV may not work, because there is no
right nor wrong. It is often a question of opinion
and majority that sometimes changes reality.
I observed a discussion and an edit war on the article
about Direct Democracy (in the Germen Wikipedia:
(Continue reading)

Jakob Voss | 9 Jul 2005 07:50
Picon
Gravatar

Re: Re: WSJ on Wikipedia

Ingo Frost wrote:

> Dear Andrew Lih,
> dear scientific community,
> 
> I am a bit disappointed about the available material
> that tries to measure the quality of Wikipedia articles.
> 
> The quoted newspaper article of the Wall street journal
> for example just analyses technical topics but it would
> be a dangerous claim to assume that quality is equally
> distributed over the different fields and topics.
> But you need that claim as condition for the method
> of randomly picking articles and conclude for the rest.

Journalists aren't searching for knowledge - they just want
to tell a story.

> My question: Is there a scientific study on the
> quality of the Wikipedia ariticles? Does anyone
> work on that problems? What methods could be used
> to analyse the Quality?

The only serious and promising attemt I know of is Andreas Brändle's 
Masters thesis:
* http://en.wikibooks.org/wiki/Wikimania05/Paper-AB1
* http://editthispage.blogspot.com/

Ulrich Fuchs did a little test on vandalising articles:
http://de.wikipedia.org/wiki/Wikipedia:Wikipedistik/Vandalismusanf%C3%A4lligkeit
(Continue reading)

Joachim Schroer | 12 Jul 2005 15:55
Picon

Announcement: Survey study on the motivation of contributors to Wikipedia

Dear all,

we're a team of organizational psychologists at the University of
Wuerzburg (Germany), and at the moment we're conducting a survey study
on the motivation of contributors to Wikipedia.

The study is available at:
<http://www.unipark.de/uc/wikipedia/>

Our project page is here:
<www.psychologie.uni-wuerzburg.de/ao/research/wikipedia.php?lang=en>

I have already sent the announcement to the WikiEN list:
<http://mail.wikipedia.org/pipermail/wikien-l/2005-July/026281.html>

The questionnaire is in English, but participants from other Wikipedias
are of course invited to take part as well. Do you have ideas how to
make the survey known in other projects?

We'd also be happy...
- if you could forward this link to other people who might be
  interested, but do not read the mailing lists, and
- if you might put up links to our survey where they fit.

Thank you!
Best wishes from Wuerzburg,

Joachim Schroer

--

-- 
(Continue reading)

Anthere | 12 Jul 2005 19:35
Picon
Favicon

Fwd: [Wiki-research-l] Announcement: Survey study on the motivation of contributors to Wikipedia

Hello

Best is to forward it to Wikipedia-l

http://www.unipark.de/uc/wikipedia/

Cheers

Ant

--- Joachim Schroer
<schroer@...> wrote:

> Date: Tue, 12 Jul 2005 15:55:42 +0200
> From: Joachim Schroer
> <schroer@...>
> To: wiki-research-l@...
> Subject: [Wiki-research-l] Announcement: Survey
> study on the motivation of	contributors to Wikipedi
> 
> Dear all,
> 
> we're a team of organizational psychologists at the
> University of
> Wuerzburg (Germany), and at the moment we're
> conducting a survey study
> on the motivation of contributors to Wikipedia.
> 
> The study is available at:
> <http://www.unipark.de/uc/wikipedia/>
(Continue reading)

Andrew Lih | 29 Jul 2005 08:03
Picon
Gravatar

Pre-Wikimania - Research BOAF

Hi all,
 
Hope to see many of you at Wikimania next week (yes, it's only one week away).
 
I want to propose some time is carved out for a BOAF session for wiki researchers. Seems Friday and Sunday eves are free, or it could be Thursday before things get started.
 
Here are some issues I'd love to talk to other folks about, please feel free to add:
 
1. Heuristics for recognizing patterns in edit histories. Most pressing is an algorithm to determine what constitutes an edit war, vandalism or any other type of "noise" in the system if one's measuring "substantive" edits. (This is hard - even the "I'll know it when I see it" method is problematic, as evidenced by the recent dispute with and departure of RickK.) Much of the research myself, Jakob Voss, Cathy Ma and others do depend on analyzing edit histories and drawing conclusions about article quality. So far, none of the research I've seen has "factored out" the effect of edit wars and vandalism.
 
2. Classifying types of edits, using diffs or edit summaries. There is a desire to qualitative
lexical (spelling, punctuation), factual (numbers, dates), organizational (rearranging), prose (style, tense change), etc. What are the best practices in detecting and classifying these?
 
3. Comparative approaches to reserach and modelling article "clusters." Last year, while comparing entries in a print encyclopedia against the categories in Wikipedia, the toughest part was trying to match up the taxonomical classifications, and the variation in breakdown into subtopics. How are people dealing with this mapping?
 
Please add to the list, and I will help assemble.
 
-Andrew Lih
University of Hong Kong
 
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l@...
http://mail.wikipedia.org/mailman/listinfo/wiki-research-l
SJ | 29 Jul 2005 15:49
Picon

Re: Pre-Wikimania - Research BOAF

Hi Andrew,

A fine idea.  Please add any discussion topics here, linking each
section to a page devoted to that topic: 
http://meta.wikimedia.org/wiki/Wikimania:Discussions

Thursday, Friday, and Sunday evenings are pretty free.  Friday there
will be a few fun things going on.  (Trivia contest, story-telling,
etc)

SJ

On 7/29/05, Andrew Lih <andrew.lih@...> wrote:
> Hi all, 
>   
> Hope to see many of you at Wikimania next week (yes, it's only one week
> away). 
>   
> I want to propose some time is carved out for a BOAF session for wiki
> researchers. Seems Friday and Sunday eves are free, or it could be Thursday
> before things get started. 
>   
> Here are some issues I'd love to talk to other folks about, please feel free
> to add: 
>   
> 1. Heuristics for recognizing patterns in edit histories. Most pressing is
> an algorithm to determine what constitutes an edit war, vandalism or any
> other type of "noise" in the system if one's measuring "substantive" edits.
> (This is hard - even the "I'll know it when I see it" method is problematic,
> as evidenced by the recent dispute with and departure of RickK.) Much of the
> research myself, Jakob Voss, Cathy Ma and others do depend on analyzing edit
> histories and drawing conclusions about article quality. So far, none of the
> research I've seen has "factored out" the effect of edit wars and vandalism.
>   
> 2. Classifying types of edits, using diffs or edit summaries. There is a
> desire to qualitative 
> lexical (spelling, punctuation), factual (numbers, dates), organizational
> (rearranging), prose (style, tense change), etc. What are the best practices
> in detecting and classifying these? 
>   
> 3. Comparative approaches to reserach and modelling article "clusters." Last
> year, while comparing entries in a print encyclopedia against the categories
> in Wikipedia, the toughest part was trying to match up the taxonomical
> classifications, and the variation in breakdown into subtopics. How are
> people dealing with this mapping? 
>   
> Please add to the list, and I will help assemble. 
>   
> -Andrew Lih 
> University of Hong Kong 
>   
> _______________________________________________
> Wiki-research-l mailing list
> Wiki-research-l@...
> http://mail.wikipedia.org/mailman/listinfo/wiki-research-l
> 
> 
> 

--

-- 
++SJ
Jimmy Wales | 30 Jul 2005 03:27
Favicon
Gravatar

Re: Pre-Wikimania - Research BOAF

Andrew Lih wrote:
> Hi all,
>  
> Hope to see many of you at Wikimania next week (yes, it's only one week
> away).
>  
> I want to propose some time is carved out for a BOAF session for wiki
> researchers. Seems Friday and Sunday eves are free, or it could be
> Thursday before things get started.
>  
> Here are some issues I'd love to talk to other folks about, please feel
> free to add:
>  
> 1. Heuristics for recognizing patterns in edit histories. Most pressing
> is an algorithm to determine what constitutes an edit war, vandalism or
> any other type of "noise" in the system if one's measuring "substantive"
> edits. (This is hard - even the "I'll know it when I see it" method is
> problematic, as evidenced by the recent dispute with and departure
> of RickK.) Much of the research myself, Jakob Voss, Cathy Ma and others
> do depend on analyzing edit histories and drawing conclusions about
> article quality. So far, none of the research I've seen has "factored
> out" the effect of edit wars and vandalism.

Revert wars and near-revert-wars are probably easier to algorithmically
identify than other types of edit wars.  How do we distinguish between
the case of two very active editors working very pleasantly together in
a back and forth session of mutual improvement and reinforcement versus
two very active editors working unpleasantly together in a back and
forth session of mutually reinforcing downward spiral of useless edits?

I think it's pretty hard to do... algorithmically.

As Andrew suggests, we all do this all the time in our own private
evaluations of what is going on.  We know that person X is a jerk, and a
problematic editor, and so is person Y, so when we see them going crazy
on an article, we know it is bad news.  But if we see Angela and Andrew
Lih both quickly and repeatedly editing an article, we know it is
probably good news.
Jimmy Wales | 30 Jul 2005 06:48
Favicon
Gravatar

Re: Re: WSJ on Wikipedia

Ingo Frost wrote:
> I am a bit disappointed about the available material
> that tries to measure the quality of Wikipedia articles.

Me too, but let's all agree that it is a difficult thing to do.

> (2) Political terms are sometimes very complex topics
> where the NPOV may not work, because there is no
> right nor wrong.

I think this is a serious misunderstanding of NPOV and of what it might
mean _for an encyclopedia_ to be "right or wrong_.

First, not everyone believes (and I certainly don't) that on political
topics there is no right nor wrong.  But some do.  And NPOV has to deal
with all of us.  The point is that we can typicall "go meta" and avoid
taking a controversial stand ourselves.  NPOV does not require us to
choose which of two sides is right or wrong on complex topics, but
rather requires us to describe the controversy.

Therefore, _for an encyclopedia_, it is quite possible to get it right
(or get it wrong) even when the underlying issue is complex and not
readily amenable to a final judgment.

> I observed a discussion and an edit war on the article
> about Direct Democracy (in the Germen Wikipedia:
> article "Direkte Demokratie") that led to a loss
> of quality: only a minimal and weak consens
> survived the different opinions: the evolutionary
> process did not improve quality in that case.

This can certainly be true in any given case.  But I wonder if you
aren't showing your own POV here -- I often wish articles read
differently, but often -- when I'm fully honest with myself -- this is
because I wish my own view were more prominently reflected, even if it
should not be.

> My question: Is there a scientific study on the
> quality of the Wikipedia ariticles? Does anyone
> work on that problems? What methods could be used
> to analyse the Quality?

I think this is a fantastic question and what I hope this list can foster.

It's an enormously difficult problem to get right, and you've identified
some of the tough problems here.  Despite my criticism (highly technical
and based on internal jargon) of what you said about NPOV, I do think
that it is quite hard to judge the quality of certain contentious
articles because there is no simple "gold standard" to which we can refer.

In many cases, and I say this with full awareness that it is also not
true in many other cases, our articles on contentious or controversial
topics are _the best in existence_ simply because they are the _most
free from bias_.

It's easy to compare a wikipedia chart of the periodic table of elements
 against a standard source and measure if it accurately reflects
received science.  It is much harder in areas where the only reliably
objective presentation one can find _at all_ is in Wikipedia in the
first place.  :-)

This gets us into some potentially insoluble philosophical issues with
measuring "quality" so what I recommend is that we remain steadfastly
practical, thinking of things which we actually can measure and test.

--Jimbo
Stirling Newberry | 30 Jul 2005 10:54

Re: Pre-Wikimania - Research BOAF

Aggressive edit wars aren't always destructive to the final article.  
Often articles improve dramatically when editors are forcing each  
other to document every assertion and push forward. The best ways to  
measure the progress of editting algorythmically are in

1. The number of links out. Growing articles, even hostille ones,  
tend to have increasing link density, as new concepts are added in.
2. The number of links in. Growing articles, even hostile ones, tend  
to have increasing traffic in from the article space.
3. Number of links from non wiki pages. If an article is getting a  
large number of links from talk pages which also have recent  
conjugate edits, this is a very good sign that discussion has broken  
down on the page.
4. Edit wars generate RFCs, talk page comments and so on. Also  
measure on the talk page the "chili ranking" of what is on the talk  
page. Destructive edit wars are accompanied by links to wiki policies  
or citations there of. Or in otherwords, the more often NPOV is  
mentioned in close proximity to other wiki policies, the more likely  
discussion has broken down. People who are editting well generally  
have better things to talk about, even if they are arguing about them.

The best way to find out if you don't have enough RAM, is to measure  
hard drive thrash. The best way to measure edit wars is by the amount  
of "thrash" that is being generated: disappearing links in or out,  
talk page links with conjugate edits, high correlation to mailing  
list in references, mentions of wikipolicies on talk page.

On Jul 29, 2005, at 9:27 PM, Jimmy Wales wrote:

> Andrew Lih wrote:
>
>> Hi all,
>>
>> Hope to see many of you at Wikimania next week (yes, it's only one  
>> week
>> away).
>>
>> I want to propose some time is carved out for a BOAF session for wiki
>> researchers. Seems Friday and Sunday eves are free, or it could be
>> Thursday before things get started.
>>
>> Here are some issues I'd love to talk to other folks about, please  
>> feel
>> free to add:
>>
>> 1. Heuristics for recognizing patterns in edit histories. Most  
>> pressing
>> is an algorithm to determine what constitutes an edit war,  
>> vandalism or
>> any other type of "noise" in the system if one's measuring  
>> "substantive"
>> edits. (This is hard - even the "I'll know it when I see it"  
>> method is
>> problematic, as evidenced by the recent dispute with and departure
>> of RickK.) Much of the research myself, Jakob Voss, Cathy Ma and  
>> others
>> do depend on analyzing edit histories and drawing conclusions about
>> article quality. So far, none of the research I've seen has "factored
>> out" the effect of edit wars and vandalism.
>>
>
> Revert wars and near-revert-wars are probably easier to  
> algorithmically
> identify than other types of edit wars.  How do we distinguish between
> the case of two very active editors working very pleasantly  
> together in
> a back and forth session of mutual improvement and reinforcement  
> versus
> two very active editors working unpleasantly together in a back and
> forth session of mutually reinforcing downward spiral of useless  
> edits?
>
> I think it's pretty hard to do... algorithmically.
>
> As Andrew suggests, we all do this all the time in our own private
> evaluations of what is going on.  We know that person X is a jerk,  
> and a
> problematic editor, and so is person Y, so when we see them going  
> crazy
> on an article, we know it is bad news.  But if we see Angela and  
> Andrew
> Lih both quickly and repeatedly editing an article, we know it is
> probably good news.
>
>
> _______________________________________________
> Wiki-research-l mailing list
> Wiki-research-l@...
> http://mail.wikipedia.org/mailman/listinfo/wiki-research-l
>
>
>
Cormac Lawler | 30 Jul 2005 11:57
Picon

Re: Re: WSJ on Wikipedia

On 7/29/05, Jimmy Wales <jwales@...> wrote:

> Ingo Frost wrote:

[snip]

> > My question: Is there a scientific study on the
> > quality of the Wikipedia ariticles? Does anyone
> > work on that problems? What methods could be used
> > to analyse the Quality?
> 
> I think this is a fantastic question and what I hope this list can foster.
> 
> It's an enormously difficult problem to get right, and you've identified
> some of the tough problems here.  Despite my criticism (highly technical
> and based on internal jargon) of what you said about NPOV, I do think
> that it is quite hard to judge the quality of certain contentious
> articles because there is no simple "gold standard" to which we can refer.
> 
> In many cases, and I say this with full awareness that it is also not
> true in many other cases, our articles on contentious or controversial
> topics are _the best in existence_ simply because they are the _most
> free from bias_.
> 
> It's easy to compare a wikipedia chart of the periodic table of elements
>  against a standard source and measure if it accurately reflects
> received science.  It is much harder in areas where the only reliably
> objective presentation one can find _at all_ is in Wikipedia in the
> first place.  :-)
> 
> This gets us into some potentially insoluble philosophical issues with
> measuring "quality" so what I recommend is that we remain steadfastly
> practical, thinking of things which we actually can measure and test.
> 
> --Jimbo
> 

To get around these philosophical issues, I believe that the only way
to measure quality of articles (especially contentious ones) is
qualitatively, ie by asking people/experts their opinions of articles,
their experience of the community etc and analysing the types of
reactions, the emotional resonance (or lack thereof), the language
they used etc. So far, I haven't seen many qualitative studies of
Wikipedia - I did one last Christmas as a kind of pilot study for my
dissertation, which you can see here:
http://wikisource.org/wiki/A_small_scale_study_of_Wikipedia 
and some others, from Wikimania, include:
http://en.wikibooks.org/wiki/Wikimania05/Paper-PA1
http://en.wikibooks.org/wiki/Wikimania05/Paper-JT1
Note: almost all Wikimania papers are still works in progress, including mine :)

The Brockhaus and New York Times studies are examples of qualitative
studies. However there are many quantitative studies, again from
Wikimania, including Andreas Brandle's paper, which Jakob already
mentioned in this thread and user:Boud's study of NPOV and meme
evolution:
http://en.wikibooks.org/wiki/Wikimania05/Paper-BO1
also upcoming data on article validation:
http://meta.wikimedia.org/wiki/Article_validation
and many more from: 
http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_in_academic_studies

A widespread notion is that only quantitative studies are scientific,
although this debate itself is fraught with philosophical as well as
methodological issues. I think that qualitative studies are extremely
valid and I'd like to see a few more of them on Wikipedia. As we all
know, there's a great spirit of openness here and, as a fresh
researcher here, I was thrilled at such clarity and honesty in the
answers I got back (see my pilot study and its appendices). Certainly
qualitative data can be large and unwieldy, but this is a separate
matter. It's all down to what you want to find out at the end.

Cormac / Cormaggio

Gmane