Casey Brown | 2 Jul 2008 04:36
Picon

Fwd: [Wikipedia-l] New static HTML dumps available

This is an often asked question on this list...

---------- Forwarded message ----------
From: Tim Starling <tstarling@...>
Date: Tue, Jul 1, 2008 at 10:18 PM
Subject: [Wikipedia-l] New static HTML dumps available
To: wikipedia-l@...
Cc: wikitech-l@...

New static HTML dumps of all Wikipedia editions are now available:

http://static.wikipedia.org/

Altogether, the dumps are 650GB uncompressed, 40GB compressed.

I think a reasonable next step for this project would be to write filter
scripts that take a compressed dump, reduce the article count in some way,
and then recompress it, possibly in a different format. For instance, we
could have a "most popular 4GB" of the English Wikipedia, based on page
view statistics, recompressed as an SQLite database.

-- Tim Starling

_______________________________________________
Wikipedia-l mailing list
Wikipedia-l@...
https://lists.wikimedia.org/mailman/listinfo/wikipedia-l

--

-- 
Casey Brown
(Continue reading)

Cormac Lawler | 3 Jul 2008 12:32
Picon

Re: Announcement: Erik Zachte, Data Analyst

On Thu, Jul 3, 2008 at 1:18 AM, Brion Vibber <brion@...> wrote:

>
> It is with great pleasure that I welcome Erik Zachte as a part-time
> contractor to the Wikimedia Foundation. Erik will start work with us
> officially on September 1.

[snip]

Fantastic news! Perhaps this is a good time to kickstart a set of pages on
Meta to help facilitate other researchers in accessing and using data - both
qualitative and quantitative? I'd also be very happy to set up a learning
community on Wikiversity with this in mind... (cross-posted to
wiki-research-l).

Cormac
_______________________________________________
foundation-l mailing list
foundation-l@...
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/foundation-l

Cormac Lawler | 3 Jul 2008 12:34
Picon

Fwd: [Foundation-l] Announcement: Erik Zachte, Data Analyst

Forwarding whole message...

---------- Forwarded message ----------
From: Brion Vibber <brion-AeOJrEpdGNeGglJvpFV4uA@public.gmane.org>
Date: Thu, Jul 3, 2008 at 1:18 AM
Subject: [Foundation-l] Announcement: Erik Zachte, Data Analyst
To: Wikimedia Foundation Mailing List <foundation-l-RusutVdil2icGmH+5r0DM0B+6BGkLq7r@public.gmane.org>


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

It is with great pleasure that I welcome Erik Zachte as a part-time
contractor to the Wikimedia Foundation. Erik will start work with us
officially on September 1.

Erik has worked as a Technical Analyst for Air France - KLM for more
than two decades. He is probably most famous in the Wikimedia community
as the developer of "WikiStats" (stats.wikimedia.org), an amazing
statistics package that reveals data about the growth and editing
patterns in our various wiki projects.

Erik has created many other wonderful tools, featured at
http://infodisiac.com/ . In his part-time role with the Wikimedia
Foundation, he will continue to maintain and develop code to provide
critical operational metrics about our projects. A key first project,
for example, will be the integration of traffic statistics into the
WikiStats package. These metrics will be key to communications,
fundraising, internal evaluation, and for many other purposes.

We're working in parallel to provide Erik with the support he needs to
do his work, including more regularly provisioned dumps and hardware
to run his scripts.

I'm very pleased that Erik is joining our team, and look forward to
working with him. Please join me in welcoming him.  :)

- -- brion vibber (brion <at> wikimedia.org)
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.8 (Darwin)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iEYEARECAAYFAkhsGukACgkQwRnhpk1wk466xACfcX+G/DjEHk8KO3RP9TMHsMpI
n3MAn2aECpBhmf+iRqDTaEdEKlw6FJYk
=O/we
-----END PGP SIGNATURE-----

_______________________________________________
foundation-l mailing list
foundation-l-RusutVdil2icGmH+5r0DM0B+6BGkLq7r@public.gmane.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/foundation-l

_______________________________________________
Wiki-research-l mailing list
Wiki-research-l@...
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Dirk Riehle | 3 Jul 2008 17:14
Gravatar

Re: Fwd: [Foundation-l] Announcement: Erik Zachte, Data Analyst

Saw it on foundation-l; thanks for cross-posting.

I think this is a great decision---what you can't measure, you can't 
manage, as the old adage goes.

I would like to suggest that Erik view his job not only as a 
statistician but also as a research outreach person. WikiSym has been 
growing at a healthy 25%/year rate in terms of publications, the quality 
has been increasing significantly. Quantitative research into Wikipedia 
plays a major role. I very much would like to see a continuation of 
Jakob's and Angela's 2006 workshop on research into Wikipedia.

On that note, maybe someone (from the board?) can clarify for us the 
role that research plays for the WMF. I've kind of lost track of the 
meaning of the changes from Erik M through James to Gregory now, I believe?

Thanks,
Dirk

Cormac Lawler wrote:
> Forwarding whole message...
>
> ---------- Forwarded message ----------
> From: *Brion Vibber* <brion@... <mailto:brion@...>>
> Date: Thu, Jul 3, 2008 at 1:18 AM
> Subject: [Foundation-l] Announcement: Erik Zachte, Data Analyst
> To: Wikimedia Foundation Mailing List 
> <foundation-l@... 
> <mailto:foundation-l@...>>
>
>
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> It is with great pleasure that I welcome Erik Zachte as a part-time
> contractor to the Wikimedia Foundation. Erik will start work with us
> officially on September 1.
>
> Erik has worked as a Technical Analyst for Air France - KLM for more
> than two decades. He is probably most famous in the Wikimedia community
> as the developer of "WikiStats" (stats.wikimedia.org 
> <http://stats.wikimedia.org>), an amazing
> statistics package that reveals data about the growth and editing
> patterns in our various wiki projects.
>
> Erik has created many other wonderful tools, featured at
> http://infodisiac.com/ . In his part-time role with the Wikimedia
> Foundation, he will continue to maintain and develop code to provide
> critical operational metrics about our projects. A key first project,
> for example, will be the integration of traffic statistics into the
> WikiStats package. These metrics will be key to communications,
> fundraising, internal evaluation, and for many other purposes.
>
> We're working in parallel to provide Erik with the support he needs to
> do his work, including more regularly provisioned dumps and hardware
> to run his scripts.
>
> I'm very pleased that Erik is joining our team, and look forward to
> working with him. Please join me in welcoming him.  :)
>
> - -- brion vibber (brion  <at>  wikimedia.org <http://wikimedia.org>)
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.8 (Darwin)
> Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
>
> iEYEARECAAYFAkhsGukACgkQwRnhpk1wk466xACfcX+G/DjEHk8KO3RP9TMHsMpI
> n3MAn2aECpBhmf+iRqDTaEdEKlw6FJYk
> =O/we
> -----END PGP SIGNATURE-----
>
> _______________________________________________
> foundation-l mailing list
> foundation-l@... <mailto:foundation-l@...>
> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/foundation-l
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> Wiki-research-l mailing list
> Wiki-research-l@...
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>   

--

-- 
Into novel software paradigms, tools, processes?
Then submit a short paper to Onward! 2008 by July 2nd!
See http://www.oopsla.org/oopsla2008/cfp/cfp-onward.html
--
Phone: + 1 (650) 215 3459, Web: http://www.riehle.org
Gilad Ravid | 3 Jul 2008 20:58

Re: Fwd: [Foundation-l] Announcement: Erik Zachte, Data Analyst

Hi,


Have anyone know how to contact Domaas Mituzas?


I works with his stats files (http://dammit.lt/wikistats/)

and have some questions:

a) I need some back files? Is there a way to get them

b) In the last days there are files with odd numbers? I need help to interpret them


thanks,

Gilad




Domas Mituzas

Cormac Lawler wrote:

Forwarding whole message...

---------- Forwarded message ----------
From: Brion Vibber <brion-AeOJrEpdGNeGglJvpFV4uA@public.gmane.org>
Date: Thu, Jul 3, 2008 at 1:18 AM
Subject: [Foundation-l] Announcement: Erik Zachte, Data Analyst
To: Wikimedia Foundation Mailing List <foundation-l-RusutVdil2icGmH+5r0DM0B+6BGkLq7r@public.gmane.org>


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

It is with great pleasure that I welcome Erik Zachte as a part-time
contractor to the Wikimedia Foundation. Erik will start work with us
officially on September 1.

Erik has worked as a Technical Analyst for Air France - KLM for more
than two decades. He is probably most famous in the Wikimedia community
as the developer of "WikiStats" (stats.wikimedia.org), an amazing
statistics package that reveals data about the growth and editing
patterns in our various wiki projects.

Erik has created many other wonderful tools, featured at
http://infodisiac.com/ . In his part-time role with the Wikimedia
Foundation, he will continue to maintain and develop code to provide
critical operational metrics about our projects. A key first project,
for example, will be the integration of traffic statistics into the
WikiStats package. These metrics will be key to communications,
fundraising, internal evaluation, and for many other purposes.

We're working in parallel to provide Erik with the support he needs to
do his work, including more regularly provisioned dumps and hardware
to run his scripts.

I'm very pleased that Erik is joining our team, and look forward to
working with him. Please join me in welcoming him.  :)

- -- brion vibber (brion <at> wikimedia.org)
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.8 (Darwin)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iEYEARECAAYFAkhsGukACgkQwRnhpk1wk466xACfcX+G/DjEHk8KO3RP9TMHsMpI
n3MAn2aECpBhmf+iRqDTaEdEKlw6FJYk
=O/we
-----END PGP SIGNATURE-----

_______________________________________________
foundation-l mailing list
foundation-l-RusutVdil2icGmH+5r0DM0B+6BGkLq7r@public.gmane.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/foundation-l

_______________________________________________ Wiki-research-l mailing list Wiki-research-l-RusutVdil2icGmH+5r0DM0B+6BGkLq7r@public.gmane.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l

-- "The Mind is not a vessel to be filled, but a fire to be kindled." -- Plutarch, On Listening to Lectures Gilad Ravid, Ph.D. Department of Industrial Engineering and Management Ben Gurion University of the Negev, Israel Office: +972-8-6472772 Mobile: +972-54-4905391 Skype: giladravid
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l@...
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Cormac Lawler | 3 Jul 2008 22:53
Picon

Re: Fwd: [Foundation-l] Announcement: Erik Zachte, Data Analyst



On Thu, Jul 3, 2008 at 7:58 PM, Gilad Ravid <gilad-nwqbNM1IGQ7YtjvyW6yDsg@public.gmane.org> wrote:

Hi,


Have anyone know how to contact Domaas Mituzas?


The email address I have for Domas (from wmf mailing lists at least) is: midom.lists-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org ; his en.wp user page is: http://en.wikipedia.org/wiki/User:Midom (though I've no idea what wikis he checks most regularly).

Cheers,
Cormac
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l@...
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Cormac Lawler | 3 Jul 2008 22:58
Picon

Re: Fwd: [Foundation-l] Announcement: Erik Zachte, Data Analyst


On Thu, Jul 3, 2008 at 4:14 PM, Dirk Riehle <dirk-aUuP8bbZmiIdnm+yROfE0A@public.gmane.org> wrote:

On that note, maybe someone (from the board?) can clarify for us the
role that research plays for the WMF. I've kind of lost track of the
meaning of the changes from Erik M through James to Gregory now, I believe?

Sue and Erik (M) drafted a document about Wikimedia's research goals (aligned to wmf's strategic goals), which you'll find at: <http://meta.wikimedia.org/wiki/Wikimedia_Foundation_Research_Goals>. But this is still pretty vague - I can't answer your question, and feel it would be better asked on foundation-l.

Cheers,
Cormac
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l@...
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Michael Bimmler | 4 Jul 2008 12:09
Picon

Re: Fwd: [Foundation-l] Announcement: Erik Zachte, Data Analyst

On Thu, Jul 3, 2008 at 10:53 PM, Cormac Lawler <cormaggio@...> wrote:
>
>
> On Thu, Jul 3, 2008 at 7:58 PM, Gilad Ravid <gilad@...> wrote:
>>
>> Hi,
>>
>> Have anyone know how to contact Domaas Mituzas?
>
> The email address I have for Domas (from wmf mailing lists at least)
> is: midom.lists@... ; his en.wp user page
> is: http://en.wikipedia.org/wiki/User:Midom (though I've no idea what wikis
> he checks most regularly).
>

I guess, the easiest way is probably to use his official address:
domas at wikimedia.

Michael

--

-- 
Michael Bimmler
mbimmler@...
Anne GOLDENBERG | 9 Jul 2008 19:01
Picon

Study about contribution to public wikis / Étude sur la contribution aux wikis publics

Dear wiki-researcher,
you may want to participate to this study or forward this
invitation.

----------------------------------------------------------------------------------------
Dear public wiki contributor,
(Version française ci-dessous).

My name is Anne Goldenberg, I'm a french PhD student in sociology and
communication, doing a thesis on contributions in public wikis. I'm
mostly interested in the policies and the habits that contributors
develop in order to organize the collaboration in the wiki.

I've set up a small survey entitled ''What is it to contribute to a
wiki'', that I would like you to take.
It's available here :
http://www.er.uqam.ca/nobel/labcmo/portraitdulibre/index.php?sid=29168&lang=en

According to your experience with wikis, you can help us understand
what are the contributors' motivations and expectations, how they learn
to participate online, how contributions are discussed or legitimized
and what helps to have good contributions and motivated contributors.

Your answers will allow me to write an article and a thesis chapter on
the notion of contribution in the wikisphere. The result will be
anonymised and put online under a free license on my websites (see
below).  You will need about 30 to 40 min to complete the survey.

Thank you in advance for your collaboration,
I hope this will help your community,
Best regards,
Anne Goldenberg
UQAM, Montréal, Quebec, Canada
Unice, Nice, France

PS : Please, feel free to forward this email to anyone or any mailing
list that you think could be interested in this survey.


http://anne.koumbit.org
http://wikifarm.koumbit.org/anne

== Version Française ==

Cher contributeur, chère contributrice aux wikis publics,

Mon nom est Anne Goldenberg, je suis une doctorante française
en sociologie et communication et ma thèse porte sur la
contribution aux wikis publics. Je m'intéresse plus particulièrement
aux habitudes et aux politiques développées par les contributeurs afin
d'organiser la collaboration dans un wiki.

J'ai mis en place une petite enquête intitulée "Qu'est ce que
contribuer sur un wiki ?", à laquelle j'aimerai que vous participiez.
Elle est disponible ici :
http://www.er.uqam.ca/nobel/labcmo/portraitdulibre/index.php?sid=32238&lang=fr

À partir de votre expérience avec les wikis, vous nous aiderez
certainement à mieux comprendre quelles sont les motivations (et les
attentes) des contributeurs, comment s'effectue leurs apprentissage
d'une culture de la participation en ligne, comment se légitime et se
discute les contributions, qu'est ce qui favorise une bonne
participation.

Vos réponses me permettront d'écrire un article et un chapitre de thèse
sur la notion de contribution dans la wikisphère. Les résultats seront
anonymisés et mis en ligne sous licence libre sur mes sites Web (voir
ci-dessous). Il vous faudra environ 30 à 40 minutes pour répondre à
l'enquête.

Merci d'avance pour votre collaboration,
J'espère que cela contribuera à la connaissance de votre communauté,

Avec tous mes remerciements,

Anne Goldenberg,
UQAM, Montréal, Québec, Canada
Unice, Nice, France


PS : Sentez-vous libre de faire circuler ce courriel à toutes les
personnes ou les listes qui pourraient être intéressées par cette étude.

http://anne.koumbit.org
http://wikifarm.koumbit.org/anne


_______________________________________________
Wiki-research-l mailing list
Wiki-research-l@...
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Felipe Ortega | 22 Jul 2008 11:22
Picon
Picon
Favicon
Gravatar

CfP: WIRW (at WikiSym 2008)

Apologies for duplicate reception.

=============================================================
                            CALL FOR PAPERS
=============================================================
  
  First Workshop on "Interdisciplinary Research on Wiki Communities"
                            September 8, 2008
   http://libresoft.es/Activities/Research_activities/Workshop_WikiSym

                             at WikiSym 2008
                      http://www.wikisym.org/ws2008/
                             Porto, Portugal
                          September 8-10, 2008

=================
Introduction
=================
The array of approaches to studying wikis is a source of
wealth but also a possible source of confusion: What are
appropriate methodologies for the analysis of wiki communities?
Which are the most critical parameters (both quantitative
and qualitative) for study in wiki evolution and outcomes?
Is it possible to find effective interdisciplinary approaches
to augment our overall understanding of these dynamic
creative environments?

This workshop intends to provide an opportunity to explore
these questions by researchers and practitioners willing to
participate in a “brainstorming research meeting”.

=================
How To Participate
=================
You are invited to submit a position paper (max. length 4 pages)
showing methodologies, tools and challenges in this research
area. Submissions will be reviewed for quality and relevance.

We plan to celebrate a dynamic brainstorming research session. If
you're attending WikiSym but didn't submit a paper, you're also
invited to join the workshop.

Submissions should be sent to jfelipe_at_gsyc_dot_es and
wiki_workshop_at_gsyc_dot_es. Submissions must follow the ACM SIG
Proceedings Format:
http://www.acm.org/sigs/pubs/proceed/template.html .

=================
Important Dates
=================
* Submission deadline: **August 12**.
* Notification of acceptance: August 19.
* Workshop date: September 8, 13:30-17:00 (time to be confirmed).

=================
Goals
=================
The main goal of this workshop is to provide an adequate forum for
wiki researchers to:

* Present their own approaches for studying wiki communities from
multiple perspectives (e-learning, sociology, content analysis,
quantitative analysis and data mining, social networking, etc.).
 
* Explain the main problems they face when integrating different
research approaches into a coherent line of research.

* Propose possible solutions in the form of tools, methodologies,
research strategies and ways of collaboration that can be adopted
to further such research.

* Exchange research ideas with colleagues who can provide novel
points of view that could complement existing research initiatives.

An important outcome will be a charter to create a global virtual
research planet on wiki communities. It will include wiki
researchers, wiki research tools, wiki research bibliography,
forums, blogs, wikis, etc. Its main objective will be to serve
as a common meeting point and resource center for all researchers
in this field.

=================
Topics
=================
Examples of relevant topics include, but are not limited to:

Practitioner studies:

  * Research tools supporting quantitative analysis of wiki
  communities.

  * Content quality, assessment, and author reputation.

  * Social Network Analysis, web graphs, and visualization
  techniques for wiki communities.

  * Technological innovation for supporting wiki communities
  (e.g., content distribution, P2P, security and authentication).

  * Ontological and taxonomic technologies (lexical corpus
  and authorities, semantic searches, content categorization).

Reflections upon methodological and disciplinary experiences
in studying wiki communities (e.g., sociology, education,
knowledge management, economics).

  * Dealing with human subjects review (IRB) or ethic
  guidelines (e.g., journalists interviewing versus oral
  historian)

  * Issues on privacy and data retention. How do these issues
  vary across disciplines and countries (legal/regulatory
  issues)

  * Did you try to submit your work to a journal in one
  discipline, have it rejected, but accepted in another?
  (And why was this so?)

  * Have you had any experiences working with collaborators
  from other disciplines?

  * Did you run into problems with your corpus of data and
  the tools people typically use in your discipline?

=================
Agenda
=================
In this context, the workshop organization will be threefold:

1.Present a complete perspective of the current state-of-the-art
of Wikipedia and wiki-based open communities from different points
of view. This will be achieved by presentations of position papers
explaining existing tools and research methods successfully applied
to the analysis of wiki environments.

2.Promote the creation of small working groups to consider the
integration of distinct research strategies, including opportunities
for interactions and feedback between researchers.

3.A final plenary session to consider the different solutions
proposed by the working groups, and summarizing a catalog of tools,
methodologies and opportunities of collaboration that may be offered
to wiki researchers wanting to undertake interdisciplinary analysis
in this area.

A detailed agenda is available at
http://libresoft.es/Activities/Research_activities/Workshop_WikiSym .

=================
Organizers
=================
Joseph Reagle - (NYU, USA)
Felipe Ortega - GSyC/Libresoft (Universidad Rey Juan Carlos, Spain).
Antonio J. Reinoso - GSyC/Libresoft (Universidad Rey Juan Carlos, Spain).
Rut Jesus - CPNSS (University of Copenhagen, Denmark)

Reviews Coordinator:
------------------------
Gregorio Robles - GSyC/Libresoft (Universidad Rey Juan Carlos, Spain).


Enviado desde Correo Yahoo!
La bandeja de entrada más inteligente.
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l@...
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l

Gmane