Wikipedia data extraction
2012-03-03 10:28:21 GMT
I am using the following perl modules to extract data from Wikipedia and Wikitravel respectively -
From both these APIs and also by looking at the MediaWiki APIs, I seem to get the entire chunk of text in the Web Service response. To extract different sections of the Wiki entry, I have to rely on pattern matching and regular expressions.
Is there a better way to achieve this? Is there some sample code in any language (preferably, perl) which anyone can share, or is there some tool which does this out of the box?
Any help would be appreciated.
_______________________________________________ Mediawiki-api mailing list Mediawiki-api <at> lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api