3 Mar 2012 11:28
Wikipedia data extraction
Ashish Mukherjee <ashish.mukherjee <at> gmail.com>
2012-03-03 10:28:21 GMT
2012-03-03 10:28:21 GMT
Hi,
I am using the following perl modules to extract data from Wikipedia and Wikitravel respectively -
- WWW::Wikipedia
- MediaWiki::API
From both these APIs and also by looking at the MediaWiki APIs, I seem to get the entire chunk of text in the Web Service response. To extract different sections of the Wiki entry, I have to rely on pattern matching and regular expressions.
Is there a better way to achieve this? Is there some sample code in any language (preferably, perl) which anyone can share, or is there some tool which does this out of the box?
Any help would be appreciated.
Regards,
Ashish
_______________________________________________ Mediawiki-api mailing list Mediawiki-api <at> lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
RSS Feed