RubyGems.org

wp2txt

0.5.0

WP2TXT extracts plain text data from Wikipedia dump file (encoded in XML/compressed with Bzip2) stripping all the MediaWiki markups and other metadata.

installgem install wp2txt -v 0.5.0
Authors

Yoichiro Hasebe

1,883 total downloads 236 for this version
Owners

4bc42bb182e5ceb135cbca1303a47df1

Gemfile
gem "wp2txt", "~> 0.5.0"
Versions
  1. 0.5.3 January 24, 2013
  2. 0.5.02 January 14, 2013
  3. 0.5.2 January 24, 2013
  4. 0.5.1 January 16, 2013
  5. 0.5.0 January 14, 2013
Show all versions (7 total)
Development Dependencies
  1. rspec >= 0