RubyGems Navigation menu

wp2txt 2.1.1

WP2TXT extracts text and category data from Wikipedia dump files (encoded in XML / compressed with Bzip2), removing MediaWiki markup and other metadata.

Gemfile:
=

install:
=

Versions:

  1. 2.1.1 February 21, 2026 (300 KB)
  2. 2.1.0 February 19, 2026 (299 KB)
  3. 1.1.3 May 13, 2023 (7.78 MB)
  4. 1.1.2 April 15, 2023 (7.78 MB)
  5. 1.1.1 January 25, 2023 (7.78 MB)
Show all versions (31 total)

Runtime Dependencies (8):

Development Dependencies (5):

bundler >= 0
rake >= 0
rspec >= 0
simplecov >= 0
webmock >= 0

Owners:

Pushed by:

Authors:

  • Yoichiro Hasebe

SHA 256 checksum:

=

Total downloads 71,956

For this version 167

Version Released:

License:

MIT

Required Ruby Version: >= 3.0

Links: