RubyGems Navigation menu

wp2txt 1.1.0

WP2TXT extracts text and category data from Wikipedia dump files (encoded in XML / compressed with Bzip2), removing MediaWiki markup and other metadata.

Gemfile:
= Copy to clipboard Copied!

install:
=

Versions:

  1. 1.1.3 - May 13, 2023 (7.78 MB)
  2. 1.1.2 - April 15, 2023 (7.78 MB)
  3. 1.1.1 - January 25, 2023 (7.78 MB)
  4. 1.1.0 - January 22, 2023 (7.78 MB)
  5. 1.0.2 - November 25, 2022 (7.78 MB)
Show all versions (29 total)

Runtime Dependencies (7):

Development Dependencies (3):

bundler >= 0
rake >= 0
rspec >= 0

Owners:

Pushed by:

Authors:

  • Yoichiro Hasebe

SHA 256 checksum:

69dcef55fddc51082f0231ed068d3d280ef2b03630f47c73fc1ce27909d2a52f

Total downloads 63,977

For this version 342

License:

MIT

Required Ruby Version: >= 2.6

Links: