RubyGems Navigation menu

wp2txt 0.5.3

WP2TXT extracts plain text data from Wikipedia dump file (encoded in XML/compressed with Bzip2) stripping all the MediaWiki markups and other metadata.

Gemfile:
= Copy to clipboard Copied!

install:
=

Versions:

  1. 1.1.3 - May 13, 2023 (7.78 MB)
  2. 1.1.2 - April 15, 2023 (7.78 MB)
  3. 1.1.1 - January 25, 2023 (7.78 MB)
  4. 1.1.0 - January 22, 2023 (7.78 MB)
  5. 1.0.2 - November 25, 2022 (7.78 MB)
  6. 0.5.3 - January 24, 2013 (279 KB)
Show all versions (29 total)

Runtime Dependencies (6):

bundler >= 0
bzip2-ruby >= 0
json >= 0
nokogiri >= 0
sanitize >= 0
trollop >= 0

Development Dependencies (1):

rspec >= 0

Owners:

Authors:

  • Yoichiro Hasebe

SHA 256 checksum:

8ab6de0e3ae0b3c777ce78375d54381e1a913a215dae36c2f570e4fc50fea8c4

Total downloads 63,494

For this version 2,852

Licenses:

N/A

Required Ruby Version: None

Links: