RubyTokenizer is a simple language processing command-line tool. It performs low-level tokenization and returns the top 10 most frequent words in a body of text. At the moment it's only available for English texts and it segments words by filtering whitespaces, punctuation marks, parantheses and other special characters.

Required Ruby Version

>= 0

Authors

irinarenteria

Versions

  1. 0.1.3 March 03, 2016 (5.5 KB)
  2. 0.1.2 February 26, 2016 (5 KB)
  3. 0.1.1 February 25, 2016 (5 KB)
  4. 0.1.0 February 25, 2016 (5 KB)

SHA 256 checksum