Docsplit is a command-line utility and Ruby library for splitting apart documents into their component parts: searchable UTF-8 plain text, page \ images or thumbnails in any format, PDFs, single pages, and document metadata (title, author, number of pages...)
Required Ruby Version
None
Authors
Jeremy Ashkenas, Samuel Clay