RubyGems Navigation menu

pikuri-pdf 0.0.6

pikuri-pdf plugs PDF → text extraction into pikuri-core's +Pikuri::Extractor+ registry. The bundled +Pikuri::Extractors::PDF+ extractor wraps the pure-Ruby pdf-reader gem and extracts lazily: paged reads (the +read+ tool's windows) parse only the pages the window needs, so the first page of a 500-page PDF never pays for the other 499. Shipped separately from pikuri-core so the core's dependency tree stays minimal and auditable: pdf-reader and its transitive deps (Ascii85, afm, hashery, ruby-rc4, ttfunk) ride along only for hosts that opt into PDF support. Registration is explicit — +Pikuri::Extractors::PDF.register+ — so requiring the gem changes nothing by itself; the host script picks which extractors it wires in. One registration extends the +read+ tool, +web_scrape+, and the pikuri-vectordb indexer simultaneously.

Gemfile:
=

install:
=

Versions:

  1. 0.0.6 June 04, 2026 (9 KB)

Runtime Dependencies (2):

pdf-reader ~> 2.15
pikuri-core = 0.0.6

Owners:

Pushed by:

Authors:

  • Martin Vysny

SHA 256 checksum:

=

Total downloads 108

For this version 108

Version Released:

License:

MIT

Required Ruby Version: >= 3.3

New versions require MFA: true

Version published with MFA: true

Links: