ruby-skill-bench orchestrates evaluation runs of AI coding agents inside isolated git sandboxes, then scores the results using deterministic and LLM-powered judges.

Required Ruby Version

>= 3.1

Authors

Ismael Marin

Versions

  1. 1.2.0 July 01, 2026 (131 KB)
  2. 1.1.0 June 23, 2026 (102 KB)
  3. 1.0.1 May 29, 2026 (98.5 KB)
  4. 0.1.0 May 17, 2026 (89 KB)

Pushed by

SHA 256 checksum