Distillery extracts the "content" portion out of an HTML document. It applies heuristics based on element type, location, class/id name and other attributes to try and find the content part of the HTML document and return it.
Jeff Pollard
gem "distillery", "~> 0.2.10"