Using the DocWrapper DSL you can easily define classes that wrap HTML DOM Documents allowing extraction of properties using either XPath or CSS selectors.
Mark Menard
gem "doc_wrapper", "~> 0.9.4"