RubyGems Navigation menu

medusa-crawler 1.0.0.pre.1

Medusa: a ruby crawler framework

Medusa is a ruby framework to crawl and collect useful information about the pages it visits. It is versatile, allowing you to write your own specialized tasks quickly and easily.

#### Features

  • Choose the links to follow on each page with ‘focus_crawl()`

  • Multi-threaded design for high performance

  • Tracks 301 HTTP redirects

  • Allows exclusion of URLs based on regular expressions

  • HTTPS support

  • Records response time for each page

  • Obey robots.txt

  • In-memory or persistent storage of pages during crawl using Moneta adapters.

  • Inherits OpenURI behavior (redirects, automatic charset and encoding detection, proxy configuration options).

Gemfile:
= 复制到剪贴板 已复制!

安装:
=

版本列表:

  1. 1.0.0 - August 17, 2020 (23.0 KB)
  2. 1.0.0.pre.2 - August 14, 2020 (23.0 KB)
  3. 1.0.0.pre.1 - August 06, 2020 (24.0 KB)

Runtime 依赖 (3):

moneta ~> 1.3, >= 1.3.0
nokogiri ~> 1.3, >= 1.3.0
robotex ~> 1.0, >= 1.0.0

业主:

推送:

作者:

  • Mauro Asprea, Chris Kite

SHA 256 校验和:

36b72004627cc1abf81715777b29c34e67dc3c1f9420311103ea275e6d216733

下载总量 4,267

这个版本 1,272

许可:

MIT

需要的 Ruby 版本: >= 0

需要的 RubyGems 版本: > 1.3.1

链接: