Scrapinghub (scrapinghub)

Scrapinghub

scrapinghub

Geek Repo

Turn web content into useful data

Location:The Internet

Home Page:https://scrapinghub.com

Twitter:@Scrapinghub

Github PK Tool:Github PK Tool

Scrapinghub's repositories

scmongo

MongoDB extensions for Scrapy

Language:PythonStargazers:44Issues:6Issues:0

webpager

Paginating the web

pycon-speakers

Speakers Spider (PyCon 2014 sprint)

navscraper

Vanguard ETF NAV scraper

Language:PythonStargazers:8Issues:0Issues:0
Language:ShellLicense:MITStargazers:7Issues:0Issues:0
Language:PythonStargazers:7Issues:0Issues:0

disco

a Map/Reduce framework for distributed computing

Language:ErlangLicense:BSD-3-ClauseStargazers:5Issues:0Issues:0

python-readability

fast python port of arc90's readability tool, updated to match latest readability.js!

Language:HTMLStargazers:4Issues:0Issues:0

vulcand

HTTP proxy that uses Etcd as a configuration backend.

License:Apache-2.0Stargazers:3Issues:0Issues:0

pydaybot

Demo bot for Python Day Uruguay 2011

Language:PythonStargazers:2Issues:0Issues:0

streamparse

streamparse lets you run Python code against real-time streams of data. Integrates with Apache Storm.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

cld2

Compact Language Detector 2

Language:C++Stargazers:1Issues:0Issues:0

docker-registry

Registry server for Docker (hosting/delivering of repositories and images)

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
License:Apache-2.0Stargazers:1Issues:0Issues:0

logrotate

The logrotate utility is designed to simplify the administration of log files on a system which generates a lot of log files.

Language:CLicense:GPL-2.0Stargazers:1Issues:0Issues:0

pkg-opengrok

Ubuntu packaging for OpenGrok

Language:ShellStargazers:1Issues:0Issues:0

python-intercom

Python wrapper for the Intercom API.

Language:PythonLicense:NOASSERTIONStargazers:1Issues:3Issues:0

python-memcached

A python memcached client library.

Language:PythonStargazers:1Issues:0Issues:0

python-wapiti

Python bindings for libwapiti

Language:CStargazers:1Issues:0Issues:0

storm-docker

Dockerfiles for building a storm cluster.

Language:ShellStargazers:1Issues:0Issues:0

backsaver

A git server

Language:PythonStargazers:0Issues:0Issues:0

cedarish

Heroku Cedar-ish Base Image for Docker

Language:ShellStargazers:0Issues:0Issues:0

deimos

Mesos containerizer hooks for Docker

License:Apache-2.0Stargazers:0Issues:0Issues:0

log-courier

Log Courier, a lightweight log shipper with Logstash integration.

Language:GoLicense:NOASSERTIONStargazers:0Issues:0Issues:0

py-trello

Python API wrapper around Trello's API

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

pyfoo

A Python Wrapper for the Wufoo REST API

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

slugbuilder

Builds Heroku slugs using Docker and buildpacks

Language:ShellLicense:NOASSERTIONStargazers:0Issues:0Issues:0

slugrunner

Runs Heroku slugs produced by slugbuilder in Docker

Language:ShellLicense:NOASSERTIONStargazers:0Issues:0Issues:0

tx-keystone-auth

A project to authenticate and authorize access with keystone

Language:PythonStargazers:0Issues:0Issues:0