Big-Data's repositories
vpdmfModels
All vpdmf models built under our current system.
elasticsearch-river-rabbitmq
RabbitMQ River Plugin for ElasticSearch
pynliner
Python CSS-to-inline-styles conversion tool for HTML using BeautifulSoup and cssutils
stream2es
Another way to stream data into ES (stdin, Wikipedia, and Twitter)
linkedin-scraper
Scrapes the public profile of the linkedin page
linkedInScraper
Scrapes public information off of LinkedIn
python-goose
Html Content / Article Extractor, web scrapping lib in Python
PythonBooks
Directory of free Python ebooks
subsection-identifier
Turns theoretically structured text into actually structured text.
requests
Python HTTP Requests for Humans™.
Infinit.e
The first Open Source document analysis platform
lapdftextProject
High-level build project for all LAPDF-Text submodules
easy-pie-chart
Lightweight jQuery plugin to render simple, animated and retina ready pie charts with the HTML5 canvas element
lapdftextServer
A web application resource (*.war) file for the digital library client.
elasticsearch-gui
An angularJS client for Elastic search as a plugin
pivot.js
Build Pivot Tables from CSV/JSON Data
dstk
A collection of the best open data sets and open-source tools for data science
scrapylib
Collection of Scrapy extensions, middlewares, pipelines & helper functions
markup
The code we use to render README.your_favorite_markup
pattern-matcher
Winning solution of Rubylight/JUG programming contest 1 (pattern matcher)
socket.io
Realtime application framework for Node.JS, with HTML5 WebSockets and cross-browser fallbacks support.
elasticsearch-analysis-stempel
Stempel (Polish) Analysis Plugin for ElasticSearch
parsimonious
The fastest pure-Python PEG parser I can muster
dc.js
Multi-Dimensional charting built to work natively with crossfilter rendered with d3.js
kiji-scoring
A module for applying trained models to score Kiji entities in real-time.
kiji-mapreduce
A framework for MapReduce-based computation over data managed by KijiSchema
spynner
Programmatic web browsing module with AJAX support for Python
elasticsearch-river-solr
Solr River plugin for elasticsearch
python-zombie
A Python driver for Zombie.js (http://zombie.labnotes.org/), a headless browser powered by node.js.