Pari's repositories
redash
Make Your Company Data Driven. Connect to any data source, easily visualize and share your data.
pyxero
Python API for accessing the REST API of the Xero accounting tool.
Open-Data-Catalog
Open Data Catalog is an open data catalog based on Django, Python and PostgreSQL. It was originally developed for OpenDataPhilly.org, a portal that provides access to open data sets, applications, and APIs related to the Philadelphia region. The Open Data Catalog is a generalized version of the original source code with a simple skin. It is intended to display information and links to publicly available data in an easily searchable format. The code also includes options for data owners to submit data for consideration and for registered public users to nominate a type of data they would like to see openly available to the public.
AzureSearchOCR
Sample of how to leverage Optical Character Recognition (OCR) to extract text from images to enable Full Text Search in Azure Search
scalding
A Scala API for Cascading
data-pipeline-samples
This repository hosts sample pipelines
dataduct
DataPipeline for humans.
kinesis-deaggregation
AWS Lambda modules for working with Kinesis Producer Library
arbalest
Arbalest is a Python data pipeline orchestration library for Amazon S3 and Amazon Redshift. It automates data import into Redshift and makes data queryable at scale in AWS.
aegisthus
A Bulk Data Pipeline out of Cassandra
amazon-kinesis-learning
Learning Amazon Kinesis Development
lumify
open source big data integration, analytics, and visualization
mortar-examples
Mortar Project with examples for several different public data sets and data types/formats
redshift-udfs
SQL for many helpful Redshift UDFs, and the scripts for generating and testing those UDFs
mario
Functional, Typesafe, Declarative Data Pipelines
docker-python-deploy
Example project for deployment
LearnDataScience
Open Content for self-directed learning in data science
spark-cs100.1x
Working of CS100.1x, Introduction to Big Data with Apache Spark
05-reproducible-research-assignment-2
Peer assignment 2 from the Coursera data science class
ddl-generator
Guesses table DDL based on data
easyRFM
An easy way to RFM analysis by R
Black-Scholes-Model
R function to compute European price option using Black Scholes Formula.
Kernel_density_ripley
Kernel density estimation with Ripley's Circumferential correction
pitchRx
Tools for scraping MLB Gameday data and Visualizing PITCHf/x
Streaming-Demos
Demos of Plotly's Real-time Streaming API
training-scripts
Scripts to launch cluster used for Strata