MX's starred repositories

private-gpt

Interact with your documents using the power of GPT, 100% privately, no data leaks

Language:PythonLicense:Apache-2.0Stargazers:53326Issues:450Issues:1151

timescaledb

An open-source time-series SQL database optimized for fast ingest and complex queries. Packaged as a PostgreSQL extension.

Language:CLicense:NOASSERTIONStargazers:17266Issues:309Issues:2807

Cookbook

The Data Engineering Cookbook

data-engineer-roadmap

Roadmap to becoming a data engineer in 2021

fuzzywuzzy

Fuzzy String Matching in Python

Language:PythonLicense:GPL-2.0Stargazers:9200Issues:259Issues:187

pysheeet

Python Cheat Sheet

Language:PythonLicense:MITStargazers:7964Issues:209Issues:38

pgloader

Migrate to PostgreSQL in a single command!

Language:Common LispLicense:NOASSERTIONStargazers:5256Issues:78Issues:1401

amundsen

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Language:PythonLicense:Apache-2.0Stargazers:4364Issues:234Issues:683

ripme

Downloads albums in bulk

Language:JavaLicense:MITStargazers:3683Issues:149Issues:1279

data-science-on-aws

AI and Machine Learning with Kubeflow, Amazon EKS, and SageMaker

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3331Issues:119Issues:222

teslausb

A smart USB drive for Tesla Dashcam - extended storage, auto archive, web viewer

Language:ShellLicense:MITStargazers:1857Issues:98Issues:640

pyrh

Python Framework to make trades with the unofficial Robinhood API

Language:PythonLicense:MITStargazers:1779Issues:140Issues:191

robin_stocks

This is a library to use with Robinhood Financial App. It currently supports trading crypto-currencies, options, and stocks. In addition, it can be used to get real time ticker information, assess the performance of your portfolio, and can also get tax documents, total dividends paid, and more. More info at

Language:PythonLicense:MITStargazers:1676Issues:86Issues:371

mysql_perf_analyzer

MySQL performance monitoring and analysis.

Language:JavaLicense:Apache-2.0Stargazers:1440Issues:151Issues:22

UnusualVolumeDetector

Gets the last 5 months of volume history for every ticker, and alerts you when a stock's volume exceeds 10 standard deviations from the mean within the last 3 days

Language:HTMLLicense:MITStargazers:973Issues:68Issues:30

grouparoo

🦘 The Grouparoo Monorepo - open source customer data sync framework

Language:JavaScriptLicense:MITStargazers:741Issues:18Issues:252
Language:PythonLicense:BSD-3-ClauseStargazers:740Issues:24Issues:26

monosi

Open source data observability platform

Language:PythonLicense:Apache-2.0Stargazers:320Issues:6Issues:50

pvpoke

Open-Source Battle Simulator, Rankings & Team Building for Pokemon GO PvP

Language:JavaScriptLicense:MITStargazers:302Issues:22Issues:169

tickit-data-lake-demo

Resources for video demonstrations and blog posts related to DataOps on AWS

amazon-redshift-developer-guide

This is the documentation for the Amazon Redshift Developer Guide

analytics_pipeline

Code to build a simple analytics data pipeline with Python

airflow-data-quality-demo

A repository of sample code to show data quality checking best practices using Airflow.

classes

Sample code from PythonLab classes

Language:PythonLicense:NOASSERTIONStargazers:55Issues:30Issues:2

AsciiBird

ASCII version of the addictive Flappy Bird game.

Language:CLicense:MITStargazers:45Issues:6Issues:1

magda-config

A simple boilerplate that allows you to quickly set up a Magda instance

Language:HCLLicense:Apache-2.0Stargazers:17Issues:3Issues:10

aws-data-pipeline-developer-guide

The open source version of the AWS Data Pipeline documentation. To provide feedback & requests for changes, submit issues in this repository, or make proposed changes & submit a pull request.

CodeSamples

Sample Code

Language:ScalaLicense:Apache-2.0Stargazers:2Issues:8Issues:0

dc-food-trucks-api

fetching & feeding data to dcfoodtrucks.today

Language:PythonStargazers:1Issues:0Issues:0