Eddie Lin's starred repositories

awesome-scalability

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems

License:MITStargazers:55581Issues:0Issues:0

airmail

Lightweight geocoder in pure Rust

Language:RustLicense:Apache-2.0Stargazers:281Issues:0Issues:0
Language:RustLicense:NOASSERTIONStargazers:5271Issues:0Issues:0

polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Language:RustLicense:NOASSERTIONStargazers:26947Issues:0Issues:0

r-polars

Bring polars to R

Language:RLicense:NOASSERTIONStargazers:406Issues:0Issues:0

connector-x

Fastest library to load data from DB to DataFrames in Rust and Python

Language:RustLicense:MITStargazers:1827Issues:0Issues:0

open-midinous

A generative meta-logic-based music software

Language:RubyLicense:GPL-3.0Stargazers:61Issues:0Issues:0

moya-techblog

Blog software for code and Photography

Language:CSSLicense:MITStargazers:31Issues:0Issues:0

namematch

Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets

Language:PythonLicense:AGPL-3.0Stargazers:104Issues:0Issues:0

From-0-to-Research-Scientist-resources-guide

Detailed and tailored guide for undergraduate students or anybody want to dig deep into the field of AI with solid foundation.

Stargazers:7395Issues:0Issues:0

coding-interview-university

A complete computer science study plan to become a software engineer.

License:CC-BY-SA-4.0Stargazers:294980Issues:0Issues:0

glances

Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac OS and Windows operating systems.

Language:PythonLicense:NOASSERTIONStargazers:25347Issues:0Issues:0

ward

Ward is a modern test framework for Python with a focus on productivity and readability.

Language:PythonLicense:MITStargazers:1192Issues:0Issues:0

vaex

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

Language:PythonLicense:MITStargazers:8194Issues:0Issues:0

duckdb

DuckDB is an analytical in-process SQL database management system

Language:C++License:MITStargazers:18215Issues:0Issues:0

system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Language:PythonLicense:NOASSERTIONStargazers:259354Issues:0Issues:0

whylogs

An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2568Issues:0Issues:0

pipelinex

PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more

Language:PythonLicense:NOASSERTIONStargazers:221Issues:0Issues:0

petl

Python Extract Transform and Load Tables of Data

Language:PythonLicense:MITStargazers:1213Issues:0Issues:0

postgis_geocoder

A plug and play geocoder.

Language:ShellLicense:GPL-2.0Stargazers:2Issues:0Issues:0

Minecraft

Simple Minecraft-inspired program using Python and Pyglet

Language:PythonLicense:MITStargazers:5175Issues:0Issues:0

nypd-complaints-data

airflow-dbt-docker workflow

Language:PythonStargazers:1Issues:0Issues:0

applied-ml

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

License:MITStargazers:26118Issues:0Issues:0

flask-react-soft-dashboard

Flask React - Soft Dashboard (Open-source) | AppSeed.us

Language:PythonLicense:NOASSERTIONStargazers:23Issues:0Issues:0

mltrace

Coarse-grained lineage and tracing for machine learning pipelines.

Language:PythonLicense:Apache-2.0Stargazers:465Issues:0Issues:0

ibis

the portable Python dataframe library

Language:PythonLicense:Apache-2.0Stargazers:4368Issues:0Issues:0

OpenMetadata

OpenMetadata is a unified platform for discovery, observability, and governance powered by a central metadata repository, in-depth lineage, and seamless team collaboration.

Language:TypeScriptLicense:Apache-2.0Stargazers:4353Issues:0Issues:0

radian

A 21 century R console

Language:PythonLicense:MITStargazers:1947Issues:0Issues:0

perspective

A data visualization and analytics component, especially well-suited for large and/or streaming datasets.

Language:C++License:Apache-2.0Stargazers:7649Issues:0Issues:0

ml-system-design-pattern

System design patterns for machine learning

License:MITStargazers:2178Issues:0Issues:0