There are 4,262 repositories under data-science topic.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Deep Learning for humans
scikit-learn: machine learning in Python
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
10 Weeks, 20 Lessons, Data Science for All!
Roadmap to becoming an Artificial Intelligence Expert in 2022
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
:memo: An awesome Data Science repository to learn and apply for real world problems.
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
matplotlib: plotting with Python
Best Practices on Recommendation Systems
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
VIP cheatsheets for Stanford's CS 229 Machine Learning
📺 Discover the latest machine learning / AI courses on YouTube.
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
Transform data, train models, and run SQL with marimo — feels like a next-gen reactive notebook, stored as Git-friendly reproducible Python. Deploy as scripts, pipelines, endpoints, and apps. All from an AI-native editor (or your own).
Code for Machine Learning for Algorithmic Trading, 2nd edition.