There are 636 repositories under data-analysis topic.
scikit-learn: machine learning in Python
Apache Superset is a Data Visualization and Data Exploration Platform
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:
Roadmap to becoming an Artificial Intelligence Expert in 2022
Streamlit — A faster way to build and share data apps.
The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis
10 Weeks, 20 Lessons, Data Science for All!
Create UIs for your machine learning model in Python in 3 minutes
GoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
Create HTML profiling reports from pandas DataFrame objects
OpenRefine is a free, open source power tool for working with messy data and improving it
Practice your pandas skills!
Open Machine Learning Course
Statsmodels: statistical modeling and econometrics in Python
A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)
Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational
Gonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and more
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
人工智能学习路线图，整理近200个实战案例与项目，免费提供配套教材，零基础入门，就业实战！包括：Python，数学，机器学习，数据分析，深度学习，计算机视觉，自然语言处理，PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Alluxio, data orchestration for analytics and machine learning in the cloud
Data-Centric Pipelines and Data Versioning
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
cuDF - GPU DataFrame Library
A curated list of awesome R packages, frameworks and software.
PyGWalker: Turn your pandas dataframe into a Tableau-style User Interface for visual analysis
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
Web-based SQL editor. Legacy project in maintenance mode.
:zap: A distributed crawler for weibo, building with celery and requests.
Open Source Feature Flagging and A/B Testing Platform
The open source high performance data integration platform built for developers.
利用Python进行数据分析 第二版 (2017) 中文翻译笔记