There are 762 repositories under data-analysis topic.
scikit-learn: machine learning in Python
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Roadmap to becoming an Artificial Intelligence Expert in 2022
The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis
10 Weeks, 20 Lessons, Data Science for All!
GoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
OpenRefine is a free, open source power tool for working with messy data and improving it
Practice your pandas skills!
Statsmodels: statistical modeling and econometrics in Python
Open Machine Learning Course
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
Gonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and more
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
「开源可视化报表,商业BI替代方案」积木报表是一款类似excel操作风格,在线拖拽完成设计的报表工具。低代码产品的臂膀!功能涵盖: 报表设计、图形报表、打印设计、大屏设计等,完全免费!秉承“简单、易用、专业”的产品理念,极大的降低报表开发难度、缩短开发周期、解决各类报表难题。
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
Open Source Feature Flagging and A/B Testing Platform
The open source high performance ELT framework powered by Apache Arrow
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
A code-first agent framework for seamlessly planning and executing data analytics tasks.