There are 905 repositories under data-analysis topic.
scikit-learn: machine learning in Python
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data :bar_chart:
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis
10 Weeks, 20 Lessons, Data Science for All!
Roadmap to becoming an Artificial Intelligence Expert in 2022
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
GoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Practice your pandas skills!
OpenRefine is a free, open source power tool for working with messy data and improving it
Statsmodels: statistical modeling and econometrics in Python
Open Machine Learning Course
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
Gonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and more
「数据可视化:报表、大屏、数据看板」积木报表是一款类Excel操作风格,在线拖拽设计的报表工具和和数据可视化产品。功能涵盖: 报表设计、大屏设计、打印设计、图形报表、仪表盘门户设计等,完全免费!秉承“简单、易用、专业”的产品理念,极大的降低报表开发难度、缩短开发周期、解决各类报表难题。
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Open Source Feature Flagging and A/B Testing Platform
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
The open source ELT framework powered by Apache Arrow
A code-first agent framework for seamlessly planning and executing data analytics tasks.
A next-generation curated knowledge sharing platform for data scientists and other technical professions.