There are 923 repositories under data-analysis topic.
scikit-learn: machine learning in Python
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data :bar_chart:
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis
10 Weeks, 20 Lessons, Data Science for All!
Roadmap to becoming an Artificial Intelligence Expert in 2022
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
GoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Practice your pandas skills!
OpenRefine is a free, open source power tool for working with messy data and improving it
Statsmodels: statistical modeling and econometrics in Python
Open Machine Learning Course
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
Gonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and more
「数据可视化:报表、大屏、数据看板」积木报表是一款类Excel操作风格,在线拖拽设计的报表工具和和数据可视化产品。功能涵盖: 报表设计、大屏设计、打印设计、图形报表、仪表盘门户设计等,完全免费!秉承“简单、易用、专业”的产品理念,极大的降低报表开发难度、缩短开发周期、解决各类报表难题。
🎯 告别信息过载,AI 助你看懂新闻资讯热点,简单的舆情监控分析 - 多平台热点聚合+基于 MCP 的AI分析工具。监控35个平台(抖音、知乎、B站、华尔街见闻、财联社等),智能筛选+自动推送+AI对话分析(用自然语言深度挖掘新闻:趋势追踪、情感分析、相似检索等13种工具)。支持企业微信/飞书/钉钉/Telegram/邮件/ntfy推送,30秒网页部署,1分钟手机通知,无需编程。支持Docker部署⭐ 让算法为你服务,用AI理解热点
Open Source Feature Flagging and A/B Testing Platform
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
Data pipelines for cloud config and security data. Build cloud asset inventory, CSPM, FinOps, and vulnerability management solutions. Extract from AWS, Azure, GCP, and 70+ cloud and SaaS sources.