There are 1,877 repositories under data topic.
The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data ๐ฅ
๐ค Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data :bar_chart:
This is a repo with links to everything you'd ever want to learn about data engineering
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
A curated list of awesome big data frameworks, ressources and other awesomeness.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! ๅผๆบ่ดข็ปๆฐๆฎๆฅๅฃๅบ
:orange_book: ไธญๅๆฐๅๅญๅ ธๆฐๆฎๅบใๅ ๆฌๆญๅ่ฏญ๏ผๆ่ฏญ๏ผ่ฏ่ฏญ๏ผๆฑๅญใ
PRQL is a modern language for transforming data โ a simple, powerful, pipelined SQL replacement
:card_index: A simple fake data generator for C#, F#, and VB.NET. Based on and ported from the famed faker.js.
A web interface to create custom vector-based visualizations on top of RAWGraphs core
A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.
Interactive Tables and Data Grids for JavaScript
๐ท๏ธ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
A mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.
The open source ELT framework powered by Apache Arrow
Countly is a product analytics platform that helps teams track, analyze and act-on their user actions and behaviour on mobile, web and desktop applications.
The home of the CUE language! Validate and define text-based and dynamic configuration
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
Browser compatibility data for Web technologies as displayed on MDN
Superduper: End-to-end framework for building custom AI applications and agents.
Data processing for and with foundation models! ๐ ๐ ๐ฝ โก๏ธ โก๏ธ๐ธ ๐น ๐ท