There are 1,789 repositories under data topic.
🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.
📗 SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs
The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:
A curated list of awesome big data frameworks, ressources and other awesomeness.
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。
A web interface to create custom vector-based visualizations on top of RAWGraphs core
:card_index: A simple fake data generator for C#, F#, and VB.NET. Based on and ported from the famed faker.js.
PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement
The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP
A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
A mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.
Interactive Tables and Data Grids for JavaScript
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
Countly helps you get insights from your application. Available self-hosted or on private cloud.
✨⚡️ A beautiful feature-rich GraphQL Client for all platforms.
This repository contains compatibility data for Web technologies as displayed on MDN
The open source high performance data integration platform built for developers.
An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.
Mimesis is a robust fake data generator for Python, which provides data for a variety of purposes in a variety of languages.
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data.humdata.org among many other sites.
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
The home of the CUE language! Validate and define text-based and dynamic configuration
A simple, declarative, and composable way to fetch data for React components
The Data Transfer Project makes it easy for people to transfer their data between online service providers. We are establishing a common framework, including data models and protocols, to enable direct transfer of data both into and out of participating online service providers.
Kestra is an infinitely scalable orchestration and scheduling platform, creating, running, scheduling, and monitoring millions of complex pipelines.
Kubernetes-native workflow automation platform for complex, mission-critical data and ML processes at scale. It has been battle-tested at Lyft, Spotify, Freenome, and others and is truly open-source.
Smarter YAML front matter parser, used by metalsmith, Gatsby, Netlify, Assemble, mapbox-gl, phenomic, vuejs vitepress, TinaCMS, Shopify Polaris, Ant Design, Astro, hashicorp, garden, slidev, saber, sourcegraph, and many others. Simple to use, and battle tested. Parses YAML by default but can also parse JSON Front Matter, Coffee Front Matter, TOML Front Matter, and has support for custom parsers. Please follow gray-matter's author: https://github.com/jonschlinkert