There are 46 repositories under datasets topic.
A topic-centric list of HQ open datasets.
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
pix2code: Generating Code from a Graphical User Interface Screenshot
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Open source annotation tool for machine learning practitioners.
An easy, flexible, and accurate plate recognition project for Chinese licenses in unconstrained situations.
An open source multi-tool for exploring and publishing data
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Dataset format for AI. Build, manage, query & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Resources for deep learning with satellite & aerial imagery
A curated list of awesome JSON datasets that don't require authentication.
Datasets, tools, and benchmarks for representation learning of code.
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
:pencil2: Web-based image segmentation tool for object detection, localization, and keypoints
Colour Science for Python
In-memory tabular data in Julia
C++ Implementation of PyTorch Tutorials for Everyone
Benchmark datasets, data loaders, and evaluators for graph machine learning
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Large datasets for conversational AI
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Open source audio annotation tool for humans™
An extension of Open3D to address 3D Machine Learning tasks
🪐 End-to-end NLP workflows from prototype to production
A large collection of system log datasets for AI-powered log analytics
Community list of transit APIs, apps, datasets, research, and software :bus::star2::train::star2::steam_locomotive:
Papers and Datasets about Point Cloud.
Machine learning datasets used in tutorials on MachineLearningMastery.com
TorchGeo: datasets, transforms, and models for geospatial data
This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)