There are 46 repositories under datasets topic.
A topic-centric list of HQ open datasets.
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
pix2code: Generating Code from a Graphical User Interface Screenshot
Label Studio is a multi-type data labeling and annotation tool with standardized output format
An easy, flexible, and accurate plate recognition project for Chinese licenses in unconstrained situations.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Dataset format for AI. Build, manage, query & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Resources for deep learning with satellite & aerial imagery
A curated list of awesome JSON datasets that don't require authentication.
搜索所有中文NLP数据集,附常用英文NLP数据集
Datasets, tools, and benchmarks for representation learning of code.
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
:pencil2: Web-based image segmentation tool for object detection, localization, and keypoints
Colour Science for Python
In-memory tabular data in Julia
C++ Implementation of PyTorch Tutorials for Everyone
Benchmark datasets, data loaders, and evaluators for graph machine learning
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Large datasets for conversational AI
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Open source audio annotation tool for humans™
Community list of transit APIs, apps, datasets, research, and software :bus::star2::train::star2::steam_locomotive:
Papers and Datasets about Point Cloud.
This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)