AI-Dataset-and-Tools's repositories
prompt2model
prompt2model - Generate Deployable Models from Natural Language Instructions
silo-Language-Models
Silo Language Models code repository
DWPose-human-whole-body-pose-estimation
Official implementation of the paper "Effective Whole-body Pose Estimation with Two-stages Distillation"
UnIVAL-Unified-Model-for-Image-Video-Audio-and-Language
Official implementation of UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.
GroundingDINO-PyTorch-models-for-Grounding-DINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
video2dataset
Easily create large video dataset from video urls
petals-100B-language-models
🌸 Run 100B+ language models at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
unilm-large-scale-pre-trained-models
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
multiannotator-benchmarks
Benchmarking algorithms for assessing quality of data labeled by multiple annotators
mintaka-question-answering-QA-dataset
Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)
BMW-Labeltool-Lite
This repository provides you with an easy-to-use labeling tool for State-of-the-art Deep Learning training purposes. It supports Auto-Labeling.
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Android-Image-Cropper
Image Cropping Library for Android, optimised for Camera / Gallery.
huggingface-datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Arch-Net
Arch-Net: Model Distillation for Architecture Agnostic Model Deployment
acav100m-Audio-Visual-Video-Representation-Learning
ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.
open-covid-19-data
Open source aggregation pipeline for public COVID-19 data, including hospitalization/ICU/ventilator numbers for many countries.
Chinese-Landscape-Painting-Dataset
Dataset used for WACV 2021 paper: "End-to-End Chinese Landscape Painting Creation Using Generative Adversarial Networks"
Open-korean-corpora
Open Korean NLP Dataset Curation for the Users All Around the Globe
Objectron
Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
tfjs-models
Pretrained models for TensorFlow.js
cvat-Computer-Vision-Annotation-Tool
Powerful and efficient Computer Vision Annotation Tool (CVAT)
coco-annotator
:pencil2: Web-based image segmentation tool for object detection, localization and keypoints
labelImg
🖍️ LabelImg is a graphical image annotation tool and label object bounding boxes in images
models
A collection of pre-trained, state-of-the-art models in the ONNX format
DarkLabel
Video/Image Labeling and Annotation Tool
deeplabel
A cross-platform image annotation tool for machine learning