AI-Dataset-and-Tools

AI-Dataset-and-Tools's repositories

prompt2model

prompt2model - Generate Deployable Models from Natural Language Instructions

Apache-2.0000

silo-Language-Models

Silo Language Models code repository

MIT000

DWPose-human-whole-body-pose-estimation

Official implementation of the paper "Effective Whole-body Pose Estimation with Two-stages Distillation"

Apache-2.0000

UnIVAL-Unified-Model-for-Image-Video-Audio-and-Language

Official implementation of UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.

Apache-2.0000

GroundingDINO-PyTorch-models-for-Grounding-DINO

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Apache-2.0000

CPR-Coach-Cardiopulmonary-Resuscitation-in-emergency-treatment

CPR-Coach

MIT100

video2dataset

Easily create large video dataset from video urls

MIT000

petals-100B-language-models

🌸 Run 100B+ language models at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

MIT000

unilm-large-scale-pre-trained-models

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

MIT000

multiannotator-benchmarks

Benchmarking algorithms for assessing quality of data labeled by multiple annotators

AGPL-3.0000

reward_design_with_llms-Language-Models

000

mintaka-question-answering-QA-dataset

Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)

CC-BY-4.0000

BMW-Labeltool-Lite

This repository provides you with an easy-to-use labeling tool for State-of-the-art Deep Learning training purposes. It supports Auto-Labeling.

000

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Apache-2.0000

Android-Image-Cropper

Image Cropping Library for Android, optimised for Camera / Gallery.

Apache-2.0000

Reinforcement-Learning-Datasets

Apache-2.0000

huggingface-datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Apache-2.0000

Arch-Net

Arch-Net: Model Distillation for Architecture Agnostic Model Deployment

000

acav100m-Audio-Visual-Video-Representation-Learning

ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.

MIT000

open-covid-19-data

Open source aggregation pipeline for public COVID-19 data, including hospitalization/ICU/ventilator numbers for many countries.

Apache-2.0000

Chinese-Landscape-Painting-Dataset

Dataset used for WACV 2021 paper: "End-to-End Chinese Landscape Painting Creation Using Generative Adversarial Networks"

000

Open-korean-corpora

Open Korean NLP Dataset Curation for the Users All Around the Globe

NOASSERTION000

Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes

NOASSERTION000

AI-Dataset-and-Tools