There are 33 repositories under data-augmentation topic.
A system for quickly generating training data with weak supervision
🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
Data augmentation for NLP, presented at EMNLP 2019
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用
fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
Code for TKDE paper "Self-supervised learning on graphs: Contrastive, generative, or predictive"
Data Augmentation For Object Detection
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
Collection of papers and resources for data augmentation for NLP.
Random Erasing Data Augmentation. Experiments on CIFAR10, CIFAR100 and Fashion-MNIST
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Light-weight Single Person Pose Estimator
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
Copy-paste augmentation for segmentation and detection tasks
Deep Convolutional Neural Networks for Musical Source Separation
Implementation of the mixup training method
Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)
Data augmentation tool for images
Programming assignments and quizzes from all courses within the GANs specialization offered by deeplearning.ai
DrQ: Data regularized Q
Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)
Python package to corrupt arbitrary images.
Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks