zhhao1's repositories
actnn
ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training
audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
AugLy
A data augmentations library for audio, image, text, and video.
BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
chinese_speech_pretrain
chinese speech pretrained models
covost
CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)
DeepSpeech
DeepSpeech is an open source speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
entmax
The entmax mapping and its loss, a family of sparse softmax alternatives.
KnowledgeDistillation
Knowledge distillation in text classification with pytorch. 知识蒸馏,中文文本分类,教师模型BERT、XLNET,学生模型biLSTM。
LASER
Language-Agnostic SEntence Representations
LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
mdistiller
A Knowledge Distillation Toolbox
Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
ParaGen
ParaGen is a PyTorch deep learning framework for parallel sequence generation.
pulse
PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models
PyTorch-Lightning-GAN
Implementations of various GAN architectures using PyTorch Lightning
sacrebleu
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
SemanticMask
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
Semi-supervised-learning
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
SpecAugment
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
torchaudio-augmentations
Audio Augmentations library for PyTorch
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)