Beast code in Giters

zhhao1's repositories

fcgcl

Language:Roff100

actnn

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

Language:PythonMIT000

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Language:PythonBSD-2-Clause000

AugLy

A data augmentations library for audio, image, text, and video.

Language:PythonMIT000

BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

Language:HTMLApache-2.0000

chinese_speech_pretrain

chinese speech pretrained models

Language:Shell000

covost

CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)

Language:PythonNOASSERTION000

DeepSpeech

DeepSpeech is an open source speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Language:C++MPL-2.0000

entmax

The entmax mapping and its loss, a family of sparse softmax alternatives.

MIT000

KnowledgeDistillation

Knowledge distillation in text classification with pytorch. 知识蒸馏，中文文本分类，教师模型BERT、XLNET，学生模型biLSTM。

Language:Python000

LASER

Language-Agnostic SEntence Representations

NOASSERTION000

LLaMA-Factory

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

Apache-2.0000

LST

Language:Python000

mdistiller

A Knowledge Distillation Toolbox

Language:Jupyter Notebook000

Open-Llama

The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.

Language:PythonMIT000

ParaGen

ParaGen is a PyTorch deep learning framework for parallel sequence generation.

Language:PythonNOASSERTION000

pulse

PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models

000

PyTorch-Lightning-GAN

Implementations of various GAN architectures using PyTorch Lightning

000

R-Drop

000

ReDense

000

sacrebleu

Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons

Apache-2.0000

SemanticMask

The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"

000

Semi-supervised-learning

A Unified Semi-Supervised Learning Codebase (NeurIPS'22)

MIT000

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Apache-2.0000

SpecAugment

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

000

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

MIT000

svcca

Apache-2.0000

torchaudio-augmentations

Audio Augmentations library for PyTorch

MIT000

vst

Language:Python010

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

BSD-4-Clause000