zhhao1

zhhao1

Geek Repo

Github PK Tool:Github PK Tool

zhhao1's repositories

Language:RoffStargazers:1Issues:0Issues:0

actnn

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

AugLy

A data augmentations library for audio, image, text, and video.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:0Issues:0Issues:0

chinese_speech_pretrain

chinese speech pretrained models

Language:ShellStargazers:0Issues:0Issues:0

covost

CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

DeepSpeech

DeepSpeech is an open source speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Language:C++License:MPL-2.0Stargazers:0Issues:0Issues:0

entmax

The entmax mapping and its loss, a family of sparse softmax alternatives.

License:MITStargazers:0Issues:0Issues:0

KnowledgeDistillation

Knowledge distillation in text classification with pytorch. 知识蒸馏,中文文本分类,教师模型BERT、XLNET,学生模型biLSTM。

Language:PythonStargazers:0Issues:0Issues:0

LASER

Language-Agnostic SEntence Representations

License:NOASSERTIONStargazers:0Issues:0Issues:0

LLaMA-Factory

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

mdistiller

A Knowledge Distillation Toolbox

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Open-Llama

The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ParaGen

ParaGen is a PyTorch deep learning framework for parallel sequence generation.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

pulse

PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models

Stargazers:0Issues:0Issues:0

PyTorch-Lightning-GAN

Implementations of various GAN architectures using PyTorch Lightning

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

sacrebleu

Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons

License:Apache-2.0Stargazers:0Issues:0Issues:0

SemanticMask

The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"

Stargazers:0Issues:0Issues:0

Semi-supervised-learning

A Unified Semi-Supervised Learning Codebase (NeurIPS'22)

License:MITStargazers:0Issues:0Issues:0

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

License:Apache-2.0Stargazers:0Issues:0Issues:0

SpecAugment

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Stargazers:0Issues:0Issues:0

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

License:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

torchaudio-augmentations

Audio Augmentations library for PyTorch

License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

License:BSD-4-ClauseStargazers:0Issues:0Issues:0