Haohe (Leo) Liu / 刘濠赫's repositories
versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
voicefixer
General Speech Restoration
audioldm_eval
This toolbox aims to unify audio generation model evaluation for easier comparison.
voicefixer_main
General Speech Restoration
AudioLDM-training-finetuning
AudioLDM training, finetuning, evaluation and inference.
SemantiCodec-inference
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.
courseProject_Compiler
java implementation of NWPU Compiler course project-西工大编译原理-试点班
youtube-8m-videos-downloader
Download videos from YouTube-8M dataset for testing
kmeans_pytorch
kmeans using PyTorch
descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
haoheliu.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
resemble-enhance
AI powered speech denoising and enhancement
video_features
Extract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, ResNet features.
decord
An efficient video loader for deep learning with smart shuffling that's super easy to digest
jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
torchmetrics
Torchmetrics - Machine learning metrics for distributed, scalable PyTorch applications.