haoheliu

followers

following

stars

UoSurrey, Centre for Vision, Speech and Signal Processing (CVSSP)

Guildford GU2 7XH Stag Hill, UK

https://haoheliu.github.io/

Haohe (Leo) Liu / 刘濠赫's repositories

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonNOASSERTION2351 42 102

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonNOASSERTION2168 44 66

versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Language:PythonMIT1012 24 52

voicefixer

General Speech Restoration

Language:PythonMIT966 16 58

audioldm_eval

This toolbox aims to unify audio generation model evaluation for easier comparison.

Language:PythonMIT281 5 8

voicefixer_main

General Speech Restoration

Language:PythonMIT273 11 18

AudioLDM-training-finetuning

AudioLDM training, finetuning, evaluation and inference.

Language:PythonMIT177 15 34

SemantiCodec-inference

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Language:PythonMIT88 3 1

SemantiCodec

Language:HTML37 6 1

courseProject_Compiler

java implementation of NWPU Compiler course project-西工大编译原理-试点班

Language:Java13 20

youtube-8m-videos-downloader

Download videos from YouTube-8M dataset for testing

Language:Python6 10

kmeans_pytorch

kmeans using PyTorch

Language:Jupyter NotebookMIT4 10

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonMIT2 10

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT2 10

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Language:PythonMIT2 10

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:Python1 10

colab_collection

Language:Jupyter Notebook1 20

haoheliu.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:SCSSMIT1 20

mushra_test_2024_April

1 20

nider

Python package to add text to images, textures and different backgrounds

Language:PythonMIT1 10

resemble-enhance

AI powered speech denoising and enhancement

Language:PythonMIT1 10

video_features

Extract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, ResNet features.

Language:PythonGPL-3.01 10

WavCaps

This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.

Language:Python1 10

CV-VAE

000

decord

An efficient video loader for deep learning with smart shuffling that's super easy to digest

Apache-2.0000

Guided-GAN-Visualization

Language:Python010

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonNOASSERTION020

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

NOASSERTION000

lfs_test

020

torchmetrics

Torchmetrics - Machine learning metrics for distributed, scalable PyTorch applications.

Language:PythonApache-2.0010