Haohe (Leo) Liu / 刘濠赫 (haoheliu)

haoheliu

Geek Repo

Company:UoSurrey, Centre for Vision, Speech and Signal Processing (CVSSP)

Location:Guildford GU2 7XH Stag Hill, UK

Home Page:https://haoheliu.github.io/

Github PK Tool:Github PK Tool

Haohe (Leo) Liu / 刘濠赫's repositories

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonLicense:NOASSERTIONStargazers:2351Issues:42Issues:102

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonLicense:NOASSERTIONStargazers:2168Issues:44Issues:66

versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Language:PythonLicense:MITStargazers:1012Issues:24Issues:52

voicefixer

General Speech Restoration

Language:PythonLicense:MITStargazers:966Issues:16Issues:58

audioldm_eval

This toolbox aims to unify audio generation model evaluation for easier comparison.

Language:PythonLicense:MITStargazers:281Issues:5Issues:8

voicefixer_main

General Speech Restoration

Language:PythonLicense:MITStargazers:273Issues:11Issues:18

AudioLDM-training-finetuning

AudioLDM training, finetuning, evaluation and inference.

Language:PythonLicense:MITStargazers:177Issues:15Issues:34

SemantiCodec-inference

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Language:PythonLicense:MITStargazers:88Issues:3Issues:1

courseProject_Compiler

java implementation of NWPU Compiler course project-西工大编译原理-试点班

Language:JavaStargazers:13Issues:2Issues:0

youtube-8m-videos-downloader

Download videos from YouTube-8M dataset for testing

Language:PythonStargazers:6Issues:1Issues:0

kmeans_pytorch

kmeans using PyTorch

Language:Jupyter NotebookLicense:MITStargazers:4Issues:1Issues:0

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonLicense:MITStargazers:2Issues:1Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:2Issues:1Issues:0

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Language:PythonLicense:MITStargazers:2Issues:1Issues:0

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:PythonStargazers:1Issues:1Issues:0
Language:Jupyter NotebookStargazers:1Issues:2Issues:0

haoheliu.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:SCSSLicense:MITStargazers:1Issues:2Issues:0

nider

Python package to add text to images, textures and different backgrounds

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

resemble-enhance

AI powered speech denoising and enhancement

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

video_features

Extract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, ResNet features.

Language:PythonLicense:GPL-3.0Stargazers:1Issues:1Issues:0

WavCaps

This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.

Language:PythonStargazers:1Issues:1Issues:0
Stargazers:0Issues:0Issues:0

decord

An efficient video loader for deep learning with smart shuffling that's super easy to digest

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:2Issues:0

torchmetrics

Torchmetrics - Machine learning metrics for distributed, scalable PyTorch applications.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0