Yiyun Chen's starred repositories
LivePortrait
Bring portraits to life!
video-diffusion-pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
awesome-diffusion-v2v
Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translation. And a video editing benchmark code.
w2v2_audioFrameClassification
wav2vec2 audio classification for prosodic boundary detection and other tasks
speechbrain-docs-zh-cn
SpeechBrain中文文档
speechbrain
A PyTorch-based Speech Toolkit
Auto_Cut_Audio
We always have a lot of wav audio to cut,and sometimes we need to cut them and we don't want to cut off a word or a complete sentence in audio.
Add_noise_and_rir_to_speech
The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generate far-field speech data using room impulse response data from BUT Speech@FIT Reverb Database.
speech-vad-demo
集成Webrtc的VAD,用于切分音频文件
IncrementalVHD_GPE
official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark
chorus-detection
A deep learning project for automated chorus detection in songs, featuring a command-line interface (CLI) tool that allows users to input a YouTube link and utilize a pre-trained CRNN model to detect chorus sections from a song on YouTube
HTS-Audio-Transformer
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
musiclm-pytorch
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
deep-audio-fingerprinting
A repository for my MSc thesis in Data Science & Machine Learning @ NTUA. A deep learning approach to audio fingerprinting for recognizing songs on real time through the microphone.
Video-Frame-Interpolation-Rankings-and-Video-Deblurring-Rankings
Rankings include: ABME AdaFNIO ALANET AMT BiT BVFI CDFI CtxSyn DBVI DeMFI DQBC DRVI EAFI EBME EDC EDENVFI EDSC EMA-VFI FGDCN FILM FLAVR H-VFI IFRNet IQ-VFI JNMR LADDER M2M MA-GCSPA NCM PerVFI PRF ProBoost-Net RIFE RN-VFI SoftSplat SSR ST-MFNet Swin-VFI TDPNet TTVFI UGFI UPR-Net UTI-VFI VFIformer VFIFT VFIMamba VFIT VIDUE VRT
ComfyUI_omost
ComfyUI implementation of Omost