Wei Xu (fseasy)

fseasy

Geek Repo

Company:surreal

Location:深圳

Home Page:https://blog.fseasy.top

Github PK Tool:Github PK Tool

Wei Xu's starred repositories

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:11204Issues:96Issues:337

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11040Issues:163Issues:224

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10615Issues:122Issues:207

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:7977Issues:150Issues:533

jukebox

Code for the paper "Jukebox: A Generative Model for Music"

Language:PythonLicense:NOASSERTIONStargazers:7721Issues:302Issues:260

denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Language:PythonLicense:MITStargazers:7648Issues:32Issues:284

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7444Issues:97Issues:1502

point-e

Point cloud diffusion for 3D model synthesis

Language:PythonLicense:MITStargazers:6424Issues:224Issues:85

OOTDiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Language:PythonLicense:NOASSERTIONStargazers:5210Issues:74Issues:194

dust3r

DUSt3R: Geometric 3D Vision Made Easy

Language:PythonLicense:NOASSERTIONStargazers:4780Issues:54Issues:131

audio2photoreal

Code and dataset for photorealistic Codec Avatars driven from audio

Language:PythonLicense:NOASSERTIONStargazers:2634Issues:30Issues:52

aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Language:PythonLicense:AGPL-3.0Stargazers:2451Issues:73Issues:209

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Language:PythonLicense:MITStargazers:1266Issues:36Issues:703

SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Language:PythonLicense:NOASSERTIONStargazers:1126Issues:63Issues:195

deepl-python

Official Python library for the DeepL language translation API.

Language:PythonLicense:MITStargazers:1076Issues:21Issues:100
Language:Jupyter NotebookLicense:MITStargazers:946Issues:23Issues:38

PIA

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画

Language:PythonLicense:Apache-2.0Stargazers:833Issues:19Issues:40

PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Language:PythonLicense:MITStargazers:327Issues:20Issues:29

espnet_model_zoo

ESPnet Model Zoo

Language:PythonLicense:Apache-2.0Stargazers:243Issues:13Issues:29

URLExtract

URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.

Language:PythonLicense:MITStargazers:239Issues:9Issues:94

LIQE

[CVPR2023] Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective

Language:PythonLicense:MITStargazers:164Issues:1Issues:20

uroman

Universal Romanizer that can convert any unicode script to roman (latin) script

Language:PerlLicense:NOASSERTIONStargazers:132Issues:12Issues:12

a3t

Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing

Language:PythonLicense:Apache-2.0Stargazers:83Issues:4Issues:8

NeMo-speech-data-processor

A toolkit for processing speech data and creating speech datasets

Language:PythonLicense:Apache-2.0Stargazers:67Issues:7Issues:1

3aransia

Transliteration for languages and dialects

Language:PythonLicense:Apache-2.0Stargazers:40Issues:6Issues:42

CQT_pytorch

Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters

Language:Jupyter NotebookStargazers:27Issues:3Issues:6

TimeStretching

Pytorch implementation of Time Stretching in Music using an Autoencoder Network

Language:Jupyter NotebookStargazers:17Issues:0Issues:1