Zunnan Xu (kkakkkka)

kkakkkka

Geek Repo

Company:Tsinghua University

Home Page:kkakkkka.github.io

Github PK Tool:Github PK Tool

Zunnan Xu's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:136309Issues:1052Issues:7542

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:AGPL-3.0Stargazers:24831Issues:174Issues:130

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24174Issues:193Issues:3802

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15101Issues:104Issues:968

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:11256Issues:150Issues:807

External-Attention-pytorch

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Language:PythonLicense:MITStargazers:11145Issues:104Issues:79

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6818Issues:59Issues:137

Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5928Issues:51Issues:139

ccf-deadlines

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Language:VueLicense:MITStargazers:5427Issues:22Issues:72

SimSwap

An arbitrary face-swapping framework on images and videos with one single trained model!

Language:PythonLicense:NOASSERTIONStargazers:4340Issues:84Issues:437

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

Language:Jupyter NotebookLicense:MITStargazers:3182Issues:39Issues:107

sketch

AI code-writing assistant that understands data content

Language:PythonLicense:MITStargazers:2210Issues:18Issues:25

Awesome-Anything

General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX

Multimodal-GPT

Multimodal-GPT

Language:PythonLicense:Apache-2.0Stargazers:1443Issues:12Issues:17

UnboundedNeRFPytorch

State-of-the-art, simple, fast unbounded / large-scale NeRFs.

Language:PythonLicense:MITStargazers:1327Issues:45Issues:122

ViT-Adapter

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

Language:PythonLicense:Apache-2.0Stargazers:1184Issues:16Issues:174

raft

:rowboat: Raft implementation in Go

Language:GoLicense:UnlicenseStargazers:986Issues:15Issues:10

FaceFormer

[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers

Language:PythonLicense:MITStargazers:763Issues:15Issues:101

lora-svc

singing voice change based on whisper, and lora for singing voice clone

Language:PythonLicense:MITStargazers:608Issues:24Issues:69

CelebV-Text

(CVPR 2023) CelebV-Text: A Large-Scale Facial Text-Video Dataset

Awesome-Parameter-Efficient-Transfer-Learning

A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.

EmoTalk_release

This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"

Language:PythonLicense:NOASSERTIONStargazers:308Issues:11Issues:28

gRefCOCO

A benchmark dataset for GRES and GREC [CVPR2023 Highlight]

DiffuseStyleGesture

DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023) | The DiffuseStyleGesture+ entry to the GENEA Challenge 2023 (ICMI 2023, Reproducibility Award)

Language:PythonLicense:MITStargazers:141Issues:7Issues:39

Diffusion-Video-Autoencoders

An official implementation of "Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding" (CVPR 2023) in PyTorch.

Language:PythonLicense:MITStargazers:135Issues:5Issues:4

SelfTalk_release

This is the official source for our ACM MM 2023 paper "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces""

Language:MATLABLicense:NOASSERTIONStargazers:119Issues:8Issues:6

PTUnifier

[ICCV-2023] Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts

SK-VG

[CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.

awesome-in-context-learning

A curated list of in-context-learning, including classic and up-to-date papers📜

Stargazers:6Issues:0Issues:0

CleanVsCode

Clean VisualStudio Code's Cache, which could be even more than 5G after 1 week's usage. | 清理VsCode缓存的脚本,一周甚至能清理5G

Language:BatchfileLicense:GPL-3.0Stargazers:5Issues:0Issues:0