kkakkkka

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Language:VueMIT5427 22 72

SimSwap

An arbitrary face-swapping framework on images and videos with one single trained model!

Language:PythonNOASSERTION4340 84 437

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

Language:Jupyter NotebookMIT3182 39 107

sketch

AI code-writing assistant that understands data content

Language:PythonMIT2210 18 25

Awesome-Anything

General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX

1694 53 1

Multimodal-GPT

Language:PythonApache-2.01443 12 17

UnboundedNeRFPytorch

State-of-the-art, simple, fast unbounded / large-scale NeRFs.

Language:PythonMIT1327 45 122

ViT-Adapter

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

Language:PythonApache-2.01184 16 174

raft

:rowboat: Raft implementation in Go

Language:GoUnlicense986 15 10

FaceFormer

[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers

Language:PythonMIT763 15 101

lora-svc

singing voice change based on whisper, and lora for singing voice clone

Language:PythonMIT608 24 69

CelebV-Text

(CVPR 2023) CelebV-Text: A Large-Scale Facial Text-Video Dataset

Language:Python378 13 25

Awesome-Parameter-Efficient-Transfer-Learning

A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.

MIT373 21 4

EmoTalk_release

This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"

Language:PythonNOASSERTION308 11 28

gRefCOCO

A benchmark dataset for GRES and GREC [CVPR2023 Highlight]

Language:Python171 4 6

DiffuseStyleGesture

DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023) | The DiffuseStyleGesture+ entry to the GENEA Challenge 2023 (ICMI 2023, Reproducibility Award)

Language:PythonMIT141 7 39

Diffusion-Video-Autoencoders

An official implementation of "Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding" (CVPR 2023) in PyTorch.

Language:PythonMIT135 5 4

SelfTalk_release

This is the official source for our ACM MM 2023 paper "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces""

Language:MATLABNOASSERTION119 8 6

PTUnifier

[ICCV-2023] Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts

Language:Python54 3 7

SK-VG

[CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.

24 2 2

awesome-in-context-learning

A curated list of in-context-learning, including classic and up-to-date papers📜

600

CleanVsCode

Clean VisualStudio Code's Cache, which could be even more than 5G after 1 week's usage. | 清理VsCode缓存的脚本，一周甚至能清理5G

Language:BatchfileGPL-3.0500