Pocky's starred repositories

Open-AnimateAnyone

Unofficial Implementation of Animate Anyone

Language:PythonStargazers:2839Issues:0Issues:0

dreamoving-project

Official implementation of DreaMoving

License:Apache-2.0Stargazers:1788Issues:0Issues:0

MAG-Edit

MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance

Language:PythonStargazers:81Issues:0Issues:0

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Language:PythonStargazers:2761Issues:0Issues:0
Language:PythonLicense:MITStargazers:2457Issues:0Issues:0

bioclip

This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].

Language:PythonLicense:NOASSERTIONStargazers:117Issues:0Issues:0

facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8695Issues:0Issues:0

w-plus-adapter

[CVPR 2024] When StyleGAN Meets Stable Diffusion: a W+ Adapter for Personalized Image Generation

Language:PythonLicense:NOASSERTIONStargazers:98Issues:0Issues:0

papermage

library supporting NLP and CV research on scientific papers

Language:PythonLicense:Apache-2.0Stargazers:641Issues:0Issues:0

facefusion

Next generation face swapper and enhancer

Language:PythonLicense:NOASSERTIONStargazers:16530Issues:0Issues:0

screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Language:PythonLicense:MITStargazers:54715Issues:0Issues:0

SimSwap

An arbitrary face-swapping framework on images and videos with one single trained model!

Language:PythonLicense:NOASSERTIONStargazers:4319Issues:0Issues:0
Language:MATLABStargazers:642Issues:0Issues:0

ziplora-pytorch

Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"

Language:PythonLicense:MITStargazers:478Issues:0Issues:0

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:2684Issues:0Issues:0

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonLicense:MITStargazers:3345Issues:0Issues:0

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonLicense:NOASSERTIONStargazers:2615Issues:0Issues:0
License:NOASSERTIONStargazers:269Issues:0Issues:0

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:4205Issues:0Issues:0

super-gradients

Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4434Issues:0Issues:0

clash-verge

A Clash GUI based on tauri. Supports Windows, macOS and Linux.

Language:TypeScriptStargazers:20736Issues:0Issues:0

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:4313Issues:0Issues:0
Language:Jupyter NotebookStargazers:184Issues:0Issues:0

FaRL

FaRL for Facial Representation Learning [Official, CVPR 2022]

Language:PythonLicense:MITStargazers:351Issues:0Issues:0

PHC

Official Implementation of the ICCV 2023 paper: Perpetual Humanoid Control for Real-time Simulated Avatars

Language:PythonLicense:NOASSERTIONStargazers:369Issues:0Issues:0
Language:PythonStargazers:436Issues:0Issues:0

HyperHuman

[ICLR 2024] Github Repo for "HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion"

Language:HTMLStargazers:481Issues:0Issues:0

AlignProp

AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion

Language:PythonLicense:MITStargazers:205Issues:0Issues:0

Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2719Issues:0Issues:0

sd-webui-cleaner

An extension for stable-diffusion-webui to remove any object.

Language:JavaScriptLicense:MITStargazers:228Issues:0Issues:0