Beast code in Giters

Pocky's starred repositories

Open-AnimateAnyone

Unofficial Implementation of Animate Anyone

Language:Python283900

dreamoving-project

Official implementation of DreaMoving

Apache-2.0178800

MAG-Edit

MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance

Language:Python8100

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Language:Python276100

bioclip

This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].

Language:PythonNOASSERTION11700

facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Language:Jupyter NotebookApache-2.0869500

w-plus-adapter

[CVPR 2024] When StyleGAN Meets Stable Diffusion: a W+ Adapter for Personalized Image Generation

Language:PythonNOASSERTION9800

papermage

library supporting NLP and CV research on scientific papers

Language:PythonApache-2.064100

facefusion

Next generation face swapper and enhancer

Language:PythonNOASSERTION1653000

screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Language:PythonMIT5471500

SimSwap

An arbitrary face-swapping framework on images and videos with one single trained model!

Language:PythonNOASSERTION431900

ziplora-pytorch

Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"

Language:PythonMIT47800

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonApache-2.0268400

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonMIT334500

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonNOASSERTION261500

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonMIT420500

super-gradients

Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.

Language:Jupyter NotebookApache-2.0443400

clash-verge

A Clash GUI based on tauri. Supports Windows, macOS and Linux.

Language:TypeScript2073600

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Language:PythonNOASSERTION431300

latent-consistency-model-colab

Language:Jupyter Notebook18400

FaRL

FaRL for Facial Representation Learning [Official, CVPR 2022]

Language:PythonMIT35100

PHC

Official Implementation of the ICCV 2023 paper: Perpetual Humanoid Control for Real-time Simulated Avatars

Language:PythonNOASSERTION36900

HyperHuman

[ICLR 2024] Github Repo for "HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion"

Language:HTML48100

AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion

Language:PythonMIT20500

Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookApache-2.0271900

sd-webui-cleaner

An extension for stable-diffusion-webui to remove any object.

Language:JavaScriptMIT22800

icech