Finn's repositories
MagicDrive
[ICLR24] Implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
DeepLearing-Interview-Awesome-2024
We'll cover some of the most common Deep Learning Interview Questions and answers and provide detailed answers to help you
shapenet-pointcloud-generator
This repository is for generating complete pointclouds, partial pointclouds, rendered depth maps and rendered rgb images from ShapeNet
leetcode-master
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
instruction-tuned-sd
Code for instruction-tuning Stable Diffusion.
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
llama
Inference code for LLaMA models
LyCORIS
Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.
style-aligned
Official code for "Style Aligned Image Generation via Shared Attention"
AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
Paint-by-Example
Paint by Example: Exemplar-based Image Editing with Diffusion Models
OutfitAnyone
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
video-diffusion-pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
ControlNet-v1-1-nightly
Nightly release of ControlNet 1.1
About-Me
Config files for my GitHub profile.
ControlNet
Let us control diffusion models!
PowerBEV
POWERBEV, a novel and elegant vision-based end-to-end framework that only consists of 2D convolutional layers to perform perception and forecasting of multiple objects in BEVs.
InST
Official implementation of the paper “Inversion-Based Style Transfer with Diffusion Models” (CVPR 2023)
nuscenes-devkit
The devkit of the nuScenes dataset.
pix2video
Code for the paper "Pix2Video: Video Editing using Image Diffusion"
NuScenes-Download-CLI
Download various NuScenes Dataset directly from the terminal
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
FB-BEV
FB-BEV and FB-OCC are vision-centric autonomous driving perception algorithm based on forward-backward view transformation strtegies.
make-a-video-pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch