Finn's repositories
BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
bevfusion
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
ChaGPT-API-Call
Python calls ChatGPT API, multi-turn dialogue support
cross-image-attention
Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"
DDPM_inversion
Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.
DeepLearing-Interview-Awesome-2024
We'll cover some of the most common Deep Learning Interview Questions and answers and provide detailed answers to help you
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
FreeStyle
FreeStyle : Free Lunch for Text-guided Style Transfer using Diffusion Models
FreeU
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
FreeU_Diffusers
"FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusers
img2img-turbo
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型
leetcode-master
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
llama3
The official Meta Llama 3 GitHub site
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
MagicDrive
[ICLR24] Implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
mmdetection3d
OpenMMLab's next-generation platform for general 3D object detection.
plug-and-play
Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)
ResShift
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS 2023 Spotlight)
shapenet-pointcloud-generator
This repository is for generating complete pointclouds, partial pointclouds, rendered depth maps and rendered rgb images from ShapeNet
StyleID
[CVPR 2024 Highlight] Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Vista
A Generalizable World Model for Autonomous Driving