finninmunich

Finn's repositories

BEVFormer

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Language:PythonApache-2.0000

bevfusion

[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Language:PythonApache-2.0000

ChaGPT-API-Call

Python calls ChatGPT API, multi-turn dialogue support

Language:Python000

cross-image-attention

Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"

MIT000

DDP

Language:Python000

DDPM_inversion

Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.

Language:PythonMIT000

DeepLearing-Interview-Awesome-2024

We'll cover some of the most common Deep Learning Interview Questions and answers and provide detailed answers to help you

000

differential-diffusion

Language:Python000

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Apache-2.0000

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookApache-2.0000

FCDiffusion

000

FreeStyle

FreeStyle : Free Lunch for Text-guided Style Transfer using Diffusion Models

000

FreeU

FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)

MIT000

FreeU_Diffusers

"FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusers

Apache-2.0000

img2img-turbo

One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more

Language:PythonMIT000

instruct-pix2pix

Language:PythonNOASSERTION000

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

MIT000

kohya_ss

Apache-2.0000

leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

000

llama3

The official Meta Llama 3 GitHub site

NOASSERTION000

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.0000

MagicDrive

[ICLR24] Implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”

Language:Python000

mmdetection3d

OpenMMLab's next-generation platform for general 3D object detection.

Language:PythonApache-2.0000

plug-and-play

Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)

000

prompt-to-prompt

Language:Jupyter NotebookApache-2.0000

ResShift

ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS 2023 Spotlight)

Language:PythonNOASSERTION000

shapenet-pointcloud-generator

This repository is for generating complete pointclouds, partial pointclouds, rendered depth maps and rendered rgb images from ShapeNet

Language:Python000

StyleID

[CVPR 2024 Highlight] Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer

000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

Vista

A Generalizable World Model for Autonomous Driving

Apache-2.0000