felixfuu's starred repositories
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
alpaca-lora
Instruct-tune LLaMA on consumer hardware
LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
FasterTransformer
Transformer related optimization, including BERT, GPT
T2I-Adapter
T2I-Adapter
LLMs-In-China
**大模型
co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
consistencydecoder
Consistency Distilled Diff VAE
awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
LLM-in-Vision
Recent LLM-based CV and related works. Welcome to comment/contribute!
LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
InstructDiffusion
PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.
Mini-DALLE3
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
prompt-pretraining
Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"
HumanBench
This repo is official implementation of HumanBench (CVPR2023)