Beast code in Giters

felixfuu's starred repositories

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonMIT36680 433 285

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.018269 156 467

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

8774 181 17

imagen-pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Language:PythonMIT7830 113 299

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++Apache-2.05532 63 623

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language:PythonMIT3473 100 159

T2I-Adapter

Language:PythonApache-2.03216 41 105

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonBSD-3-Clause2930 60 86

LLMs-In-China

**大模型

Apache-2.02524 64 18

co-tracker

CoTracker is a model for tracking any point (pixel) on a video.

Language:Jupyter NotebookNOASSERTION2455 26 66

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonAGPL-3.02306 390

consistencydecoder

Consistency Distilled Diff VAE

Language:PythonMIT2081 22 19

parti

Apache-2.01524 56 9

awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

835 47 7

MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Language:PythonApache-2.0823 12 37

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

736 50 9

Entity

EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation

Language:Jupyter NotebookNOASSERTION669 23 43

LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Language:PythonApache-2.0640 10 24

groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Language:Python595 30 50