daoyuan98's starred repositories
stable-diffusion
A latent text-to-image diffusion model
sd-webui-roop
roop extension for StableDiffusion web-ui
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
LLMs_interview_notes
该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题
LLM-in-Vision
Recent LLM-based CV and related works. Welcome to comment/contribute!
groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
diffusion-rig
Code Release for DiffusionRig (CVPR 2023)
LRV-Instruction
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
Efficient-LLM-Survey
The Efficiency Spectrum of LLM
MMVP-motion-matrix-based-video-prediction
This is the official repo of MMVP: motion-matrix-based video prediction (ICCV 2023)
Skeleton-in-Context
[CVPR2024] Official implementation of the paper: Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning
Symbol-LLM
Code for NeurIPS2023 Paper "Symbol-LLM: Leverage Language Models for Symbolic System in Visual Human Activity Reasoning"
CaesarNeRF
This repo is for CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering.