nth2000's starred repositories
AgentBoard
An Analytical Evaluation Board of Multi-turn LLM Agents
ChartMimic
ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation
TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
Awesome-Chart-Understanding
A curated list of recent and past chart understanding work based on our survey paper: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models.
T2I-CompBench
[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation
Attend-and-Excite
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
HallusionBench
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
text2image-benchmark
Benchmark for generative image models
Chinese-Mixtral-8x7B
中文Mixtral-8x7B(Chinese-Mixtral-8x7B)
label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
blended-diffusion
Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
Awesome-LLM-hallucination
LLM hallucination paper list
magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models