Beast code in Giters

nth2000's starred repositories

AgentBoard

An Analytical Evaluation Board of Multi-turn LLM Agents

Language:SAS21000

ChartMimic

ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation

Language:Python6600

LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Language:PythonNOASSERTION129700

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:PythonMIT401700

TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

Language:PythonApache-2.048600

Awesome-Chart-Understanding

A curated list of recent and past chart understanding work based on our survey paper: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models.

10600

OneBit

The homepage of OneBit model quantization framework.

Language:PythonMIT10900

GraphGPT

[SIGIR'2024] "GraphGPT: Graph Instruction Tuning for Large Language Models"

Language:PythonApache-2.047900

Awesome-TableReasoning-LLM-Survey

2200

T2I-CompBench

[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation

Language:PythonMIT15900

opencv

Open Source Computer Vision Library

Language:C++Apache-2.07688300

Linguistic-Binding-in-Diffusion-Models

Language:Jupyter Notebook6600

Attend-and-Excite

Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)

Language:Jupyter NotebookMIT65700

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookApache-2.0836400

MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Language:PythonApache-2.083200

HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Language:PythonBSD-3-Clause20300