nth2000

nth2000

Geek Repo

Github PK Tool:Github PK Tool

nth2000's starred repositories

AgentBoard

An Analytical Evaluation Board of Multi-turn LLM Agents

Language:SASStargazers:210Issues:0Issues:0

ChartMimic

ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation

Language:PythonStargazers:66Issues:0Issues:0

LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Language:PythonLicense:NOASSERTIONStargazers:1297Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:4017Issues:0Issues:0

TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

Language:PythonLicense:Apache-2.0Stargazers:486Issues:0Issues:0

Awesome-Chart-Understanding

A curated list of recent and past chart understanding work based on our survey paper: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models.

Stargazers:106Issues:0Issues:0

OneBit

The homepage of OneBit model quantization framework.

Language:PythonLicense:MITStargazers:109Issues:0Issues:0

GraphGPT

[SIGIR'2024] "GraphGPT: Graph Instruction Tuning for Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:479Issues:0Issues:0

T2I-CompBench

[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation

Language:PythonLicense:MITStargazers:159Issues:0Issues:0

opencv

Open Source Computer Vision Library

Language:C++License:Apache-2.0Stargazers:76883Issues:0Issues:0
Language:Jupyter NotebookStargazers:66Issues:0Issues:0

Attend-and-Excite

Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)

Language:Jupyter NotebookLicense:MITStargazers:657Issues:0Issues:0

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8364Issues:0Issues:0

MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Language:PythonLicense:Apache-2.0Stargazers:832Issues:0Issues:0

HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Language:PythonLicense:BSD-3-ClauseStargazers:203Issues:0Issues:0

huozi

活字通用大模型

Language:PythonLicense:Apache-2.0Stargazers:309Issues:0Issues:0

text2image-benchmark

Benchmark for generative image models

Language:Jupyter NotebookLicense:MITStargazers:32Issues:0Issues:0

Chinese-Mixtral-8x7B

中文Mixtral-8x7B(Chinese-Mixtral-8x7B)

Language:PythonLicense:Apache-2.0Stargazers:630Issues:0Issues:0

pi-Tuning

Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.

Language:PythonLicense:NOASSERTIONStargazers:32Issues:0Issues:0

LLaMA-Pro

[ACL 2024] Progressive LLaMA with Block Expansion.

Language:PythonLicense:Apache-2.0Stargazers:442Issues:0Issues:0

magicoder

Magicoder: Source Code Is All You Need

Language:PythonLicense:MITStargazers:1932Issues:0Issues:0

label-words-are-anchors

Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

Language:PythonLicense:MITStargazers:128Issues:0Issues:0

blended-diffusion

Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]

Language:Jupyter NotebookLicense:MITStargazers:541Issues:0Issues:0

Awesome-LLM-hallucination

LLM hallucination paper list

License:MITStargazers:237Issues:0Issues:0

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonLicense:MITStargazers:484Issues:0Issues:0

GVT

Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".

Language:PythonLicense:Apache-2.0Stargazers:54Issues:0Issues:0

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonLicense:MITStargazers:37529Issues:0Issues:0

LaVIT

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:438Issues:0Issues:0

SEED

Official implementation of SEED-LLaMA (ICLR 2024).

Language:PythonLicense:NOASSERTIONStargazers:518Issues:0Issues:0