Yuxuan Qiao's starred repositories

MG-LLaVA

Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).

Language:PythonLicense:Apache-2.0Stargazers:124Issues:0Issues:0

Prism

A Framework for Decoupling and Assessing the Capabilities of VLMs

Language:PythonLicense:Apache-2.0Stargazers:32Issues:0Issues:0

MathBench

[ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset

License:Apache-2.0Stargazers:67Issues:0Issues:0

Ada-LEval

The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"

Language:PythonStargazers:45Issues:0Issues:0
Stargazers:612Issues:0Issues:0

Agent-FLAN

[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

License:Apache-2.0Stargazers:302Issues:0Issues:0

VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks

Language:PythonLicense:Apache-2.0Stargazers:796Issues:0Issues:0

MixtralKit

A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI

Language:PythonLicense:Apache-2.0Stargazers:764Issues:0Issues:0

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3438Issues:0Issues:0