tdye24

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Language:PythonApache-2.078000

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

Language:HTMLApache-2.0871000

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonMIT166800

idl_data

OCR Annotations from Amazon Textract for Industry Documents Library

Language:Python9300

MM-NIAH

This is the official implementation of the paper "Needle In A Multimodal Haystack"

Language:Python7200

GAOKAO-Bench

GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.

Language:PythonApache-2.050500

CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Language:PythonApache-2.0181600

CHIP2022_MedTable-MedInvoice_CogVLM

阿里天池算法竞赛-CHIP2022医疗清单发票OCR要素提取任务-solution

Language:Python300

DuReader

Baseline Systems of DuReader Dataset

Language:Python112100

ChiQA

The implementations of various baselines in our CIKM 2022 paper: ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding.

Language:Python3000

Efficient-Multimodal-LLMs-Survey

Efficient Multimodal Large Language Models: A Survey

Apache-2.021900

TaiSu

TaiSu（太素）--a large-scale Chinese multimodal dataset（亿级大规模中文视觉语言预训练数据集）

Language:PythonNOASSERTION17100

CLUEDatasetSearch

搜索所有中文NLP数据集，附常用英文NLP数据集

Language:Python406400

ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K question-answer pairs collected by human annotators for ~35K screenshots from Rico. It should be used to train and evaluate models capable of screen content understanding via question answering.

CC-BY-4.08200

tdye24

tdye24's starred repositories

llm-inference-benchmark

LLMs_interview_notes

Personalized_PCA

llama3-from-scratch

llama3

Awesome-LLM-Compression

dap-cl

LLM-Pruner

llm-action

Monkey

Bolaco

idl_data

MM-NIAH

GAOKAO-Bench

CogVLM2

CHIP2022_MedTable-MedInvoice_CogVLM

DuReader

ChiQA

Efficient-Multimodal-LLMs-Survey

TaiSu

CLUEDatasetSearch

screen_qa

MP-DocVQA-Framework

ChartQA

DeepSpeed

Awesome-Multimodal-Large-Language-Models

InternVL

FedPR

VLMEvalKit

invoice