ding ding (dpimagine)

dpimagine

Geek Repo

Company:bhu

Location:China

Github PK Tool:Github PK Tool

ding ding's starred repositories

CMMLU

CMMLU: Measuring massive multitask language understanding in Chinese

Language:PythonStargazers:686Issues:0Issues:0

mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Language:PythonLicense:NOASSERTIONStargazers:434Issues:0Issues:0

Hallu-PI

The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs".

License:MITStargazers:8Issues:0Issues:0

Counting-Stars

Counting-Stars (★)

Language:Jupyter NotebookLicense:MITStargazers:74Issues:0Issues:0

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1509Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36702Issues:0Issues:0

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:2731Issues:0Issues:0

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonLicense:MITStargazers:13763Issues:0Issues:0

Image-Aesthetics-and-Quality-Assessment

[ACMMM 2023, Official Code] for paper "EAT: An Enhancer for Aesthetics-Oriented Transformers". Official Weights and Demos provided. 目前是地表最强开源美学评估模型之一.

Language:PythonStargazers:107Issues:0Issues:0

ava_downloader

:arrow_double_down: Download AVA dataset (A Large-Scale Database for Aesthetic Visual Analysis)

Stargazers:390Issues:0Issues:0

Neural-IMage-Assessment

A PyTorch Implementation of Neural IMage Assessment

Language:PythonLicense:NOASSERTIONStargazers:527Issues:0Issues:0
Language:PythonStargazers:61Issues:0Issues:0

IAA_Tutorial

实验室【外部】美学课题组入门学习材料,加入课题组后,会有更详细的内部学习资料。

Stargazers:37Issues:0Issues:0

Image-Color-Aesthetics-and-Quality-Assessment

[ICCV 2023, Official Code] for paper "Thinking Image Color Aesthetics Assessment: Models, Datasets and Benchmarks". Official Weights and Demos provided. 首个面向图像色彩主观美学评估的数据集、算法和benchmark.

Language:PythonStargazers:144Issues:0Issues:0

TANet-image-aesthetics-and-quality-assessment

[IJCAI 2022, Official Code] for paper "Rethinking Image Aesthetics Assessment: Models, Datasets and Benchmarks". Official Weights and Demos provided. 首个面向多主题场景的美学评估数据集、算法和benchmark.

Language:PythonLicense:Apache-2.0Stargazers:273Issues:0Issues:0

GAOKAO-MM

[ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation

Language:PythonLicense:Apache-2.0Stargazers:34Issues:0Issues:0

MathEval

MathEval is a benchmark dedicated to the holistic evaluation on mathematical capacities of LLMs.

Language:PythonStargazers:59Issues:0Issues:0

Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:8968Issues:0Issues:0

MMBench

Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"

License:Apache-2.0Stargazers:154Issues:0Issues:0

mercury

Convert Jupyter Notebooks to Web Apps

Language:PythonLicense:AGPL-3.0Stargazers:4012Issues:0Issues:0

EffiBench

[NeurIPS 2024] EffiBench: Benchmarking the Efficiency of Automatically Generated Code

Language:PythonStargazers:54Issues:0Issues:0

ianvs

Distributed Synergy AI Benchmarking

Language:PythonLicense:Apache-2.0Stargazers:113Issues:0Issues:0

FlagGems

FlagGems is an operator library for large language models implemented in Triton Language.

Language:PythonLicense:Apache-2.0Stargazers:298Issues:0Issues:0

MMTrustEval

A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)

Language:PythonLicense:CC-BY-SA-4.0Stargazers:93Issues:0Issues:0

amber-data-prep

Data preparation code for Amber 7B LLM

Language:PythonLicense:Apache-2.0Stargazers:79Issues:0Issues:0

MobileLLM

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Language:PythonLicense:NOASSERTIONStargazers:957Issues:0Issues:0

unibench

Python Library to evaluate VLM models' robustness across diverse benchmarks

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:166Issues:0Issues:0

MiniGPT-4-ZH

MiniGPT-4 中文部署翻译 完善部署细节

Language:PythonLicense:BSD-3-ClauseStargazers:858Issues:0Issues:0

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:4944Issues:0Issues:0

bookget

bookget 数字古籍图书下载工具

Language:GoLicense:GPL-3.0Stargazers:1309Issues:0Issues:0