OpenDataLab (opendatalab)

OpenDataLab

opendatalab

Geek Repo

OpenDataLab provides access to numerous significant open-source datasets.

Location:China

Home Page:https://opendatalab.org.cn

Github PK Tool:Github PK Tool

OpenDataLab's repositories

WanJuan1.0

万卷1.0多模态语料

labelU

Data annotation toolbox supports image, audio and video data.

VIGC

AAAI 2024: Visual Instruction Generation and Correction

Language:PythonLicense:Apache-2.0Stargazers:70Issues:3Issues:13

opendatalab-python-sdk

SDK of OpenDataLab - https://opendatalab.org.cn

Language:PythonLicense:MITStargazers:52Issues:2Issues:3

CLIP-Parrot-Bias

Parrot Captions Teach CLIP to Spot Text

Language:PythonLicense:Apache-2.0Stargazers:50Issues:3Issues:2

dsdl-docs

Data Set Description Language Specification (新一代人工智能数据集描述语言DSDL)

Language:HTMLLicense:Apache-2.0Stargazers:42Issues:2Issues:0

UniMERNet

UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:40Issues:0Issues:0

HA-DPO

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization

Language:PythonLicense:Apache-2.0Stargazers:37Issues:2Issues:6

labelU-Kit

Data annotation component library --provided as NPM packages

Language:TypeScriptLicense:Apache-2.0Stargazers:35Issues:5Issues:5

H2RSVLM

H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model

Language:TypeScriptLicense:Apache-2.0Stargazers:24Issues:0Issues:0

MLLM-DataEngine

MLLM-DataEngine: An Iterative Refinement Approach for MLLM

Language:PythonLicense:Apache-2.0Stargazers:22Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:19Issues:3Issues:2

MLS-BRN

[CVPR 2024] 3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions

License:Apache-2.0Stargazers:17Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:13Issues:1Issues:0

CHARM

Chinese commonsense benchmark for LLMs

License:Apache-2.0Stargazers:9Issues:0Issues:0

labelU-frontend

LabelU front-end library

Language:TypeScriptLicense:Apache-2.0Stargazers:7Issues:4Issues:0

WanJuan2.0-WanJuan-CC

WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。

Stargazers:5Issues:0Issues:0

allz

A universal command line tool for compression and decompression

Language:PythonLicense:MITStargazers:4Issues:2Issues:0
Language:PythonStargazers:1Issues:3Issues:0
Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0