chupei (e06084)

e06084

Geek Repo

Company:Shanghai AI Lab

Location:Shanghai, China

Home Page:https://little-holmes.com/

Github PK Tool:Github PK Tool

chupei's starred repositories

DocGenome

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models

License:CC-BY-4.0Stargazers:81Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:113Issues:0Issues:0

uptrain

UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.

Language:PythonLicense:Apache-2.0Stargazers:2123Issues:0Issues:0

PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Language:PythonLicense:Apache-2.0Stargazers:3768Issues:0Issues:0

Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Stargazers:516Issues:0Issues:0

GRUtopia

GRUtopia: Dream General Robots in a City at Scale

Language:PythonLicense:MITStargazers:394Issues:0Issues:0

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Language:PythonLicense:AGPL-3.0Stargazers:6515Issues:0Issues:0

FAST-VQA-and-FasterVQA

[ECCV2022, TPAMI2023] FAST-VQA, and its extended version FasterVQA.

Language:Jupyter NotebookLicense:MITStargazers:241Issues:0Issues:0
Language:PythonLicense:MITStargazers:649Issues:0Issues:0

leptonai

A Pythonic framework to simplify AI service building

Language:PythonLicense:Apache-2.0Stargazers:2601Issues:0Issues:0

datacomp

DataComp: In search of the next generation of multimodal datasets

Language:PythonLicense:NOASSERTIONStargazers:626Issues:0Issues:0

DataLab

The unified platform for data-related resources.

Language:PythonLicense:Apache-2.0Stargazers:130Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:13372Issues:0Issues:0

llama-fs

A self-organizing file system with llama 3

Language:Jupyter NotebookLicense:MITStargazers:4675Issues:0Issues:0

UltraEval

[ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.

Language:PythonLicense:Apache-2.0Stargazers:195Issues:0Issues:0

llama_parse

Parse files for optimal RAG

Language:PythonLicense:MITStargazers:2056Issues:0Issues:0

Magic-Doc

conversion doc(pdf/html/doc/docx/ppt/pptx)to markdown

Language:PythonLicense:Apache-2.0Stargazers:20Issues:0Issues:0

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:PythonStargazers:9126Issues:0Issues:0

dolma

Data and tools for generating and inspecting OLMo pre-training data.

Language:PythonLicense:Apache-2.0Stargazers:868Issues:0Issues:0

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:12018Issues:0Issues:0

SoccerDB

SoccerDB: A Large-Scale Database for Comprehensive Video Understanding

Language:PythonStargazers:43Issues:0Issues:0

Agent-FLAN

[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

License:Apache-2.0Stargazers:302Issues:0Issues:0

HuixiangDou

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance

Language:PythonLicense:BSD-3-ClauseStargazers:1219Issues:0Issues:0

Tutorial

LLM&VLM Tutorial

Language:PythonStargazers:1177Issues:0Issues:0

agentlego

Enhance LLM agents with versatile tool APIs

Language:PythonLicense:Apache-2.0Stargazers:320Issues:0Issues:0

doccano

Open source annotation tool for machine learning practitioners.

Language:PythonLicense:MITStargazers:9292Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29232Issues:0Issues:0

nocobase

NocoBase is a scalability-first, open-source no-code/low-code platform for building business applications and enterprise solutions.

Language:TypeScriptLicense:NOASSERTIONStargazers:11370Issues:0Issues:0

nsfw_model

Keras model of NSFW detector

Language:PythonLicense:NOASSERTIONStargazers:1691Issues:0Issues:0

fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Language:PythonLicense:MITStargazers:74301Issues:0Issues:0