Tong Li (TongLi3701)

TongLi3701

Geek Repo

Company:Colossal-AI Team @hpcaitech

Location:Shanghai, China

Home Page:https://www.linkedin.com/in/tongli3701/

Github PK Tool:Github PK Tool


Organizations
hpcaitech

Tong Li's starred repositories

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:41116Issues:435Issues:9160

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:28780Issues:277Issues:1183

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:14553Issues:62Issues:174

Chat2DB

🔥🔥🔥AI-driven data management platform Over 1 million developers are using Chat2DB

Language:JavaLicense:Apache-2.0Stargazers:14345Issues:103Issues:962

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:12356Issues:77Issues:779

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonLicense:MITStargazers:12048Issues:90Issues:334

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:11313Issues:78Issues:13

Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

Language:TypeScriptLicense:MITStargazers:11227Issues:80Issues:185

searxng

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

Language:PythonLicense:AGPL-3.0Stargazers:11170Issues:105Issues:1235

phidata

Build AI Assistants with memory, knowledge and tools.

Language:PythonLicense:MPL-2.0Stargazers:10670Issues:83Issues:141

plandex

AI driven development in your terminal. Designed for large, real-world tasks.

Language:GoLicense:AGPL-3.0Stargazers:10013Issues:83Issues:108

embedchain

Memory for AI agents

Language:PythonLicense:Apache-2.0Stargazers:8977Issues:64Issues:500

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLLicense:Apache-2.0Stargazers:7722Issues:52Issues:1055

corenet

CoreNet: A library for training deep neural networks

Language:PythonLicense:NOASSERTIONStargazers:6773Issues:61Issues:19

reader

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

Language:TypeScriptLicense:Apache-2.0Stargazers:5796Issues:33Issues:73

PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Language:PythonLicense:AGPL-3.0Stargazers:4650Issues:58Issues:1900

pandarallel

A simple and efficient tool to parallelize Pandas operations on all available CPUs

Language:PythonLicense:BSD-3-ClauseStargazers:3597Issues:27Issues:218

Qwen-Agent

Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonLicense:NOASSERTIONStargazers:2700Issues:28Issues:257
Language:PythonLicense:Apache-2.0Stargazers:2501Issues:32Issues:24

MoA

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Language:PythonLicense:Apache-2.0Stargazers:2272Issues:29Issues:15

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:1763Issues:21Issues:179

CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Language:PythonLicense:Apache-2.0Stargazers:1611Issues:26Issues:134

awesome-llm-powered-agent

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

Language:C++License:Apache-2.0Stargazers:1194Issues:28Issues:147

ChatChat

Chat Chat, your own unified chat and search to AI platform, with a simple and easy to use interface.

Language:TypeScriptLicense:AGPL-3.0Stargazers:1167Issues:16Issues:53

FlashRAG

⚡FlashRAG: A Python Toolkit for Efficient RAG Research

Language:PythonLicense:MITStargazers:932Issues:10Issues:46

Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:355Issues:9Issues:6

TableLLM

TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

Token-level-Direct-Preference-Optimization

Reference implementation for Token-level Direct Preference Optimization(TDPO)

Language:PythonLicense:Apache-2.0Stargazers:74Issues:1Issues:3