Tong Li (TongLi3701)

TongLi3701

Geek Repo

Company:Colossal-AI Team @hpcaitech

Location:Shanghai, China

Home Page:https://www.linkedin.com/in/tongli3701/

Github PK Tool:Github PK Tool


Organizations
hpcaitech

Tong Li's starred repositories

nano-graphrag

A simple, easy-to-hack GraphRAG implementation

Language:PythonStargazers:409Issues:0Issues:0

MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

Stargazers:712Issues:0Issues:0

Bora

Bora: Biomedical Generalist Video Generation Model

Language:PythonLicense:BSD-3-ClauseStargazers:155Issues:0Issues:0

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonLicense:Apache-2.0Stargazers:1589Issues:0Issues:0

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:15827Issues:0Issues:0

Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

Stargazers:990Issues:0Issues:0

Token-level-Direct-Preference-Optimization

Reference implementation for Token-level Direct Preference Optimization(TDPO)

Language:PythonLicense:Apache-2.0Stargazers:83Issues:0Issues:0

awesome-llm-powered-agent

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

License:MITStargazers:1304Issues:0Issues:0

MoA

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Language:PythonLicense:Apache-2.0Stargazers:2482Issues:0Issues:0

CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Language:PythonLicense:Apache-2.0Stargazers:1795Issues:0Issues:0

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:1916Issues:0Issues:0

searxng

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

Language:PythonLicense:AGPL-3.0Stargazers:11858Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2623Issues:0Issues:0

reader

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

Language:TypeScriptLicense:Apache-2.0Stargazers:6097Issues:0Issues:0

FlashRAG

⚡FlashRAG: A Python Toolkit for Efficient RAG Research

Language:PythonLicense:MITStargazers:1029Issues:0Issues:0

mem0

The memory layer for Personalized AI

Language:PythonLicense:Apache-2.0Stargazers:19938Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:12623Issues:0Issues:0

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:15780Issues:0Issues:0

PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Language:PythonLicense:AGPL-3.0Stargazers:4899Issues:0Issues:0

TableLLM

TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

Language:PythonStargazers:115Issues:0Issues:0

Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

Language:TypeScriptLicense:MITStargazers:12472Issues:0Issues:0

Chat2DB

🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.

Language:JavaLicense:Apache-2.0Stargazers:14635Issues:0Issues:0

corenet

CoreNet: A library for training deep neural networks

Language:PythonLicense:NOASSERTIONStargazers:6894Issues:0Issues:0

phidata

Build AI Assistants with memory, knowledge and tools.

Language:PythonLicense:MPL-2.0Stargazers:11002Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:41940Issues:0Issues:0

AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

Language:C++License:Apache-2.0Stargazers:1280Issues:0Issues:0

pandarallel

A simple and efficient tool to parallelize Pandas operations on all available CPUs

Language:PythonLicense:BSD-3-ClauseStargazers:3622Issues:0Issues:0

OpenHands

🙌 OpenHands: Code Less, Make More

Language:PythonLicense:MITStargazers:30313Issues:0Issues:0

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonLicense:MITStargazers:12991Issues:0Issues:0

plandex

AI driven development in your terminal. Designed for large, real-world tasks.

Language:GoLicense:AGPL-3.0Stargazers:10271Issues:0Issues:0