Tong Li's starred repositories
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Qwen-Agent
Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
distilabel
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
ringattention
Transformers with Arbitrarily Large Context
video_captioning_datasets
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
Feishu-On-Leave-Status-Sync
This is a cron job which will periodically sync the on-leave status on Feishu.
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Chinese-Mixtral-8x7B
中文Mixtral-8x7B(Chinese-Mixtral-8x7B)
Machine-Mindset
An MBTI Exploration of Large Language Models
modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
SwiftInfer
Efficient AI Inference & Serving