Daoyuan Chen's repositories
data-juicer
A data-centric text processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
FederatedScope
An easy-to-use federated learning platform
Language:PythonApache-2.0000
Awesome-Pruning
A curated list of neural network pruning resources.
000
minisora
The Mini Sora project aims to explore the implementation path and future development direction of Sora.
Language:PythonApache-2.0000
mlcontests.github.io
A list of public machine learning/data science/AI contests.
GPL-3.0000
ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Apache-2.0000