Sixie Yu's starred repositories

lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Language:PythonLicense:BSD-3-ClauseStargazers:1253Issues:0Issues:0

model2vec

Distill a Small Static Model from any Sentence Transformer

Language:PythonLicense:MITStargazers:335Issues:0Issues:0

Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Language:Jupyter NotebookLicense:MITStargazers:2623Issues:0Issues:0

swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Language:PythonLicense:MITStargazers:13696Issues:0Issues:0

mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Language:PythonLicense:NOASSERTIONStargazers:432Issues:0Issues:0

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4599Issues:0Issues:0

repopack

📦 Repopack is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, and Gemini.

Language:TypeScriptLicense:MITStargazers:1691Issues:0Issues:0

LLM-Training-Puzzles

What would you do with 1000 H100s...

Language:Jupyter NotebookLicense:MITStargazers:892Issues:0Issues:0

rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Language:PythonLicense:MITStargazers:2275Issues:0Issues:0

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++License:Apache-2.0Stargazers:4225Issues:0Issues:0

ML-Papers-Explained

Explanation to key concepts in ML

Stargazers:7271Issues:0Issues:0

llama-stack

Model components of the Llama Stack APIs

Language:PythonLicense:MITStargazers:3796Issues:0Issues:0

penrose

Create beautiful diagrams just by typing notation in plain text.

Language:TypeScriptLicense:MITStargazers:7532Issues:0Issues:0

RD-Agent

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through our open source R&D automation tool RD-Agent, which lets AI drive data-driven AI.

Language:PythonLicense:MITStargazers:919Issues:0Issues:0

kotaemon

An open-source RAG-based tool for chatting with your documents.

Language:PythonLicense:Apache-2.0Stargazers:14452Issues:0Issues:0

mctx

Monte Carlo tree search in JAX

Language:PythonLicense:Apache-2.0Stargazers:2333Issues:0Issues:0

minimind

「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!

Language:PythonLicense:Apache-2.0Stargazers:2372Issues:0Issues:0

mem0

The Memory layer for your AI apps

Language:PythonLicense:Apache-2.0Stargazers:22405Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:5525Issues:0Issues:0

RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7760Issues:0Issues:0

llama-stack-apps

Agentic components of the Llama Stack APIs

Language:PythonLicense:MITStargazers:3754Issues:0Issues:0

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:18222Issues:0Issues:0

SPPO

The official implementation of Self-Play Preference Optimization (SPPO)

Language:PythonLicense:Apache-2.0Stargazers:480Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:29414Issues:0Issues:0

GPTFast

Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:682Issues:0Issues:0

textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Language:PythonLicense:MITStargazers:1706Issues:0Issues:0

code

Code for the book "The Elements of Differentiable Programming".

Language:PythonLicense:Apache-2.0Stargazers:60Issues:0Issues:0

AISystem

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10863Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:28792Issues:0Issues:0

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:11885Issues:0Issues:0