Mukai Li's starred repositories
flash-attention
Fast and memory-efficient exact attention
consistency_models
Official repo for consistency models.
GPU-Puzzles
Solve puzzles. Learn CUDA.
tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
ring-flash-attention
Ring attention implementation with flash attention
Apple-Monitor
一个用 Java 实现的 Apple 商店库存监控,支持bark,dingtalk,微信等方式推送实时库存信息。目前支持**和日本地区。An Apple store inventory monitoring implemented in Java, supports bark, dingtalk, WeChat and other methods to push real-time inventory information. Currently supports China and Japan regions.
diffusion-of-thoughts
Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
ChatBridge
ChatBridge, an approach to learning a unified multimodal model to interpret, correlate, and reason about various modalities without relying on all combinations of paired data.