Yifei Zuo's repositories
flash-attention
Fast and memory-efficient exact attention
FlexGen-PurnedInference
Running large language models like OPT-175B/GPT-3 on a single GPU. Focusing on high-throughput generation.
Machine-Learning-Project
Course project for EE3001 Machine Learning
Bend
A massively parallel, high-level programming language
Blockchain-dark-forest-selfguard-handbook
Blockchain dark forest selfguard handbook. Master these, master the security of your cryptocurrency.
candle
Minimalist ML framework for Rust
Compiler-Principle-2023Fall-Lab-Repo-deprecated-
Redesigned course project for Compiler Principle 2023 Fall
COS-ECE470-fa2022
Princeton University - COS/ECE 470 : Principles of Blockchains
crepe
Datalog compiler embedded in Rust as a procedural macro
flashinfer
FlashInfer: Kernel Library for LLM Serving
H3
Language Modeling with the H3 State Space Model
HVM
A massively parallel, optimal functional runtime in Rust
husky
Empowering everyone towards next generation AI and software.
iTransformer
Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah
lightning-attention
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
RoboGame-epsilonGoal
A robot control program based on STM32F103ZET6.