Sungjae Lee's starred repositories
Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
deepmd-kit
A deep learning package for many-body potential energy representation and molecular dynamics
MegaMolBART
A deep learning model for small molecule drug discovery and cheminformatics based on SMILES
DL_Compiler
Study Group of Deep Learning Compiler
DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
FasterTransformer
Transformer related optimization, including BERT, GPT
grpc-gateway
gRPC to JSON proxy generator following the gRPC HTTP spec
backend.ai
Backend.AI is a streamlined, container-based computing cluster platform that hosts popular computing/ML frameworks and diverse programming languages, with pluggable heterogeneous accelerator support including CUDA GPU, ROCm GPU, TPU, IPU and other NPUs.
marss.dramsim
A branch of marss with DRAMSim hooks