Dinghow Yang's starred repositories
Megatron-LM
Ongoing research training transformer models at scale
text-generation-inference
Large Language Model Text Generation Inference
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
safetensors
Simple, safe way to store and distribute tensors
tvm_mlir_learn
compiler learning resources collect.
spikingjelly
SpikingJelly is an open-source deep learning framework for Spiking Neural Network (SNN) based on PyTorch.
Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
codereview.gpt
Reviews your Pull/Merge Requests using ChatGPT
Point-Bind_Point-LLM
Align 3D Point Cloud with Multi-modalities for Large Language Models
segformer-pytorch
Implementation of Segformer, Attention + MLP neural network for segmentation, in Pytorch
llm-code-review
A container GitHub Action to review a pull request by HuggingFace's LLM Model.