pthread's repositories
BEVFormer_tensorrt
BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).
annotated-transformer
An annotated implementation of the Transformer paper.
Language:Jupyter NotebookMIT000
Awesome-LLM-Survey
An Awesome Collection for LLM Survey
000
Efficient-LLM-Inferencing-on-GPUs
Penn CIS 5650 (GPU Programming and Architecture) Final Project
Language:C++MIT000
LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language:Python000
work
scripts
000