Gehao Zhang's starred repositories
LLM4Decompile
Reverse Engineering: Decompiling Binary Code with Large Language Models
mathematical_foundation
Mathematical Foundations for Deep Learning
minimal-light
A simple and elegant Jekyll theme for an academic personal homepage
eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee
eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
FasterTransformer4CodeFuse
High-performance LLM inference based on our optimized version of FastTransfomer
ByteTransformer
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
mac-precision-touchpad
Windows Precision Touchpad Driver Implementation for Apple MacBook / Magic Trackpad
NLPMetrics
Python code for various NLP metrics
compose-spec
The Compose specification
MSMARCO-Document-Ranking
MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, and passage/document ranking