Tony Wang's starred repositories
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
model_analyzer
Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Server models.
amazon-sagemaker-examples
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
FasterTransformer
Transformer related optimization, including BERT, GPT
project-layout
Standard Go Project Layout
llama_index
LlamaIndex is a data framework for your LLM applications
instill-core
🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications
awesome-quant
A curated list of insanely awesome libraries, packages and resources for Quants (Quantitative Finance)
Understanding-Ethereum-Go-version
Understanding Ethereum: Go-Ethereum Code Analysis|理解以太坊: Go-Ethereum 源码剖析
Blockchain-dark-forest-selfguard-handbook
Blockchain dark forest selfguard handbook. Master these, master the security of your cryptocurrency.
retentioneering-tools
Retentioneering: product analytics, data-driven CJM optimization, marketing analytics, web analytics, transaction analytics, graph visualization, process mining, and behavioral segmentation in Python. Predictive analytics over clickstream, AB tests, machine learning, and Markov Chain simulations.