Nam D. Tran's repositories
gemma-cpp-python
A Python wrapper for gemma.cpp
terminalmind
Friendly Terminal Assistant for Developers
common-util-databases
Common util for many type of databases.
hackathon_big_data_2018
Predict Churn
active-learning-hub
All papers, notes and things for Active Learning.
AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.
ByteTrack
ByteTrack: Multi-Object Tracking by Associating Every Detection Box
mmdetection
OpenMMLab Detection Toolbox and Benchmark
pattern-recognition-and-machine-learning-hub
Notes and Notebooks for the Bishop's book: Pattern Recognition And Machine Learning.
system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
trl
Train transformer language models with reinforcement learning.
unsloth
5X faster 60% less memory QLoRA finetuning
YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/