namtranase

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.0000

trl

Train transformer language models with reinforcement learning.

Apache-2.0000

unsloth

5X faster 60% less memory QLoRA finetuning

Language:PythonApache-2.0000

YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Language:PythonApache-2.0000

namtranase

Nam D. Tran's repositories

gemma-cpp-python

terminalmind

airsim-UAV-indoor-obstacle-avoidance

airsim-SAR-at-sea-with-UAV

llm.c

users-clustering-based-minhash-lsh

common-util-databases

hackathon_big_data_2018

llama.cpp

namtranase

webwise

active-learning-hub

AutoAWQ

ByteTrack

mmdetection

pattern-recognition-and-machine-learning-hub

system-design-primer

TensorRT-LLM

trl

unsloth

YOLOX