KarhouTam

Jiahao Tan's starred repositories

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonApache-2.032834 476 18304

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION25822 212 230

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.025671 221 4202

spdlog

Fast C++ logging library.

Language:C++NOASSERTION23685 445 2136

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT23012 226 131

jan

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)

Language:TypeScriptAGPL-3.021727 124 1682

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

Language:C++MIT16499 434 3378

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT14338 108 347

triton

Development repository for the Triton language and compiler

Language:C++MIT12402 185 1380

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION9807 160 679

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Language:PythonBSD-3-Clause8274 101 1173

cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

Language:CNOASSERTION5996 118 231

oneflow

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

Language:C++Apache-2.05846 145 966

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

MIT3311 26 81

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

GPL-3.02334 80 6