josephyuzb

followers

following

stars

josephyuzb's starred repositories

ollama

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Language:GoMIT80741 485 3750

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:C++MIT33351 308 1242

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION24447 207 208

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.023688 220 3627

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.013130 91 627

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookNOASSERTION10783 88 298

tpot

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Language:PythonLGPL-3.09611 288 918

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookApache-2.09336 119 129

openvino

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

Language:C++Apache-2.06552 190 2547

h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://h2oai.github.io/h2o-llmstudio/

Language:PythonApache-2.03797 47 369

cub

[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

Language:CudaBSD-3-Clause1660 89 281

megablocks

Language:PythonApache-2.01134 19 50

most-common-american-idioms

A book created by xiaolai with the help of ChatGPT and its TTS

Language:Jupyter Notebook889 5 9

cloudpan189-go

天翼云盘命令行客户端(CLI)，基于GO语言实现

Language:GoApache-2.0605 13 93

Chinese-Mixtral

中文Mixtral混合专家大模型（Chinese Mixtral MoE LLMs）

Language:PythonApache-2.0569 15 10

Qwen-TensorRT-LLM

Language:PythonMIT549 6 114

rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Language:C++Apache-2.0462 11 76

llmperf-leaderboard

Apache-2.0407 15 10

llm-inference-benchmark

LLM Inference benchmark

Language:PythonMIT299 2 2

tensorflow_hmm

A tensorflow implementation of an HMM layer

Language:Jupyter NotebookApache-2.0286 16 6

ROCR-Runtime

ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime

Language:C++NOASSERTION206 65 106

omniperf

Advanced Profiling and Analytics for AMD Hardware

Language:PythonMIT127 18 182

vllm-rocm

vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.085 2 13

FLASHNN

Language:PythonApache-2.04100

llmperf

Language:C++38 30

rank_dataset

PyTorch Dataset Rank Dataset

Language:PythonApache-2.035 5 2

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.023 30

LLM-System-Requirements

Open-source calculator for LLM system requirements.

Language:PythonMIT2100

llm-inference-benchmark

LLM 推理服务性能测试

Language:Jupyter Notebook21 2 2

tf_benchmarks

A benchmark framework for Tensorflow

Language:PythonApache-2.0100