josephyuzb's starred repositories

ollama

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

whisper.cpp

Port of OpenAI's Whisper model in C/C++

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:24447Issues:207Issues:208

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23688Issues:220Issues:3627

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:13130Issues:91Issues:627

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10783Issues:88Issues:298

tpot

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Language:PythonLicense:LGPL-3.0Stargazers:9611Issues:288Issues:918

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9336Issues:119Issues:129

openvino

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

Language:C++License:Apache-2.0Stargazers:6552Issues:190Issues:2547

h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://h2oai.github.io/h2o-llmstudio/

Language:PythonLicense:Apache-2.0Stargazers:3797Issues:47Issues:369

cub

[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

Language:CudaLicense:BSD-3-ClauseStargazers:1660Issues:89Issues:281
Language:PythonLicense:Apache-2.0Stargazers:1134Issues:19Issues:50

most-common-american-idioms

A book created by xiaolai with the help of ChatGPT and its TTS

Language:Jupyter NotebookStargazers:889Issues:5Issues:9

cloudpan189-go

天翼云盘命令行客户端(CLI),基于GO语言实现

Language:GoLicense:Apache-2.0Stargazers:605Issues:13Issues:93

Chinese-Mixtral

中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)

Language:PythonLicense:Apache-2.0Stargazers:569Issues:15Issues:10

rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Language:C++License:Apache-2.0Stargazers:462Issues:11Issues:76

llm-inference-benchmark

LLM Inference benchmark

Language:PythonLicense:MITStargazers:299Issues:2Issues:2

tensorflow_hmm

A tensorflow implementation of an HMM layer

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:286Issues:16Issues:6

ROCR-Runtime

ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime

Language:C++License:NOASSERTIONStargazers:206Issues:65Issues:106

omniperf

Advanced Profiling and Analytics for AMD Hardware

Language:PythonLicense:MITStargazers:127Issues:18Issues:182

vllm-rocm

vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:85Issues:2Issues:13
Language:PythonLicense:Apache-2.0Stargazers:41Issues:0Issues:0
Language:C++Stargazers:38Issues:3Issues:0

rank_dataset

PyTorch Dataset Rank Dataset

Language:PythonLicense:Apache-2.0Stargazers:35Issues:5Issues:2

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23Issues:3Issues:0

LLM-System-Requirements

Open-source calculator for LLM system requirements.

Language:PythonLicense:MITStargazers:21Issues:0Issues:0

llm-inference-benchmark

LLM 推理服务性能测试

Language:Jupyter NotebookStargazers:21Issues:2Issues:2

tf_benchmarks

A benchmark framework for Tensorflow

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0