Yihao Wang's repositories
whisper.cpp
Port of OpenAI's Whisper model in C/C++
speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
cursor
The AI Code Editor
unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
LongRAG
Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".
runner-images
GitHub Actions runner images
llama-cpp-python
Python bindings for llama.cpp
llama.cpp
LLM inference in C/C++
mistral.rs
Blazingly fast LLM inference.
ggml
Tensor library for machine learning
optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
Olive
Olive is an easy-to-use hardware-aware model optimization tool that composes industry-leading techniques across model compression, optimization, and compilation.
onnx
Open standard for machine learning interoperability
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
ort
Fast ML inference & training for Rust with ONNX Runtime
cuda-toolkit
GitHub Action to install CUDA
onnxruntime-genai
Generative AI extensions for onnxruntime
langchain
🦜🔗 Build context-aware reasoning applications
zed
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
executorch
On-device AI across mobile, embedded and edge for PyTorch
ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
excalidraw
Virtual whiteboard for sketching hand-drawn like diagrams
dl-notes
:triangular_ruler: Jekyll theme for building a personal site, blog, project documentation, or portfolio.
zenith
Zenith - sort of like top or htop but with zoom-able charts, CPU, GPU, network, and disk usage