AgainstEntropy

Yihao Wang's repositories

StreamDiffusionIO

Language:PythonApache-2.0100

whisper.cpp

Port of OpenAI's Whisper model in C/C++

MIT000

speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Language:PythonApache-2.0000

cursor

The AI Code Editor

000

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Apache-2.0000

mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

MIT000

LongRAG

Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".

MIT000

runner-images

GitHub Actions runner images

MIT000

llama-cpp-python

Python bindings for llama.cpp

MIT000

trio-ollama

Language:TypeScript000

llama.cpp

LLM inference in C/C++

Language:C++MIT000

mistral.rs

Blazingly fast LLM inference.

MIT000

ggml

Tensor library for machine learning

MIT000

huggingface-repo-vscode

Language:TypeScriptMIT000

optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Language:PythonApache-2.0000

Olive

Olive is an easy-to-use hardware-aware model optimization tool that composes industry-leading techniques across model compression, optimization, and compilation.

Language:PythonMIT000