Alex Andonian's starred repositories
whisper.cpp
Port of OpenAI's Whisper model in C/C++
DeepFaceLive
Real-time face swap for PC streaming or video calls
open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
cohere-toolkit
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
mistral.rs
Blazingly fast LLM inference.
torchtitan
A native PyTorch Library for large model training
visual_anagrams
Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"
kerkour.com
(Ab)using technology for fun & profit. Programming, Hacking & Entrepreneurship @ https://kerkour.com
ring-flash-attention
Ring attention implementation with flash attention
arena-hard
Arena-Hard benchmark
ollama-grid-search
A multi-platform desktop application to evaluate and compare LLM models, written in Rust and React.
Mixture-of-depths
Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
Mixture-of-Depths
Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
llm-uncertainty
code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"
unified-model-editing
We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.