alexandonian

Alex Andonian's starred repositories

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:CMIT31938 291 1162

OpenVoice

Instant voice cloning by MyShell.

Language:PythonMIT25860 200 178

DeepFaceLive

Real-time face swap for PC streaming or video calls

Language:PythonGPL-3.023829 339 144

open-webui

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Language:SvelteMIT23467 122 1034

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT12071 101 177

phidata

Build AI Assistants with memory, knowledge and tools.

Language:PythonMPL-2.08658 58 111

storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Language:PythonMIT4444 37 26

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonBSD-3-Clause3251 37 288

cohere-toolkit

Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.

Language:TypeScriptMIT2193 25 19

maestro

A framework for Claude Opus to intelligently orchestrate subagents.

Language:Python1921 47 22

mistral.rs

Blazingly fast LLM inference.

Language:RustMIT1645 19 108

torchtitan

A native PyTorch Library for large model training

Language:PythonBSD-3-Clause1161 27 84

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonApache-2.0824 40 54

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonApache-2.0758 17 57

visual_anagrams

Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"

Language:Jupyter NotebookMIT731 10 10

kerkour.com

(Ab)using technology for fun & profit. Programming, Hacking & Entrepreneurship @ https://kerkour.com

Language:RustApache-2.0459 12 15

ring-flash-attention

Ring attention implementation with flash attention

Language:Python369 9 19

PLLaVA

Official repository for the paper PLLaVA

Language:Python356 10 35

MathPile

Generative AI for Math: MathPile

Language:JavaScriptApache-2.0349 8 4

paperetl

📄 ⚙️ ETL processes for medical and scientific papers

Language:PythonApache-2.0320 8 52

arena-hard

Arena-Hard benchmark

Language:Jupyter NotebookApache-2.0200 7 14

ollama-grid-search

A multi-platform desktop application to evaluate and compare LLM models, written in Rust and React.

Language:TypeScriptMIT189 5 16

llamaduo

Language:Jupyter NotebookApache-2.0166 5 7

Mixture-of-depths

Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Language:Python81 2 7

SoM-LLaVA

Empowering Multimodal LLMs with Set-of-Mark Prompting and Improved Visual Reasoning Ability.

Language:Python6900

Mixture-of-Depths

Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Language:PythonMIT38 4 1

Neuroformer

Language:PythonMIT2800

Token-level-Direct-Preference-Optimization

Reference implementation for Token-level Direct Preference Optimization(TDPO)

Language:PythonApache-2.02700

llm-uncertainty

code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"

Language:Python1900

unified-model-editing

We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.

Language:Python900