Minjune Song's repositories
reka-vibe-eval
Multimodal language model benchmark, featuring challenging examples
whisper-chat
Voice controls for ChatGPT
whisper-stream
Real-time whisper voice to text transcription, in Python
bark
🔊 Text-Prompted Generative Audio Model
ComfyUI
custom comfy
dataset
lora test data
emg
emg
extreme-parkour
Train your parkour robot in less than 20 hours.
kohya_ss
custom kohya
llama
Inference code for LLaMA models
LLaVA-pp
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
membot
remember this bot
mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
Monolog.ai
Developer information page for Monolog
nanoDiffusion
nanoscale implementation of text and image diffusion models
OneTrainer
OneTrainer is a one-stop solution for all your stable diffusion training needs.
open-interpreter
OpenAI's Code Interpreter in your terminal, running locally
poker-ai
for comp
speech-decoding
Reimplementation of speech decoding 2022 paper by MetaAI
Splatters
Browse Splatters
stable-diffusion-regularization-images
Stable Diffusion Regularization Images in 512px, 768px and 1024px on 1.5, 2.1 and SDXL 1.0 checkpoints
tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
uroman
Universal Romanizer that can convert any unicode script to roman (latin) script
whisper.cpp
Port of OpenAI's Whisper model in C/C++