itsmnjn

Min Jun Kim's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0134319 1125 16054

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT20877 203 381

unsloth

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.017682 122 957

AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

Apache-2.014453 671 92

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause14005 118 1095

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonApache-2.08987 101 1343

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT6239 71 992

mlx-examples

Examples in the MLX framework

Language:PythonMIT6128 71 495

MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Language:Jupyter NotebookApache-2.04787 43 125

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.04636 111 135

CTranslate2

Fast inference engine for Transformer models

Language:C++MIT3370 59 702

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookApache-2.02487 35 7

priompt

Prompt design using JSX.

Language:TypeScriptMIT2000 23 7

RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Language:PythonMIT1973 31 94

RealtimeTTS

Converts text to speech in realtime

Language:Python1964 20 109

whisper-plus

WhisperPlus: Faster, Smarter, and More Capable 🚀

Language:PythonApache-2.01712 19 51

FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

Language:C++Apache-2.01695 32 659

coffee

Build and iterate on your UI 10x faster with AI - right from your own IDE ☕️

Language:PythonApache-2.01467 8 7

self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Language:PythonMIT1328 23 17

open-instruct

Language:PythonApache-2.01252 16 115

AQLM

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression https://arxiv.org/abs/2405.14852

Language:PythonApache-2.01162 19 85

aphrodite-engine

Large-scale LLM inference engine

Language:PythonAGPL-3.01091 15 183

react-native-skottie

▶️ Efficient lottie animations using Skia's Skottie module

Language:C++MIT867 9 40

VTubeStudio

VTube Studio API Development Page

Language:C#MIT849 43 57

talk

Let's make sand talk

Language:TypeScript586 17 26

tabbyAPI

An OAI compatible exllamav2 API that's both lightweight and fast

Language:PythonAGPL-3.0563 10 113

landmark-attention

Landmark Attention: Random-Access Infinite Context Length for Transformers

Language:PythonApache-2.0414 40 15

lightspeedGPT

Use GPT4 and GPT3.5 on inputs of unlimited size. Uses multithreading to process multiple chunks in parallel. Useful for tasks like Named Entity Recognition, information extraction on large books, datasets, etc.

Language:Jupyter NotebookMIT272 7 1

landmark-attention-qlora

Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA

Language:PythonApache-2.0124 50

VTS-AI-Plugin

Language:PythonGPL-3.01400