MikeLuck's starred repositories
descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
build-nanogpt
Video+code lecture on building nanoGPT from scratch
AudioNotes
快速提取音视频内容,整理成一份结构化的markdown笔记
UltraPixel
Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks
florence2-finetuning
Quick exploration into fine tuning florence 2
LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
SenseVoice
Multilingual Voice Understanding Model
cloudflare-docker-proxy
A docker registry proxy run on cloudflare worker.
python-markdownify
Convert HTML to Markdown
RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
piccolo-embedding
code for piccolo embedding model from SenseTime