Pavel Smirnov's starred repositories
bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
recurrent-memory-transformer
[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
python-mastery
Advanced Python Mastery (course by @dabeaz)
LLaMA-Cult-and-More
Large Language Models for All, 🦙 Cult and More, Stay in touch !
awesome-mlss
List of summer schools in machine learning + related fields across the globe
foundryvtt-docker
An easy-to-deploy Dockerized Foundry Virtual Tabletop server.
dbt-clickhouse
The Clickhouse plugin for dbt (data build tool)
ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
YouTokenToMe
Unsupervised text tokenizer focused on computational efficiency
sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
GifCapture
🏇 Gif capture app for macOS
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format