K.G. Miller's starred repositories
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
chat-to-your-database
Chat to your database with AI. An experimental app to test the abilities of LLMs to query SQL databases using natural language.
resemble-enhance
AI powered speech denoising and enhancement
voicefixer_main
General Speech Restoration
h2o-llmstudio
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
TopicalChange
Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.
haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
SwiftAudioPlayer
Streaming and realtime audio manipulation with AVAudioEngine
MetalAudioVisualizer
Tutorial on making your first Audio Visualizer in Swift using Metal, Accelerate, and AVAudioEngine!
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
google-research
Google Research
MLB_prediction
Predict Major League Baseball games (win/loss) with machine learning
uWebSockets.js
μWebSockets for Node.js back-ends :metal: