Tom Rogers's starred repositories
private-gpt
Interact with your documents using the power of GPT, 100% privately, no data leaks
flash-attention
Fast and memory-efficient exact attention
trigger.dev
Trigger.dev is the open source background jobs platform for TypeScript.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
evolutionary-model-merge
Official repository of Evolutionary Optimization of Model Merging Recipes
yet-another-applied-llm-benchmark
A benchmark to evaluate language models on questions I've previously asked them to solve.
use-whisper
React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in
whisper.rn
React Native binding of whisper.cpp.
MetalSplatter
Render Gaussian Splats using Metal on Apple platforms (iOS/iPhone/iPad, macOS, and visionOS)
selfhealing-action-express
A express server with a self-healing langchain chain github action workflow
whisper-server
streaming speech to text server using Whisper
whisper-stream
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
DensePose_from_WiFi
Using of the WiFi signal in combination with deep learning architectures, commonly used in computer vision, to estimate dense human pose correspondence.
safeguards-plugin
Serverless Framework Plugin to enforce safeguard policies
e2e-test-sns-kinesis-demo
Demo to illustrate how you can include SNS and Kinesis in end-2-end tests