Mark Stewart's starred repositories
system-design-101
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
paperless-ngx
A community-supported supercharged document management system: scan, index and archive all your documents
fish-speech
SOTA Open Source TTS
translate-shell
:speech_balloon: Command-line translator using Google Translate, Bing Translator, Yandex.Translate, etc.
whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
napkin-math
Techniques and numbers for estimating system's performance from first-principles
spf-dkim-dmarc-simplified
Email security is a key part of internet communication. But what are SPF, DKIM, and DMARC, and how do they work? This guide will explain it all in simple terms to make these concepts clearer.
photocrate
Photo library and interactive editor built with Next.js & Cloudinary
xournalpp_htr
Developing handwritten text recognition for Xournal++
ocrs-models
PyTorch models for the ocrs OCR engine
2024-09-noise-storms
Notes and receipts (PCAPs) for TCP and ICMP Noise Storms
rag-tutorials
a series of tutorials implementing rag service with BentoML and LlamaIndex
Handwritten-Text-Recognition-Tesseract-OCR
A Handwritten Text Recognition built with Tensorflow2 & Keras & IAM Dataset, Convolutional Recurrent Neural Network, CTC. Decoder