Chenghao Mou's repositories
text-dedup
All-in-one text de-duplication
touchbar-lyric
Show synced lyric in the touch-bar with BetterTouchTool and NetEase APIs
awesome-data-deduplication
An awesome list of data deduplication use cases, papers, tools, and methods.
chenghaomou.github.io
Personal Blog
whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
.trunk
Used by Trunk. Learn more at https://docs.trunk.io/code-quality/ci/get-started/github-integration
agents
A powerful framework for building realtime voice AI agents 🤖🎙️📹
async_task_pipeline
A simple multithread pipeline with async and streaming io
delayed-streams-modeling
Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
desktop
Welcome to a calmer internet
edgar-crawler
SEC EDGAR Exhibit Downloader
GLiNER
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
nnw-allmine-flexoki
Modified NetNewsWire theme based on nnw-allmine with Flexoki colors
paper2speech
Convert a research paper to audio
pipecat
Open Source framework for voice and multimodal conversational AI
pydantic-ai
Agent Framework / shim to use Pydantic with LLMs
quartz
🌱 a fast, batteries-included static-site generator that transforms Markdown content into fully functional websites
remarks
Extract highlights, scribbles, and annotations from PDFs marked with the reMarkable tablet. Export to Markdown and PDF
rmc
Convert to/from v6 .rm files from the reMarkable tablet
rmscene
Read v6 .rm files from the reMarkable tablet
speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
tos_datasets
Terms of Service/Privacy Policy Datasets