Neil Stoker's starred repositories

Scrapegraph-ai

Python scraper based on AI

Language:PythonLicense:MITStargazers:12888Issues:86Issues:164

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12572Issues:170Issues:505

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:9378Issues:79Issues:107

CopilotKit

A framework for building custom AI Copilots 🤖 in-app AI chatbots, in-app AI Agents, & AI-powered Textareas.

Language:TypeScriptLicense:MITStargazers:8570Issues:70Issues:106

outlines

Structured Text Generation

Language:PythonLicense:Apache-2.0Stargazers:7426Issues:45Issues:517

moondream

tiny vision language model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4642Issues:54Issues:98

porcupine

On-device wake word detection powered by deep learning

Language:PythonLicense:Apache-2.0Stargazers:3627Issues:63Issues:542

matmulfreellm

Implementation for MatMul-free LM.

Language:PythonLicense:Apache-2.0Stargazers:2779Issues:44Issues:23

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter

Language:C++License:Apache-2.0Stargazers:2646Issues:43Issues:399

pipecat

Open Source framework for voice and multimodal conversational AI

Language:PythonLicense:BSD-2-ClauseStargazers:2596Issues:24Issues:60

RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Language:PythonLicense:MITStargazers:1353Issues:24Issues:61

tarsier

Vision utilities for web interaction agents 👀

Language:Jupyter NotebookLicense:MITStargazers:1325Issues:7Issues:18

sqlite-vec

Work-in-progress vector search SQLite extension that runs anywhere.

Language:CLicense:Apache-2.0Stargazers:1192Issues:46Issues:46

distributed-llama

Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.

Language:C++License:MITStargazers:1164Issues:19Issues:46

page-assist

Use your locally running AI models to assist you in your web browsing

Language:TypeScriptLicense:MITStargazers:1028Issues:12Issues:103

GPT-4V-Act

AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI

Lip2Wav

This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"

Language:PythonLicense:MITStargazers:692Issues:27Issues:39

vad

Voice activity detector (VAD) for the browser with a simple API

Language:TypeScriptLicense:NOASSERTIONStargazers:681Issues:11Issues:85

june

Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit

Language:PythonLicense:MITStargazers:618Issues:5Issues:7

openWakeWord

An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:566Issues:12Issues:112

paddler

Stateful load balancer custom-tailored for llama.cpp

Language:GoLicense:MITStargazers:467Issues:6Issues:3

Linguflex

Command Your World with Voice

Language:PythonLicense:MITStargazers:301Issues:9Issues:8

cq2

Tool for RFCs

Language:TypeScriptLicense:AGPL-3.0Stargazers:299Issues:7Issues:0

Building-LLM-Powered-Applications

Building Large Language Model Applications, Published by Packt

Language:Jupyter NotebookLicense:MITStargazers:115Issues:5Issues:2
Language:CLicense:Apache-2.0Stargazers:78Issues:6Issues:0

gemini-spatial-example

How to use bounding boxes with the Gemini API

stream2sentence

Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.

lambda-bedrock-s3-streaming-rag

Fully serverless streaming RAG application

Language:JavaScriptLicense:MIT-0Stargazers:25Issues:1Issues:2
Language:JavaScriptLicense:Apache-2.0Stargazers:2Issues:1Issues:0