Spencer Lord (sal1023)

sal1023

Geek Repo

Github PK Tool:Github PK Tool

Spencer Lord's starred repositories

LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Language:PythonLicense:Apache-2.0Stargazers:2056Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:5773Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:13198Issues:0Issues:0

espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Language:CLicense:GPL-3.0Stargazers:4085Issues:0Issues:0

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:46803Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:23595Issues:0Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:5986Issues:0Issues:0

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonLicense:MITStargazers:3523Issues:0Issues:0

tinydiarize

Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens

Language:PythonLicense:MITStargazers:429Issues:0Issues:0
Language:PythonLicense:MITStargazers:2993Issues:0Issues:0

scalene

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Language:PythonLicense:Apache-2.0Stargazers:11615Issues:0Issues:0

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:11074Issues:0Issues:0

whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Language:PythonLicense:MITStargazers:1836Issues:0Issues:0

transformers.js

State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!

Language:JavaScriptLicense:Apache-2.0Stargazers:11067Issues:0Issues:0

tabby

Self-hosted AI coding assistant

Language:RustLicense:NOASSERTIONStargazers:21277Issues:0Issues:0

watlings

Learn WebAssembly by writing small programs!

Language:JavaScriptLicense:UnlicenseStargazers:1627Issues:0Issues:0

asr-sd-pipeline

Speech recognition & diarisation solution with text alignment, deployed in AML pipelines

Language:PythonLicense:MITStargazers:81Issues:0Issues:0

py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

Language:CLicense:NOASSERTIONStargazers:2022Issues:0Issues:0

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:3376Issues:0Issues:0

stable-ts

Transcription, forced alignment, and audio indexing with OpenAI's Whisper

Language:PythonLicense:MITStargazers:1498Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:CLicense:MITStargazers:34688Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:11587Issues:0Issues:0

whispering

Streaming transcriber with whisper

Language:PythonLicense:MITStargazers:683Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:11604Issues:0Issues:0

porcupine

On-device wake word detection powered by deep learning

Language:PythonLicense:Apache-2.0Stargazers:3702Issues:0Issues:0

usb_4_mic_array

ReSpeaker 4 Mic Array with builtin VAD, DOA, AEC, Beamforming & NS

Language:PythonLicense:Apache-2.0Stargazers:141Issues:0Issues:0

mic_array

DOA, VAD and KWS for ReSpeaker Microphone Array

Language:PythonLicense:Apache-2.0Stargazers:287Issues:0Issues:0

avs

python implementation of alexa voice service app, 支持DuerOS

Language:PythonLicense:NOASSERTIONStargazers:195Issues:0Issues:0

ec

Echo Canceller, part of Voice Engine project

Language:CLicense:GPL-3.0Stargazers:244Issues:0Issues:0

seeed-voicecard

2 Mic Hat, 4 Mic Array, 6-Mic Circular Array Kit, and 4-Mic Linear Array Kit for Raspberry Pi

Language:CLicense:GPL-3.0Stargazers:480Issues:0Issues:0