Beast code in Giters

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Language:C++MPL-2.02487300

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Apache-2.0152500

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++MIT2982000

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonMIT3415800

pixel_ring

RGB LED library for ReSpeaker 4 Mic Array, ReSpeaker V2 & ReSpeaker USB 6+1 Mic Array

Language:Python5900

deeplearning_ai_books

deeplearning.ai（吴恩达老师的深度学习课程笔记及资源）

Language:HTML1774800

spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

Language:PythonMIT2940900

Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

Language:C++Apache-2.02194700

chroma

the AI-native open-source embedding database

Language:RustApache-2.01391700

whisper_mic

Project that allows one to use a microphone with OpenAI whisper.

Language:PythonMIT67900

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonMIT1066200

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:C++MIT3346700

srs

SRS is a simple, high-efficiency, real-time video server supporting RTMP, WebRTC, HLS, HTTP-FLV, SRT, MPEG-DASH, and GB28181.

Language:C++MIT2499400

speech-to-text-wavenet

Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow

Language:PythonApache-2.0392900

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT564200

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonBSD-2-Clause1032100

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Language:Python236000

MaixPy-v1

MicroPython for K210 RISC-V, let's play with edge AI easier

Language:PythonNOASSERTION167600