Anlim

Anlim

Geek Repo

Location:Beijing

Github PK Tool:Github PK Tool

Anlim's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:130142Issues:0Issues:0

supervisor

Supervisor process control system for Unix (supervisord)

Language:PythonLicense:NOASSERTIONStargazers:8381Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8339Issues:0Issues:0

pyusb

Easy USB access for Python

Language:PythonLicense:BSD-3-ClauseStargazers:2110Issues:0Issues:0

noisereduce

Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)

Language:Jupyter NotebookLicense:MITStargazers:1372Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36003Issues:0Issues:0

espnet_model_zoo

ESPnet Model Zoo

Language:PythonLicense:Apache-2.0Stargazers:243Issues:0Issues:0

ros2

The Robot Operating System, is a meta operating system for robots.

Stargazers:3361Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8182Issues:0Issues:0

agents

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Language:PythonLicense:Apache-2.0Stargazers:5033Issues:0Issues:0

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:34677Issues:0Issues:0

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Language:C++License:MPL-2.0Stargazers:24873Issues:0Issues:0

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

License:Apache-2.0Stargazers:1525Issues:0Issues:0

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++License:MITStargazers:29820Issues:0Issues:0

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:34158Issues:0Issues:0

pixel_ring

RGB LED library for ReSpeaker 4 Mic Array, ReSpeaker V2 & ReSpeaker USB 6+1 Mic Array

Language:PythonStargazers:59Issues:0Issues:0

deeplearning_ai_books

deeplearning.ai(吴恩达老师的深度学习课程笔记及资源)

Language:HTMLStargazers:17748Issues:0Issues:0

Coursera-ML-AndrewNg-Notes

吴恩达老师的机器学习课程个人笔记

Language:HTMLStargazers:31179Issues:0Issues:0

spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

Language:PythonLicense:MITStargazers:29409Issues:0Issues:0

Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

Language:C++License:Apache-2.0Stargazers:21947Issues:0Issues:0

chroma

the AI-native open-source embedding database

Language:RustLicense:Apache-2.0Stargazers:13917Issues:0Issues:0

whisper_mic

Project that allows one to use a microphone with OpenAI whisper.

Language:PythonLicense:MITStargazers:679Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:10662Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:C++License:MITStargazers:33467Issues:0Issues:0

srs

SRS is a simple, high-efficiency, real-time video server supporting RTMP, WebRTC, HLS, HTTP-FLV, SRT, MPEG-DASH, and GB28181.

Language:C++License:MITStargazers:24994Issues:0Issues:0

speech-to-text-wavenet

Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow

Language:PythonLicense:Apache-2.0Stargazers:3929Issues:0Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:5642Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:10321Issues:0Issues:0

pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Language:PythonStargazers:2360Issues:0Issues:0

MaixPy-v1

MicroPython for K210 RISC-V, let's play with edge AI easier

Language:PythonLicense:NOASSERTIONStargazers:1676Issues:0Issues:0