Anjos (whaozl)

whaozl

Geek Repo

Location:Shanghai,China

Home Page:blog.csdn.net/zhulinniao

Github PK Tool:Github PK Tool

Anjos's repositories

cn2an

📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

whisper-plus

WhisperPlus: Advancing Speech-to-Text Processing 🚀

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

AhoCorasickDoubleArrayTrie

An extremely fast implementation of Aho Corasick algorithm based on Double Array Trie.

Language:JavaStargazers:0Issues:0Issues:0

ASR-decoder

it's ASR decoder and make graph project

Language:C++License:MITStargazers:0Issues:0Issues:0

CapsWriter-Offline

CapsWriter 简陋但好用的离线版,一个 PC 端的语音输入工具

Language:PythonStargazers:0Issues:0Issues:0

commonvoice-th

Kaldi recipe to train commonvoice corpus in Thai language

Language:ShellStargazers:0Issues:0Issues:0

github1s

One second to read GitHub code with VS Code.

Language:TypeScriptLicense:MITStargazers:0Issues:1Issues:0

HelloGitHub

:octocat: 分享 GitHub 上有趣、入门级的开源项目(5 周年)

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

javacpp

The missing bridge between Java and native C++

Language:JavaLicense:NOASSERTIONStargazers:0Issues:1Issues:0

jiwer

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Leaderboard

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.

Language:PythonStargazers:0Issues:0Issues:0

LLaSM

第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验,同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

NPTEL2020-Indian-English-Speech-Dataset

NPTEL2020: Speech2Text dataset for Indian-English Accent

Language:PythonStargazers:0Issues:0Issues:0

pororo

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Python-Wrapper-for-World-Vocoder

A Python wrapper for the high-quality vocoder "World"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Recorder

html5 js 录音 mp3 wav ogg webm amr 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

riva-asrlib-decoder

Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva

Language:PythonStargazers:0Issues:0Issues:0

sherpa

Speech-to-text server framework with next-gen Kaldi

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

sherpa-onnx

Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

License:MITStargazers:0Issues:0Issues:0

speech_dataset

The dataset of Speech Recognition

License:Apache-2.0Stargazers:0Issues:0Issues:0

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TMSpeech

腾讯会议摸鱼工具

Language:C#License:MITStargazers:0Issues:0Issues:0

transformers

🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Whisper-Finetune

微调Whisper语音识别模型,支持无时间戳数据训练,有时间戳数据训练、无语音数据训练。加速推理,支持Web部署、Windows桌面部署和Android部署

Language:CLicense:Apache-2.0Stargazers:0Issues:0Issues:0