ChengweiBian's starred repositories

Chinese-Text-Classification-Pytorch

中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。

Language:PythonLicense:MITStargazers:5182Issues:0Issues:0

Kashgari

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

Language:PythonLicense:Apache-2.0Stargazers:2382Issues:0Issues:0

aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Language:PythonLicense:AGPL-3.0Stargazers:2445Issues:0Issues:0

wordninja

Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.

Language:PythonLicense:MITStargazers:768Issues:0Issues:0

music-genre-recognition

Musical genre recognition using a CNN

Language:PythonLicense:WTFPLStargazers:20Issues:0Issues:0

AutoLyrixAlign

Pre-trained model and script to automatically align lyrics to polyphonic audio

Stargazers:101Issues:0Issues:0

forced-alignment-tools

A collection of links and notes on forced alignment tools

Language:PythonLicense:NOASSERTIONStargazers:856Issues:0Issues:0

AlignmentDuration

Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.

Language:PythonLicense:AGPL-3.0Stargazers:56Issues:0Issues:0

deepcorrect

Text and Punctuation correction with Deep Learning

Language:PythonLicense:GPL-3.0Stargazers:129Issues:0Issues:0

crnn

Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.

Language:LuaLicense:MITStargazers:2047Issues:0Issues:0

CHINESE-OCR

[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别

Language:PythonStargazers:2901Issues:0Issues:0

speech-to-text-benchmark

speech to text benchmark framework

Language:PythonLicense:Apache-2.0Stargazers:594Issues:0Issues:0

TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language:Jupyter NotebookLicense:MPL-2.0Stargazers:9043Issues:0Issues:0

speech

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Language:PythonLicense:Apache-2.0Stargazers:736Issues:0Issues:0

speech-to-text-wavenet

Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow

Language:PythonLicense:Apache-2.0Stargazers:3928Issues:0Issues:0

py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Language:C++License:Apache-2.0Stargazers:170Issues:0Issues:0

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:59752Issues:0Issues:0

keras

Deep Learning for humans

Language:PythonLicense:Apache-2.0Stargazers:61331Issues:0Issues:0

End-to-end-ASR-Pytorch

This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.

Language:PythonLicense:MITStargazers:1174Issues:0Issues:0

masr

中文语音识别; Mandarin Automatic Speech Recognition;

Language:PythonStargazers:1836Issues:0Issues:0

kaldi-gstreamer-server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Language:PythonLicense:BSD-2-ClauseStargazers:1065Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonLicense:Apache-2.0Stargazers:10628Issues:0Issues:0

ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Language:PythonLicense:MITStargazers:120Issues:0Issues:0

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Language:C++License:MPL-2.0Stargazers:24791Issues:0Issues:0

tensorflow-speech-recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Language:PythonLicense:NOASSERTIONStargazers:2160Issues:0Issues:0

deepspeech.pytorch

Speech Recognition using DeepSpeech2.

Language:PythonLicense:MITStargazers:2093Issues:0Issues:0

ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Language:PythonLicense:GPL-3.0Stargazers:7644Issues:0Issues:0

Attention-OCR

Visual Attention based OCR

Language:PythonLicense:MITStargazers:1109Issues:0Issues:0

chinese_ocr

CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras

Language:PythonLicense:Apache-2.0Stargazers:2735Issues:0Issues:0

chineseocr

yolo3+ocr

Language:PythonLicense:MITStargazers:5875Issues:0Issues:0