Beast code in Giters

ChengweiBian's starred repositories

Chinese-Text-Classification-Pytorch

中文文本分类，TextCNN，TextRNN，FastText，TextRCNN，BiLSTM_Attention，DPCNN，Transformer，基于pytorch，开箱即用。

Language:PythonMIT518200

Kashgari

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

Language:PythonApache-2.0238200

aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Language:PythonAGPL-3.0244500

wordninja

Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.

Language:PythonMIT76800

music-genre-recognition

Musical genre recognition using a CNN

Language:PythonWTFPL2000

AutoLyrixAlign

Pre-trained model and script to automatically align lyrics to polyphonic audio

10100

forced-alignment-tools

A collection of links and notes on forced alignment tools

Language:PythonNOASSERTION85600

Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.

Language:PythonAGPL-3.05600

deepcorrect

Text and Punctuation correction with Deep Learning

Language:PythonGPL-3.012900

crnn

Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.

Language:LuaMIT204700

CHINESE-OCR

[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别

Language:Python290100

speech-to-text-benchmark

speech to text benchmark framework

Language:PythonApache-2.059400

TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language:Jupyter NotebookMPL-2.0904300

speech

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Language:PythonApache-2.073600

speech-to-text-wavenet

Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow

Language:PythonApache-2.0392800

py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Language:C++Apache-2.017000

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++Apache-2.05975200

keras

Deep Learning for humans

Language:PythonApache-2.06133100

End-to-end-ASR-Pytorch

This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.

Language:PythonMIT117400

masr

中文语音识别; Mandarin Automatic Speech Recognition;

Language:Python183600

kaldi-gstreamer-server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Language:PythonBSD-2-Clause106500

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonApache-2.01062800

ChengweiBian