xiexukang / WeTextProcessing

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Text Normalization & Inverse Text Normalization

1. How To Use

$ git clone https://github.com/wenet-e2e/WeTextProcessing.git
$ cd WeTextProcessing
$ python normalize.py --text "text to be normalized"
$ python inverse_normalize.py --text "text to be denormalized"

2. TN Pipeline

Please refer to TN.README

3. ITN Pipeline (Coming soon...)

Please refer to ITN.README

Acknowledge

  1. Thank the authors of foundational libraries like OpenFst & Pynini.
  2. Thank NeMo team & NeMo open-source community.
  3. Thank Zhenxiang Ma, Jiayu Du, and SpeechColab organization.
  4. Referred Pynini for reading the FAR, and printing the shortest path of a lattice in the C++ runtime.
  5. Referred TN of NeMo for the data to build the tagger graph.
  6. Referred ITN of chinese_text_normalization for the data to build the tagger graph.

About

License:Apache License 2.0


Languages

Language:Python 57.7%Language:C++ 32.2%Language:CMake 8.6%Language:C 1.5%