Mddct / neural-lm-deprecated

focus on fusion on speech recognition

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

(deprecated, will reimplement by jax) under development may not work until whole pipeline done

neural-lm

focus on fusion on speech recognition

Note

When a language model is used wide beam searches often yield incomplete transcripts. With narrow beams, the problem is less visible due to implicit hypothesis pruning.

See if it appears in ctc+lm fusion

TODO

  • adaptive softmax for large voca (because pytorch offical implementation can't work with torchscript)
  • onnx support and torchscript
  • gru
  • rnn tie embedding
  • gru fusion on wenet runtime ctc prefix beam search
  • transformer-xl with cache
  • transformer-xl with cache to fusion
  • mwer training when lm fusion
  • etc

reference

About

focus on fusion on speech recognition


Languages

Language:Python 94.7%Language:C++ 3.3%Language:CMake 2.0%