xuducan / CVTE_chain_model_finetune

finetune the chain model based on cvte open source model without traing any GMM for frame alignment

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CVTE_chain_model_finetune

  • finetune chain model base on cvte open source model using aishell1 data
  • using the open source GMM for frame align

Explanation

  • cvte supply a chain model trained using more than 2000h audio data
  • cvte supply a 3-gram LM model trained with 1000 GB text;
  • this project does not need training any GMM series model
  • this project support online cmvn, since "apply-cmvn-online" is used during the training and decoding

Install

  • kaldi

Usage

  • bash run_train_ft.sh to prepare the data、train LM(optinal,you can also use the cvte open source HCLG) and finetune chain model
  • bash run_test.sh to decode the test data using the finetune model with apply-cmvn-online

Workflow

  • prepare the lexicon if train LM with your own data(optional,you can also use the cvte open source HCLG)
  • prepare the train data
  • prepare phone sets, questions, L compilation(optinal,you can also use the cvte open source HCLG)
  • train LM using kaldi_lm or SRILM(optinal,you can also use the cvte open source HCLG)
  • make graph(optinal,you can also use the cvte open source HCLG)
  • extract the 40 fbank
  • generate speed-perturbed data (for alignment) and use it to align through cvte open source GMM(tri6b)
  • get the alignments as lattices
  • generate volume-perturbed data (for train)
  • prepare config file for finetune such as model、den.fst、phone_lm.fst and tree
  • funtune the chain model by some epoch with a little bit small learning rate
  • decode the test data use finetune model with apply-cmvn-online

Result(aishell1 test data)

  • cer 11.10%

Reference

About

finetune the chain model based on cvte open source model without traing any GMM for frame alignment


Languages

Language:Shell 55.5%Language:Python 35.1%Language:Perl 9.2%Language:Roff 0.2%