Refer to FSPEN: AN ULTRA-LIGHTWEIGHT NETWORK FOR REAL TIME SPEECH ENAHNCMENT.
Note that thera are some parameter setting mistakes in the original paper, so we modify some parameters to make model running succeed, for example:
- the number of sub-bands in groups is set to {8,7,6,7,6};
- the linear op in feature Merge is set to (66,32);
- the linear op in feature split is set to (32, 66);