Machine-Tom / bertsum-chinese-LAI

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

issues when processes json_data to bert_data with chinese

4floorer opened this issue · comments

There is a issue in data_ builder_LAI.py at line 78. In the inner function named "_rouge_clean", the regex expression r'[^a-zA-Z0-9 ]' is for the English and have issue for chineses texts.