sagorbrur / bnlp

BNLP is a natural language processing toolkit for Bengali Language.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CleanText remove_punctuations() prints corpus.punctuations unneccessarily

treeDweller98 opened this issue · comments

Applying CleanText on datasets floods output with garbage. Please remove the print statement on line 61 from bnlp/bnlp/cleantext/clean.py.

A temporary solution till then might be to suppress print statements while applying CleanText:
How to block calls to print?

Hi @treeDweller98 ,
Thanks for pointing out the problem.
I will remove the print statement but it will be effective on the upcoming version.
For now, you can use the method you shared.