hi-primus / optimus

:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

Home Page:https://hi-optimus.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Too much logs printed. It's at least 10x slower.

Pcosmin opened this issue · comments

Version: 2.2.32
The profiling process print a lot of useless logs which makes the process very slow. I think you should remove the print from optimus/infer.py:309, where it prints the dtypes:
"print(dtypes)"

With "print(dtypes)" it took a very long time, I waited for over 10 minutes to print all the dtypes, and it didn't finish to print all of them. After 10 minutes I just killed the process.
Without "print(dtypes)" it took 96 seconds for the whole profiling process to be done.

Issue-Label Bot is automatically applying the label feature_request to this issue, with a confidence of 0.55. Please mark this comment with 👍 or 👎 to give our bot feedback!

Links: app homepage, dashboard and code for this bot.

Thanks for the report @Pcosmin. Could you send PR?

Hi @argenisleon . I sent the PR: #1064 . Please let me know if there are other changes that I should do.

Thanks @Pcosmin. Merged