microsoft / ContextualSP

Multiple paper open-source codes of the Microsoft Research Asia DKI group

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Potential performance issue: .apply slow in pandas below 1.5 version

TendouArisu opened this issue · comments

Issue Description:

Hello.
I have discovered a performance degradation in the .apply function of pandas version below 1.5. And I notice parts of the repository depends on pandas below 1.5 such as robustness_of_text_to_sql/CTA/requirements.txt. I am not sure whether this performance problem in pandas will affect this repository. I found some discussions on pandas GitHub related to this issue, including #44172 and #45404.
I also found that poset_decoding/traversal_path_prediction/MatchZoo-py/matchzoo/data_pack/data_pack.py used the influenced api. There may be more files using the influenced api and more parts using pandas below 1.5.

Suggestion

I would recommend considering an upgrade to a different version of pandas >= 1.5 or exploring other solutions to optimize the performance of .apply.
Any other workarounds or solutions would be greatly appreciated.
Thank you!