automl / CAAFE

Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).

Home Page:http://priorlabs.ai

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

bug in preprocessing.py

zyf505 opened this issue · comments

Lines 23-25: the mapping isn't created correctly, key and value are in reversed order.
Should be something like

            mappings[col] = {v: i for i, v in 
                enumerate(df_train[col].astype("category").cat.categories)
            }

Fixed this bug! Thank you for pointing it out!