capeprivacy / cape-python

Collaborate on privacy-preserving policy for data science projects in Pandas and Apache Spark

Home Page:

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

See what Transformations are Linked to What Data Types

kjam opened this issue · comments

Is your feature request related to a problem? Please describe.
It would be useful within EDA to see what types of transformations I can apply to what Series/Columns so that I can know how to approach the problem.

Describe the solution you'd like
There are several ways this could be implemented - it might be nice to explore a few ways and investigate how others have solved this problem. One way off the top of my head is that the Pandas or interactive Spark session dataframe dtypes could be used as an input and then display a list of possible transforms that might apply. Another idea is to pass in a Series or Spark column and have a show_available_transforms method (or something with a better name).

Describe alternatives you've considered
One can look through the documentation to see this and the docstrings.

Additional context
I can share a notebook of what I am thinking!