sryza / aas

Code to accompany Advanced Analytics with Spark from O'Reilly Media

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Chapter 3: Convert PySpark DataFrame to Pandas

tanthiamhuat opened this issue · comments

image

image

Is it because recommendations is of type array, it creates some issue when need to convert to Pandas?
How do we overcome this?

This is fine, it's a warning that arrow optimizations do not apply to this case, but that's normal.

yup, it is a warning.. but I am not sure why I am not allowed to print it out, as that variable 'top_prediction_pandas' is not defined. However, its below code with the variable 'top_prediction_pandas' runs fine.
image

There is a typo here - top_prediction_pandas vs top_predictions_pandas

oic, thank you, we can close this issue.