A set of pyspark utility functions.
Example:
import pyspark_util as psu
data = [(1, 2, 3)]
columns = ['a', 'b', 'c']
df = spark.createDataFrame(data, columns)
prefixed = psu.prefix_columns(df, 'x')
prefixed.show()
Output:
+---+---+---+
|x_a|x_b|x_c|
+---+---+---+
| 1| 2| 3|
+---+---+---+
pip install pyspark-util
docker-compose build
docker-compose up -d
docker exec psu-cnt ./tools/lint.sh
docker exec psu-cnt ./tools/test.sh