apache / datafusion-ballista

Apache DataFusion Ballista Distributed Query Engine

Home Page:https://datafusion.apache.org/ballista

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Use StreamWriter instead of FileWriter

Dandandan opened this issue · comments

Is your feature request related to a problem or challenge? Please describe what you are trying to do.
It is redundant to store schema information in the shuffle files, as we already have them as part of the execution plan / query stages.

Describe the solution you'd like
Use IPC streams instead of file format.

Describe alternatives you've considered

Additional context