it will create many small files?
katty0924 opened this issue · comments
katty0924 commented
Hi, everyone , recently i used hdfs sink connector, and i am worried about that if the data is consistenctly input to kafka, will it generate some small files, which are not good for hdfs namenode ?
Jordan Moore commented
Depends on your output format, flush size, and partitioner. It is possible to generate several GB files, in my experience