confluentinc / kafka-connect-hdfs

Kafka Connect HDFS connector

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

it will create many small files?

katty0924 opened this issue · comments

Hi, everyone , recently i used hdfs sink connector, and i am worried about that if the data is consistenctly input to kafka, will it generate some small files, which are not good for hdfs namenode ?

Depends on your output format, flush size, and partitioner. It is possible to generate several GB files, in my experience