Disk space not released

Question

Disk space not released

funky-eyes opened this issue 7 months ago · comments

What can we help you with?

I found that after running for a while, the disk occupancy is not actually released, I am using raft cluster mode and I don't know how to troubleshoot it

Why is this happening? I found that after I restarted the node again, the disk space was freed up

Where would you expect to find this information?

funkye · Answer 1 · Thu Mar 14 2024 17:59:45 GMT+0800 (China Standard Time)

I found a very large number of files that were deleted but still referenced without freeing up space

funkye · Answer 2 · Thu Mar 14 2024 18:06:42 GMT+0800 (China Standard Time)

I've clearly seen this file in s3, but locally he's still not cleaned up!

Ivan Yurchenko · Answer 3 · Thu Mar 14 2024 18:37:42 GMT+0800 (China Standard Time)

Hi @funky-eyes
By "not cleaned up" you mean they exist as "<old_file_name>.deleted"?

funkye · Answer 4 · Thu Mar 14 2024 19:45:54 GMT+0800 (China Standard Time)

Hi @funky-eyes By "not cleaned up" you mean they exist as "<old_file_name>.deleted"?

They no longer exist in the catalogue. Can you see the picture I sent? The deleted file is still being referenced, resulting in the disk space not released.

funkye · Answer 5 · Thu Mar 14 2024 20:39:29 GMT+0800 (China Standard Time)

Hi @funky-eyes By "not cleaned up" you mean they exist as "<old_file_name>.deleted"?

Could this be due to the operating system? I've noticed that after a while, the disk is actually freed, but it's minutes or even tens of minutes before it's freed!

funkye · Answer 6 · Thu Mar 14 2024 20:49:29 GMT+0800 (China Standard Time)

I set a topic again, remote.storage.enable=false, retention.ms=180000, and when it is cleaned up on disk, the disk space is freed up almost in real-time

funkye · Answer 7 · Fri Mar 15 2024 09:11:42 GMT+0800 (China Standard Time)

I waited for more than ten hours and the space still hasn't been freed, which is significantly different from the performance of a topic without tiered storage, where some files that no longer exist on disk are still being referenced by kafka

funkye · Answer 8 · Fri Mar 15 2024 15:46:05 GMT+0800 (China Standard Time)

I also gave feedback on this issue in the kafka community：https://issues.apache.org/jira/browse/KAFKA-16378

Ivan Yurchenko · Answer 9 · Fri Mar 15 2024 18:41:03 GMT+0800 (China Standard Time)

We're looking into this, trying to first understand if it's the plugin's or broker's problem.

funkye · Answer 10 · Sat Mar 16 2024 01:23:34 GMT+0800 (China Standard Time)

We're looking into this, trying to first understand if it's the plugin's or broker's problem.

Thank you very much for your intervention. The cluster I deployed is in kraft mode, and the code used is the latest main branch packaged and deployed, and I found that as long as I use jcmd pid GC.run, the disk space occupation will be released immediately, but when there is no gc, some files are not released, and there is no error-level log output in the log.

funkye · Answer 11 · Sat Mar 16 2024 01:24:37 GMT+0800 (China Standard Time)

And I'm using s3's tiered storage implementation

Ivan Yurchenko · Answer 12 · Sat Mar 16 2024 21:17:56 GMT+0800 (China Standard Time)

Seems to be really an issue in the plugin. Will be fixed in #516

funkye · Answer 13 · Sat Mar 16 2024 23:35:03 GMT+0800 (China Standard Time)

Seems to be really an issue in the plugin. Will be fixed in #516

Thanks, I'll pull it up later and recompile it locally for testing.

funkye · Answer 14 · Sun Mar 17 2024 13:10:32 GMT+0800 (China Standard Time)

Seems to be really an issue in the plugin. Will be fixed in #516

I understand that the purpose of this PR is to introduce a ClosableInputStreamHolder, which uniformly handles the closing of all InputStreams generated during the copyLogSegmentData phase, ensuring that the streams are correctly closed. Is my understanding correct?

funkye · Answer 15 · Mon Mar 18 2024 13:41:47 GMT+0800 (China Standard Time)

@ivanyu I have confirmed that this issue has been fixed by #516

Ivan Yurchenko · Answer 16 · Mon Mar 18 2024 14:35:56 GMT+0800 (China Standard Time)

I understand that the purpose of this PR is to introduce a ClosableInputStreamHolder, which uniformly handles the closing of all InputStreams generated during the copyLogSegmentData phase, ensuring that the streams are correctly closed. Is my understanding correct?

Yeah, that's correct. We forgot to close those streams and through this the open files. They lingered open until the Java internal cleaning machinery kicks in and closes the files.

funkye · Answer 17 · Mon Mar 18 2024 15:29:38 GMT+0800 (China Standard Time)

I understand that the purpose of this PR is to introduce a ClosableInputStreamHolder, which uniformly handles the closing of all InputStreams generated during the copyLogSegmentData phase, ensuring that the streams are correctly closed. Is my understanding correct?

Yeah, that's correct. We forgot to close those streams and through this the open files. They lingered open until the Java internal cleaning machinery kicks in and closes the files.

Thank you for your eagerness to help.