docker does not remove btrfs subvolumes when destroying container

Question

docker does not remove btrfs subvolumes when destroying container

phemmer opened this issue 9 years ago · comments

I receive the following error when deleting a container which created a btrfs subvolume (as happens when you run docker in docker).

# docker run --rm fedora:20 sh -c 'yum -y -q install btrfs-progs && btrfs subvolume create /test'
Public key for lzo-2.08-1.fc20.x86_64.rpm is not installed
Public key for e2fsprogs-libs-1.42.8-3.fc20.x86_64.rpm is not installed
Importing GPG key 0x246110C1:
 Userid     : "Fedora (20) <fedora@fedoraproject.org>"
 Fingerprint: c7c9 a9c8 9153 f201 83ce 7cba 2eb1 61fa 2461 10c1
 Package    : fedora-release-20-3.noarch (@fedora-updates/$releasever)
 From       : /etc/pki/rpm-gpg/RPM-GPG-KEY-fedora-20-x86_64
Create subvolume '//test'
FATA[0033] Error response from daemon: Cannot destroy container c9badf5fc87bb9bfb50a3ee6e5e7c840476bd704e62404c9136aab4d27239d1e: Driver btrfs failed to remove root filesystem c9badf5fc87bb9bfb50a3ee6e5e7c840476bd704e62404c9136aab4d27239d1e: Failed to destroy btrfs snapshot: directory not empty

Info:

# docker info
docContainers: 22
Images: 47
Storage Driver: btrfs
Execution Driver: native-0.2
Kernel Version: 3.13.2-gentoo
Operating System: Gentoo/Linux
CPUs: 8
Total Memory: 15.64 GiB
Name: whistler
ID: RL3I:O6RS:UJRN:UU74:WAGE:4X5B:T2ZU:ZRSU:BN6Q:WN7L:QTPM:VCLN
Username: phemmer
Registry: [https://index.docker.io/v1/]
WARNING: No swap limit support

# docker version
Client API version: 1.16
Go version (client): go1.3.3
OS/Arch (client): linux/amd64
Server version: 1.4.1
Server API version: 1.16
Go version (server): go1.3.3
Git commit (server): 5bc2ff8

Tianon Gravi · Answer 1 · Wed Jan 07 2015 11:55:43 GMT+0800 (China Standard Time)

@jfrazelle didn't you look into this? Did you get anywhere good? :)

Jess Frazelle · Answer 2 · Wed Jan 07 2015 12:01:35 GMT+0800 (China Standard Time)

still looking i really want to fix this, but it is a permissions thing, because we are making the same exact syscall as btrfs-tools

Jess Frazelle · Answer 3 · Wed Jan 07 2015 12:01:56 GMT+0800 (China Standard Time)

but this is a duplicate issue

Jess Frazelle · Answer 4 · Wed Jan 07 2015 12:03:23 GMT+0800 (China Standard Time)

duplicate of #7773, closing, lmk tho if you believe differently

Jess Frazelle · Answer 5 · Wed Jan 07 2015 12:04:01 GMT+0800 (China Standard Time)

actually I like your better, I'm going to close the other

Tianon Gravi · Answer 6 · Fri Jan 09 2015 12:48:47 GMT+0800 (China Standard Time)

This is something else, @jfrazelle. 😄

This is sub-subvolumes with BTRFS (ie, docker-in-docker artifacts, etc). This is why we had to add sort -r to our nuke script (so that it'd delete the inner subvolumes before the outer ones).

Jess Frazelle · Answer 7 · Tue Jan 13 2015 09:19:50 GMT+0800 (China Standard Time)

pretty sure we need this but in go lol https://lists.linuxcontainers.org/pipermail/lxc-devel/2014-July/009881.html

Jess Frazelle · Answer 8 · Tue Jan 13 2015 09:20:55 GMT+0800 (China Standard Time)

working on a port

Jess Frazelle · Answer 9 · Tue Jan 13 2015 10:13:36 GMT+0800 (China Standard Time)

but if someone else who is super good at C wants to do it go ahead :P

Kyle Kelley · Answer 10 · Tue Mar 03 2015 23:58:41 GMT+0800 (China Standard Time)

@tianon What are you using for a nuke script now?

Tianon Gravi · Answer 11 · Fri Mar 06 2015 00:29:05 GMT+0800 (China Standard Time)

@rgbkrk https://github.com/docker/docker/blob/620339f166984540f15aadef2348646eee9a5b42/contrib/nuke-graph-directory.sh 😉

Kyle Kelley · Answer 12 · Sat Mar 07 2015 07:19:13 GMT+0800 (China Standard Time)

Awesome, thank you!

Tomas Tomecek · Answer 13 · Mon Jul 20 2015 21:01:04 GMT+0800 (China Standard Time)

Still hitting this. Are you planning to address this anytime soon?

Pavel Kajaba · Answer 14 · Thu Mar 03 2016 21:13:30 GMT+0800 (China Standard Time)

I had problem with this today (BTRFS seemed out of space because of it) with version 1.9.1

Any progress?

Tomas Tomecek · Answer 15 · Tue Mar 08 2016 01:18:27 GMT+0800 (China Standard Time)

@pkajaba I had to manually remove the nested subvolumes using $ btrfs directly.

Pavel Kajaba · Answer 16 · Tue Mar 08 2016 05:18:36 GMT+0800 (China Standard Time)

@TomasTomecek me too

Santosh Kumar Gupta · Answer 17 · Tue Apr 26 2016 13:49:55 GMT+0800 (China Standard Time)

I'm having this problem too.

Prateek Sahu · Answer 18 · Thu May 05 2016 18:00:10 GMT+0800 (China Standard Time)

Does any one know of any workaround for it? the btrfs seems to be corrupted for me and somehow i need to make atleast the machine filesystem work.

Mario · Answer 19 · Thu Jun 02 2016 00:18:10 GMT+0800 (China Standard Time)

To anyone else having to derp with this, I had to do similar to @TomasTomecek

┌[root@lovell] 
└[/var/lib/docker]> btrfs subvolume delete btrfs/subvolumes/*

^that worked for me ;)

Stuart Chaney · Answer 20 · Thu Jul 07 2016 05:28:21 GMT+0800 (China Standard Time)

Anyone know a way to distinguish the orphan subvolumes? I don't want to blow away my running containers.

Romain Vernoux · Answer 21 · Wed Feb 15 2017 17:27:19 GMT+0800 (China Standard Time)

I encountered the issue in v1.13 in my CI environment. I eventually figured out that it occured when multiple Jenkins jobs were doing docker system prune simultaneously on a btrfs filesystem. Is there any doc or information regarding thread-safety of the docker daemon?

Sam Jackson · Answer 22 · Mon Mar 06 2017 20:44:26 GMT+0800 (China Standard Time)

@thechane
I wrote a script that traverses the docker structures and finds what directories are orphaned and optionally deletes them. If it'd be useful to people I can put up on github and share here. I've only done basic testing on oraclelinux but it should apply to other OSes.

Justin Cormack · Answer 23 · Mon Mar 06 2017 21:08:15 GMT+0800 (China Standard Time)

Can you open a new issue for the multiple prune on btrfs issue?

…

On 15 Feb 2017 09:27, "Romain Vernoux" ***@***.***> wrote: I encountered the issue in v1.13 in my CI environment. I eventually figured out that it occured when multiple Jenkins jobs were doing docker system prune simultaneously on a btrfs filesystem. Is there any doc or information regarding thread-safety of the docker daemon? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#9939 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAdcPAukqjhGOtGZpxLRIc72-uxTNSKSks5rcsT_gaJpZM4DPNC8> .

Julien Laurenceau · Answer 24 · Tue Aug 22 2017 15:21:49 GMT+0800 (China Standard Time)

Fix embedded in "docker clean" would be nice

Christian Hügel · Answer 25 · Fri Sep 22 2017 03:47:29 GMT+0800 (China Standard Time)

still hitting this bug. btrfs/subvolumes grows very fast.

John Harris · Answer 26 · Tue Sep 26 2017 02:22:30 GMT+0800 (China Standard Time)

@hipposareevil if you can post that script it would be super useful please :)

Sam Jackson · Answer 27 · Tue Sep 26 2017 02:27:52 GMT+0800 (China Standard Time)

@johnharris85 Turns out that my script would negatively affect the system as it was deleting subvolumes that were tied to images. So if you tried to remove an image, it would be broken. It would work while you were running things, but then later when manipulating images you'd be hosed. :|

John Harris · Answer 28 · Tue Sep 26 2017 02:38:31 GMT+0800 (China Standard Time)

OK thanks @hipposareevil, anyway to filter out those volumes that were being used by images as well? Mind posting the script anyway (even as a gist) so I can hack on it?

Sam Jackson · Answer 29 · Wed Oct 04 2017 01:23:22 GMT+0800 (China Standard Time)

@johnharris85 Shoot me an email at dockergist@mailinator.com

alessio · Answer 30 · Mon Nov 06 2017 10:10:27 GMT+0800 (China Standard Time)

This is still an issue 😢

o-lenczyk · Answer 31 · Tue Nov 28 2017 18:02:25 GMT+0800 (China Standard Time)

my subvolumes on btrfs grows to 222gb

Christian Hügel · Answer 32 · Tue Nov 28 2017 20:51:01 GMT+0800 (China Standard Time)

I´ve deactivated btrfs for now and I´m using overlay2

Julien Laurenceau · Answer 33 · Mon Dec 04 2017 19:29:11 GMT+0800 (China Standard Time)

with docker 1.13+ you have the command below that can help:
docker system prune -f

Deleted user · Answer 34 · Sat Feb 03 2018 22:54:27 GMT+0800 (China Standard Time)

Since people here say this still happens, maybe it might be a good idea to reopen this ticket?

Beerend Lauwers · Answer 35 · Wed Mar 21 2018 21:18:02 GMT+0800 (China Standard Time)

@huegelc Does the overlay2 driver work on a btrfs file system? The documentation says only ext4 and xfs are supported.

devopxy · Answer 36 · Wed Aug 15 2018 00:01:14 GMT+0800 (China Standard Time)

Using btrfs commands I could remove those sub-volumes e.g. :

btrfs subvolume delete eb669bae4f4aa17f3c432d956f481146e4ac77e3f1803fee15e1f2b17787510d-init

Deleted user · Answer 37 · Wed Aug 22 2018 04:12:31 GMT+0800 (China Standard Time)

@devopxy yea that works, but it doesn't solve the underlying problem... what I usually do is uninstall docker, then wipe /var/lib/docker completely with all files and btrfs subvolumes inside, then reinstall.

t.malavash() · Answer 38 · Tue Sep 18 2018 21:09:12 GMT+0800 (China Standard Time)

Using btrfs commands I could remove those sub-volumes e.g. :
btrfs subvolume delete eb669bae4f4aa17f3c432d956f481146e4ac77e3f1803fee15e1f2b17787510d-init

Thx devopxy, that work for me to...
This command delete all sub-volumes present is the current directory :
btrfs subvolume delete *

Cameron D. · Answer 39 · Thu Nov 08 2018 11:34:14 GMT+0800 (China Standard Time)

Just ran into this today and had to clean it out. There doesn't appear to be any real solution...is there?

Julien Laurenceau · Answer 40 · Wed Nov 28 2018 17:23:39 GMT+0800 (China Standard Time)

Anyways it seems that btrfs need some periodic cleaning operations.
On my side, I use this function in my .bashrc to do it:

unalias btrfsCleanup 2>/dev/null
btrfsCleanup() {
    echo "btrfsCleanup"
    sudo btrfs fi show
    sudo btrfs fi df /
    sudo btrfs fi usage /
    sudo btrfs balance start -dusage=80 /
    sudo btrfs scrub start -d /
    sleep 120
    sudo btrfs fi df /var
    sudo btrfs fi usage /var
    sudo btrfs balance start -dusage=80 /var
    sudo btrfs scrub start -d /var
    sleep 120
    sudo btrfs fi df /var
    sudo btrfs fi usage /var
    echo "Done"
}

Marc Wäckerlin · Answer 41 · Thu Nov 21 2019 19:38:00 GMT+0800 (China Standard Time)

Loop:

#9939 closed as duplicate of #7773
#7773 closed as duplicate of #9939

Please reopen and fix #9939!

I just had to cleanup due to the same reason!

Pablo Fernandez · Answer 42 · Sun Aug 23 2020 02:28:48 GMT+0800 (China Standard Time)

Last news on ticket #38207 trying to reopen this one, @thaJeztah needs someone that is capable of reproducing the error and provide all details asked by the issue template.
I think there are many more listeners on this thread so, if someone is still experiencing this issue, I think there might be a chance.

Arseniy Zhizhelev · Answer 43 · Wed Jul 21 2021 16:06:35 GMT+0800 (China Standard Time)

Experiencing the same issue.

# docker system prune -f
Total reclaimed space: 0B
# du -sh /var/lib/docker/btrfs/subvolumes
16G	/var/lib/docker/btrfs/subvolumes

The following helps:

pushd /var/lib/docker/btrfs/subvolumes/
btrfs subvolume delete *
popd

Anton Kuroedov · Answer 44 · Wed Aug 04 2021 19:01:07 GMT+0800 (China Standard Time)

The following helps:

pushd /var/lib/docker/btrfs/subvolumes/
btrfs subvolume delete *
popd

This may break your docker build cache and other stuff

Arseniy Zhizhelev · Answer 45 · Fri Aug 06 2021 19:21:01 GMT+0800 (China Standard Time)

This may break your docker build cache and other stuff

What would you suggest?

Harry G. Coin · Answer 46 · Wed Aug 11 2021 21:59:54 GMT+0800 (China Standard Time)

ONLY WHEN ceph -s reports 'totally normal, everything you care about is running and all is perfect' -- do
docker system prune -a --volumes
If you do it under any other ceph operating condition -- not good.

Arseniy Zhizhelev · Answer 47 · Tue Feb 22 2022 00:13:40 GMT+0800 (China Standard Time)

@hcoin, This indeed helps:

$ docker system prune -a --volumes
WARNING! This will remove:
  - all stopped containers
  - all networks not used by at least one container
  - all volumes not used by at least one container
  - all images without at least one container associated to them
  - all build cache

Are you sure you want to continue? [y/N] y
Deleted Volumes:
...
Deleted Images:
...
Total reclaimed space: 5.033GB

Harry G. Coin · Answer 48 · Tue Feb 22 2022 00:16:40 GMT+0800 (China Standard Time)

On 2/21/22 10:13, Arseniy Zhizhelev wrote: docker system prune -a --volumes

Thanks! That probably should be included in the 'official' documentation.

Adam Cooper · Answer 49 · Sun Oct 15 2023 23:59:06 GMT+0800 (China Standard Time)

docker system prune -a --volumes

does nothing for me and reports 0GB cleaned up. I have 22GB of subvolumes on a system with 16 running containers. It's a very low storage RPi system and I'm not sure what to do here without just nuking the /var/lib/docker folder and starting again.