Description
Background
I had this issue with 27.5.1 and now with 29.3.1 as well.
I've had a set of containers running in unRAID for a while now, but recently I noticed my services would go down randomly, often after a couple hours. Trying to docker ps resulted in a message saying it could not connect to the socket and asking whether the daemon was running.
It had indeed died.
Checking the error logs didn't reveal that much, but the debug logs gave me some hints: before the crash, I saw an unusual number of healthcheck commands being performed, particularly to my CrowdSec container; but that does not seem to be entirely correlated.
The only thing I know is that it is connected to healthchecks.
Things I tried and seemed not to have any influence in this:
- Using only bridge networks
- Using a macvlan network
- Changing backing filesystem (zfs, btrfs)
- Changing storage driver (zfs, btrfs, overlay2)
- Upgrading from unRAID 7.2.4 (Docker 27.5.1) to 7.3.0 (Docker 29.3.1)
- Not mounting the Docker socket to any containers
- Changing RAM - also tested with memtest86+ 8.00 with 16 passes
- Changing Motherboard
- Changing my boot device
Disabling the healthchecks has gotten rid of the issue 100%. It's now been running for about a week whereas previously I got at most 24h.
Reproduce
docker run my containers with healthcheck
- wait
- it crashes
Expected behavior
Docker Daemon should not die because of a healthcheck, no matter how bad the healthcheck might be.
docker version
Client:
Version: 29.3.1
API version: 1.54
Go version: go1.25.8
Git commit: c2be9cc
Built: Wed Mar 25 16:12:39 2026
OS/Arch: linux/amd64
Context: default
Server: Docker Engine - Community
Engine:
Version: 29.3.1
API version: 1.54 (minimum version 1.40)
Go version: go1.25.8
Git commit: f78c987a
Built: Wed Mar 25 16:14:56 2026
OS/Arch: linux/amd64
Experimental: false
containerd:
Version: v2.2.2
GitCommit: 301b2dac98f15c27117da5c8af12118a041a31d9
runc:
Version: 1.3.4
GitCommit: v1.3.4-0-gd6d73eb
docker-init:
Version: 0.19.0
GitCommit: de40ad0
docker info
Client:
Version: 29.3.1
Context: default
Debug Mode: false
Plugins:
buildx: Docker Buildx (Docker Inc.)
Version: v0.32.1
Path: /usr/libexec/docker/cli-plugins/docker-buildx
compose: Docker Compose (Docker Inc.)
Version: v2.40.3
Path: /usr/lib/docker/cli-plugins/docker-compose
Server:
Containers: 38
Running: 38
Paused: 0
Stopped: 0
Images: 32
Server Version: 29.3.1
Storage Driver: overlay2
Backing Filesystem: zfs
Supports d_type: true
Using metacopy: false
Native Overlay Diff: true
userxattr: false
Logging Driver: json-file
Cgroup Driver: cgroupfs
Cgroup Version: 2
Plugins:
Volume: local
Network: bridge host ipvlan macvlan null overlay
Log: awslogs fluentd gcplogs gelf journald json-file local splunk syslog
CDI spec directories:
/etc/cdi
/var/run/cdi
Swarm: inactive
Runtimes: io.containerd.runc.v2 runc
Default Runtime: runc
Init Binary: docker-init
containerd version: 301b2dac98f15c27117da5c8af12118a041a31d9
runc version: v1.3.4-0-gd6d73eb
init version: de40ad0
Security Options:
seccomp
Profile: builtin
cgroupns
Kernel Version: 6.18.20-Unraid
Operating System: Unraid OS 7.3 x86_64
OSType: linux
Architecture: x86_64
CPUs: 16
Total Memory: 62.58GiB
Name: NAS
ID: 7842bc14-0b75-4c17-af87-949347eb783b
Docker Root Dir: /var/lib/docker
Debug Mode: false
Experimental: false
Insecure Registries:
::1/128
127.0.0.0/8
Live Restore Enabled: false
Product License: Community Engine
Firewall Backend: iptables
WARNING: No swap limit support
Additional Info
Also posting here some of the logs from when the crashes happened and a redacted debug log:
The crashes don't all seem to be that related, but they all stopped when I turned off all healthchecks here.
Here are all the containers I'm running as of today. All images pulled just now:
IMAGE COMMAND CREATED STATUS
lscr.io/linuxserver/bazarr:latest "/init" 4 hours ago Up 4 hours
ghcr.io/immich-app/immich-server:v2.7.2 "tini -- /bin/bash -…" 4 hours ago Up 4 hours
ghcr.io/immich-app/immich-machine-learning:v2.7.2-openvino "tini -- python -m i…" 4 hours ago Up 4 hours
amir20/dozzle:latest "/dozzle" 4 hours ago Up 4 hours
ghcr.io/goauthentik/server:2026.2.2 "dumb-init -- ak ser…" 10 hours ago Up 10 hours
ghcr.io/goauthentik/server:2026.2.2 "dumb-init -- ak wor…" 10 hours ago Up 10 hours
lscr.io/linuxserver/code-server:latest "/init" 29 hours ago Up 29 hours
lscr.io/linuxserver/plex "/init" 29 hours ago Up 29 hours
lscr.io/linuxserver/qbittorrent:latest "/init" 42 hours ago Up 42 hours
lscr.io/linuxserver/radarr:latest "/init" 42 hours ago Up 42 hours
lscr.io/linuxserver/prowlarr:latest "/init" 42 hours ago Up 42 hours
stirlingtools/stirling-pdf "tini -- /scripts/in…" 42 hours ago Up 42 hours
lscr.io/linuxserver/sonarr "/init" 3 days ago Up 3 days
lscr.io/linuxserver/nextcloud:33.0.2 "/init" 5 days ago Up 5 days
redis:8.6.2-alpine "docker-entrypoint.s…" 5 days ago Up 5 days
redis:8.6.2-alpine "docker-entrypoint.s…" 5 days ago Up 5 days
gotenberg/gotenberg:8.29.1 "/usr/bin/tini -- go…" 5 days ago Up 5 days
apache/tika:3.3.0.0 "/bin/sh -c 'exec ja…" 5 days ago Up 5 days
crowdsecurity/crowdsec:v1.7.7 "/bin/bash /docker_s…" 5 days ago Up 9 hours
lscr.io/linuxserver/socket-proxy:latest "/docker-entrypoint.…" 5 days ago Up 5 days
redis:8.6.2-alpine "docker-entrypoint.s…" 5 days ago Up 42 hours
redis:8.6.2-alpine "docker-entrypoint.s…" 5 days ago Up 5 days
collabora/code:25.04.9.4.1 "/start-collabora-on…" 5 days ago Up 5 days
traefik:3.6.12 "/entrypoint.sh trae…" 10 days ago Up 42 hours
ghcr.io/seerr-team/seerr:latest "docker-entrypoint.s…" 11 days ago Up 6 days
santiagosayshey/profilarr:latest "/entrypoint.sh guni…" 11 days ago Up 6 days
lscr.io/linuxserver/lidarr:latest "/init" 11 days ago Up 6 days
flaresolverr/flaresolverr "/usr/bin/dumb-init …" 11 days ago Up 6 days
ghcr.io/paperless-ngx/paperless-ngx:2.20.13 "/init" 11 days ago Up 6 days
postgres:16.13-alpine "docker-entrypoint.s…" 11 days ago Up 6 days
postgres:16.13-alpine "docker-entrypoint.s…" 11 days ago Up 6 days
ghcr.io/immich-app/postgres:14-vectorchord0.4.3-pgvectors0.2.0 "/usr/local/bin/immi…" 11 days ago Up 6 days
valkey/valkey:9 "docker-entrypoint.s…" 11 days ago Up 6 days
postgres:16.13-alpine "docker-entrypoint.s…" 11 days ago Up 6 days
maxmindinc/geoipupdate:latest "/usr/bin/entry.sh" 11 days ago Up 6 days
qmcgaw/gluetun:v3 "/gluetun-entrypoint" 11 days ago Up 6 days
ghcr.io/jarvis2f/telegram-files:latest "/sbin/tini -- /bin/…" 2 weeks ago Up 6 days
c4illin/convertx:latest "bun run dist/src/in…" 2 weeks ago Up 6 days
Description
Background
I had this issue with 27.5.1 and now with 29.3.1 as well.
I've had a set of containers running in unRAID for a while now, but recently I noticed my services would go down randomly, often after a couple hours. Trying to
docker psresulted in a message saying it could not connect to the socket and asking whether the daemon was running.It had indeed died.
Checking the error logs didn't reveal that much, but the debug logs gave me some hints: before the crash, I saw an unusual number of healthcheck commands being performed, particularly to my CrowdSec container; but that does not seem to be entirely correlated.
The only thing I know is that it is connected to healthchecks.
Things I tried and seemed not to have any influence in this:
Disabling the healthchecks has gotten rid of the issue 100%. It's now been running for about a week whereas previously I got at most 24h.
Reproduce
docker runmy containers with healthcheckExpected behavior
Docker Daemon should not die because of a healthcheck, no matter how bad the healthcheck might be.
docker version
Client: Version: 29.3.1 API version: 1.54 Go version: go1.25.8 Git commit: c2be9cc Built: Wed Mar 25 16:12:39 2026 OS/Arch: linux/amd64 Context: default Server: Docker Engine - Community Engine: Version: 29.3.1 API version: 1.54 (minimum version 1.40) Go version: go1.25.8 Git commit: f78c987a Built: Wed Mar 25 16:14:56 2026 OS/Arch: linux/amd64 Experimental: false containerd: Version: v2.2.2 GitCommit: 301b2dac98f15c27117da5c8af12118a041a31d9 runc: Version: 1.3.4 GitCommit: v1.3.4-0-gd6d73eb docker-init: Version: 0.19.0 GitCommit: de40ad0docker info
Additional Info
Also posting here some of the logs from when the crashes happened and a redacted debug log:
The crashes don't all seem to be that related, but they all stopped when I turned off all healthchecks here.
Here are all the containers I'm running as of today. All images pulled just now: