文档简介:
可能原因
排查方法
检查磁盘空间是否不足
Normal Killing 39s (x735 over 15h) kubelet, 10.179.80.31 Killing container with id docker://apigateway:Need to kill Pod
检查是否存在 “i” 文件属性
现象描述
A file with the 'i' attribute cannot be modified: it cannot be deleted or renamed,
no link can be created to this file and no data can be written to the file.
Only the superuser or a process possessing the CAP_LINUX_IMMUTABLE
capability can set or clear this attribute.
Sep 27 14:37:21 VM_0_7_centos kubelet[14109]: E0927 14:37:21.922965
14109 remote_runtime.go:250] RemoveContainer "19d837c77a3c294052a
99ff9347c520bc8acb7b8b9a9dc9fab281fc09df38257" from runtime service
failed: rpc error: code = Unknown desc = failed to remove container
"19d837c77a3c294052a99ff9347c520bc8acb7b8b9a9dc9fab281fc09df38257":
Error response from daemon: container 19d837c77a3c294052a99ff9347c520
bc8acb7b8b9a9dc9fab281fc09df38257: driver "overlay2" failed to remove
root filesystem: remove /data/docker/overlay2/b1aea29c590aa9abda79f7
cf3976422073fb3652757f0391db88534027546868/diff/usr/bin/bash: operation not permitted
Sep 27 14:37:21 VM_0_7_centos kubelet[14109]: E0927 14:37:21.923027
14109 kuberuntime_gc.go:126] Failed to remove container "19d837c77a3
c294052a99ff9347c520bc8acb7b8b9a9dc9fab281fc09df38257": rpc error:
code = Unknown desc = failed to remove container "19d837c77a3c294052a
99ff9347c520bc8acb7b8b9a9dc9fab281fc09df38257": Error response from daemon:
container 19d837c77a3c294052a99ff9347c520bc8acb7b8b9a9dc9fab281fc09df38257:
driver "overlay2" failed to remove root filesystem: remove /data/docker
/overlay2/b1aea29c590aa9abda79f7cf3976422073fb3652757f0391db88534027546868
/diff/usr/bin/bash: operation not permitted
解决方法
chattr -i /data/docker/overlay2/b1aea29c590aa9abda79f7cf3976422073fb3652757f0391db88534027546868/diff/usr/bin/bash
检查是否存在 Docker 17 版本 bug
现象描述
Warning FailedSync 3m (x408 over 1h) kubelet, 10.179.80.31 error determining
status: rpc error: code = DeadlineExceeded desc = context deadline exceeded
解决方法
检查是否存在 Finalizers
现象描述
解决方法
检查是否存在低版本 kubelet list-watch 的 bug
Dockerd 与 containerd 状态不同步
现象描述
aufs au_opts_verify:1597:dockerd[5347]: dirperm1 breaks the protection by the permission bits on the lower branch
Sep 18 10:19:49 VM-1-33-ubuntu dockerd[4822]: time="2019-09-18T10:19:49.
903943652+08:00" level=error msg="Failed to log msg \"\" for logger json-file:
write /opt/docker/containers/54922ec8b1863bcc504f6dac41e40139047f7a84ff09175d
2800100aaccbad1f/54922ec8b1863bcc504f6dac41e40139047f7a84ff09175d2800100
aaccbad1f-json.log: no space left on device"
问题分析
$ docker-container-ctr --namespace moby --address /var/run/docker/containerd
/docker-containerd.sock task ls |grep a9a1785b81343c3ad2093ad973f4f8e52dbf54823b8bb089886c8356d4036fe0
a9a1785b81343c3ad2093ad973f4f8e52dbf54823b8bb089886c8356d4036fe0 30639 STOPPED