There is a known intermittent bug with docker which breaks some of
its functions, such as downloading images [1].
The details are being investigated, but most likely docker.service
start occasionally fails to create all the subfolders required
in /var/lib/docker. The workaround is a service restart.
With this change, there is a short wait time after which docker
health is checked and if the check fails the service is restarted.
Note the required subfolders are created almost immediately, so
the wait can be short.
Still, pmon tolerance is slightly increased to allow the repair
mechanism a couple retries before stepping in.
[1] https://bugs.launchpad.net/ubuntu/+source/docker.io/+bug/1950751
Test Plan:
PASS With pmon turned off and a sleep time of 10 secs,
deleted /var/lib/docker/tmp and restarted docker.
Then deleted /var/lib/docker/tmp dir during the 'sleep 10',
observed that an automatic '/bin/systemctl restart
docker.service' is triggered, docker is restarted and /tmp
recreated successfully.
PASS With pmon service up and using the proposed time intervals,
restarted docker service successfully without interference
between the two mechanisms
PASS Completed the following operations:
- AIO-SX install/bootstrap/unlock
- lock/unlock
- sudo reboot
with the following results:
- /var/lib/docker has all sub-directories
- applications applied
- docker service running
- pulled hello-world image
- no alarms
- no 'download failed' error messages in daemon.log
Partial-Bug: 1999182
Signed-off-by: Leonardo Fagundes Luz Serrano <Leonardo.FagundesLuzSerrano@windriver.com>
Change-Id: Ide2d214ea3c7efb3f2a24327c11ae55f90d5a9ce