Currently image-pull-progress-deadline is set to 2m. For larger images or slow pulls from busy registries, this needs to be increased. This results in ImagePullBackOff when the cluster is upgraded and many images are pulled at the same time.
By increasing it to at least 5m, this would solve pulling images when a cluster is upgraded and restarted at the same time.
The issue is when the cluster ends up trying to pull e.g large images simultaneously, all end up timing out.
The issue happens both with Docker registry and Gitlab registry. I tested with images ~2GB, and pulling 5 simultaneously on a droplet takes about 4-5 minutes.
These answers are provided by our Community. If you find them useful, show some love by clicking the heart. If you run into issues leave a comment, or add your own answer to help others.
Click below to sign up and get $100 of credit to try our products over 60 days!