Cilium & Kubernetes DO Cluster Offering

Posted March 5, 2019 3.2k views

I’ve been playing with the k8 cluster offering for the past month. Running SFO2 - 1.13.3-do.0 I noticed the Cilium pods and coordinator would crash once in a while. Because of its control on the overlay this causes routing outages. Starting this past Friday I had to recycle the pods due to the inability for the k8 api to execute changes due to the routing failures. This past weekend, the celium pods have escalated in their failure rate and has halted the cluster multiple times now.

These answers are provided by our Community. If you find them useful, show some love by clicking the heart. If you run into issues leave a comment, or add your own answer to help others.

Submit an Answer
2 answers

Hey Peter, do you happen to have some more information on the crash itself (e.g. Cilium log)? Thanks!

I will post logs with the next occurrence.

It may relate to the following:

  • Which version of Cilium are you running? Afaik, latest from DO should be 1.4.1 which should have the above two fixed. But in any case logs might be good for taking a look at the failure you’re seeing. Thanks!