Report this

What is the reason for this report?

Best Practices in Monitoring a Kubernetes Cluster With Prometheus, Grafana and Loki

Published on April 12, 2022
Kim Schlesinger

By Kim Schlesinger

Developer Advocate

Best Practices in Monitoring a Kubernetes Cluster With Prometheus, Grafana and Loki

About the Talk

If you’re responsible for a Kubernetes cluster, it’s important to know how to monitor its health and troubleshoot problems. Learn how to collect metrics from your cluster, setup alerts, and send notifications to the right people when something goes wrong.

If you’re looking for a managed Kubernetes hosting service, check out our simple, managed Kubernetes service built for growth.

What You’ll Learn

  • Setting up monitoring and logging using Prometheus, Grafana, and Loki
  • Which five key Kubernetes health metrics should be monitored
  • Setting up Alertmanager to send alerts to an on-call engineer

This Talk Is Designed For

Anyone that is setting up production Kubernetes clusters.

Prerequisites

  • You’ve connected to a Kubernetes cluster with the command-line tool kubectl.
  • You’ve deployed workloads in a Kubernetes cluster.

Resources

Prometheus Setup Stack

Loki Setup Stack

Kubernetes Monitoring Stack

Thanks for learning with the DigitalOcean Community. Check out our offerings for compute, storage, networking, and managed databases.

Learn more about our products

About the author

Kim Schlesinger
Kim Schlesinger
Author
Developer Advocate
See author profile

I'm a developer advocate at DigitalOcean focusing on Kubernetes and other Cloud Native technologies

Still looking for an answer?

Was this helpful?


This textbox defaults to using Markdown to format your answer.

You can type !ref in this text area to quickly search our full set of tutorials, documentation & marketplace offerings and insert the link!

Both Prometheus Setup Stack and Loki Setup Stack Resources links are broken.

Is it a good idea to setup monitoring in the same cluster that we are trying to monitor?

What if the cluster itself is down/busy/overwhelmed and we can not connect to it?

Creative CommonsThis work is licensed under a Creative Commons Attribution-NonCommercial- ShareAlike 4.0 International License.
Join the Tech Talk
Success! Thank you! Please check your email for further details.

Please complete your information!

The developer cloud

Scale up as you grow — whether you're running one virtual machine or ten thousand.

Get started for free

Sign up and get $200 in credit for your first 60 days with DigitalOcean.*

*This promotional offer applies to new accounts only.