GCP Checklist 6 — Logging ,Monitoring and Alerting (maintaining reliability)

Grace
Google Cloud - Community
1 min readDec 15, 2018

When it comes to maintaining reliability in your systems understanding how your systems typically behave is crucial. Once you understand the typical behaviour you will be in a position to identify the anomalies and act upon them . This requires setting up an appropriate framework for Logging, monitoring & alerting

Logging — you need to collect and analyse logs to look for application anomalies and to audit your application and environments.

Monitoring — is closely related to logging and often goes hand in hand with logging. A typical monitoring solution consists of some way to collect metrics, dashboards to view the status of your systems and applications and a way to send alerts. You need to instrument your system to provide meaningful metrics.

GCP has logging and monitoring services that are available as part of the platform

Here are some References that are good place to start:

https://cloud.google.com/logging/docs/

https://cloud.google.com/monitoring/audit-logging

https://cloud.google.com/monitoring/docs/

https://cloud.google.com//monitoring/alerts/using-alerting-ui

https://cloud.google.com/solutions/design-patterns-for-exporting-stackdriver-logging

https://cloudplatform.googleblog.com/2018/03/best-practices-for-working-with-Google-Cloud-Audit-Logging.html

https://cloud.google.com/blog/products/management-tools/building-a-more-reliable-infrastructure-with-new-stackdriver-tools-and-partners

And here’s your Check list:

A list of all the checklists in the series can be found here

--

--

Grace
Google Cloud - Community

Chocolate addict - I have it under control really I do. I do stuff involving cloudy tech. Tweets my own so only me to blame, except for retweets.