You are running a production application on Compute Engine. You want to monitor the key metrics of CPU, Memory, and Disk I/O time. You want to ensure that the metrics are visible by the team and will be explorable if an issue occurs. What should you do? (Choose 2)
A. Set up logs-based metrics based on your application logs to identify errors.
B. Export key metrics to a Google Cloud Function and then analyze them for outliers.
C. Set up alerts in Stackdriver Monitoring for key metrics breaching defined thresholds.
D. Create a Dashboard with key metrics and indicators that can be viewed by the team.
E. Export key metrics to BigQuery and then run hourly queries on the metrics to identify outliers.
Disclaimer
This is a practice question. There is no guarantee of coming this question in the certification exam.
Answer
C, D
Explanation
A. Set up logs-based metrics based on your application logs to identify errors.
(There are easier ways to get metrics like CPU, Memory, etc. Log based metrics will give application logs.)
B. Export key metrics to a Google Cloud Function and then analyze them for outliers.
(Extra efforts and not exact solution. External to Stackdriver/Operations stack.)
C. Set up alerts in Stackdriver Monitoring for key metrics breaching defined thresholds.
(There is no ask in the question for alerting.)
D. Create a Dashboard with key metrics and indicators that can be viewed by the team.
(Requirement to have metrics visible to the team. The Operations Dashboard is convenient, built in, and configuratble.)
E. Export key metrics to BigQuery and then run hourly queries on the metrics to identify outliers.
(Again, extra efforts. Running hourly queries delays things. Ideally, we want all analytics in Stackdriver/Operations UI.)