Skip to content

Glossary

Quick Lookup

Common terms used in this module:

  • System Monitoring - observing system behavior using operational metrics
  • Signal - a derived metric designed to summarize system behavior
  • Anomaly - a value that falls outside expected limits
  • System State - an overall interpretation of system health
  • Operational Decision - an action taken in response to system behavior

System Monitoring

The practice of observing system behavior using measurements. Monitoring systems help detect issues early and maintain reliability. Examples:

  • request volume
  • error rates
  • response times

Signal

A signal is a derived metric that helps summarize system behavior. Signals help analysts understand how a system is performing. Signals are often easier to interpret than raw measurements. Examples:

  • error rate
  • average latency
  • throughput

Anomaly

An anomaly is a value that falls outside the expected range. Anomalies may indicate problems or unusual system conditions. Examples:

  • unusually high error rate
  • unusually slow response time
  • unexpected drop in requests

System State

A summary description of the system’s overall condition. System state helps translate monitoring data into operational insight. Examples include:

  • STABLE
  • DEGRADED
  • FAILURE

Operational Decision

An action taken in response to monitoring information. Examples include:

  • investigating system errors
  • scaling infrastructure
  • rolling back a deployment

Continuous intelligence systems support operational decisions by providing timely information about system behavior.