Set up and Manage Canary Tests

Pulse automates canary tests to continuously monitor the real-time performance of key data services such as HDFS, Hive, and Impala. These tests track critical metrics such as data transfer rates, query execution times, and error rates, helping you detect performance issues before they affect end users.

Available Canary Actions

You can configure and run specific canary operations for the services mentioned below.

HDFS Canary Operations

  • hdfs-canary-create-read-delete-file
  • hdfs-canary-list-dir

Hive Canary Operations

  • hive-canary-create-delete-table
  • hive-canary-select

Impala Canary Operations

  • impala-canary-create-delete-table
  • Impala-canary-select

How It Works

You can configure and schedule these canary actions to run automatically as cron jobs.

Pulse collects key performance metrics such as latency, throughput, and error rates from services, stores them in the Pulse database, and visualizes the data through intuitive dashboards on the Admin > Dashplot Studio page in the Pulse UI.

This provides continuous insight into system behavior and helps ensure consistent cluster performance and stability.

Benefits of Canary Tests

  • Early Detection: Identify performance degradations before they impact end users.
  • Service Reliability: Continuously validate the responsiveness and stability of HDFS, Hive, and Impala.
  • Operational Insights: Measure throughput, latency, and error rates to optimize cluster performance.
  • Automation: Reduce manual monitoring by scheduling automated canary tests.
  • Historical Trends: Track long-term performance patterns to detect anomalies and plan capacity effectively.

Next Steps

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard