Pulse automates canary tests to continuously monitor the real-time performance of key data services such as HDFS, Hive, and Impala. These tests track critical metrics such as data transfer rates, query execution times, and error rates, helping you detect performance issues before they affect end users.
Available Canary Actions
You can configure and run specific canary operations for the services mentioned below.
HDFS Canary Operations
- hdfs-canary-create-read-delete-file
- hdfs-canary-list-dir
Hive Canary Operations
- hive-canary-create-delete-table
- hive-canary-select
Impala Canary Operations
- impala-canary-create-delete-table
- Impala-canary-select
How It Works
You can configure and schedule these canary actions to run automatically as cron jobs.
Pulse collects key performance metrics such as latency, throughput, and error rates from services, stores them in the Pulse database, and visualizes the data through intuitive dashboards on the Admin > Dashplot Studio page in the Pulse UI.
This provides continuous insight into system behavior and helps ensure consistent cluster performance and stability.
Benefits of Canary Tests
- Early Detection: Identify performance degradations before they impact end users.
- Service Reliability: Continuously validate the responsiveness and stability of HDFS, Hive, and Impala.
- Operational Insights: Measure throughput, latency, and error rates to optimize cluster performance.
- Automation: Reduce manual monitoring by scheduling automated canary tests.
- Historical Trends: Track long-term performance patterns to detect anomalies and plan capacity effectively.
Next Steps
For details about enabling the canary tests, see Enable Canary Test Actions.
For configuration details, see:
After configuration, you can view performance trends. For details, see Visualize Canary Test Performance Metrics.