Kudu Dashboard
The Kudu Dashboard shows metrics in the form of summary panels and charts covering Master and Tablet Server health, Block Cache and RPC performance, and RowSet activity and transaction summary to help you understand service status and operational activity.
Master and Tablet Server Health Overview
Summary Panel
This panel summarizes the health and status of Kudu’s Master and Tablet Servers, including server counts, data directory usage, and Raft leadership roles. It helps you quickly assess availability and storage conditions.
Metric | Description |
---|---|
Number of Master Servers | Shows the number of master servers currently registered and running in the cluster. |
Number of Tablet Servers | Shows the number of tablet servers currently registered and running in the cluster. |
Master Server | This panel shows the count and status of data directory paths on the master process.
|
Tablet Server | This panel shows the count and status of data directory paths on the tablet process.
|
Number of Raft Leaders | Master: Shows the number of master replicas that are Raft leaders. Tablet: Shows the number of tablet replicas that are Raft leaders. |
Charts
The charts display CPU and memory usage for Kudu Master and Tablet Servers, helping you track resource consumption and identify potential bottlenecks on each node.
Metric | Description |
---|---|
CPU Master Server | Shows the CPU time spent processing Kudu Master processes on each node. |
CPU Tablet Server | Shows the CPU time spent processing Kudu Tablet processes on each node. |
Memory Master Server | Shows the memory usage of master server processes on each node. |
Memory Table Server | Shows the memory usage of tablet server processes on each node. |
Block Cache and RPC Performance Summary
Summary Panel
This panel displays the number of block cache insertions and lookup operations performed on the tablet server.
Metric | Description |
---|---|
Master Block Cache Inserts | Shows the number of blocks added to the master’s block cache. |
Master Block Cache Lookups | Shows number of times the master block cache was accessed. |
Tserver Block Cache Inserts | Shows the number of blocks added to the tablet server’s block cache. |
Tserver Block Cache Lookups | Shows the number of times the tablet server block cache was accessed. |
Charts
The charts show RPC connection rates and high-percentile latency metrics for Master and Tablet servers, helping you identify network delays and server responsiveness issues.
Metric | Description |
---|---|
RPC Rates | Shows the number of incoming TCP connections made to the RPC server. |
RPC Latency | Shows the 99.99th percentile RPC queue time (in microseconds) for Tablet and Master servers. |
RowSet Activity and Transaction Summary
Summary Panel
This panel indicates the number of active rowsets (on-disk or in-memory segments) in each tablet, helping to understand data fragmentation and compaction needs.
Metric | Description |
---|---|
RowSet Memory Size | Shows the amount of memory currently used by in-memory RowSets. |
RowSet Disk Size | Shows the total disk space occupied by RowSets persisted on disk. |
Running Rowset Compactions | Shows the number of RowSet compaction processes currently in progress. |
Rows Deleted | Shows the number of rows marked for deletion across the dataset. |
Charts
The charts display row-level operations and transaction activity, helping you monitor data changes, track slow scans, and assess overall write and read performance across nodes.
Metric | Description |
---|---|
Rows Inserted | Shows the number of row insert operations over time for each node. |
Rows Updated | Shows the count of row updates happening across nodes during the selected time range. |
Rows Deleted | Tracks how many rows were deleted on each node over time. |
Total Write Transactions Inflight | Indicates the number of ongoing write transactions at a given time. |
Total Transactions In-flight | Represents all inflight (active) transactions both read and write per node. |
Slow Scans | Shows the number of scan operations that exceed the configured latency threshold. |