LLAP Query Details

The LLAP Query Details page contains the following panels.

  • Summary
  • Query Trends
  • Recommendations
  • Query
  • YARN Diagnostics
  • MapReduce Stats
  • Query Execution Metrics
  • Query Plan and DAG

The summary panel displays the following information.

Field NameDescription
UserThe name of the user that executed the job.
StateThe state of the job that can be one of the following: Created, Initialized, Compiled, Running, Finished, Exception, or Unknown.
DurationThe duration of the query execution.
Start TimeThe time at which the query execution started.
End TimeThe time at which the query execution ended.
# of VerticesThe number of vertices.
HDFS Data ReadThe amount of HDFS data read.
HDFS WrittenThe amount of HDFS data written to an output file format.
Application IDThe ID of the application.

The Query Trends panel displays a chart showing the pattern of jobs running at a particular time, based on the following factors.

MetricDescription
Elapsed TimeThe time taken to run the jobs at a particular time.
VCoresThe number of VCores consumed to execute the query within a timeframe.
MemoryThe amount of memory used to execute the query within a timeframe.

Comparing Runs

Click Compare Runs to compare different runs of the query. Select the runs that you want to compare. You can choose from upto 10 previous runs of the query. The metrics that are different are highlighted and displayed at the very top of the comparison result.

Query

The Query panel displays the query along with the Join details and the following details of table(s) used in the query.

ColumnDescription
Table NameThe table used in the query.
Filter expressionThe expression used in query filtering.
Total RowsThe total number of rows in the table.
Output RowsThe number of rows returned on executing the query.

Tables with Small Files

If any of the table accessed by the query has small files attached to it, you can see those table names with an yellow border. The legend at the top of the Query table indicates the same.

YARN Diagnostics

This panel displays the following diagnostics metrics of YARN.

Column NameDescription
Start TimeThe time at which the YARN application started.
End TimeThe time at which the YARN application ended.
StateThe state of the YARN application. The state can be one of the following: Created, Initialized, Compiled, Running, Finished, Exception, or Unknown.
MessageThe diagnostic message in the YARN application.
Message CountThe number of diagnostic messages.

The following details are displayed for jobs in a YARN container.

Note: A row contains data for a minute of the selected duration.

Column NameDescription
TimeThe minute at which the job is executed.
Preempted MBThe amount of processes that need priority to run the job.
Preempted VCoresThe number of VCores that need priority to run the job.
Allocated MBThe amount of memory allocated to the query (in Mb).
Avg MemoryThe average a mount of memory used.
Avg VCoreThe average amount of VCores used.
Running ContainersThe number of containers running in the query.
Queue Usage %The amount of queue usage (in %).
Cluster Usage %The amount of cluster usage (in %).
StateThe state of the query using YARN application. The state can be one of the following: Created, Initialized, Compiled, Running, Finished, Exception, or Unknown.
MessageThe diagnostic message.

Map Reduce Stats

This tile displays the statistics of processing of large data sets on a worker node. You can monitor the statistics of following processes by elapsed time.

  • Mappers
  • Reducers

These statistics can be sorted by Duration and Start Time.

Query Execution Metrics

The Query Execution Metrics panel displays the following set of metrics.

Metric TypeMetric NameDescription
TaskOutput BytesThe amount of output bytes written to a file format while executing the query at a given time.
Output Bytes With OverheadThe amount of additional output bytes.
Output Bytes PhysicalThe amount of output bytes written to physical storage.
Input Records ProcessedThe number of input records processed.
Input Split Length BytesThe size of logical division of query records.
Shuffle Phase TimeThe time taken to shuffle a query phase.
Output RecordsThe number of output records.
Additional Spills Bytes WrittenThe additional size of spilled records written (in bytes).
Spilled RecordsThe number of spilled records.
Shuffle BytesThe number of shuffle bytes the query uses.
DAG MetricsTotal Launched TasksThe number of tasks launched in DAG.
Rack Local TasksThe number of tasks that are local to the rack.
Data Local TasksThe number of tasks that are local to the data.
# of Killed TasksThe number of tasks that terminated.
# of Succeeded TasksThe number of tasks that completed successfully.
Application Master Cpu TimeThe time taken by the CPU in application master.
Application Master GC TimeThe time taken by the GC in the application master.
Wall Clock TimeThe time elapsed in completing tasks in DAG.
LLAP IO MetricsNum Vector BatchesThe number of vector batches for processing.
HDFS TimeThe time taken by HDFS to process LLAP queries.
Consumer TimeThe time taken by a consumer to process LLAP queries.
Cache Hit BytesThe size of cache hits (in bytes).
Rows EmittedThe number of rows emitted by LLAP queries.
Total I/O TimeThe time utilized in input and output operations.
Metadata Cache HitThe number of hits of metadata cache.

Query Plan and DAG

The panel displays the distribution of query logic in the form of a DAG and a physical execution plan.

Plan

Plan is a logical representation of how the Tez engine executes the query, where a query is broken into different logical plans.

DAG

The Direct Acyclic Graph (DAG) is an execution graph that displays a flow diagram of the compiled SQL queries. This graph is a work scheduling graph with finite elements connected in edges and vertices. The order of execution of the jobs in DAG is specified by the directions of the edges in the graph. The graph is acyclic as it has no loops or cycles.

DAG

DAG

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard