Impala Queries

Click Impala → Queries to view the list of queries. By default, queries that completed execution over the past 24 hours are displayed.

There are two ways to view the metrics. By default, the metrics are shown in the Group By view. However, you can choose to group them by User or Resource Pool.

Default View

The following table displays the default view of the query metrics with detailed information.

MetricsDescription
Query IDThe ID of the query
DatabaseThe name of the database
Resource PoolThe name of the resource pool
DurationThe duration taken to run the query
VarianceThe difference between the actual and the estimated duration
StateThe state of the query
UserThe name of the user who initiated the query
Query TypeThe type of the query
CoordinatorThe caches metadata for all table partitions and data files
#RowThe number of rows in the query
Thread CPU TimeThe sum of the CPU time used by all threads of the query.
Per Node Peak Memory UsageThe peak memory usage per node.
HDFS Aggr Read MemoryThe sum of HDFS bytes read from memory by the query.
Start TimeThe Query start time
Completion TimeThe query completion time
Statistics CorruptRepresents the number of tables or partitions with corrupt or unusable statistics. Corrupt stats can mislead the query planner, potentially resulting in inefficient query execution plans.
Statistics MissingIndicates the number of tables or partitions involved in the query that are missing table or column statistics. Lack of statistics can lead to suboptimal query plans, as the query planner relies on these stats for cost-based optimization.

Group by User or Resource Pool View

You can group the metrics by User or Resource Pool to view them accordingly.

Perform the following steps to group the metrics:

  1. Click Group-By drop-down menu in the top right corner of the page, the drop-down menu is displayed.

  2. Select the type of grouping you want to display from the following list:

    1. User
    2. Resource Pool
  3. The details by selected type is grouped in the table.

  4. (Optional)To ungroup the queries, select Ungroup.

Group by Resource Pool and User

Group by Resource Pool and User

The following tables provides description of the metrics displayed for each job group:

MetricsDescription
Group NameThe name of the group.
Query CountThe total number of queries
FinishedThe total number of completed queries
FailedThe total number of failed queries associated by a particular group.
RunningThe total number of running queries associated by a particular group.
Avg DurationThe average time taken to execute a query by a particular group.
Total DurationThe total time taken to execute a query by a particular group.
Avg WaitingsThe average wait time for a query
Total WaitingsThe total waiting time for a query

Click on the group row to view the details of the jobs in that group.

Group Details

Group Details

Search Option

Perform the following steps to search for a record in the search bar.

  1. Select a search parameter. Pulse provides you multiple search parameters.
  2. Select a suitable operator for your parameter.
  3. Enter the search term and hit the enter key.

Important

For String Data type fields, you can use the =~ operator. This operator allows you to search strings by entering partial string. You can also use regular expressions to search a required value.

Adding/Removing Columns

To apply filter to the column of the table perform the following:

  1. Click the table settings icon. The Select Columns window is displayed.
  2. Select/deselect the checkbox from the the available list of columns names. By default all the checkboxes is selected.
  3. Click Apply.
  4. (Optional) You can search for the name of the column using the search box.

The following metrics are displayed for each query:

MetricDescription
Query ID

The ID of the query. The following action can be performed on the Query ID:

  • Click on the Query ID to view details of the query on the Impala Query Details page.
  • Click , to view the incident groups for the query. The exponent indicates the number of groups of query which it belongs.
  • To display the concurrency of queries, click the ... icon and click View Concurrency.
  • To copy the query ID, click the icon when you hover over the query ID. The query ID is copied to the clipboard.
DatabaseThe database on which the query was run.
Resource PoolThe name of the resource pool used by the Query.
DurationThe total time taken by the query to run.
VarianceVariance indicates if a job is slower than previous runs. It is calculated as (avgValue - currentValue)/stdDeviation.
StateThe state of the query, whether Running, Succeeded, Exception, or Finished. Click on the status to view details on the Impala Query Details page.
UserThe user that ran the query.
Query TypeThe type of the query, whether DDL, DML, or Query.
CoordinatorThe name of coordinator node used by the query.
# of RowsThe number of rows produced by the query.
Thread CPU TimeThe sum of the CPU time used by all threads of the query.
Per Node Peak Memory UsageThe peak memory usage per node.
HDFS Aggr Read MemoryThe sum of HDFS bytes read from memory by the query.
Start TimeThe date and time at which the query started running.
Completion TimeThe date and time at which the query completed.

Incidents

To hide or display the Incident details panel, click the Incidents button. By default, the Incident details are displayed.

You can view the incidents that were raised for the queries in the Incidents panel. Select an incident group to filter the queries displayed in the table.

Impala Query Incidents Panel

Impala Query Incidents Panel

View incidents

Click View Incident to view the list of incidents in the Incidents window. Click an incident in the Incident panel to view its details. The following table provides details of the data displayed in the Incidents window:

NameDescription
NameName of the incident group.
DurationDisplays the start date and time and the end date and time of the incident.
HostName of the host.
StatusDisplays the status, whether it is raised or cleared. To change the status from raised to cleared, click Clear.

To view the details of an incident click . The details of the incidents is displayed. To hide the details, click .

Impala Query Incident Details

Impala Query Incident Details

Advanced Search Options

The search field in the Impala Queries Page supports Influx-like query statements. For example, if you want to search for a query, use the format parametername:value. However, you need to know the full name of the parameter to be able to use it. For example, the name of the parameter visible in the UI is User. However, internally, the parameter name is effective_user. Hence, if you want to search for queries with User = Mark, you need to type effective_user:Mark.

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard