MapReduce Queries

Click MapReduceQueries to view the list of MapReduce queries. The Queries page consists of a filter panel, a Sankey Diagram, and charts that display information about queries that you can group by Users or Queries.

Sankey Chart

The Sankey diagram displays the flow of jobs from a selected queue to users and to the queries.

You can view the Sankey chart by the following distributions:

  • Duration
  • Mappers
  • Reducers
  • GC Time
  • Reducer Time Avg
  • Reducer Time Max
  • Shuffle Time Avg
  • Shuffle Time Max
  • Sort Time Avg
  • Sort Time Max

The default time range is Last 24 hrs. To view statistics from a custom date range, click the icon and select a time frame and timezone of your choice.

Query Details

The following metrics are displayed for each query group:

MetricDescription
Group NameThe name of the group filtered by User or Queries.
Query CountThe number of queries in the group.
Failed QueriesThe number of queries that failed to execute.
Running QueriesThe number of currently running queries.
Avg MemoryThe average amount of memory consumed by a query group.
Total MemoryThe total memory consumed by a query group.
Avg VCoreThe average number of VCores used by a query group.
Total VCoreThe total number of VCores used by a query group.
Avg DurationThe average time taken to execute queries in a query group.
Total DurationThe total time taken to execute queries in a query group.

To group queries by User or Queries, perform the following:

  1. Click Group By, drop-down menu is displayed.
  2. Select User, if you want to group the queries by user. Select Queries, if you want to group by queries. Select Queue to group the queries by their Queue. Select The details by selected type is grouped.
  3. (Optional)To ungroup the queries, click Ungroup.

The following metrics are displayed for each query:

Click a query ID to view more details about the query in the MapReduce Query Details page.

MetricDescription
IDThe ID of a query. To copy the ID, click the icon next to the ID that appears when you hover over the ID.
UserThe user who executed the query.
QueueThe queue that the query belongs to.
DurationThe time taken to execute the query.
StatusThe final state of the query run by the user. The state can be: Failed, Finished, Compiled.
Start TimeThe time at which the query started.
Completion TimeThe time at which the query completed execution.
MemoryThe amount of memory consumed in executing the query.
VCoresThe number of VCores used in query processing.
HiveServer2The IP address of the installed host.

Applying Filter

To filter the data you want to view in the table, click the filter drop-down menu. The following are the drop-down menu options:

  • User
  • Queue
  • Status
  • Hiveserver2

To apply the filter, perform the following:

  1. Click on the filter option to which you want to apply the filter. A drop-down list is displayed.
  2. Select from the list. You can select one or more than one. The table details for the selected data is displayed.
  3. (Optional) To search from the list, type the name in the search box.
  4. (Optional) Click Reset to remove the applied filter.

Advanced Search Options

The search field in the MapReduce Queries page supports Influx-like query statements. For example, if you want to search for a query, use the format parametername:value. However, you need to know the full name of the parameter to be able to use it. For example, the name of the parameter visible in the UI is User. However, internally, the parameter name is effective_user. Hence, if you want to search for queries with User = Mark, you need to type effective_user:Mark.

Click Reset to clear the applied filter.

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard