MapReduce Queries
Click MapReduce → Queries to view the list of MapReduce queries. The Queries page consists of a filter panel, a Sankey Diagram, and charts that display information about queries that you can group by Users or Queries.
Sankey Chart
The Sankey diagram displays the flow of jobs from a selected queue to users and to the queries.
You can view the Sankey chart by the following distributions:
- Duration
- Mappers
- Reducers
- GC Time
- Reducer Time Avg
- Reducer Time Max
- Shuffle Time Avg
- Shuffle Time Max
- Sort Time Avg
- Sort Time Max
The default time range is Last 24 hrs. To view statistics from a custom date range, click the icon and select a time frame and timezone of your choice.
Query Details
The following metrics are displayed for each query group:
Metric | Description |
---|---|
Group Name | The name of the group filtered by User or Queries. |
Query Count | The number of queries in the group. |
Failed Queries | The number of queries that failed to execute. |
Running Queries | The number of currently running queries. |
Avg Memory | The average amount of memory consumed by a query group. |
Total Memory | The total memory consumed by a query group. |
Avg VCore | The average number of VCores used by a query group. |
Total VCore | The total number of VCores used by a query group. |
Avg Duration | The average time taken to execute queries in a query group. |
Total Duration | The total time taken to execute queries in a query group. |
To group queries by User or Queries, perform the following:
- Click Group By, drop-down menu is displayed.
- Select User, if you want to group the queries by user. Select Queries, if you want to group by queries. Select Queue to group the queries by their Queue. Select The details by selected type is grouped.
- (Optional)To ungroup the queries, click Ungroup.
The following metrics are displayed for each query:
Click a query ID to view more details about the query in the MapReduce Query Details page.
Metric | Description |
---|---|
ID | The ID of a query.
To copy the ID, click the |
User | The user who executed the query. |
Queue | The queue that the query belongs to. |
Duration | The time taken to execute the query. |
Status | The final state of the query run by the user. The state can be: Failed, Finished, Compiled. |
Start Time | The time at which the query started. |
Completion Time | The time at which the query completed execution. |
Memory | The amount of memory consumed in executing the query. |
VCores | The number of VCores used in query processing. |
HiveServer2 | The IP address of the installed host. |
Applying Filter
To filter the data you want to view in the table, click the filter drop-down menu. The following are the drop-down menu options:
- User
- Queue
- Status
- Hiveserver2
To apply the filter, perform the following:
- Click on the filter option to which you want to apply the filter. A drop-down list is displayed.
- Select from the list. You can select one or more than one. The table details for the selected data is displayed.
- (Optional) To search from the list, type the name in the search box.
- (Optional) Click Reset to remove the applied filter.
Advanced Search Options
The search field in the MapReduce Queries page supports Influx-like query statements. For example, if you want to search for a query, use the format parametername:value. However, you need to know the full name of the parameter to be able to use it. For example, the name of the parameter visible in the UI is User. However, internally, the parameter name is effective_user. Hence, if you want to search for queries with User = Mark, you need to type effective_user:Mark.
Click Reset to clear the applied filter.