Monitoring the Application Metrics

The Application metrics such as application name, type, ID, final status (Succeeded or Failed), user, memory, CPU, etc. usage help you understand the application details in-depth.

Steps to access the Application Explorer page to analyze the Yarn Workloads:

  1. Log in to Pulse UI and select YARN from the left navigation menu.
  2. On the YARN pane, select Application Explorer. The Application Explorer page appears with workload details.

Functionalities

FunctionalityDescription
TimestampChoose an option to select a time period (For example, Today, Last 12 hours, Last 3 Months, etc.) or choose a “custom date and time” and click Apply.
RM Live ViewEnable this to fetch a real-time overview of the cluster's resource utilization and application status in YARN.
Searching JobsCreate a query with metric details such as ID, Type, Variance, name, etc., and corresponding details to search for the application details.
Save a Search QueryClick the Save Search icon to store a query for future use.
Saved Search QueriesUse the search drop-down menu to view and select previously saved queries.
Modify Table ColumnsClick the Settings icon to search for columns, add or remove them, and rearrange their order as needed.
Refresh the ResultsClick the Refresh icon to retrieve real-time data.

Searching the Application Details

You can perform the following steps to search for a record.

  1. On the Application Explorer page, navigate to the Search bar.
  2. On the Search bar, select a search parameter. Pulse provides you with multiple search parameters.
  3. Select a suitable operator, enter the search term, and click the enter key.

Saving a Search Query

You can save and store a search query to run in the future.

  1. On the Application Explorer page, navigate to the Search bar.
  2. Enter a search query and click the Save Search icon (+).
  3. Enter the name with which you want to save a query and click Save.

The saved searches are available in the Search drop-down option.

Checking the Application Details

You can view the specifics of the running applications by clicking on their IDs.

Checking the Recommendations

For jobs of every application type you submit, you can view recommendations to optimize the performance of your jobs. These recommendations are based on jobs submitted by applications in the YARN queue of your Hadoop cluster.

To view recommendations for the optimization of jobs, do the following.

  1. Click the Recommendations button at the top. The Recommendations tile is displayed.
  2. On the Recommendations tile, the recommendations are segregated into three categories. You can click on them to filter the results accordingly.
    1. Low Max Memory: This recommendation indicates applications that are using significantly less memory than allocated. Allocating excess memory to tasks when it’s not fully utilized can lead to inefficient resource usage and reduced cluster throughput. Reducing the maximum memory allocated for such tasks can improve overall resource utilization.
    2. Task Runtime Skews: This refers to tasks within an application that have significantly varying runtimes. Some tasks may be finished quickly, while others take much longer.
    3. Small Tasks: This represents tasks that process only a small amount of data, resulting in inefficient task execution due to overhead costs (e.g., task setup time, I/O operations).

Checking the Incidents

To view incidents in the application explorer, do the following.

  1. Click the Incidents button at the top. An incident tile is displayed. The incident is displayed in the following format: Number of incidents, service name whose incident was reported, View incidents link. For example, 2 Critical Incidents, Hadoop Service, View Incidents status means that two incidents were raised by Hadoop service.
  2. Click View incidents to know more. The Incidents window is displayed.

Incidents Format

The Incidents page displays the list of incidents raised by a service. Each incident is displayed in the following format.

(Incident name ) (Time start time and end time of the incident) (The host on which the incident was raised) (Status of the incident, whether cleared or raised).

The following image is an example of the incident format.

Incident Details

To view the incident details, click on the incident name to expand the details.

The Details and Alert tabs are displayed.

  • The Details tab displays all details of the incident.
  • The Alert tab lets you edit the alert raised by the incident. Click the Edit button to edit the fields of the alert.
  • From the incident details window, you can also do the following to view the date and time at which the incident was raised and evaluated.
    1. Click Raised at to expand detailed information on the date and time at which the incident was raised.
    2. Click Evaluated at to expand detailed information on the incident evaluation.

Metrics

By default, the table in the dashboard allows you to view the following metrics of jobs carried out by applications installed in YARN.

OptionDescription
IDThe ID of the application.
VarianceThe value that displays by how many times the application is running slow.
TypeThe type of application.
Final StatusThe status of the application when the associated job is completed execution. The final status can be either of the following. Succeeded, Failed, Killed, Undefined, Unknown.
NameThe name of the job.
UserThe name of the user executing the job.
avgAppMemoryThe average amount of memory consumed by the job.
avgAppVcoresThe average amount of vCores used by the application.
DurationThe time consumed to complete the job.
QueueThe name of the queue the application resides in.
StartedThe time at which the job in the application started.
EndedThe time at which the job in the application ended.
maxContainerCoreThe maximum number of container cores used.
maxAppMemoryThe maximum amount of memory allocated to applications (in MB).
cpuUsePercentThe amount of CPU used (in %) by the application.
avgContainerCoreThe average number of container cores used.
avgContainerMemoryMbThe average amount of container memory used (in MB).
maxMemoryMbThe maximum amount of memory used (in MB).
memoryUsePercentThe amount of memory used (in %).
containersThe number of containers used by the application.
hostsThe list of hosts the application executed jobs on.
memoryMbSecThe rate of memory utilization.
coresSecThe rate of core utilization.
totalTimeThe total time taken to execute the job.

The metrics table might display the following icons for a record. Click these icons to know more.

  • The alert button displays the service(s) of the application that might require attention.
  • The tachometer button displays more details of the selected application ID.
  • The three horizontal dots button displays the following options.

Adding and Removing Columns

To add or remove a column to the Applications Explorer table, perform the following:

  1. In the top right column of the table, click the table settings icon. A Select Column window appears.
  2. From the available list of columns, select the column name you want to add to the table or deselect a column name you want to remove from the table.
  3. Click Apply. The table is modified.

Checking the Concurrency

Click on the three horizontal dots button > View Concurrency. It displays the list of jobs that were running at the same time as the selected application ID.

Checking the Logs

To view a record's status log, click the Log icon in the ID column of the desired record, as shown below.

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard