The Flink monitoring capability in Pulse enables you to track component health, review system logs, and identify issues across Flink services in real time.
This capability enables you to:
- Monitor the status of the Flink Job History Server component.
- Detect component failures early to maintain service uptime.
- Analyze Flink logs to troubleshoot job and system issues quickly.
- Correlate logs across Flink services for faster root-cause analysis.
Check Flink Component Status
You can monitor the status of key Flink components or services to ensure system stability.
- In the Nodes > Summary page, check the status of the Flink Job History Server component.
- For details, see Check Hadoop Service Status.
Check Flink Log Data
You can analyze Flink logs in Pulse to troubleshoot errors, query failures, or performance bottlenecks.
View Component-Specific Logs
View and analyze logs for individual Flink components to troubleshoot issues, monitor performance, and understand system behavior.
- In the Logs > Application Logs, search and review log details for the Flink Job History Server component.
- The log details shown in Messages are grouped into the following categories for easier analysis.
- History Server Logs: Information about completed jobs and their execution history.
- Task Manager Logs: Details about task-level execution, performance, and failures.
- Job Manager Logs: Events related to job submission, scheduling, and coordination.
View Application-Specific Logs
View and analyze logs for a specific Flink application to identify job-related issues or failures directly from the Applications page in Pulse.
- In the Pulse UI, go to Flink > Applications.
- On the Applications page, select the Logs icon next to the Final Status column to view logs specific to that application.
This view helps you identify and troubleshoot job-level issues within a specific Flink application.
For more details, see Troubleshoot Using Application Logs.