Configure CDH Hive and Sqoop

This page describes how to place Pulse Hive hook JARs, update Hive and Sqoop configurations, and enable metrics collection for CDH clusters.

Configure CDH Hive for Pulse

Hook Version Mapping (CDH Hive)

Distro VersionHive VersionTez VersionPulse Hook Jar Name
CDH 6.2.x2.1.x2.1.1-cdh6.2.1ad-hive-hook__2.1.1__cdh6.2.1-assembly-2.0.0.jar
CDH 6.3.42.1.x2.1.1-cdh6.3.4ad-hive-hook__cdh__3.0.0-assembly-2.0.0.jar

Place Hive Hook JARs

  1. Obtain the Hive hook JARs from the Acceldata team (refer to the table above).
  2. Place the JARs on all edge nodes and HiveServer2 nodes under:
Bash
Copy
  1. Ensure the hook directory is readable and executable by all users.

Update Hive Environment with Hook JAR

In CM, search for:

  • Gateway Client Environment Advanced Configuration Snippet (Safety Valve) for hive-env.sh

Add:

Bash
Copy

Update Hive Site Properties

In CM, search for:

  • Hive Client Advanced Configuration Snippet (Safety Valve) for hive-site.xml
  • HiveServer2 Advanced Configuration Snippet (Safety Valve) for hive-site.xml

Switch to XML view and add:

Bash
Copy

Update Pulse Details in Hive Site Properties

In the Advanced hive-site XML section, add the following properties:

Bash
Copy

Restart Hive Services

  • Restart the affected Hive components.
  • Deploy the updated client configuration.

Configure CDH Sqoop for Pulse

  1. Place the hook JAR in the classpath libraries of the Sqoop client on the designated edge nodes.
  2. Ensure that the Sqoop client can access the Pulse hook JAR, as configured in the Hive steps above.

Result

CDH Hive and Sqoop are configured with Pulse hook JARs and metric sinks. Pulse captures metrics and Hive query statistics from your CDH 5.x/6.x cluster.

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard