Title
Create new category
Edit page index title
Edit category
Edit link
Configure Pulse to Monitor Standalone Spark
This document provides you a step by step process on how to install single Pulse instance for multiple Spark Standalone clusters.
Pre-requisites
Ensure the following are present:
- Spark hosts: Refer to steps 1 and 2 mentioned below the note.
- Zookeeper hosts files: Refer to step 3 mentioned below the note.
- Log locations
- Spark history server locations
- Certificates (if any for Spark history server)
- Docker version
Prerequisites for enabling (TLS) HTTPS for Pulse Web UI Configuration using:
- Certificate File: cert.crt
- Certificate Key: cert.key
- CA Certificate: ca.crt (optional)
- Decide whether to keep the HTTP port (Default: 4000) open or not
- Decide on which port to use (default: 443)
- Obtain the fully qualified domain names (FQDN) for the Spark Master URLs for both clusters and include them in the
spark_<clustername>.hostsfile. The Spark hosts file should be structured as follows:
xxxxxxxxxxSparkMasterURLList: - <http/s>://<Alias/FQDN of the Spark Master 1>:<Spark Master UI Port> - <http/s>://<Alias/FQDN of the Spark Master 2>:<Spark Master UI Port>SparkWorkerURLList: - <http/s>://<Alias/FQDN of the Spark Worker 1>:<Spark Worker Port> - <http/s>://<Alias/FQDN of the Spark Worker 2>:<Spark Worker Port>- Retrieve the fully qualified domain names (FQDN) for the Spark History Server URLs for both clusters. When requested, provide the URL in the following format:
xxxxxxxxxx<http/s>://<Alias/FQDN of the Spark History Server URL>:<Spark History Server URL>- Obtain the fully qualified domain names (FQDN) for the Zookeeper Server URLs for both clusters and place them in the
zk_<clustername>.hostsfile. The Zookeeper Hosts file should adhere to the following format:
xxxxxxxxxx<http/s>://<Alias/FQDN for the Zookeeper Server>:<Zookeeper Server Port>- Retrieve the log locations for the application and deployment logs, as well as the
SPARK_HOMEdirectory for both clusters. - Ensure that the Docker version is >= 20.10.x.
Uninstallation
To uninstall agents, perform the following:
- To uninstall agents, you must run the
hystaller uninstallcommand through their ansible setup. - You must remove the Pulse Spark Hook Jars from the locations along with the related configurations from the Spark master and worker nodes.
- Acceldata team must then perform the following steps using the command below to backup and uninstall the existing Pulse application.
- Create a backup directory:
mkdir -p /data01/backup - As a backup, copy the entire
configandworkdirectories:cp -R $AcceloHome/config /data01/backup/cp -R $AcceloHome/work /data01/backup/`` - Uninstall the existing Pulse setup by running the following command:
accelo uninstall local
- Create a backup directory:
OUTPUT
[root@nifihost1:data01 (ad-default)]$ accelo uninstall local✗ You're about to uninstall the local AccelData setup. This will also DELETE all persistent data from the current node. However, NONE of the remote no✔ You're about to uninstall the local AccelData setup. This will also DELETE all persistent data from the current node. However, NONE of the remote no✔ You're about to uninstall the local AccelData setup. This will also DELETE all persistent data from the current node. However, NONE of the remote noYou're about to uninstall the local AccelData setup. This will also DELETE all persistent data from the current node. However, NONE of the remote nodes will be affected. Please confirm your action [y/n]: : yWARN: Gauntlet is running in dry run mode. Disable this to delete indices from elastic and purge data from mongo DBUninstalling the AccelData components from local machine ...Executing this action will remove all files, folders, docker containers, docker images, and the entire Acceldata directory.
- Logout of the terminal session.
Download and Load Binaries and Docker Images
To download and load binaries and Docker images, perform the following:
When downloading the Pulse all-in-one TAR file, extract the hystaller binary directly from the package.
- Download the jars, hystaller, accelo binaries, and docker images from the download links provided by the Acceldata team.
- Move the Docker images and jars into the following directory:
xxxxxxxxxxmkdir -p /data01/images- Copy the Binaries and Tar files into the
/data01/imagesfolder.
xxxxxxxxxxcp </path/to/binaries/tar> /data01/images- Change the directory
xxxxxxxxxxcd /data01/images- Extract the single tar file
xxxxxxxxxxtar xvf <name_of_tar_file>.tarOUTPUT
xxxxxxxxxx[root@nifihost1 images]# tar xvf pulse-333-beta.tar./ad-alerts.tgz./ad-connectors.tgz./ad-dashplots.tgz./ad-database.tgz./ad-deployer.tgz./ad-director.tgz./ad-elastic.tgz./ad-events.tgz./ad-fsanalyticsv2-connector.tgz./ad-gauntlet.tgz./ad-graphql.tgz./ad-hydra.tgz./ad-impala-connector.tgz./ad-kafka-0-10-2-connector.tgz./ad-kafka-connector.tgz./ad-ldap.tgz./ad-logsearch-curator.tgz./ad-logstash.tgz./ad-notifications.tgz./ad-oozie-connector.tgz./ad-pg.tgz./ad-pulsemon-ui.tgz./ad-recom.tgz./ad-sparkstats.tgz./ad-sql-analyser.tgz./ad-streaming.tgz./ad-vminsert.tgz./ad-vmselect.tgz./ad-vmstorage.tgz./accelo.linux./admon./hystaller- To load the Docker images, execute the following command:
xxxxxxxxxxls -1 *.tgz | xargs --no-run-if-empty -L 1 docker load -i- Check if all the images are loaded to the server using the following command:
xxxxxxxxxxdocker images | grep 4.x.xReplace 4.x.x with the Pulse version you want to install.
Configure the Cluster
To configure the cluster in Pulse, perform the following:
- Validate all the host files.
- Create the
acceldatadirectory by running the following command:
xxxxxxxxxxcd /data01/mkdir -p acceldata- Place the
accelobinary in this/data01/acceldatadirectory:
xxxxxxxxxxcp </path/to/accelo/binary> /data01/acceldata- Rename the
accelo.linuxbinary toaccelo.
xxxxxxxxxxmv /data01/acceldata/accelo.linux accelochmod +x /data01/acceldata/accelo- Change the directory:
xxxxxxxxxxcd /data01/acceldata/accelo- Run the following command to perform
accelo init:
xxxxxxxxxx./accelo init- Enter appropriate answers when prompted.
- When the Spark master is available, you can add the following parameter in the /etc/profile.d/ad.sh file to sync the Spark worker list from the Spark master URL.
xxxxxxxxxxSYNC_SPARK_MASTER=true- Run the following command to source the
ad.shfile:
xxxxxxxxxxsource /etc/profile.d/ad.sh- Run the
initcommand to provide the Pulse version:
xxxxxxxxxxaccelo initOUTPUT
xxxxxxxxxx[root@nifihost1:~ (ad-default)]$ accelo initEnter the AccelData ImageTag: : 4.x.x✓ Done, AccelData Init Successful.Replace 4.x.x with the Pulse version you want to install.
- Run
accelo infocommand as follows:
xxxxxxxxxxaccelo infoOUTPUT
xxxxxxxxxx[root@nifihost1:~ (ad-default)]$ accelo infoWARN: Gauntlet is running in dry run mode. Disable this to delete indices from elastic and purge data from mongo DB ___ ____________________ ____ ___ _________ / | / ____/ ____/ ____/ / / __ \/ |/_ __/ | / /| |/ / / / / __/ / / / / / / /| | / / / /| | / ___ / /___/ /___/ /___/ /___/ /_/ / ___ |/ / / ___ |/_/ |_\____/\____/_____/_____/_____/_/ |_/_/ /_/ |_| Accelo CLI Version: 4.x.xAccelo CLI Build Hash: 8ba4727f11e5b3f3902547585a37611b6ec74e7cAccelo CLI Build ID: 1700746329Accelo CLI Builder ID: ZEdjMmxrYUdGdWRGOWhZMk5sYkdSaEVLCg==Accelo CLI Git Branch Hash: TXdLaTlCVDFBdE56STNvPQo=AcceloHome: /data01/acceldataAcceloStack: ad-defaultAccelData Registry: 191579300362.dkr.ecr.us-east-1.amazonaws.com/acceldataAccelData ImageTag: 4.x.xActive Cluster Name: NotFoundAcceloConfig Mongo DB Retention days: 15AcceloConfig Mongo DB HDFS Reports Retention days: 15AccelConfig TSDB Retention days: 31dNumber of AccelData stacks found in this node: 0Replace 4.x.x with the Pulse version you want to install.
- To configure the cluster in Pulse, run the
config clustercommand:
xxxxxxxxxxaccelo config cluster- Provide the correct information when prompted. The output must appear as follows:
[root@nifihost1:acceldata (ad-default)]$ accelo config clusterINFO: Configuring the cluster ...INFO: Using default API Version v10 for CM APIIs the 'Database Service' up and running? [y/n]: : nWARN: Gauntlet is running in dry run mode. Disable this to delete indices from elastic and purge data from mongo DB✔ Stand-Alone✔ SparkEnter Your Cluster's Display Name: : spark341Enter the cluster name to use (MUST be all lowercase & unique): : spark341ERROR: stat /data01/acceldata/.activecluster: no such file or directoryINFO: Creating Post dirs.Enter the hosts file path for Spark-On-StandAlone (MUST formatted, one IP/host per line): : spark_spark341.hostsThe hostname for the spark worker node is : : kafka2.ops.iti.acceldata.devThe hostname for the spark worker node is : : kafka1.ops.iti.acceldata.dev✔ The hostname for the spark worker node is : : nifihost2.ops.iti.acceldata.dev█Is Zookeeper installed in the cluster: [Y/N]: YEnter the hosts file path for Zookeeper Hosts (MUST formatted, one IP/host per line): : spark341_zookeeper.hostsEnter the Spark History URL (with http/https): : https://10.90.6.169:18480✔ The hostname for the spark history server is : : nifihost2.ops.iti.acceldata.dev█INFO: min-reports is set to default value 10INFO: Purging old config files✓ acceldata.conf file generated successfully.INFO: Creating post config filesINFO: Writing the .dist filesINFO: Clustername : spark341INFO: Performing PreCheck of FilesINFO: Setting the active clusterWARN: Cannot find the pulse.yaml file, getting the values from acceldata.conf fileCreating hydra inventory✔ SSH Key Algorithm used (RSA/DSA)?: : RSA█Which user should connect over SSH: : rootSSH private key file path for connecting to hosts: : /root/.ssh/id_rsanifihost1.ops.iti.acceldata.dev is the hostname of the Pulse Server, Is this correct? [Y/N]: : y✔ Enter the JMX Port for zookeeper_server: : 8989█✔ Enter the JMX Port for zookeeper_server: : 8989█Enter the JMX Port for zookeeper_server: : 8989✔ Would you like to enable NTP Stats? [y/n]: : y█Would you like to setup LogSearch? [y/n]: : y? Select the logs for components that are installed/enabled in your target cluster: spark_application, zookeeper, syslog, kern, spark_jobhistoryserver, spark_master, spark_worker✓ Generated the vars.yml file successfullyConfiguring notifications✓ Generated the notifications.yml file successfullyConfiguring notifications✓ Generated the actions notifications.yml file successfullyINFO: Please run 'accelo deploy core' to deploy APM core using this configuration.[root@nifihost1:acceldata (ad-default)]$- Run the
config cluster commandfor all the clusters and provide the appropriate answers when prompted.
[root@nifihost1:acceldata (ad-default)]$ accelo config clusterINFO: Configuring the cluster ...INFO: Using default API Version v10 for CM API✔ Is the 'Database Service' up and running? [y/n]: : n█WARN: Gauntlet is running in dry run mode. Disable this to delete indices from elastic and purge data from mongo DB✔ Stand-Alone✔ SparkEnter Your Cluster's Display Name: : spark330Enter the cluster name to use (MUST be all lowercase & unique): : spark330Enter the hosts file path for Spark-On-StandAlone (MUST be formatted in pulse host file format): : spark_330.hostsThe hostname for the spark worker node is : : sac03.acceldata.dvlThe hostname for the spark worker node is : : sac02.acceldata.dvlIs Zookeeper installed in the cluster: [Y/N]: YEnter the hosts file path for Zookeeper Hosts (MUST formatted, one IP/host per line): : zookeeper.hosts✔ Enter the Spark History URL (with http/https): : http://sac01.acceldata.dvl:18080█INFO: min-reports is set to default value 10INFO: Purging old config files✓ acceldata.conf file generated successfully.INFO: Creating post config filesINFO: Writing the .dist filesINFO: Clustername : spark330INFO: Performing PreCheck of FilesINFO: Setting the active clusterCreating hydra inventorySSH Key Algorithm used (RSA/DSA)?: : RSA✔ Which user should connect over SSH: : root█SSH private key file path for connecting to hosts: : /root/.ssh/id_rsanifihost1.ops.iti.acceldata.dev is the hostname of the Pulse Server, Is this correct? [Y/N]: : yEnter the JMX Port for zookeeper_server: : 8989✔ Would you like to enable NTP Stats? [y/n]: : y█Would you like to setup LogSearch? [y/n]: : y? Select the logs for components that are installed/enabled in your target cluster: syslog, kern, spark_jobhistoryserver, spark_master, spark_worker, spark_application, zookeeper✓ Generated the vars.yml file successfullyConfiguring notifications✓ Generated the notifications.yml file successfullyConfiguring notifications✓ Generated the actions notifications.yml file successfullyINFO: Please run 'accelo deploy core' to deploy APM core using this configuration.[root@nifihost1:acceldata (ad-default)]$- Run the
config cluster commandfor Nifi Stand-Alone and select standalone > nifi.
xxxxxxxxxx[root@nifihost1:acceldata (ad-default)]$ accelo config clusterINFO: Configuring the cluster ...INFO: Using default API Version v10 for CM APIIs the 'Database Service' up and running? [y/n]: : nWARN: Gauntlet is running in dry run mode. Disable this to delete indices from elastic and purge data from mongo DB✔ Stand-Alone✔ NifiEnter Your Cluster's Display Name: : nifisa✔ Enter the cluster name to use (MUST be all lowercase & unique): : nifisa█INFO: Creating Post dirs.INFO: Getting the Nifi Host ListEnter the hosts file path for Nifi (One Nifi URL per line, Must be formatted): : nifi.hostsDiscovered NIFI Hosts: ✓ nifihost1.ops.iti.acceldata.devWould you like to continue with the above NIFI nodes? [y/n]: : YINFO: min-reports is set to default value 10INFO: Purging old config files✓ acceldata.conf file generated successfully.INFO: Creating post config filesINFO: Writing the .dist filesINFO: Clustername : nifisaINFO: Performing PreCheck of FilesINFO: Setting the active clusterCreating hydra inventory✔ SSH Key Algorithm used (RSA/DSA)?: : RSA█Which user should connect over SSH: : root✔ SSH private key file path for connecting to hosts: : /root/.ssh/id_rsa█nifihost1.ops.iti.acceldata.dev is the hostname of the Pulse Server, Is this correct? [Y/N]: : yWould you like to enable NTP Stats? [y/n]: : yWould you like to enable NTP Stats? [y/n]: : yWould you like to setup LogSearch? [y/n]: : y? Select the logs for components that are installed/enabled in your target cluster: syslog, nifi, kern✓ Generated the vars.yml file successfullyConfiguring notifications✓ Generated the notifications.yml file successfullyConfiguring notifications✓ Generated the actions notifications.yml file successfullyINFO: Please run 'accelo deploy core' to deploy APM core using this configuration.[root@nifihost1:acceldata (ad-default)]$Copy the License
Place the license file provided by the Acceldata team in the work directory as shown below:
xxxxxxxxxxcp </path/to/license> /data01/acceldata/workDeploy Pulse Core Components
Deploy the Pulse core components by running the following command:
xxxxxxxxxxaccelo deploy coreThe output must appear as follows:
[root@nifihost1:acceldata (ad-default)]$ accelo deploy coreERROR: Cannot connect to DB, Because: cannot connect to mongodbWARN: Gauntlet is running in dry run mode. Disable this to delete indices from elastic and purge data from mongo DBHave you verified the acceldata config file at '/data01/acceldata/config/acceldata_spark341.conf' ? [y/n]: : y✓ accelo.yml file found and parsed✓ AcceloEvents - events.json file found and parsed✓ acceldata conf file found and parsed✓ .dist file found and parsed✓ hydra_hosts.yml file found and parsed✓ vars.yml file found and parsed✓ alerts notification.yml file found and parsed✓ actions notification.yml file found and parsed✓ alerts default-endpoints.yml file found and parsed✓ override.yml file found and parsed✓ gauntlet_mongo_spark341.yml file found and parsed✓ gauntlet_elastic.yml file found and parsedINFO: No existing AccelData networks found. Current stack 'ad-default' is missing.INFO: Trying to create a new network ..INFO: If you're setting up AccelData for the first time give 'y' to the below.Would you like to initiate DB with the config file '/data01/acceldata/config/acceldata'? [y/n]: : yCreating group monitors [================================================================================================>-------------------] 83.33%INFO: Pushing the hydra_hosts.yml to mongodbDeployment Completed [==============================================================================================>--------------------] 81.82% 28s✓ Done, Core services deployment completed.Now, you can access the AccelData APM Server at the configured port of this node.To deploy the AccelData addons, Run './accelo deploy addons'Deploy Add-ons
To deploy the Pulse add-ons, run the code below and select the required components for Spark standalone:
xxxxxxxxxxaccelo deploy addonsThe output must appear as follows:
[root@nifihost1:acceldata (ad-default)]$ accelo deploy addonsWARN: Gauntlet is running in dry run mode. Disable this to delete indices from elastic and purge data from mongo DBINFO: Active Cluster: spark341? Select the components you would like to install: Alerts (Agents MUST be configured), Core Connectors, Dashplot, Director (Agents MUST be configured), HYDRA, LogSearch, NotificationsStarting the deployment ..Completed [==============================================================================================================================] 137.50% 29s✓ Done, Addons deployment completed.Configure Alerts Notifications
To configure alerts notifications, perform the following:
- Set the active cluster by running the following command:
xxxxxxxxxxaccelo set- Configure the alerts notifications using the following command:
OUTPUT
- Set cluster2 as the active cluster:
- Configure the alerts for second cluster:
- Set cluster3 as the active cluster:
- Configure the alerts for the third cluster:
- Restart the alerts notifications:
OUTPUT
Database Push Configuration
Run the following command to push config to db:
Configure the Override
- Change the dir to
work/<clustername>.
- Modify the
override.ymlfile.
- Paste the below config in the file.
Do the above steps for all clusters.
Deploy the Pulse Agents
Install the new Pulse version x.x.x agents on all cluster nodes. Make a copy of the new hystaller file to /tmp or any executable location on all cluster nodes and then run the following command on all cluster nodes.
Change the following code snippet as per your environment
Reconfig Cluster
- After completing the edits to the override files as outlined above, the next step is to run the following command:
OUTPUT
- DB Push Config
Adding Edge Nodes for Monitoring
These are edge nodes that are not the part of the spark standalone cluster.
- Change the dir to
work/<clustername>.
- Modify the
hydra_hosts_override.ymlfile.
- Add the following code to add a host to a already existing host for pulse to monitor:
- Run the
accelo reconfig clustercommand for clusters with edge nodes that require monitoring by Pulse. Alternatively, for comprehensive coverage, perform a reconfig cluster on all clusters.
- Check the
hydra_hosts.ymlfile which will now contain the new hosts as well. For example:
Configure Gauntlet
Updating the Gauntlet Crontab Duration
- Check if the
ad-core.ymlfile is present or not by running the following command:
- If the file above is not present, then generate it by:
- Edit the
ad-core.ymlfile
a. Open the file:
b. Update the CRON_TAB_DURATION env variable in the ad-gauntlet section:
This makes gauntlet run every 2 days at midnight.
c. The updated file will look something like this:
d. Save the file.
- Restart gauntlet service by running the command:
Updating the Gauntlet Dry Run Mode
- Check if the
ad-core.ymlfile is present or not by running the following command:
- If the file above is not present, then generate it by:
- Edit the
ad-core.ymlfile.
a. Open the file.
b. Update the DRY_RUN_ENABLE env variable in the ad-gauntlet section:
This will make the gauntlet delete the order elastic indices and mongo db data.
c. The updated file will look something like this:
d. Save the file.
- Restart gauntlet service by running the command:
Updating MongoDB Cleanup and Compaction Frequency in Hours
By default, when dry run is disabled MongoDB cleanup and compaction will run once a day. To configure the frequency, follow the steps listed below.
- Run the following command:
- Answer the following prompts, if you’re unsure about how many days you wish to retain. Then proceed with the default values.
- When the following prompt comes up, specify the hours of the day during which you would like MongoDB clean up and compaction to run. The value must be a CSV of hours as per the 24 hour time notation.
- Run the following command. When gauntlet runs the next time, MongoDB clean up and compaction will run at the specified hours, once per hour.
Enabling (TLS) HTTPS for Pulse Web UI Configuration
For details on the configuration, see Enable Native SSL/TLS for Pulse Web UI.
Set Up LDAP for Pulse UI
For details on the configuration, see Set up LDAP for Pulse UI.
Spark Jars Placements and Spark Config Changes
Perform the following steps for all the Spark Cluster Nodes:
- Add the following configuration in the
metrics.propertiesfile for Spark TimeSeries data:
- Add the following configuration in the
spark-defaults.conffile for the Events data:
- Take the
ad-spark-hook.jarfile and put it in the following dir:
- Restart all the Spark services.
DotLog Download
We have introduced a feature that allows downloading of service logs in .log format. This file is not the original server log but an xlsx sheet merged into a .log format.
Perform the following to add a configurable parameter to enable this feature:
- Insert the dotLogFileDownload parameter into the feature flags property of the
ad-graphqlsection found at the file path: $Acceldata_Home/config/docker/ad-core.yml.
- Restart the ad-graphl service using the following command:
New Search Bar
Perform the following to enable new search options:
- Locate the “ad-graphql“ section in file $Acceldata_Home/config/docker/ad-core.yml and under the “environment“ key, add the following line:
- Restart the ad-graphl service using the following command:
Does a user in the Spark Standalone environment still see the Spark option in the left menu even after their access has been revoked from the role?
Create a different role that does not have Spark permission and assign that role to the user. Alternatively, you can leave it as is because even if the Spark entry is visible in the left navigation, the user will not be able to access it if access has been revoked from their role.
Are non-admin users in the Spark Standalone environment able to access Spark even though they have the appropriate role permissions for accessing Spark?
In the role edit window, click on "Select All" just below the Page permissions. Then, remove any permissions that you do not wish to grant and save the role. Any user assigned to this role should now have access to Spark in the Spark Standalone environment.
What is the reason for the absence of the Oozie workflow link between the Oozie workflow and the application ID in PULSE for a Spark job?
The Spark job's Application ID is generated by the Oozie service. It will only appear in the Pulse UI if it is available in Oozie's Web Service UI. If it is not present in the Oozie's Web Service UI, it will not be displayed in Pulse.
For additional help, contact www.acceldata.force.com OR call our service desk +1 844 9433282
Copyright © 2026