xDP Documentation
xDP
Get Started
Deployment Guide
Requirement Guide
xCentral - Platform Management
xStore Catalog and Metadata Management
xCompute - Compute Layer
xObserve - Observability Layer
Data Management
Applications
SQL (Trino)
Notebooks
Developer Guide
Migration Guide
Title
Message
Create new category
What is the title of your new category?
Edit page index title
What is the title of the page index?
Edit category
What is the new title of your category?
Edit link
What is the new title and URL of your link?
Spark Jobs
Summarize Page
Copy Markdown
Open in ChatGPT
Open in Claude
Connect to Cursor
Connect to VS Code
Spark Jobs
Spark Jobs in xDP provides a managed environment for creating, running, and monitoring Apache Spark applications. It handles cluster submission and resource management so your team can focus on building data pipelines.
Key Concepts
| Concept | Description |
|---|---|
| Job Definition | A reusable template for a Spark application — container image, executable script or JAR, resource requirements, and Spark configuration. |
| Job Run | A single execution of a Job Definition. Each run has its own lifecycle, logs, and metrics. |
| Execution Configuration | The full set of parameters applied to a run: runtime arguments, driver/executor resources, and Spark property overrides. |
Jobs Listing
From the side navigation, go to Spark > Spark Jobs to see all jobs across clusters:
Filter by Job Type, Plugin Type, Date Range, and Cluster. Use the ⋯ Actions menu on any row to Edit, Clone, Run Now, or Delete a job.
Capabilities
- Create and manage Spark Python, Java, and Notebook jobs from a single interface.
- Submit jobs to Kubernetes without writing
spark-submitcommands. - Monitor every run with execution timelines, live log streaming, and resource metrics.
- Launch directly into the Spark History Server or Live Spark UI for deep diagnostics.
Prerequisites
- A running xDP Compute Cluster.
- Your Spark application packaged as a Docker image in a registry accessible from the cluster.
- A user role with permissions to create and run jobs.
Best Practices
- Use naming conventions such as
{source}_{target}_{purpose}_jobfor discoverability. - Use versioned image tags (e.g.,
my-app:1.2.3) instead oflatestfor repeatable deployments. - Tune resources iteratively — start conservative, analyze runs with the Spark History Server, then adjust.
- Declare all Data Store Dependencies so xDP can manage credentials and track lineage automatically.
VariableType to search · ESC to discard
GlossaryType to search · ESC to discard
InsertType to search · ESC to discard
No matches
Last updated on
Was this page helpful?
Next to read:
Spark Job ManageFor additional help, contact our Support Team!
©2026, Acceldata Inc — All Rights Reserved.
Discard Changes
Do you want to discard your current changes and overwrite with the template?
Archive Synced Block
Message
Create new Template
What is this template's title?
Delete Template
Message