Configuring ODP with GCS

In order to configure access to your GCS buckets from your cluster, start by adding the GCS connector jar. Ensure to include the gcs-connector-hadoop3-2.2.16-shaded.jar, as it not only contains the classes and resources for the GCS Connector for Hadoop but also includes its dependencies.

Once you have downloaded the jar mentioned above to your Hadoop cluster, proceed with the following configuration steps to include the gcs-connector jar into the classpath for the relevant files.

Configuration Steps for Respective Classpaths

Hadoop-env.sh

Bash
Copy

Hive-env.sh

Bash
Copy

Mapreduce.application.classpath

Bash
Copy

Core-Site.xml

Bash
Copy

Configure Access using Core-Site.xml

Core-Site.xml

Bash
Copy

Configure Access using Hadoop Credential JCEKS

Bash
Copy

After successfully crafting the JCEKS file, you can validate its functionality by attempting to access your GCS buckets, as shown in the code below.

Bash
Copy

To make the above changes work, you must start the required components on your cluster.

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard
  Last updated