Data Collection and Environment Preparation

Before deploying a cluster, you must have the following information:

  • The fully qualified domain name (FQDN) of each host in your system. The Ambari Cluster Install wizard supports using IP addresses. Use the following command to check or verify the FQDN of a host: hostname -f
  • The list of components you want to set up on each host.
  • The base directories you want to use as mount points for storing:
    • NameNode data
    • DataNode data
    • Secondary NameNode data
    • Kafka data
    • YARN data
    • ZooKeeper data
    • Various log, pid, and db files, depending on your install type

You must use base directories that provide persistent storage locations for your components and your Hadoop data. Installing components in locations that may be removed from a host may result in cluster failure or data loss. For example: Do Not use /tmp in a base directory path.

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard
  Last updated