What are the configuration files in Hadoop?

Free Online Certification Courses – Learn Today. Lead Tomorrow. Forums Apache Hadoop What are the configuration files in Hadoop?

Viewing 2 reply threads
  • Author
    Posts
    • #4855
      DataFlair TeamDataFlair Team
      Spectator

      List out Hadoop configuration files?
      Name different configuration files in Hadoop?

    • #4856
      DataFlair TeamDataFlair Team
      Spectator

      Configuration Files are the files which are located in the extracted tar.gz file in the etc/hadoop/ directory.
      All Configuration Files in Hadoop are listed below,

      1) HADOOP-ENV.sh->>It specifies the environment variables that affect the JDK used by Hadoop Daemon (bin/hadoop). We know that Hadoop framework is wriiten in Java and uses JRE so one of the environment variable in Hadoop Daemons is $Java_Home in Hadoop-env.sh.

      2) CORE-SITE.XML->>It is one of the important configuration files which is required for runtime environment settings of a Hadoop cluster.It informs Hadoop daemons where the NAMENODE runs in the cluster. It also informs the Name Node as to which IP and ports it should bind.

      3) HDFS-SITE.XML->>It is one of the important configuration files which is required for runtime environment settings of a Hadoop. It contains the configuration settings for NAMENODE, DATANODE, SECONDARYNODE. It is used to specify default block replication. The actual number of replications can also be specified when the file is created,

      4) MAPRED-SITE.XML->>It is one of the important configuration files which is required for runtime environment settings of a Hadoop. It contains the configuration settings for MapReduce . In this file, we specify a framework name for MapReduce, by setting the MapReduce.framework.name.

      5) Masters->>It is used to determine the master Nodes in Hadoop cluster. It will inform about the location of SECONDARY NAMENODE to Hadoop Daemon.
      The Mater File on Slave node is blank.

      6) Slave->>It is used to determine the slave Nodes in Hadoop cluster.
      The Slave file at Master Node contains a list of hosts, one per line.
      The Slave file at Slave server contains IP address of Slave nodes.

      Follow the link to learn more about configuration files in Hadoop

    • #4858
      DataFlair TeamDataFlair Team
      Spectator

      One more important file if we are running Hadoop in V2 (i.e.) YARN Mode.

      yarn-site.xml:
      This file contains the configuration settings related to YARN . For example, it contains settings for Node Manager, Resource Manager, Containers, and Application Master.

      Some example properties:

      1) Where to store Node Manager data in local FS

      <property>
          <name>yarn.nodemanager.local-dirs</name>
          <value>/var/lib/hadoop-yarn/cache/${user.name}/nm-local-dir</value>
      </property>

      2) Where to store Node Manager logs in local FS

      <property>
          <name>yarn.nodemanager.log-dirs</name>
          <value>/var/log/hadoop-yarn/containers</value>
      </property>

      3) Where to store Resource Manager data in local FS

      <property>
          <name>yarn.resourcemanager.local-dirs</name>
          <value>/var/lib/hadoop-yarn/cache/${user.name}/rm-local-dir</value>
      </property>

      4) Where to store Resource Manager logs in local FS

      <property>
          <name>yarn.resourcemanager.log-dirs</name>
          <value>/var/log/hadoop-yarn/resource-manager</value>
      </property>
Viewing 2 reply threads
  • You must be logged in to reply to this topic.