Pre-Requisite:- Basic knowledge of Hadoop and Ansible.

In this task, we will see how we can configure entire hadoop cluster using Ansible Playbook. But, first we have to install Ansible on or OS.I am using Redhat Enterprise Linux 8(RHEL8) for this practical. For installing Ansible you can use either of below commands:-

i) yum install ansible -y

ii) pip3 install ansible -y

In order to Configure Hadoop Cluster, we first have to install Java as Hadoop normally works on Java. Thus, we first need to download Java(JDK) and then Hadoop in our Master/Controller node in which Ansible is installed.

Then, you can follow this video below which will help you to setup the entire cluster and it also shows the practical demonstration of this task. In my case, I have have configured my Controller node as Hadoop Master and other 2 Managed Nodes as Slaves which will contribute their storage to the Master.

You can also refer this Github URL attached below to see and refer my entire YAML code.





Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

Global Picklist in Salesforce

Solve sudokus automatically — and naturally

Azure — Combined (Composite) SLA

Hibernate Part I

Engineering Design & Development Part 2: Assembly and Testing

Practical Data Structures Guide for Android developers

Kubernetes Cluster Bootstrap

How to Create a Writing Portfolio When You Have Zero Experience

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Aditya Pande

Aditya Pande

More from Medium

Use Ansible to customize AWS EMR through bootstrap actions

Configuring Snowflake to access S3 bucket

Analyzing AWS IAM Relationships Using Snowflake

A graph showing the relationships between various IAM entities.

Create an Admin IAM user in AWS