Assignment Tanupriya BDDV

Uploaded by

Tanu Ameta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views8 pages

Assignment Tanupriya BDDV

Uploaded by

Tanu Ameta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

ASSIGNMENT

Hands-on with HDFS Task:

Install Hadoop in pseudo-distributed mode or use an online simulator.
Upload and retrieve a sample file using HDFS commands.
Deliverable: Screenshots of steps + command list. Evaluation Criteria:
Execution, clarity of explanation.

Steps to Install Hadoop in Pseudo-Distributed Mode (Conceptual with

Command Examples):

1. Install Java:adoop requires Java. Let's assume you have it installed.

You can check with:
```bash
java -version
```
2. Download and Extract Hadoop:
Let's say you download the Hadoop binary (e.g., `hadoop-
3.3.6.tar.gz`) to your home directory.
```bash
tar -xzvf hadoop-3.3.6.tar.gz
cd hadoop-3.3.6
```

3. Set Environment Variables: You'll need to configure your `~/.bashrc`

or `~/.zshrc` file. Add the following lines (adjust the path if your Hadoop
directory is different):
```bash
export HADOOP_HOME=/home/$USER/hadoop-3.3.6
export PATH=$PATH:$HADOOP_HOME/bin:$HADOOP_HOME/sbin
```
Then, apply the changes:
```bash
source ~/.bashrc
“’
4. Configure Hadoop Configuration Files: Navigate to the `etc/hadoop`
directory within your Hadoop installation. You'll need to edit a few key
files:
‘hadoop-env.sh`: Set the `JAVA_HOME` variable.
```bash
nano etc/hadoop/hadoop-env.sh
```
Add or uncomment the line similar to:
```bash
export JAVA_HOME=/usr/lib/jvm/java-11-openjdk-amd64
```
‘core-site.xml`: Configure the HDFS default name node.
```bash
nano etc/hadoop/core-site.xml
```
Add the following within the `<configuration>` tags:
```xml
<property>
<name>fs.defaultFS</name>
<value>hdfs://localhost:9000</value>
</property>
```
`hdfs-site.xml`: Configure the HDFS data node directory.
```bash
nano etc/hadoop/hdfs-site.xml
```
Add the following within the `<configuration>` tags:
```xml
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
<property>
<name>dfs.datanode.data.dir</name>
<value>/tmp/hadoop-data</value>
</property>
<property>
<name>dfs.namenode.name.dir</name>
<value>/tmp/hadoop-name</value>
</property>
```
`mapred-site.xml`: Configure MapReduce execution mode. You might
need to rename the template file first:
```bash
cp etc/hadoop/mapred-site.xml.template etc/hadoop/mapred-
site.xml
nano etc/hadoop/mapred-site.xml
```
Add the following within the `<configuration>` tags:
```xml
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
```
```
Add the following within the `<configuration>` tags:
```xml
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.resourcemanager.hostname</name>
<value>localhost</value>
</property>
```
5. This initializes the HDFS file system.
```bash
hdfs namenode -format
```
6.Start Hadoop Services:
```bash
start-dfs.sh
start-yarn.sh
```
7.Access Hadoop Web UIs (Optional but useful for monitoring)
NameNode: `http://localhost:9870` (or `http://localhost:5007` for older
versions)
```bash
echo "This is a sample file for Hadoop HDFS." > sample.txt
```

Now, let's use HDFS commands:

1. Create a directory in HDFS (optional but good practice):
```bash
hdfs dfs -mkdir /user/$USER/input
```
2. Upload the local file to HDFS:
```bash
hdfs dfs -put sample.txt /user/$USER/input/
```
3. List the files in the HDFS directory:
```bash
hdfs dfs -ls /user/$USER/input/
```
4.
```bash
hdfs dfs -get /user/$USER/input/sample.txt retrieved_sample.txt
```
5.
```bash
cat retrieved_sample.txt
```
6. ```bash
stop-yarn.sh
stop-dfs.sh
```

Group A 1st
No ratings yet
Group A 1st
4 pages
Week 1 in Terminal
No ratings yet
Week 1 in Terminal
10 pages
Practical 5
No ratings yet
Practical 5
3 pages
Hadoop Configuration
No ratings yet
Hadoop Configuration
12 pages
Install Sqoop
No ratings yet
Install Sqoop
7 pages
Installing Hadoop 3.2.4 Guide
No ratings yet
Installing Hadoop 3.2.4 Guide
7 pages
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
No ratings yet
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
14 pages
Hive INstallation
No ratings yet
Hive INstallation
13 pages
BDA Practical Experiment 1
No ratings yet
BDA Practical Experiment 1
5 pages
Hadoop Installation Guide for Ubuntu
No ratings yet
Hadoop Installation Guide for Ubuntu
8 pages
Hadoop Setup Guide for Ubuntu 16.04/18.04
No ratings yet
Hadoop Setup Guide for Ubuntu 16.04/18.04
20 pages
Hadoop For Ubuntu 2
No ratings yet
Hadoop For Ubuntu 2
4 pages
Hadoop 3.3.5 Setup and HDFS File Management
No ratings yet
Hadoop 3.3.5 Setup and HDFS File Management
3 pages
BDA Lab Manual UPDATED
No ratings yet
BDA Lab Manual UPDATED
45 pages
Install Hadoop: Standalone & Pseudo Modes
No ratings yet
Install Hadoop: Standalone & Pseudo Modes
13 pages
Big Data Record
No ratings yet
Big Data Record
69 pages
Exp 1-2
No ratings yet
Exp 1-2
9 pages
Hadoop Multinode Cluster Installation
No ratings yet
Hadoop Multinode Cluster Installation
4 pages
Hadoop Installation
No ratings yet
Hadoop Installation
4 pages
Hadoop Setup Guide for Linux Users
No ratings yet
Hadoop Setup Guide for Linux Users
23 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
6 pages
Hadoop Setup Guide for Developers
No ratings yet
Hadoop Setup Guide for Developers
7 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
49 pages
DataVisuaization Lab
No ratings yet
DataVisuaization Lab
5 pages
Install Hadoop Single Node Cluster Guide
No ratings yet
Install Hadoop Single Node Cluster Guide
6 pages
Hadoop CLI & Installation Guide
No ratings yet
Hadoop CLI & Installation Guide
11 pages
CCS334-BDA LAB MANUAL Final
No ratings yet
CCS334-BDA LAB MANUAL Final
46 pages
Hadoop Setup Guide for Developers
No ratings yet
Hadoop Setup Guide for Developers
3 pages
Ex 1
No ratings yet
Ex 1
5 pages
Hadoop Setup & File Management Guide
No ratings yet
Hadoop Setup & File Management Guide
16 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
Bdafile
No ratings yet
Bdafile
9 pages
Hadoop Installation and YARN Setup Guide
No ratings yet
Hadoop Installation and YARN Setup Guide
11 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
BigData Lab Manual
No ratings yet
BigData Lab Manual
44 pages
Bigdatamanual
No ratings yet
Bigdatamanual
45 pages
BIG DATA UNIT-III Notes
No ratings yet
BIG DATA UNIT-III Notes
16 pages
Experiment No - 1
No ratings yet
Experiment No - 1
13 pages
Single Node Hadoop Installation Guide
100% (1)
Single Node Hadoop Installation Guide
6 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
Hadoop Installation and MapReduce Guide
No ratings yet
Hadoop Installation and MapReduce Guide
25 pages
Big Data Record 2024-25
No ratings yet
Big Data Record 2024-25
46 pages
BDA LabManual
No ratings yet
BDA LabManual
20 pages
Week 1 Lab
No ratings yet
Week 1 Lab
8 pages
Big Data Analytics lab-JD
No ratings yet
Big Data Analytics lab-JD
49 pages
Setting Up and Running Hadoop 0.20.2
No ratings yet
Setting Up and Running Hadoop 0.20.2
20 pages
Lab 1
No ratings yet
Lab 1
12 pages
Big Data Analytics Lab Manual
No ratings yet
Big Data Analytics Lab Manual
33 pages
Hadoop 3.2.2 Installation Guide
No ratings yet
Hadoop 3.2.2 Installation Guide
3 pages
Amc Engineering College: Dept. of Computer Science and Engineering
No ratings yet
Amc Engineering College: Dept. of Computer Science and Engineering
6 pages
Hadoopfile PP
No ratings yet
Hadoopfile PP
83 pages
Sqoop Data Transfer Tutorial
No ratings yet
Sqoop Data Transfer Tutorial
11 pages
Hadoop HDFS Setup and Commands Guide
No ratings yet
Hadoop HDFS Setup and Commands Guide
35 pages
Lab 1 - Hadoop HDFS and MapReduce
No ratings yet
Lab 1 - Hadoop HDFS and MapReduce
4 pages
Hands On-Exercies
No ratings yet
Hands On-Exercies
17 pages
Hadoop Cluster
No ratings yet
Hadoop Cluster
26 pages
Amrita CC 3.1
No ratings yet
Amrita CC 3.1
7 pages
Big Data File
No ratings yet
Big Data File
16 pages
Exp 1 Hadoop Installation Steps
No ratings yet
Exp 1 Hadoop Installation Steps
4 pages
AI Overview and Search Algorithms
No ratings yet
AI Overview and Search Algorithms
4 pages
Computer Science Career Insights
No ratings yet
Computer Science Career Insights
20 pages
Computer Types & Uses Explained
No ratings yet
Computer Types & Uses Explained
7 pages
KPMG Data Extraction Tool For SAP Execution Guide v8.2
No ratings yet
KPMG Data Extraction Tool For SAP Execution Guide v8.2
48 pages
Product Selection Guide
No ratings yet
Product Selection Guide
3 pages
Netlab Pan91 Pod Install Guide
No ratings yet
Netlab Pan91 Pod Install Guide
26 pages
ISO 26262 Compliance for Developers
No ratings yet
ISO 26262 Compliance for Developers
6 pages
Cs f342 Computer Architecture1
No ratings yet
Cs f342 Computer Architecture1
2 pages
MPMC Lab - Expt1 Introduction To Tasm
100% (1)
MPMC Lab - Expt1 Introduction To Tasm
7 pages
Lecture-Communication Devices
No ratings yet
Lecture-Communication Devices
3 pages
Unit 3 Networking Devices and Topologies
No ratings yet
Unit 3 Networking Devices and Topologies
36 pages
Log
No ratings yet
Log
12 pages
HTTP Status Codes Cheat Sheet
No ratings yet
HTTP Status Codes Cheat Sheet
2 pages
FieldAssistantv6.5 - UserManual
No ratings yet
FieldAssistantv6.5 - UserManual
90 pages
CS-1101 (ICP) Test 1
No ratings yet
CS-1101 (ICP) Test 1
2 pages
Threads in Java
No ratings yet
Threads in Java
6 pages
Huawei Optixstar Eg8040h6 Datasheet 01
No ratings yet
Huawei Optixstar Eg8040h6 Datasheet 01
3 pages
Iac Developer Guide
No ratings yet
Iac Developer Guide
34 pages
Lecture 5 - Cybersecurity Risks Threats
No ratings yet
Lecture 5 - Cybersecurity Risks Threats
33 pages
LXM16 Catalog 2020
No ratings yet
LXM16 Catalog 2020
19 pages
25 AMS Design Flow V1
No ratings yet
25 AMS Design Flow V1
16 pages
Train Schedule: Indore to Jodhpur
No ratings yet
Train Schedule: Indore to Jodhpur
42 pages
JavaScript Event Handling Guide
No ratings yet
JavaScript Event Handling Guide
7 pages
(B) Figure Q3 (B) Shows The Routing Table Result Fo...
No ratings yet
(B) Figure Q3 (B) Shows The Routing Table Result Fo...
1 page
Chapter7 Testing
No ratings yet
Chapter7 Testing
21 pages
Lab 1 Introduction To Python Programming
No ratings yet
Lab 1 Introduction To Python Programming
11 pages
Early History of Computer
No ratings yet
Early History of Computer
3 pages
Log Cat 1707757161727
No ratings yet
Log Cat 1707757161727
17 pages
Azure Role Management Guide
No ratings yet
Azure Role Management Guide
6 pages
Body Worn Cameras Final
No ratings yet
Body Worn Cameras Final
12 pages

Assignment Tanupriya BDDV

Uploaded by

Assignment Tanupriya BDDV

Uploaded by

ASSIGNMENT

Hands-on with HDFS Task:

Steps to Install Hadoop in Pseudo-Distributed Mode (Conceptual with

1. Install Java:adoop requires Java. Let's assume you have it installed.

3. Set Environment Variables: You'll need to configure your `~/.bashrc`

Now, let's use HDFS commands:

You might also like