0% found this document useful (0 votes)

18 views

Running Jar Program

Uploaded by

nkr189

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views

Running Jar Program

Uploaded by

nkr189

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Steps to run a Hadoop Jar file using Eclipse

1. Download and Open eclipse(version: mars) in Linux

2. Create a java project
3. Create a java class named WordCount as follows:

import java.io.IOException;
import java.util.StringTokenizer;

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.Mapper;
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;

public class WordCount {

public static class TokenizerMapper

extends Mapper<Object, Text, Text, IntWritable>{

private final static IntWritable one = new IntWritable(1);

private Text word = new Text();

public void map(Object key, Text value, Context context

) throws IOException, InterruptedException {
StringTokenizer itr = new StringTokenizer(value.toString());
while (itr.hasMoreTokens()) {
word.set(itr.nextToken());
context.write(word, one);
}
}
}

public static class IntSumReducer

extends Reducer<Text,IntWritable,Text,IntWritable> {
private IntWritable result = new IntWritable();

public void reduce(Text key, Iterable<IntWritable> values,

Context context
) throws IOException, InterruptedException {
int sum = 0;
for (IntWritable val : values) {
sum += val.get();
}
result.set(sum);
context.write(key, result);
}
}

public static void main(String[] args) throws Exception {

Configuration conf = new Configuration();
Job job = Job.getInstance(conf, "word count");
job.setJarByClass(WordCount.class);
job.setMapperClass(TokenizerMapper.class);
job.setCombinerClass(IntSumReducer.class);
job.setReducerClass(IntSumReducer.class);
job.setOutputKeyClass(Text.class);
job.setOutputValueClass(IntWritable.class);
FileInputFormat.addInputPath(job, new Path(args[0]));
FileOutputFormat.setOutputPath(job, new Path(args[1]));
System.exit(job.waitForCompletion(true) ? 0 : 1);
}
}

4. Right click the project and select run as configurations and click new launch configuration
under java application. Give a name of the configuration and select the project name and
class of the project. Press on apply button and then close button to close the window.
5. Right click on the project and click on export. Select runnable jar file under java. Select
launch configuration file and destination directory (ex: dest/wordcount.jar) to store the jar
file.
6. Make sure that $jps command will list all the running daemons as follows.

Node manager
Name node
Data node
Secondary Name node
Resource manager

7. Create the input directory in hdfs as follows:

$hdfs dfs -mkdir -p /user/dbda/input

8. Copy the input directory having input files into hdfs as follows:

$hdfs dfs -copyFromLocal <local file directory> /user/dbda/input

9. Run the hadoop program using jar file as follows:

$yarn jar <jar path>/wordcount.jar /user/dbda/input /user/dbda/output

Program starts running

10. To see the results, display the contents of the output directory. It has two files _success and
part-r-00000.
11. The results can be displayed by using the following command:

$hdfs dfs -cat /user/dbda/output/part-r-00000

The result will be shown in command window.

12. The results can be downloaded using browser also.
13. Open the browser using URL localhost:50070. NameNode information will be displayed.
14. Go to utilities tab and select browse the file system. Navigate to the input and output folder
created in DFS. Input data as well as output generated can be downloaded.
Steps to run a Hadoop Jar file using Command Line

Compile WordCount.java and create a jar:

$ javac –cp $(hadoop classpath) WordCount.java

$ jar cf wc.jar WordCount*.class

Assuming that:

 /user/dbda/wordcount/input - input directory in HDFS

 /user/dbda/wordcount/output - output directory in HDFS

Sample text-files as input:

$ hdfs dfs -ls /user/dbda/wordcount/input/

/user/dbda/wordcount/input/file01
/user/dbda/wordcount/input/file02

$ hdfs dfs -cat /user/dbda/wordcount/input/file01

Hello World Bye World

$ hdfs dfs -cat /user/dbda/wordcount/input/file02

Hello Hadoop Goodbye Hadoop

Run the application:

$ yarn jar wc.jar WordCount /user/dbda/wordcount/input

/user/dbda/wordcount/output

Output:

$ hdfs dfs -cat /user/dbda/wordcount/output/part-r-00000

Bye 1
Goodbye 1
Hadoop 2
Hello 2
World 2

Comparative Study of Countries
No ratings yet
Comparative Study of Countries
7 pages
WordCount Program Hadoop Task 2
No ratings yet
WordCount Program Hadoop Task 2
7 pages
Practical 2c
No ratings yet
Practical 2c
2 pages
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
No ratings yet
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
5 pages
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
No ratings yet
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
22 pages
Experiment-4 BDA LAB
No ratings yet
Experiment-4 BDA LAB
7 pages
Wordcount
No ratings yet
Wordcount
3 pages
Steps to create jar file and execute word count problem in mapper reducer
No ratings yet
Steps to create jar file and execute word count problem in mapper reducer
5 pages
Ravinder Big Data 4 PDF
No ratings yet
Ravinder Big Data 4 PDF
15 pages
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
No ratings yet
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
4 pages
DSBDA 11
No ratings yet
DSBDA 11
15 pages
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
No ratings yet
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
11 pages
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
No ratings yet
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
13 pages
ExNo04
No ratings yet
ExNo04
4 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
BDM Lab Manual 2
No ratings yet
BDM Lab Manual 2
4 pages
BDF Programs
No ratings yet
BDF Programs
32 pages
Prerequisites: Single Node Setup Cluster Setup
No ratings yet
Prerequisites: Single Node Setup Cluster Setup
5 pages
Hadoop and Map Reduce
No ratings yet
Hadoop and Map Reduce
27 pages
Word Count Example
No ratings yet
Word Count Example
4 pages
Palak
No ratings yet
Palak
10 pages
Labs Lecture2
No ratings yet
Labs Lecture2
6 pages
Import Import Import Import Import Import Import Import Public Class Extends Implements
No ratings yet
Import Import Import Import Import Import Import Import Public Class Extends Implements
7 pages
Ravikant_Hadoop_file
No ratings yet
Ravikant_Hadoop_file
22 pages
Unit IV Programming Model
No ratings yet
Unit IV Programming Model
30 pages
Map Reduce
No ratings yet
Map Reduce
57 pages
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
No ratings yet
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
9 pages
Run Wordcount
No ratings yet
Run Wordcount
3 pages
DA Lab Program-2
No ratings yet
DA Lab Program-2
6 pages
Creation and Execution Process Document
No ratings yet
Creation and Execution Process Document
4 pages
Word Count Program
No ratings yet
Word Count Program
3 pages
6 WIBD-Practicals
No ratings yet
6 WIBD-Practicals
19 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Advanced Mapreduce
No ratings yet
Advanced Mapreduce
37 pages
BDA Lab
No ratings yet
BDA Lab
13 pages
03_Run the WordCount program instructions.docx
No ratings yet
03_Run the WordCount program instructions.docx
4 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
CS702_Big_Data_Programs
No ratings yet
CS702_Big_Data_Programs
58 pages
✅ PART 1- Install Java and Hadoop on Ubuntu
No ratings yet
✅ PART 1- Install Java and Hadoop on Ubuntu
4 pages
1WordCount
No ratings yet
1WordCount
2 pages
Big Data Practical 2
No ratings yet
Big Data Practical 2
11 pages
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
Bda Experiment No2
No ratings yet
Bda Experiment No2
12 pages
Cloud PDF
No ratings yet
Cloud PDF
47 pages
Assignment 11 DSBDA
No ratings yet
Assignment 11 DSBDA
4 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
37 pages
Word Count Program
No ratings yet
Word Count Program
2 pages
Practical 2-1
No ratings yet
Practical 2-1
4 pages
CTBD Ex02
No ratings yet
CTBD Ex02
3 pages
Word Count
No ratings yet
Word Count
10 pages
Core Java Programming Book
From Everand
Core Java Programming Book
Manish Soni
No ratings yet
DSBDA GRP B Print
No ratings yet
DSBDA GRP B Print
21 pages
Exp-11
No ratings yet
Exp-11
4 pages
To Count Using Map and Reduce Program: Wordcount - Java
No ratings yet
To Count Using Map and Reduce Program: Wordcount - Java
2 pages
Homework_Labs_Lecture2
No ratings yet
Homework_Labs_Lecture2
6 pages
Climate Change Guidelines For WASH in Eastern Equatoria State (EES) in South Sudan
No ratings yet
Climate Change Guidelines For WASH in Eastern Equatoria State (EES) in South Sudan
3 pages
BDA
No ratings yet
BDA
6 pages
Example - (Map Function in Word Count)
No ratings yet
Example - (Map Function in Word Count)
6 pages
BDA3
No ratings yet
BDA3
7 pages
Activity 2
No ratings yet
Activity 2
31 pages
Word_Count(2021)
No ratings yet
Word_Count(2021)
50 pages
Airport Mangement System
No ratings yet
Airport Mangement System
11 pages
Relational Database Flash Cards
No ratings yet
Relational Database Flash Cards
4 pages
Linux File System
No ratings yet
Linux File System
8 pages
How To Create A Pivot Table in Excel 2010 - For Dummies PDF
No ratings yet
How To Create A Pivot Table in Excel 2010 - For Dummies PDF
2 pages
345-SQL Database Fundamentals R 2015
No ratings yet
345-SQL Database Fundamentals R 2015
9 pages
Oracle Mock Test at 4
No ratings yet
Oracle Mock Test at 4
9 pages
Netapp Storage Deployment Guide: Smart Business Architecture Data Center
No ratings yet
Netapp Storage Deployment Guide: Smart Business Architecture Data Center
27 pages
DBMS - CST-227
No ratings yet
DBMS - CST-227
2 pages
(eBook PDF) Data Mining Concepts and Techniques 3rdinstant download
100% (3)
(eBook PDF) Data Mining Concepts and Techniques 3rdinstant download
44 pages
SQL Server DBA Interview Questions
100% (2)
SQL Server DBA Interview Questions
3 pages
Tools, Debugging and Troubleshooting
No ratings yet
Tools, Debugging and Troubleshooting
30 pages
Kings: Department of Computer Science & Engineering Question Bank
No ratings yet
Kings: Department of Computer Science & Engineering Question Bank
5 pages
S. K. Mishra: Lecturer (Information Technology), Institute of Management Sciences, University of Lucknow, Lucknow
No ratings yet
S. K. Mishra: Lecturer (Information Technology), Institute of Management Sciences, University of Lucknow, Lucknow
15 pages
Mis SQL HW
No ratings yet
Mis SQL HW
2 pages
SQL+Server+Index+Architecture+and+Design+Guide+-+SQL+Server+ +Microsoft+Docs
No ratings yet
SQL+Server+Index+Architecture+and+Design+Guide+-+SQL+Server+ +Microsoft+Docs
47 pages
Baze de Date
No ratings yet
Baze de Date
6 pages
H13-624 (20 Questions)
No ratings yet
H13-624 (20 Questions)
4 pages
Technical Summative Assessment 2: Group 8
No ratings yet
Technical Summative Assessment 2: Group 8
10 pages
Access Tutorial 1 Creating A Database: Comprehensive
No ratings yet
Access Tutorial 1 Creating A Database: Comprehensive
29 pages
Basic Operations With CSV Files: CSV (Comma Separated Values) May Be A Simple File Format Accustomed To
No ratings yet
Basic Operations With CSV Files: CSV (Comma Separated Values) May Be A Simple File Format Accustomed To
7 pages
Bi Manual
No ratings yet
Bi Manual
66 pages
Chapter 4
No ratings yet
Chapter 4
6 pages
Database Development: By: Miss Ruzanna Binti Abu Bakar
No ratings yet
Database Development: By: Miss Ruzanna Binti Abu Bakar
22 pages
ASD01.10q: Number: ASD01 Passing Score: 800 Time Limit: 120 Min
No ratings yet
ASD01.10q: Number: ASD01 Passing Score: 800 Time Limit: 120 Min
11 pages
1.affairscloud Com Dbms Questions Set 1
No ratings yet
1.affairscloud Com Dbms Questions Set 1
7 pages
PLSQL Introduction Final
No ratings yet
PLSQL Introduction Final
81 pages
Deep Blind SQL Injection: Benchmark Etc
No ratings yet
Deep Blind SQL Injection: Benchmark Etc
3 pages
DBMS Relational Algebra - Javatpoint
No ratings yet
DBMS Relational Algebra - Javatpoint
6 pages
Lab2 FD Spring-2019-1
No ratings yet
Lab2 FD Spring-2019-1
3 pages

Running Jar Program

Uploaded by

Running Jar Program

Uploaded by

Steps to run a Hadoop Jar file using Eclipse

1. Download and Open eclipse(version: mars) in Linux

public class WordCount {

public static class TokenizerMapper

private final static IntWritable one = new IntWritable(1);

public void map(Object key, Text value, Context context

public static class IntSumReducer

public void reduce(Text key, Iterable<IntWritable> values,

public static void main(String[] args) throws Exception {

7. Create the input directory in hdfs as follows:

$hdfs dfs -mkdir -p /user/dbda/input

$hdfs dfs -copyFromLocal <local file directory> /user/dbda/input

9. Run the hadoop program using jar file as follows:

$yarn jar <jar path>/wordcount.jar /user/dbda/input /user/dbda/output

Program starts running

$hdfs dfs -cat /user/dbda/output/part-r-00000

The result will be shown in command window.

Compile WordCount.java and create a jar:

$ javac –cp $(hadoop classpath) WordCount.java

$ jar cf wc.jar WordCount*.class

 /user/dbda/wordcount/input - input directory in HDFS

Sample text-files as input:

$ hdfs dfs -ls /user/dbda/wordcount/input/

$ hdfs dfs -cat /user/dbda/wordcount/input/file01

Hello World Bye World

$ hdfs dfs -cat /user/dbda/wordcount/input/file02

Hello Hadoop Goodbye Hadoop

Run the application:

$ yarn jar wc.jar WordCount /user/dbda/wordcount/input

$ hdfs dfs -cat /user/dbda/wordcount/output/part-r-00000

You might also like