0% found this document useful (0 votes)

75 views10 pages

5-Practicas+BigData Trabajar Hdfs

This document provides an overview of HDFS commands and conducting basic operations in HDFS such as: 1. Using the hdfs dfs command to view, create, copy, move, and delete files and directories in HDFS. 2. Exploring how HDFS stores file data across multiple blocks and nodes for replication and reliability. 3. Generating sample text and large files in HDFS to demonstrate how HDFS partitions files across blocks and nodes. 4. Performing basic file operations like copying, moving and deleting files within HDFS similar to Linux commands.

Uploaded by

Christiam Niño

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views10 pages

5-Practicas+BigData Trabajar Hdfs

Uploaded by

Christiam Niño

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Apasoft Training

Prácticas BigData
1. Prácticas con HDFS
1.1. Comando hdfs dfs
• Ejecutar el comando “hdfs dfs”. Este comando permite trabajar con los ficheros
de HDFS.
• Casi todas las opciones son similares a los comandos “Linux”
hdfs dfs
Usage: hadoop fs [generic options]
[-appendToFile <localsrc> ... <dst>]
[-cat [-ignoreCrc] <src> ...]
[-checksum <src> ...]
[-chgrp [-R] GROUP PATH...]
[-chmod [-R] <MODE[,MODE]... | OCTALMODE> PATH...]
[-chown [-R] [OWNER][:[GROUP]] PATH...]
[-copyFromLocal [-f] [-p] [-l] [-d] <localsrc> ... <dst>]
[-copyToLocal [-f] [-p] [-ignoreCrc] [-crc] <src> ... <localdst>]
[-count [-q] [-h] [-v] [-t [<storage type>]] [-u] [-x] <path> ...]
[-cp [-f] [-p | -p[topax]] [-d] <src> ... <dst>]
[-createSnapshot <snapshotDir> [<snapshotName>]]
[-deleteSnapshot <snapshotDir> <snapshotName>]
[-df [-h] [<path> ...]]
[-du [-s] [-h] [-x] <path> ...]
[-expunge]
[-find <path> ... <expression> ...]
[-get [-f] [-p] [-ignoreCrc] [-crc] <src> ... <localdst>]
[-getfacl [-R] <path>]
[-getfattr [-R] {-n name | -d} [-e en] <path>]
[-getmerge [-nl] [-skip-empty-file] <src> <localdst>]
[-help [cmd ...]]
[-ls [-C] [-d] [-h] [-q] [-R] [-t] [-S] [-r] [-u] [<path> ...]]
[-mkdir [-p] <path> ...]
[-moveFromLocal <localsrc> ... <dst>]
[-moveToLocal <src> <localdst>]
[-mv <src> ... <dst>]

www.apasoft-training.com 1
Apasoft Training

[-put [-f] [-p] [-l] [-d] <localsrc> ... <dst>]

[-renameSnapshot <snapshotDir> <oldName> <newName>]
[-rm [-f] [-r|-R] [-skipTrash] [-safely] <src> ...]
[-rmdir [--ignore-fail-on-non-empty] <dir> ...]
[-setfacl [-R] [{-b|-k} {-m|-x <acl_spec>} <path>]|[--set <acl_spec> <path>]]
[-setfattr {-n name [-v value] | -x name} <path>]
[-setrep [-R] [-w] <rep> <path> ...]
[-stat [format] <path> ...]
[-tail [-f] <file>]
[-test -[defsz] <path>]
[-text [-ignoreCrc] <src> ...]
[-touchz <path> ...]
[-truncate [-w] <length> <path> ...]
[-usage [cmd ...]]

Generic options supported are:

-conf <configuration file> specify an application configuration file
-D <property=value> define a value for a given property
-fs <file:///|hdfs://namenode:port> specify default filesystem URL to use, overrides
'fs.defaultFS' property from configurations.
-jt <local|resourcemanager:port> specify a ResourceManager
-files <file1,...> specify a comma-separated list of files to be copied to the
map reduce cluster
-libjars <jar1,...> specify a comma-separated list of jar files to be included
in the classpath
-archives <archive1,...> specify a comma-separated list of archives to be
unarchived on the compute machines

The general command line syntax is:

command [genericOptions] [commandOptions]
• Vamos a ver el contenido de nuestr HDFS. En principio debe estar vacío
hdfs dfs -ls /
• También podemos ver que está vacío desde la web de administración en el menú
Utilities  Browse the File System

www.apasoft-training.com 2
Apasoft Training

• Vamos a crear un nuevo directorio

hdfs dfs -mkdir /datos
• Comprobar que existe
hdfs dfs -ls /
Found 1 items
drwxr-xr-x - hadoop supergroup 0 2018-01-06 18:31 /datos
• Podemos verlo en la página WEB

• Creamos un fichero en el directorio /tmp con alguna frase

echo "Esto es una prueba" >/tmp/prueba.txt
• Copiarlo al HDFS, en concreto al directorio /datos. Usamos el comando “put”
hdfs dfs -put /tmp/prueba.txt /datos

www.apasoft-training.com 3
Apasoft Training

• Comprobar su existencia
hdfs dfs -ls /datos
Found 1 items
-rw-r--r-- 1 hadoop supergroup 19 2018-01-06 18:34 /datos/prueba.txt
• También podemos verlo en la página web. Podemos comprobar el tipo de
replicación que tiene y el tamaño correspondiente.

• Visualizar su contenido
hdfs dfs -cat /datos/prueba.txt
Esto es una prueba
• Vamos a comprobar lo que ha creado a nivel de HDFS
• Vamos a la página WEB y pulsamos en el nombre del fichero.
• Debe aparecer algo parecido a lo siguiente

www.apasoft-training.com 4
Apasoft Training

• Vemos que solo ha creado un bloque, ya que el BLOCK SIZE por defecto de
HDFS es 128M y por lo tanto nuestro pequeño fichero solo genera uno.
• Además, nos dice el BLOCK_ID y también los nodos donde ha creado las
réplicas. Como tenemos un replication de 1, solo aparece el nodo1. Cuando
veamos la parte del cluster completo veremos más nodos
• Volvemos al sistema operativo y nos vamos al directorio siguiente.
Evidentemente el subdirectorio BP-XXXX será distinto en tu caso. Se
corresponde con el Block Pool ID que genera de forma automática Hadoop.
/datos/datanode/current/BP-344905797-192.168.56.101-
1515254230192/current/finalized
• Dentro de este subdirectorio, Hadoop irá creando una estructura de
subdirectorios donde albergará los bloques de datos, don el formato
subdirN/subdirN, en este caso subdir0/subdir0.
• Entramos en él.
cd subdir0/
cd subdir0/
ls -l
total 8
-rw-rw-r--. 1 hadoop hadoop 19 ene 6 18:34 blk_1073741825
-rw-rw-r--. 1 hadoop hadoop 11 ene 6 18:34 blk_1073741825_1001.meta
• Podemos comprobar que hay dos ficheros con el mismo BLOCK_ID que
aparece en la página WEB.
o Uno contiene los datos

www.apasoft-training.com 5
Apasoft Training

o El otro contiene metadatos

• Podemos comprobarlo si visualizamos el contenido
cat blk_1073741825
Esto es una prueba
• Evidentemente, cuando tengamos ficheros muy grandes o que no sean texto, esto
no es de ninguna utilidad. Solo lo hacemos para entender bien HDFS.
• Vamos a crear otro ejemplo con un fichero grande
• Lanzamos este comando para generar un fichero de 1G en /tmp, llamado
fic_grande.dat, lleno de ceros (el comando dd de Linux permite hacer esto entre
otras muchas cosasI)
dd if=/dev/zero of=/tmp/fic_grande.dat bs=1024 count=1000000
1000000+0 registros leídos
1000000+0 registros escritos
1024000000 bytes (1,0 GB) copiados, 5,1067 s, 201 MB/s
• Lo subimos al directorio /datos de nuestro HDFS
hdfs dfs -put /tmp/fic_grande.dat /datos
• Podemos comprobar en la página web que ha creado múltiples bloques de
128MB

www.apasoft-training.com 6
Apasoft Training

• Si comprobamos de nuevo el directorio subdir0 podemos ver los bloques

correspondientes
ls -l
total 1007852
-rw-rw-r--. 1 hadoop hadoop 19 ene 6 18:34 blk_1073741825
-rw-rw-r--. 1 hadoop hadoop 11 ene 6 18:34 blk_1073741825_1001.meta
-rw-rw-r--. 1 hadoop hadoop 134217728 ene 6 18:59 blk_1073741826
-rw-rw-r--. 1 hadoop hadoop 1048583 ene 6 18:59 blk_1073741826_1002.meta
-rw-rw-r--. 1 hadoop hadoop 134217728 ene 6 18:59 blk_1073741827
-rw-rw-r--. 1 hadoop hadoop 1048583 ene 6 18:59 blk_1073741827_1003.meta
-rw-rw-r--. 1 hadoop hadoop 134217728 ene 6 18:59 blk_1073741828
-rw-rw-r--. 1 hadoop hadoop 1048583 ene 6 18:59 blk_1073741828_1004.meta
-rw-rw-r--. 1 hadoop hadoop 134217728 ene 6 18:59 blk_1073741829
-rw-rw-r--. 1 hadoop hadoop 1048583 ene 6 18:59 blk_1073741829_1005.meta
-rw-rw-r--. 1 hadoop hadoop 134217728 ene 6 18:59 blk_1073741830
-rw-rw-r--. 1 hadoop hadoop 1048583 ene 6 18:59 blk_1073741830_1006.meta
-rw-rw-r--. 1 hadoop hadoop 134217728 ene 6 18:59 blk_1073741831
-rw-rw-r--. 1 hadoop hadoop 1048583 ene 6 18:59 blk_1073741831_1007.meta
-rw-rw-r--. 1 hadoop hadoop 134217728 ene 6 18:59 blk_1073741832
-rw-rw-r--. 1 hadoop hadoop 1048583 ene 6 18:59 blk_1073741832_1008.meta
-rw-rw-r--. 1 hadoop hadoop 84475904 ene 6 19:00 blk_1073741833
-rw-rw-r--. 1 hadoop hadoop 659975 ene 6 19:00 blk_1073741833_1009.meta

• Vamos a crear otro directorio llamado “practicas”

hdfs dfs -mkdir /practicas
• Copiamos prueba.txt desde datos a prácticas
hdfs dfs -cp /datos/prueba.txt /practicas/prueba.txt
• Comprobamos el contenido
hdfs dfs -ls /practicas
Found 1 items
-rw-r--r-- 1 hadoop supergroup 19 2018-01-06 19:08 /practicas/prueba.txt
• Borramos el fichero
hdfs dfs -rm /practicas/prueba.txt
Deleted /practicas/prueba.txt
• Vemos que los comandos son muy parecidos a Linux

www.apasoft-training.com 7
Apasoft Training

1.2. Nuestro primer proceso Hadoop

• Vamos a ejecutar nuestro primer trabajo hadoop. Luego veremos con más detalle
esto.
• Hadoop tiene una serie de ejemplos que se encuentran en el fichero siguiente
(recordad el número de versión)
/opt/hadoop/share/hadoop/mapreduce/hadoop-mapreduce-examples-2.9.0.jar

• Para lanzar un proceso hadoop Map Reduce usamos el comando

hadoop jar librería.jar proceso
• En este caso, si queremos ver los programas que hay en ese “jar” ponemos lo
siguiente, sin poner el comando final
hadoop jar /opt/hadoop/share/hadoop/mapreduce/hadoop-mapreduce-
examples-2.9.0.jar
An example program must be given as the first argument.
Valid program names are:
aggregatewordcount: An Aggregate based map/reduce program that counts the
words in the input files.
aggregatewordhist: An Aggregate based map/reduce program that computes the
histogram of the words in the input files.
bbp: A map/reduce program that uses Bailey-Borwein-Plouffe to compute exact
digits of Pi.
dbcount: An example job that count the pageview counts from a database.
distbbp: A map/reduce program that uses a BBP-type formula to compute exact bits
of Pi.
grep: A map/reduce program that counts the matches of a regex in the input.
join: A job that effects a join over sorted, equally partitioned datasets
multifilewc: A job that counts words from several files.
pentomino: A map/reduce tile laying program to find solutions to pentomino
problems.
pi: A map/reduce program that estimates Pi using a quasi-Monte Carlo method.
randomtextwriter: A map/reduce program that writes 10GB of random textual data
per node.
randomwriter: A map/reduce program that writes 10GB of random data per node.
secondarysort: An example defining a secondary sort to the reduce.
sort: A map/reduce program that sorts the data written by the random writer.
sudoku: A sudoku solver.
teragen: Generate data for the terasort
terasort: Run the terasort
teravalidate: Checking results of terasort
wordcount: A map/reduce program that counts the words in the input files.

www.apasoft-training.com 8
Apasoft Training

wordmean: A map/reduce program that counts the average length of the words in
the input files.
wordmedian: A map/reduce program that counts the median length of the words in
the input files.
wordstandarddeviation: A map/reduce program that counts the standard deviation
of the length of the words in the input files.
• Vemos que hay un comando llamado “wordcount”.
• Permite contar las palabras que hay en uno o varios ficheros.
• Creamos un par de ficheros con palabras (algunas repetidas) y lo guardamos en
ese directorio
hdfs dfs -put /tmp/palabras.txt /practicas
hdfs dfs -put /tmp/palabras1.txt /practicas
• Lanzamos el comando
hadoop jar /opt/hadoop/share/hadoop/mapreduce/hadoop-mapreduce-
examples-2.9.0.jar wordcount /practicas /salida1
INFO mapreduce.Job: Counters: 38
File System Counters
FILE: Number of bytes read=812740
FILE: Number of bytes written=1578775
FILE: Number of read operations=0
FILE: Number of large read operations=0
FILE: Number of write operations=0
HDFS: Number of bytes read=211
HDFS: Number of bytes written=74
HDFS: Number of read operations=25
HDFS: Number of large read operations=0
HDFS: Number of write operations=5
Map-Reduce Framework
Map input records=2
Map output records=16
Map output bytes=147
Map output materialized bytes=191
Input split bytes=219
Combine input records=16
Combine output records=16
Reduce input groups=10
Reduce shuffle bytes=191
Reduce input records=16
Reduce output records=10
Spilled Records=32
Shuffled Maps =2

www.apasoft-training.com 9
Apasoft Training

Failed Shuffles=0
Merged Map outputs=2
GC time elapsed (ms)=131
CPU time spent (ms)=0
Physical memory (bytes) snapshot=0
Virtual memory (bytes) snapshot=0
Total committed heap usage (bytes)=549138432
Shuffle Errors
BAD_ID=0
CONNECTION=0
IO_ERROR=0
WRONG_LENGTH=0
WRONG_MAP=0
WRONG_REDUCE=0
File Input Format Counters
Bytes Read=84
File Output Format Counters
Bytes Written=74

• Podemos ver el contenido del directorio

hdfs dfs -ls /salida
Found 2 items
-rw-r--r-- 1 hadoop supergroup 0 2015-04-20 07:52 /salida/_SUCCESS
-rw-r--r-- 1 hadoop supergroup 74 2015-04-20 07:52 /salida/part-r-00000
[hadoop@localhost ~]$ hadoop fs -cat /salida/part-r-00000
Esto 1
con 2
el 2
es 2
esto 1
fichero 2
primer 1
prueba 2
segundo 1
una 2

www.apasoft-training.com 10

Apache Hadoop
No ratings yet
Apache Hadoop
3 pages
Hadoop HDFS Commands
No ratings yet
Hadoop HDFS Commands
1 page
Hadoop Hdfs Commands
No ratings yet
Hadoop Hdfs Commands
2 pages
Big Data Cheat Sheet
No ratings yet
Big Data Cheat Sheet
12 pages
Hadoop 1
No ratings yet
Hadoop 1
15 pages
Hadoop
No ratings yet
Hadoop
4 pages
Data Storage Data Processing: Hadoop Distributed File System (HDFS) Mapreduce
No ratings yet
Data Storage Data Processing: Hadoop Distributed File System (HDFS) Mapreduce
35 pages
Hadoop HDFS Setup and Commands Guide
No ratings yet
Hadoop HDFS Setup and Commands Guide
7 pages
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
No ratings yet
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
74 pages
Hadoop Linux Commands
No ratings yet
Hadoop Linux Commands
8 pages
HDFS Commnads
No ratings yet
HDFS Commnads
26 pages
Hadoop
No ratings yet
Hadoop
6 pages
Lista de Comandos HDFS
No ratings yet
Lista de Comandos HDFS
8 pages
1 Hdfs Notes
No ratings yet
1 Hdfs Notes
38 pages
Extreme Computing Lab Exercises Session One: 1 Getting Started
No ratings yet
Extreme Computing Lab Exercises Session One: 1 Getting Started
6 pages
Ai&Ml (Bdamanual)
No ratings yet
Ai&Ml (Bdamanual)
24 pages
BDA Record
No ratings yet
BDA Record
34 pages
Week 1 in Terminal
No ratings yet
Week 1 in Terminal
10 pages
PDC All Labs
100% (1)
PDC All Labs
129 pages
BDH Record - Merged
No ratings yet
BDH Record - Merged
47 pages
HDFS Commands - Revised
No ratings yet
HDFS Commands - Revised
6 pages
Unit 2-HDFS SGS
No ratings yet
Unit 2-HDFS SGS
29 pages
Exp-2 Hadoop Commands
No ratings yet
Exp-2 Hadoop Commands
6 pages
2335 m4 Demo1 v1 b54 kwf9d75
No ratings yet
2335 m4 Demo1 v1 b54 kwf9d75
8 pages
Command
No ratings yet
Command
1 page
Lab2 BigData-HDFSp
No ratings yet
Lab2 BigData-HDFSp
4 pages
3 Hadoop
No ratings yet
3 Hadoop
40 pages
HDFS
No ratings yet
HDFS
18 pages
Big Data Questions MQC
No ratings yet
Big Data Questions MQC
9 pages
Exp 1-2
No ratings yet
Exp 1-2
9 pages
2 HDFS Commands
No ratings yet
2 HDFS Commands
7 pages
Hadoop File System: CSC 369 Distributed Computing Alexander Dekhtyar
No ratings yet
Hadoop File System: CSC 369 Distributed Computing Alexander Dekhtyar
5 pages
Hadoop Assignement Sumit 241111 133837
No ratings yet
Hadoop Assignement Sumit 241111 133837
13 pages
COMMAND Line Interface
No ratings yet
COMMAND Line Interface
26 pages
9 Practicas+BigData MapReduce
No ratings yet
9 Practicas+BigData MapReduce
6 pages
How To Set Up A Hadoop Cluster in Docker
No ratings yet
How To Set Up A Hadoop Cluster in Docker
13 pages
Hafs Commands
No ratings yet
Hafs Commands
17 pages
HDFS Commands Guide for Beginners
No ratings yet
HDFS Commands Guide for Beginners
36 pages
Hadoop Setup & File Management Guide
No ratings yet
Hadoop Setup & File Management Guide
16 pages
HDFS Commands Updated
No ratings yet
HDFS Commands Updated
87 pages
Practical 1 - 1 - Hadoop Commands
No ratings yet
Practical 1 - 1 - Hadoop Commands
3 pages
BDM Hdfs
No ratings yet
BDM Hdfs
37 pages
HDFS Command
No ratings yet
HDFS Command
15 pages
Unit II Hadoop and Map Reduce Overview
No ratings yet
Unit II Hadoop and Map Reduce Overview
136 pages
HDFS
No ratings yet
HDFS
6 pages
Basic HDFS Commands
No ratings yet
Basic HDFS Commands
7 pages
HDFS Commands
No ratings yet
HDFS Commands
15 pages
Lab2 BD
No ratings yet
Lab2 BD
20 pages
Big Data Ia Answers
No ratings yet
Big Data Ia Answers
14 pages
Hadoop Commands
No ratings yet
Hadoop Commands
5 pages
HDFS File System Shell Guide
No ratings yet
HDFS File System Shell Guide
10 pages
Hadoop Tutorial
No ratings yet
Hadoop Tutorial
13 pages
BDA UNIT - 3 Updated
No ratings yet
BDA UNIT - 3 Updated
25 pages
Hadoop HDFS Commands Guide
No ratings yet
Hadoop HDFS Commands Guide
2 pages
Dsa Practical File
No ratings yet
Dsa Practical File
16 pages
C21053 Jay Vijay Karwatkar-Big Data Analytics & Visualization
No ratings yet
C21053 Jay Vijay Karwatkar-Big Data Analytics & Visualization
210 pages
Prácticas Bigdata: 1. Lanzar Un Proceso Mapreduce Contra El Cluster
No ratings yet
Prácticas Bigdata: 1. Lanzar Un Proceso Mapreduce Contra El Cluster
3 pages
Comandos Hive SQL
100% (1)
Comandos Hive SQL
5 pages
Borrar CACHE
No ratings yet
Borrar CACHE
1 page
Oracle Linux 7.x Installation Guide
No ratings yet
Oracle Linux 7.x Installation Guide
11 pages
Document 376700.1 PDF
No ratings yet
Document 376700.1 PDF
26 pages
PostgreSQL CREATE PROCEDURE Guide
No ratings yet
PostgreSQL CREATE PROCEDURE Guide
4 pages
Web Sraping
No ratings yet
Web Sraping
11 pages
MySQL - Day 02 - DDL Commands
No ratings yet
MySQL - Day 02 - DDL Commands
13 pages
Lecture 5 - Functional Forms of Linear Regression Models - Lin-Log Model
No ratings yet
Lecture 5 - Functional Forms of Linear Regression Models - Lin-Log Model
6 pages
Mongodb Cheat Sheet: by Via
No ratings yet
Mongodb Cheat Sheet: by Via
1 page
Dea C01
No ratings yet
Dea C01
7 pages
Advanced SQL for Data Analysts
No ratings yet
Advanced SQL for Data Analysts
78 pages
S N Technique Description Remarks: © Yoda Learning Solutions Author: Rishabh Pugalia - Updated: 02-Dec-19 - V 5.0
No ratings yet
S N Technique Description Remarks: © Yoda Learning Solutions Author: Rishabh Pugalia - Updated: 02-Dec-19 - V 5.0
2 pages
M. ADs
No ratings yet
M. ADs
15 pages
Europass CV Template
100% (1)
Europass CV Template
2 pages
Website: Vce To PDF Converter: Facebook: Twitter:: C2090-616.Vceplus - Premium.Exam.63Q
No ratings yet
Website: Vce To PDF Converter: Facebook: Twitter:: C2090-616.Vceplus - Premium.Exam.63Q
20 pages
Darshan - BA Assignment
No ratings yet
Darshan - BA Assignment
10 pages
UmakantBabasahebJadhav - Dot Net Developer
No ratings yet
UmakantBabasahebJadhav - Dot Net Developer
6 pages
Gaurav Misra: Education
No ratings yet
Gaurav Misra: Education
1 page
Expert Veri Ed, Online, Free.: Custom View Settings Question #77
No ratings yet
Expert Veri Ed, Online, Free.: Custom View Settings Question #77
2 pages
Ims-Db Training Class 09
No ratings yet
Ims-Db Training Class 09
18 pages
SQL
No ratings yet
SQL
10 pages
Akshay - Dayath - SAP S - 4 HANA CONSULTANT
No ratings yet
Akshay - Dayath - SAP S - 4 HANA CONSULTANT
3 pages
2-3-4 Tree Is A Self-Balancing
No ratings yet
2-3-4 Tree Is A Self-Balancing
6 pages
Pega 7: Platform Support Guide
No ratings yet
Pega 7: Platform Support Guide
48 pages
Pharmacy Management - Merged
No ratings yet
Pharmacy Management - Merged
31 pages
Sqlmap Cheat Sheet
No ratings yet
Sqlmap Cheat Sheet
12 pages
A Comparative Study of Classifying Legal Documents With Neural Networks
No ratings yet
A Comparative Study of Classifying Legal Documents With Neural Networks
9 pages
Anti-Virus Exclusions With 8.5
No ratings yet
Anti-Virus Exclusions With 8.5
3 pages
CSC 202 File Processing
No ratings yet
CSC 202 File Processing
54 pages
Ibm PROJECT 1 1 Output
No ratings yet
Ibm PROJECT 1 1 Output
10 pages
BDA Unit-1
No ratings yet
BDA Unit-1
31 pages
Handwritten Signature ID Project
No ratings yet
Handwritten Signature ID Project
24 pages
Professional Synopsis:: Kunal Anarse
No ratings yet
Professional Synopsis:: Kunal Anarse
2 pages
3rd PL - SQL Interview Questions (2022) - Javatpoint
No ratings yet
3rd PL - SQL Interview Questions (2022) - Javatpoint
17 pages

5-Practicas+BigData Trabajar Hdfs

Uploaded by

5-Practicas+BigData Trabajar Hdfs

Uploaded by

Apasoft Training

[-put [-f] [-p] [-l] [-d] <localsrc> ... <dst>]

Generic options supported are:

The general command line syntax is:

• Vamos a crear un nuevo directorio

• Creamos un fichero en el directorio /tmp con alguna frase

o El otro contiene metadatos

• Si comprobamos de nuevo el directorio subdir0 podemos ver los bloques

• Vamos a crear otro directorio llamado “practicas”

1.2. Nuestro primer proceso Hadoop

• Para lanzar un proceso hadoop Map Reduce usamos el comando

• Podemos ver el contenido del directorio

You might also like