HDFS伪分布式配置

### HDFS Pseudo-Distributed Configuration Tutorial In a pseudo-distributed setup, each daemon runs on a single machine simulating a cluster environment. This configuration is useful for testing and development purposes. #### Setting Up Environment Variables Ensure that Java and SSH are installed properly as these services will be required by Hadoop components. Set up the `JAVA_HOME` variable within the `.bashrc` file: ```bash export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64/ export PATH=$PATH:$JAVA_HOME/bin ``` #### Configuring Core-Site.xml Modify the core-site.xml located under `$HADOOP_HOME/etc/hadoop/core-site.xml`. Add necessary properties such as specifying the default filesystem URI which points back to localhost indicating this is indeed a local instance of HDFS running in pseudo-distributed mode[^1]: ```xml <configuration> <property> <name>fs.defaultFS</name> <value>hdfs://localhost:9000</value> </property> </configuration> ``` #### Formatting NameNode Before starting any daemons format the namenode using command line tool provided with Hadoop distribution package[^2]. Execute below mentioned shell script from terminal window after navigating into appropriate directory where binaries reside: ```bash $HADOOP_HOME/bin/hdfs namenode -format ``` #### Adjusting hdfs-site.xml File Edit the hdfs-site.xml found inside etc/hadoop folder path appending relevant parameters like replication factor along with directories used during operation time including temporary storage location `/tmp`, alongside permanent repository area designated specifically towards housing tables managed via Apache Hive service. ```xml <configuration> <property> <name>dfs.replication</name> <value>1</value> </property> <property> <name>dfs.namenode.name.dir</name> <value>/path/to/name/directory</value> </property> <property> <name>dfs.datanode.data.dir</name> <value>/path/to/data/directory</value> </property> </configuration> ``` #### Starting Daemons Start all related processes associated with Hadoop framework through executing start-all.sh present at bin subdirectory relative to installation root position: ```bash $HADOOP_HOME/sbin/start-dfs.sh $HADOOP_HOME/sbin/start-yarn.sh ``` Verify successful initialization checking web interfaces available over HTTP protocol listening ports 50070 (for Namenode), 8088 (Resource Manager). #### Creating Directories Required By Hive Prepare specific folders needed when integrating Hadoop ecosystem component named Hive ensuring proper permissions granted so users can write data thereupon: ```bash $hadoop fs -mkdir /tmp $hadoop fs -mkdir /user/hive/warehouse $hadoop fs -chmod g+w /tmp $hadoop fs -chmod g+w /user/hive/warehouse ```

阅读全文

HDFS伪分布式配置

相关推荐

Hadoop3.1.3安装和单机/伪分布式配置

hadoop伪分布式配置教程.doc

Linux单机环境下HDFS伪分布式集群安装操作步骤v1.0.pdf

头歌HDFS伪分布式配置

头歌大数据平台运维-HDFS伪分布式配置

第1关：HDFS伪分布式配置头歌

hdfs伪分布式部署

hadopp伪分布式配置

Linux下Hadoop伪分布式配置

Hadoop伪分布式配置教程视频

Hadoop伪分布式配置实战指南

Hadoop伪分布式配置

hadoop伪分布式配置

ubuntuHadoop伪分布式配置

hadoop伪分布式配置教程

hadoop伪分布式 配置问题

hadoop伪分布式配置文件

hadoop伪分布式配置linux

hadoop伪分布式配置yarn

Hadoop伪分布式配置网络

大家在看

ChromeStandaloneSetup 87.0.4280.66（正式版本） （64 位）

HVDC_高压直流_cigre_CIGREHVDCMATLAB_CIGREsimulink

白盒测试基本路径自动生成工具制作文档附代码

vindr-cxr:VinDr-CXR

基于遗传算法的机场延误航班起飞调度模型python源代码

最新推荐

Hadoop安装教程_单机/伪分布式配置_Hadoop2.7.1/Ubuntu 16.04

C++实现的DecompressLibrary库解压缩GZ文件

【数据融合技术】：甘肃土壤类型空间分析中的专业性应用

VM ware如何查看软件版本信息

数据库课程设计报告：常用数据库综述

【空间分布规律】：甘肃土壤类型与农业生产的关联性研究

在halcon中，卡尺测量和二维测量谁的精度高

掌握牛顿法解方程：切线与割线的程序应用

【制图技术】：甘肃高质量土壤分布TIF图件的成图策略

GaAs外延设备维修是指什么意思

hadoop伪分布式配置问题

ChromeStandaloneSetup 87.0.4280.66（正式版本）（64 位）