遇到的问题:
一、从oracle数据库向hbase导入数据时报错,Connection reset
解决:
在import命令中加入-D mapred.child.java.opts="\-Djava.security.egd=/dev/urandom" 。
参考:http://www.cnblogs.com/dalu610/p/6423820.html;https://stackoverflow.com/questions/2327220/oracle-jdbc-intermittent-connection-issue/;http://ifeve.com/jvm-random-and-entropy-source/;http://blog.csdn.net/hguisu/article/details/27305435
二、从千万级数据量大表中导入数据到hbase时,内存空间不足问题,
解决:对hadoop增加配置
参见,https://zh.hortonworks.com/blog/how-to-plan-and-configure-yarn-in-hdp-2-0/