十八:Centos7.4离线安装CDH5.14环境准备-阿里云

一:核心软件包下载:

CDH安装需要两类软件包:依据自己的系统匹配对应的CDH版本:
两类文件包:
1.repo-as-tarball cm管理(server agent 闭源)
http://archive.cloudera.com/cm5/repo-as-tarball/5.14.0/

http://archive.cloudera.com/cm5/repo-as-tarball/5.14.0/cm5.14.0-centos7.tar.gz

2.parcels (压缩存储格式 cloudera 专门的 hadoop zk hive spark1.6)
http://archive.cloudera.com/cdh5/parcels/5.14.0/

http://archive.cloudera.com/cdh5/parcels/5.14.0/CDH-5.14.0-1.cdh5.14.0.p0.24-el7.parcel
http://archive.cloudera.com/cdh5/parcels/5.14.0/CDH-5.14.0-1.cdh5.14.0.p0.24-el7.parcel.sha1 检验文件
http://archive.cloudera.com/cdh5/parcels/5.14.0/manifest.json 包含相关配套信息

由于两个文件比较大,差不多3个G,在Centos上线安装个网页下载工具,直接下载到阿里云上,否则下载到自己电脑上在上传上去花费时间较多:
yum install lynx
lynx http://XXX
wget http://XXX也可以

二:准备工作:系统环境搭建:root账号

如果能用root用户安装,是最好了,如果没有,新建立一个hadoop用户,拥有sudo权限:

adduser hadoop
passwd hadoop
vi /etc/sudoers或者visudo

比照原用户添加新用户权限
#Allow root to run any commands anywhere

2.1:设置主机名:

更改每台机器名称,便于维护管理:

vi /etc/sysconfig/network
vi /etc/hosts
cat /etc/hosts

十八:Centos7.4离线安装CDH5.14环境准备-阿里云
十八:Centos7.4离线安装CDH5.14环境准备-阿里云

2.2:SSH多机共享:

在主节点上/root/下,相关秘钥配置,并进行验证如下:

ssh-******
scp id_rsa.pub [email protected]:/root/.ssh/id_rsa.pub2/3
cat id_rsa.pub(23) >>authorized_keys
scp authorized_keys [email protected]:/root/.ssh/
chmod  600 /root/.ssh/authorized_keys
chmod 700 -R  /home/hadoop/.ssh

ssh hadoop001 date && ssh hadoop002 date && ssh hadoop003 date

十八:Centos7.4离线安装CDH5.14环境准备-阿里云

2.3:关闭防火墙:

1 关闭防火墙
执行命令 service iptables stop
验证: service iptables status
2 关闭防火墙的自动运行
执行命令 chkconfig iptables off
验证: chkconfig --list | grep iptables
$>vi /etc/selinux/config
SELINUX=disabled
清空防火墙策略:
iptables -L 查看一下规则 是否还在
iptables -F 清空

2.4:安装Oracle的Java:

在此/usr/java/目录下安装Java:
并vi /etc/profile 增加内容如下
export JAVA_HOME=/usr/java/jdk1.8.0_45
export PATH=.:JAVAHOME/bin:JAVA_HOME/bin:PATH

核查如下:

which java
/usr/java/jdk1.8.0_45/bin/java

2.5:安装配置MySql(主节点)

[[email protected] software]# clear
[[email protected] software]# mysql -uroot -p
Enter password:
Welcome to the MySQL monitor. Commands end with ; or \g.
Your MySQL connection id is 1392
Server version: 8.0.13 MySQL Community Server - GPL

Copyright © 2000, 2018, Oracle and/or its affiliates. All rights reserved.

Oracle is a registered trademark of Oracle Corporation and/or its
affiliates. Other names may be trademarks of their respective
owners.

Type ‘help;’ or ‘\h’ for help. Type ‘\c’ to clear the current input statement.

mysql>

2.6:检查python:

坑有点多,严格按照这个来:建议:
centos6.x python2.6.x
centos7.x python2.7.x

Python 2.7.5 (default, Aug 4 2017, 00:39:18)
[GCC 4.8.5 20150623 (Red Hat 4.8.5-16)] on linux2
Type “help”, “copyright”, “credits” or “license” for more information.

2.7:时区+时钟同步:

阿里云主机已经做好,无需关注:
十八:Centos7.4离线安装CDH5.14环境准备-阿里云

2.8:关闭大页面:

可参考http://blog.51cto.com/8026776/2074544

echo never > /sys/kernel/mm/transparent_hugepage/defrag
echo never > /sys/kernel/mm/transparent_hugepage/enabled

echo 'echo never > /sys/kernel/mm/transparent_hugepage/defrag'>>  /etc/rc.local
echo 'echo never > /sys/kernel/mm/transparent_hugepage/enabled'>>  /etc/rc.local

2.9:swap 物理磁盘空间 作为内存:最大不要超过16G

echo ‘vm.swappiness = 10’ >> /etc/sysctl.conf
sysctl -p 生效

swap=0-100
0不代表禁用 而是惰性最高
100表示 使用积极性最高

集群计算对实时性 要求高的 swap=0 允许job挂 迅速的加内存或调大参数 重启job
集群计算对实时性 要求不高的 swap=10/30 不允许job挂 慢慢的运行