当前位置:   article > 正文

Hadoop自动化安装shell脚本

Hadoop自动化安装shell脚本

之前写过一些如何安装Cloudera Hadoop的文章,安装hadoop过程中,最开始是手动安装apache版本的hadoop,其次是使用Intel的IDH管理界面安装IDH的hadoop,再然后分别手动和通过cloudera manager安装hadoop,也使用bigtop-util yum方式安装过apache的hadoop。

安装过程中参考了很多网上的文章,解压缩过cloudera的cloudera-manager-installer.bin,发现并修复了IDH shell脚本中关于puppt的自认为是bug的一个bug,最后整理出了一个自动安装hadoop的shell脚本,脚本托管在github上面: hadoop-install

hadoop安装文章

博客中所有关于安装hadoop的文章列出如下:

  1. 【笔记】Hadoop安装部署

  2. 手动安装Cloudera Hive CDH

  3. 手动安装Cloudera HBase CDH

  4. 手动安装Cloudera Hadoop CDH

  5. 安装impala过程

  6. 从yum安装Cloudera CDH集群

  7. 通过Cloudera Manager安装CDH

hadoop-install

hadoop-install上脚本,all-in-one-install.sh是在一个节点上安装hdfs、hive、yarn、zookeeper和hbase,编写该脚本是为了在本机(fedora19系统)上调试mapreduce、hive和hbase;cluster-install.sh是在多个节点上安装hadoop集群,同样目前完成了hdfs、hive、yarn、zookeeper和hbase的自动安装。

脚本片段

IDH安装脚本中有一些写的比较好的shell代码片段,摘出如下,供大家学习。

检测操作系统版本

  1. ( grep -i "CentOS" /etc/issue > /dev/null ) && OS_DISTRIBUTOR=centos
  2. ( grep -i "Red[[:blank:]]*Hat[[:blank:]]*Enterprise[[:blank:]]*Linux" /etc/issue > /dev/null ) && OS_DISTRIBUTOR=rhel
  3. ( grep -i "Oracle[[:blank:]]*Linux" /etc/issue > /dev/null ) && OS_DISTRIBUTOR=oel
  4. ( grep -i "Asianux[[:blank:]]*Server" /etc/issue > /dev/null ) && OS_DISTRIBUTOR=an
  5. ( grep -i "SUSE[[:blank:]]*Linux[[:blank:]]*Enterprise[[:blank:]]*Server" /etc/issue > /dev/null ) && OS_DISTRIBUTOR=sles
  6. ( grep -i "Fedora" /etc/issue > /dev/null ) && OS_DISTRIBUTOR=fedora
  7. major_revision=`grep -oP '\d+' /etc/issue | sed -n "1,1p"`
  8. minor_revision=`grep -oP '\d+' /etc/issue | sed -n "2,2p"`
  9. OS_RELEASE="$major_revision.$minor_revision"

修改root密码

echo 'redhat'|passwd root --stdin

修改dns

  1. # Set up nameservers.
  2. # http://ithelpblog.com/os/linux/redhat/centos-redhat/howto-fix-couldnt-resolve-host-on-centos-redhat-rhel-fedora/
  3. # http://stackoverflow.com/a/850731/1486325
  4. echo "nameserver 8.8.8.8" | tee -a /etc/resolv.conf
  5. echo "nameserver 8.8.4.4" | tee -a /etc/resolv.conf

修改操作系统时区

cp /usr/share/zoneinfo/Asia/Shanghai /etc/localtime

修改hosts文件

  1. cat > /etc/hosts <<EOF
  2. 127.0.0.1 localhost
  3. 192.168.56.121 cdh1
  4. 192.168.56.122 cdh2
  5. 192.168.56.123 cdh3
  6. EOF

去掉b文件中包括a文件的内容

grep -vf a b >result.log

修改file-max

  1. echo -e "Global file limit ..."
  2. rst=`grep "^fs.file-max" /etc/sysctl.conf`
  3. if [ "x$rst" = "x" ] ; then
  4. echo "fs.file-max = 727680" >> /etc/sysctl.conf || exit $?
  5. else
  6. sed -i "s:^fs.file-max.*:fs.file-max = 727680:g" /etc/sysctl.conf
  7. fi

生成ssh公要

  1. [ ! -d ~/.ssh ] && ( mkdir ~/.ssh ) && ( chmod 600 ~/.ssh )
  2. yes|ssh-keygen -f ~/.ssh/id_rsa -t rsa -N "" && ( chmod 600 ~/.ssh/id_rsa.pub )

ssh设置无密码登陆

  1. set timeout 20
  2. set host [lindex $argv 0]
  3. set password [lindex $argv 1]
  4. set pubkey [exec cat /root/.ssh/id_rsa.pub]
  5. set localsh [exec cat ./config_ssh_local.sh]
  6. #spawn ssh-copy-id -i /root/.ssh/id_rsa.pub root@$host
  7. spawn ssh root@$host "
  8. umask 022
  9. mkdir -p /root/.ssh
  10. echo \'$pubkey\' > /root/.ssh/authorized_keys
  11. echo \'$localsh\' > /root/.ssh/config_ssh_local.sh
  12. cd /root/.ssh/; sh config_ssh_local.sh
  13. "
  14. expect {
  15. timeout exit
  16. yes/no {send "yes\r";exp_continue}
  17. assword {send "$password\r"}
  18. }
  19. expect eof
  20. #interact

配置JAVA_HOME

  1. ### JAVA_HOME ###
  2. if [ -f ~/.bashrc ] ; then
  3. sed -i '/^export[[:space:]]\{1,\}JAVA_HOME[[:space:]]\{0,\}=/d' ~/.bashrc
  4. sed -i '/^export[[:space:]]\{1,\}CLASSPATH[[:space:]]\{0,\}=/d' ~/.bashrc
  5. sed -i '/^export[[:space:]]\{1,\}PATH[[:space:]]\{0,\}=/d' ~/.bashrc
  6. fi
  7. echo "" >>~/.bashrc
  8. echo "export JAVA_HOME=/usr/java/latest" >>~/.bashrc
  9. echo "export CLASSPATH=.:\$JAVA_HOME/lib/tools.jar:\$JAVA_HOME/lib/dt.jar">>~/.bashrc
  10. echo "export PATH=\$JAVA_HOME/bin:\$PATH" >> ~/.bashrc
  11. alternatives --install /usr/bin/java java /usr/java/latest 5
  12. alternatives --set java /usr/java/latest
  13. source ~/.bashrc

格式化集群

su -s /bin/bash hdfs -c 'yes Y | hadoop namenode -format >> /tmp/format.log 2>&1'

创建hadoop目录

  1. su -s /bin/bash hdfs -c "hadoop fs -chmod a+rw /"
  2. while read dir user group perm
  3. do
  4. su -s /bin/bash hdfs -c "hadoop fs -mkdir -R $dir && hadoop fs -chmod -R $perm $dir && hadoop fs -chown -R $user:$group $dir"
  5. echo "."
  6. done << EOF
  7. /tmp hdfs hadoop 1777
  8. /tmp/hadoop-yarn mapred mapred 777
  9. /var hdfs hadoop 755
  10. /var/log yarn mapred 1775
  11. /var/log/hadoop-yarn/apps yarn mapred 1777
  12. /hbase hbase hadoop 755
  13. /user hdfs hadoop 777
  14. /user/history mapred hadoop 1777
  15. /user/root root hadoop 777
  16. /user/hive hive hadoop 777
  17. EOF

hive中安装并初始化postgresql

  1. yum install postgresql-server postgresql-jdbc -y >/dev/null
  2. chkconfig postgresql on
  3. rm -rf /var/lib/pgsql/data
  4. rm -rf /var/run/postgresql/.s.PGSQL.5432
  5. service postgresql initdb
  6. sed -i "s/max_connections = 100/max_connections = 600/" /var/lib/pgsql/data/postgresql.conf
  7. sed -i "s/#listen_addresses = 'localhost'/listen_addresses = '*'/" /var/lib/pgsql/data/postgresql.conf
  8. sed -i "s/shared_buffers = 32MB/shared_buffers = 256MB/" /var/lib/pgsql/data/postgresql.conf
  9. sed -i "s/127.0.0.1\/32/0.0.0.0\/0/" /var/lib/pgsql/data/pg_hba.conf
  10. sudo cat /var/lib/pgsql/data/postgresql.conf | grep -e listen -e standard_conforming_strings
  11. rm -rf /usr/lib/hive/lib/postgresql-jdbc.jar
  12. ln -s /usr/share/java/postgresql-jdbc.jar /usr/lib/hive/lib/postgresql-jdbc.jar
  13. su -c "cd ; /usr/bin/pg_ctl start -w -m fast -D /var/lib/pgsql/data" postgres
  14. su -c "cd ; /usr/bin/psql --command \"create user hiveuser with password 'redhat'; \" " postgres
  15. su -c "cd ; /usr/bin/psql --command \"CREATE DATABASE metastore owner=hiveuser;\" " postgres
  16. su -c "cd ; /usr/bin/psql --command \"GRANT ALL privileges ON DATABASE metastore TO hiveuser;\" " postgres
  17. su -c "cd ; /usr/bin/psql -U hiveuser -d metastore -f /usr/lib/hive/scripts/metastore/upgrade/postgres/hive-schema-0.10.0.postgres.sql" postgres
  18. su -c "cd ; /usr/bin/pg_ctl restart -w -m fast -D /var/lib/pgsql/data" postgres

总结

更多脚本,请关注github:hadoop-install,你可以下载、使用并修改其中代码!

----EOF-----

转载: http://blog.javachen.com/2013/08/02/hadoop-install-script/


声明:本文内容由网友自发贡献,不代表【wpsshop博客】立场,版权归原作者所有,本站不承担相应法律责任。如您发现有侵权的内容,请联系我们。转载请注明出处:https://www.wpsshop.cn/w/天景科技苑/article/detail/880010
推荐阅读
相关标签
  

闽ICP备14008679号