【大数据基础】hbase安装与配置详解_Hadoop

4条回答

summer

2楼 · 2020-12-04 18:58

一、前提条件

JDK1.7+

Hadoop 2.5.0+

Zookeeper 3.4.5+

下面介绍一下Zookeeper的安装：

（1）到Zookeeper官网下载Zookeeper最新安装包，

下载地址：http://mirrors.shu.edu.cn/apache/zookeeper/zookeeper-3.4.11/zookeeper-3.4.11.tar.gz

（2）下载完成后使用命令 tar -zxvf zookeeper-3.4.11.tar.gz进行解压

（3）然后切换到其conf目录下

使用vim zoo.cfg新建一个配置文件。内容如下：

# The number of milliseconds of each ticktickTime=2000# The number of ticks that the initial# synchronization phase can takeinitLimit=10# The number of ticks that can pass between# sending a request and getting an acknowledgementsyncLimit=5# the directory where the snapshot is stored.# do not use /tmp for storage, /tmp here is just# example sakes.dataDir=/home/chenjie/zookeeper-3.4.11/tmp# the port at which the clients will connectclientPort=2181# the maximum number of client connections.# increase this if you need to handle more clients#maxClientCnxns=60## Be sure to read the maintenance section of the# administrator guide before turning on autopurge.## http://zookeeper.apache.org/doc/current/zookeeperAdmin.html#sc_maintenance## The number of snapshots to retain in dataDir#autopurge.snapRetainCount=3# Purge task interval in hours# Set to "0" to disable auto purge feature#autopurge.purgeInterval=1server.1=pc2:2888:3888server.2=pc3:2888:3888server.3=pc4:2888:3888

其中的主要配置项有

dataDir=/home/chenjie/zookeeper-3.4.11/tmp

server.1=pc2:2888:3888server.2=pc3:2888:3888server.3=pc4:2888:3888

dataDir是指定zookeeper的数据目录，在此我设置在了zookeeper根目录下的tmp目录下（此目录不存在，需要使用mkdir tmp创建）

类似于

server.1=pc2:2888:3888

是配置zookeeper的节点，有多少个节点就配置多少个server，依次为server.1、server.2。。。pc2、pc3、pc4为各节点的主机名。

（4）配置完成后，切换到刚才配置的dataDir下，在此目录下新建一个myid文件（使用命令 vim myid）

进入vim后按i进入编辑模式，输入当前节点的编号(server.1的编号为1，依次类推)，按esc推出编辑模式，输入shift + : 然后输入wq回车保存退出。

（5）使用命令scp -r zookeeper-3.4.11 chenjie@pc3:/home/chenjie将刚才配置好的zookeeper发送到另外两个节点上。

【然后修改另外两台节点上的dataDir/myid中的id为它们各自的id】

（6）切换到bin目录下，使用命令启动zookeeper

如果出现启动不了的情况，记得关闭防火墙

因为防火墙没有关闭。关闭防火墙：

#查看防火墙状态

sudo service iptables status

#关闭防火墙
sudo service iptables stop
#查看防火墙开机启动状态
sudo chkconfig iptables --list
#关闭防火墙开机启动
sudo chkconfig iptables off

二、安装HBase

（1）下载HBase安装包并解压

https://mirrors.tuna.tsinghua.edu.cn/apache/hbase/1.2.6/hbase-1.2.6-bin.tar.gz

tar -zxvf hbase-1.2.6-bin.tar.gz

向hbase-env.sh中添加：

export JAVA_HOME=/usr/lib/jvm/java-8-oracle

export HBASE_MANAGES_ZK=false

修改hbase-site.xml为

hbase.zookeeper.quorumpc2,pc3,pc4The directory shared by RegionServers.hbase.rootdirhdfs://pc2:9000/hbaseThe directory shared by RegionServers.hbase.cluster.distributedtrueThe mode the cluster will be in. Possible values arefalse: standalone and pseudo-distributed setups with managed Zookeepertrue: fully-distributed with unmanaged Zookeeper Quorum (see hbase-env.sh)    hbase.master.info.port60010

其中，

hbase.zookeeper.quorum为zookeeper的节点主机名

hbase.rootdir为hbase的存储根目录，设为hadoopHDFS根目录下的hbase

修改regionservers为（regionserver为从结点）

pc3

pc4

使用scp -r hbase-1.2.6 chenjie@pc3:/home/chenjie

scp -r hbase-1.2.6 chenjie@pc4:/home/chenjie

将pc2上配置好的hbase发送到pc3、pc4上

（3）切换到bin目录下使用./start-hbase.sh启动hbase

（4）使用hbase shell进入hbase的命令行窗口

使用命令create 'test1', {NAME=>'cf',VERSIONS=>3,COMPRESSION=>'gz'}创建一张表

分别执行

put 'test1','rowkey1','cf:q1','r1-v1'
put 'test1','rowkey1','cf:q2','r1-v2'
put 'test1','rowkey2','cf:q1','r2-v1'
put 'test1','rowkey2','cf:q2','r2-v2'
put 'test1','rowkey2','cf:q3','r2-v3'

插入四条记录

使用命令scan 'test1'查看表数据

(5)登录master的web管理界面

http://192.168.1.112:60010

我是大脸猫

3楼 · 2020-12-06 20:05

在开发中，HBASE的安装一般都是有相关人员安装与维护，这里只是学习的写笔记而已。
一、上传解压：将需要安装的jar包上传解压到指定目录。
二、修改配置：下图是需要修改的配置文件的内容

配置文件：hbase-env.sh，配置Javahome以及HBASE的zk该为false。

以及配置hbase-site.xml文件：

hbase.tmp.dir/opt/cdh5.7.6/hbase-1.2.0-cdh5.7.6/data/tempDatahbase.rootdirhdfs://bigdata-hpsk01.huadian.com:8020/hbasehbase.cluster.distributedtruehbase.zookeeper.quorumbigdata-hpsk01.huadian.comhbase.master.check.compressiontruehbase.regionserver.codecsnappy,lz4

三、启动
启动之前的准备
zookeeper启动：bin/zkServer.sh start
hdfs启动：启动主节点
           bin/hbase-daemon.sh start master
   启动从节点： bin/hbase-daemon.sh start regionserver
四、验证是否成功
       CDH版本： http://bigdata-hpsk01.huadian.com:60010
       Apache： http://bigdata-hpsk01.huadian.com:16010

五、HBASE shell CURD ：下面是测试是否安装成功的操作
   进入Shell交互：bin/hbase shell
   创建表： create "order","info"
   查看表列表： list
   插入数据：
       put "表名","rowKey"   ,   "列簇：列名"           ,           值
       put "order","00:00:21_314295453511152","info:date","2015-04-21 00:00:21"
       put "order","00:00:21_314295453511152","info:order_id","314295453511152"
   查询数据： get "表名","rowKey"
       get "order","00:00:21_314295453511152"
   扫描表的数据： scan "order"
           在测试的时候使用，生产环境千万不能用

赵小刀

4楼 · 2020-12-09 10:00

hbase有本地模式和分布式模式

hbase-site.xml配置

hbase.tmp.dir

本地文件系统tmp目录，一般配置成bailocal模式的设置一下，但是最好还是需要设置一下，因为很多文件都会默认设置成它下面的

线上配置