1.0 优缺点
使用简单,无需配置 但是支持单session
1.1 准备工作:安装hive的机器上应该有HADOOP环境(安装目录,HADOOP_HOME环境变量)
1.2 安装:直接解压一个hive安装包即可 上传软件包:apache-hive-2.1.1-bin.tar.gz
# 解压拆包 tar -zxvf apache-hive-2.1.1-bin.tar.gz -C /opt/ # 重命名(进入到/opt/目录下) mv apache-hive-2.1.1-bin/ hive # 配置环境变量 vi /etc/profile/ # 在最后面添加 export HIVE_HOME=/opt/hive export PATH=$PATH:$HIVE_HOME/bin1.4 配置hive的配置文件:hive-env.sh
# Set HADOOP_HOME to point to a specific hadoop install directory # 这里选择自己安装的hadoop目录 HADOOP_HOME=/opt/hadoop # Hive Configuration Directory can be controlled by: export HIVE_CONF_DIR=/opt/hive/conf1.5 启动hive服务(其实就是打开hdfs和yarn)
start-dfs.sh start-yarn.sh hive Logging initialized using configuration in jar:file:/usr/local/hive-1.2.1/lib/hive-common-1.2.1.jar!/hive-log4j.properties hive>2.1 优缺点
支持多session 需要额外配置,还需要安装数据库软件
前提: 安装好mysql在一台分布式集群上(例如hadoop02) 参照:点击查看安装mysql(linux版(centos7))
2.2 安装mysql并远程授权
GRANT ALL PRIVILEGES ON *.* TO root@'%' IDENTIFIED BY '123456' with grant option; flush privileges; GRANT ALL PRIVILEGES ON *.* TO root@'localhost' IDENTIFIED BY '123456' with grant option; flush privileges;2.3 配置hive-site.xml
<configuration> <!--配置mysql的连接字符串--> <property> <name>javax.jdo.option.ConnectionURL</name> <value>jdbc:mysql://hadoop02:3306/hive?createDatabaseIfNotExist=true</value> <description>JDBC connect string for a JDBC metastore</description> </property> <!--配置mysql的连接驱动--> <property> <name>javax.jdo.option.ConnectionDriverName</name> <value>com.mysql.jdbc.Driver</value> <description>Driver class name for a JDBC metastore</description> </property> <!--配置登录mysql的用户--> <property> <name>javax.jdo.option.ConnectionUserName</name> <value>root</value> <description>username to use against metastore database</description> </property> <!--配置登录mysql的密码--> <property> <name>javax.jdo.option.ConnectionPassword</name> <value>123456</value> <description>password to use against metastore database</description> </property> </configuration>2.4 上传一个mysql的jdbc驱动jar到hive安装目录的lib目录中(记得上传)
mysql-connector-java-5.1.28-bin.jar
2.5 配置HADOOP_HOME 和HIVE_HOME到系统环境变量中:/etc/profile 2.6 source /etc/profile(每次配置完环境变量记得source)
注意!注意!注意!要schematool
# 在hive2.x.x之后需要 schematool schematool -initSchema -dbType mysql #(注意要在hive的安装包的bin目录下) # 或者在安装的hive目录下 bin/schematool -initSchema -dbType mysql2.7 启动hive 测试
hive