博客
关于我
强烈建议你试试无所不能的chatGPT,快点击我
HDP上安装impala
阅读量:6214 次
发布时间:2019-06-21

本文共 6207 字,大约阅读时间需要 20 分钟。

hot3.png

Impala是Cloudera公司主导开发的新型查询系统,它提供SQL语义,能查询存储在Hadoop的HDFS和HBase中的PB级大数据。Impala提供更快的查询速度,性能上号称比Hive快3~10倍。Impala是开源的,但一般都是通过cloudera manager或者在CDH版本上安装,今天主要介绍的是在HDP版本上的安装。

版本

Impala对于Hadoop的版本要求很高,现在说明一下当前安装的版本信息

Impala 2.5

HDP 2.2.8.0 基于Hadoop2.6

安装步骤

1. 在/etc/yum.repo.d 中创建impala.repo

[cloudera-cdh5]# Packages for Cloudera's Distribution for Hadoop, Version 5, on RedHat or CentOS 6 x86_64name=Cloudera's Distribution for Hadoop, Version 5baseurl=https://archive.cloudera.com/cdh5/redhat/6/x86_64/cdh/5/gpgkey =https://archive.cloudera.com/cdh5/redhat/6/x86_64/cdh/RPM-GPG-KEY-cloudera    gpgcheck = 1

2. 执行yum命令

yum install  impala-server impala-catalog impala-state-store impala-shell

3.更改impala链接库信息

ln -sf  /usr/hdp/2.2.8.0-3150/hadoop/hadoop-common.jar /usr/lib/impala/lib/hadoop-common.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop/hadoop-annotations.jar /usr/lib/impala/lib/hadoop-annotations.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop/hadoop-auth.jar /usr/lib/impala/lib/hadoop-auth.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop/hadoop-aws.jar /usr/lib/impala/lib/hadoop-aws.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop-hdfs/hadoop-hdfs.jar /usr/lib/impala/lib/hadoop-hdfs.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop-mapreduce/hadoop-mapreduce-client-common.jar /usr/lib/impala/lib/hadoop-mapreduce-client-common.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop-mapreduce/hadoop-mapreduce-client-core.jar /usr/lib/impala/lib/hadoop-mapreduce-client-core.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop-mapreduce/hadoop-mapreduce-client-jobclient.jar /usr/lib/impala/lib/hadoop-mapreduce-client-jobclient.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop-mapreduce/hadoop-mapreduce-client-shuffle.jar /usr/lib/impala/lib/hadoop-mapreduce-client-shuffle.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop-yarn/hadoop-yarn-api.jar /usr/lib/impala/lib/hadoop-yarn-api.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop-yarn/hadoop-yarn-client.jar /usr/lib/impala/lib/hadoop-yarn-client.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop-yarn/hadoop-yarn-common.jar /usr/lib/impala/lib/hadoop-yarn-common.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop-yarn/hadoop-yarn-server-applicationhistoryservice.jar /usr/lib/impala/lib/hadoop-yarn-server-applicationhistoryservice.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop-yarn/hadoop-yarn-server-common.jar /usr/lib/impala/lib/hadoop-yarn-server-common.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop-yarn/hadoop-yarn-server-nodemanager.jar /usr/lib/impala/lib/hadoop-yarn-server-nodemanager.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop-yarn/hadoop-yarn-server-resourcemanager.jar /usr/lib/impala/lib/hadoop-yarn-server-resourcemanager.jarln -sf  /usr/hdp/2.2.8.0-3150/hadoop-yarn/hadoop-yarn-server-web-proxy.jar /usr/lib/impala/lib/hadoop-yarn-server-web-proxy.jarln -sf /usr/hdp/2.2.8.0-3150/hadoop/lib/native/libhadoop.so /usr/lib/impala/lib/libhadoop.soln -sf /usr/hdp/2.2.8.0-3150/hadoop/lib/native/libhadoop.so.1.0.0 /usr/lib/impala/lib/libhadoop.so.1.0.0ln -sf /usr/hdp/2.2.8.0-3150/usr/lib/libhdfs.so /usr/lib/impala/lib/libhdfs.soln -sf /usr/hdp/2.2.8.0-3150/usr/lib/libhdfs.so.0.0.0 /usr/lib/impala/lib/libhdfs.so.0.0.0ln -sf /usr/hdp/2.2.8.0-3150/hive/lib/hive-ant.jar /usr/lib/impala/lib/hive-ant.jar ln -sf /usr/hdp/2.2.8.0-3150/hive/lib/hive-beeline.jar /usr/lib/impala/lib/hive-beeline.jar ln -sf /usr/hdp/2.2.8.0-3150/hive/lib/hive-common.jar /usr/lib/impala/lib/hive-common.jar ln -sf /usr/hdp/2.2.8.0-3150/hive/lib/hive-exec.jar /usr/lib/impala/lib/hive-exec.jar ln -sf /usr/hdp/2.2.8.0-3150/hive/lib/hive-hbase-handler.jar /usr/lib/impala/lib/hive-hbase-handler.jar ln -sf /usr/hdp/2.2.8.0-3150/hive/lib/hive-metastore.jar /usr/lib/impala/lib/hive-metastore.jar ln -sf /usr/hdp/2.2.8.0-3150/hive/lib/hive-serde.jar /usr/lib/impala/lib/hive-serde.jar ln -sf /usr/hdp/2.2.8.0-3150/hive/lib/hive-service.jar /usr/lib/impala/lib/hive-service.jar ln -sf /usr/hdp/2.2.8.0-3150/hive/lib/hive-shims-common.jar /usr/lib/impala/lib/hive-shims-common.jar ln -sf /usr/hdp/2.2.8.0-3150/hive/lib/hive-shims-scheduler.jar /usr/lib/impala/lib/hive-shims-scheduler.jar ln -sf /usr/hdp/2.2.8.0-3150/hive/lib/hive-shims.jar /usr/lib/impala/lib/hive-shims.jar ln -sf /usr/hdp/2.2.8.0-3150/zookeeper/zookeeper.jar /usr/lib/impala/lib/zookeeper.jar

4. 增加JAVA_HOME

vi /etc/default/bigtop-utils 

增加 EXPORT JAVA_HOME= /usr/jdk64/jdk1.7.0_67

5. 修改配置文件

  增加 /etc/hadoop/conf/core-site.xml

dfs.client.read.shortcircuit
true
dfs.client.read.shortcircuit.skip.checksum
false
dfs.datanode.hdfs-blocks-metadata.enabled
true

  增加 /etc/hadoop/conf/hdfs-site.xml

dfs.datanode.hdfs-blocks-metadata.enabled
true
dfs.block.local-path-access.user
impala
dfs.client.file-block-storage-locations.timeout.millis
60000

  执行拷贝

cp /etc/hadoop/conf/*.xml /etc/impala/confcp /etc/hive/conf/hive-site.xml /etc/impala/conf

  否则报错

E0721 15:52:54.265014 11246 impala-server.cc:247] Unsupported default filesystem. The default filesystem must be a DistributedFileSystem but the configured default filesystem is LocalFileSystem. fs.defaultFS (file:///) might be set incorrectly.ERROR: block location tracking is not properly enabled because  - dfs.datanode.hdfs-blocks-metadata.enabled is not enabled.  - dfs.client.file-block-storage-locations.timeout.millis is too low. It should be at least 10 seconds.

6. 修改 /etc/default/impala

   修改IMPALA_CATALOG_SERVICE_HOST,IMPALA_STATE_STORE_HOST 两个变量为你启动catalog,state_store两个服务的host

7. 从网上下载 htrace-core-3.0.4.jar改名htrace-core-2.00.jar 替换 /usr/lib/impala/lib/htrace-core-2.00.jar

否则会报错

E0711 13:55:00.451584  6925 impala-server.cc:247] NoClassDefFoundError: org/htrace/TraceCAUSED BY: ClassNotFoundException: org.htrace.TraceE0711 13:55:00.451653  6925 impala-server.cc:249] Aborting Impala Server startup due to improper configuration

8 一个host启动 catalog, state_store

   所有host启动 impalad

 

提示:

   安装过程中需要学会查看/var/log/impala的日志,可以查看当前的错误信息

 

我们已经编写好可以用的ambari组件进行impala安装,组件地址见下面的链接

https://github.com/cas-bigdatalab/ambari-impala-service

转载于:https://my.oschina.net/u/2277929/blog/716230

你可能感兴趣的文章
一个强大专业的响应式jQuery幻灯效果插件:Royal Slider
查看>>
Formal Languages and Compilers-LL(1),FIRST and FOLLOW
查看>>
ThinkPHP源码学习---is_ssl() 函数 判断是否SSL协议
查看>>
PyOdps在交互式环境下的使用,让探索ODPS数据更容易些
查看>>
聊聊netflix的EurekaHttpClient
查看>>
聊聊flink Table的select操作
查看>>
单臂路由
查看>>
作业 第一章
查看>>
第一次作业
查看>>
我的友情链接
查看>>
康盛系列产品工程师php
查看>>
Excel利用超链接打开指定文件夹
查看>>
Elasticsearch走上专案专有化 AWS:将另开辟开放版
查看>>
J盘无法访问RAW,里面的数据怎样找到
查看>>
硬盘的读写原理
查看>>
photoshop技术
查看>>
python编写员工信息表
查看>>
2018-3-30 Linux学习笔记
查看>>
VM系统安装
查看>>
Squid的ACL访问控制及反向代理
查看>>