Preparing the Apache Hive Environment

How to:

Configure the Java CLASSPATH

The following components are needed to use the adapter for Hadoop/Hive:

Java. To use this JDBC-based adapter, you must have Java installed. Version 1.8 or later is recommended. Java can be downloaded from http://www.java.com.
The location of Java must be specified in an environment variable.

If you are using Linux, add a line to your profile with the location where Java is installed. For example:
```
export JAVA_HOME=/usr/lib/jvm/jre-1.8.0
```
If you have JDK installed:
```
export JAVA_HOME=/usr/lib/jvm/jdk-1.8.0
```
If you are using Windows, right-click Computer and select Properties. If using Windows 10, click Settings in the Start menu and search for Environment. Click Edit the system environment variables to open the System Properties dialog box. Click the Advanced tab and click Environment Variables. Add the locations to your PATH variable. For example:
```
C:\Program Files\Java\jdk8\bin\server;C:\Program Files\Java\jdk8\bin;
```
JDBC Drivers. The Apache JDBC driver is distributed as a jar file, which is included in a Hadoop distribution. If you are installing the server on a node or an edge node of your Hadoop cluster, you can point to the hive-jdbc-<version>-standalone.jar file as described in the next section. If you are installing the server on some other system, copy the file to a location of your choice.
If you are using the Cloudera/Simba JDBC driver, download the driver as described in step 3 below.

Procedure: How to Configure the Java CLASSPATH

The location of the Hadoop and Hive jar files must be specified to the server. If you are running the server on a node or an edge node of your Hadoop cluster, you can specify their location. If you are running the server on another system, copy the files listed below to some location on your system and specify their location.

This can be done in the system CLASSPATH or in the DataMigrator or WebFOCUS Reporting Server IBI_CLASSPATH variable as follows:

From the Web Console menu bar, select Workspace

or

From the Data Management Console, expand the Workspace folder.
Expand the Java Services folder. Right-click DEFAULT and click Properties.

The Java Services Configuration page opens.
Expand Class path.
In the IBI_CLASSPATH box, enter the full location of the Hive and Hadoop files shown below, where hive_home is where Hive is installed and hadoop_home is where Hadoop is installed. You must type them explicitly and cannot use $HIVE_HOME. The file names must be entered one per line.

If you are installing the adapter on a different system than where Hadoop and Hive are installed, copy the jar files to a location on that system.
Note: For a server running on Windows, use Windows syntax for directory names. For example:
```
C:\jdbc\hive-jdbc-<version>-standalone.jar
```
For Apache Hive:

Enter the full path to the location of the JDBC client jar on your system:
```
hive-jdbc-<version>-standalone.jar
```
The following jar files may also be required when using the HTPP transport, connecting to a Kerberos enabled cluster, or using HBase as a source.
```
hadoop-common.jar
hadoop-auth.jar
```
For Cloudera/Simba:
1. Download the JDBC driver from Cloudera from the Downloads page at http://www.cloudera.com/downloads.html.
2. From the Database Drivers section of the page, click the Hive JDBC Driver Downloads link.
3. Select your operating system and version from the drop-down menus, and click Get It Now.
This downloads a file called hive_jdbc_2.5.20.zip, which contains two .zip files. (Note that these numbers will change with each new release.) Unzip the Cloudera_HiveJDBC41_2.5.20.nnnn.zip to a location on your system.

Note: The 41 in the file name indicates JDBC 4.1, which supports Java 1.7 or later.

These 15 jar files comprise the Cloudera/Simba JDBC driver. Add the names of all of them to your CLASSPATH or IBI_CLASSPATH, as described above in this step.
Scroll down and click the Save and Restart Java Services button.