Preparing the Apache Hive Environment

How to:

The following components are needed to use the adapter for Hadoop/Hive:

Procedure: How to Configure the Java CLASSPATH

The location of the Hadoop and Hive jar files must be specified to the server. If you are running the server on a node or an edge node of your Hadoop cluster, you can specify their location. If you are running the server on another system, copy the files listed below to some location on your system and specify their location.

This can be done in the system CLASSPATH or in the DataMigrator or WebFOCUS Reporting Server IBI_CLASSPATH variable as follows:

  1. From the Web Console menu bar, select Workspace

    or

    From the Data Management Console, expand the Workspace folder.

  2. Expand the Java Services folder. Right-click DEFAULT and click Properties.

    The Java Services Configuration page opens.

  3. Expand Class path.

    In the IBI_CLASSPATH box, enter the full location of the Hive and Hadoop files shown below, where hive_home is where Hive is installed and hadoop_home is where Hadoop is installed. You must type them explicitly and cannot use $HIVE_HOME. The file names must be entered one per line.

    If you are installing the adapter on a different system than where Hadoop and Hive are installed, copy the jar files to a location on that system.

    Note: For a server running on Windows, use Windows syntax for directory names. For example:
    C:\jdbc\hive-jdbc-<version>-standalone.jar

    For Apache Hive:

    Enter the full path to the location of the JDBC client jar on your system:

    hive-jdbc-<version>-standalone.jar

    The following jar files may also be required when using the HTPP transport, connecting to a Kerberos enabled cluster, or using HBase as a source.

    hadoop-common.jar
    hadoop-auth.jar

    For Cloudera/Simba:

    1. Download the JDBC driver from Cloudera from the Downloads page at http://www.cloudera.com/downloads.html.
    2. From the Database Drivers section of the page, click the Hive JDBC Driver Downloads link.
    3. Select your operating system and version from the drop-down menus, and click Get It Now.

    This downloads a file called hive_jdbc_2.5.20.zip, which contains two .zip files. (Note that these numbers will change with each new release.) Unzip the Cloudera_HiveJDBC41_2.5.20.nnnn.zip to a location on your system.

    Note: The 41 in the file name indicates JDBC 4.1, which supports Java 1.7 or later.

    These 15 jar files comprise the Cloudera/Simba JDBC driver. Add the names of all of them to your CLASSPATH or IBI_CLASSPATH, as described above in this step.

  4. Scroll down and click the Save and Restart Java Services button.

WebFOCUS

Feedback