Configuring Connections: Cloudera
When you create a QuerySurge Connection, the Connection Wizard guides you through the process. Different types of QuerySurge connections require different types of information.
For a Cloudera Connection, you will need the following information (check with a DBA or other knowledgeable resource in your organization):
- Login credentials (username and password)
- Hive Server hostname
- Port (10000 is the default port)
- Hive Database name
Launch the Connection Wizard
- Log into QuerySurge as an Admin user.
- To configure a Connection, select Configuration > Connection in the Administrative View tree (at the left).
- Click on the Add button at the bottom left of the page to launch the Connection Wizard. Click Next.
Note: Check the Advanced Mode checkbox for access to advanced features.
- Provide a name for your connection. Select "Cloudera Hive (Simba)" as the Data Source.
Note: The Cloudera driver is not bundled with QuerySurge, so you will need to obtain and install the driver with your Agents. This driver can be downloaded from the Cloudera download page. See Adding a JDBC Driver to your QuerySurge Agent for instructions on deploying the driver jars.
Note: The Connection Wizard expects the JDBC 4.1 drivers, so extract the HiveJDBC41.jar from the download to get the driver jar file to deploy to your Agent(s).
- Once you have selected your Data Source, the Wizard will tell you what information you are likely to need in order to create your Connection. Once you have collected this information, you are ready to click Next.
- Provide the connection information to your database. This includes the Hive server hostname, the port (the default port will automatically populate), the Hive database name and the login credentials. Click Next.
Required fields for your Connection Type are marked by an *.
- Click on Test Connection.
Note: You must have an Agent running with the driver for this Connection deployed in order to test the Connection.
- Save the Connection.
- Congratulations! You’ve created a QuerySurge Connection. Again, make sure that you have deployed the driver for this Connection to all your QuerySurge Agents. (See Adding a JDBC Driver to your QuerySurge Agent for instructions.)
Note: Most Hive connections use Kerberos authentication. The connnection setup is considerably more complex than the procedure shown here. Follow the instructions in the driver documentation for this setup; for specifics as it applies to QuerySurge, see this article (Windows; for Linux installations, the procedure is parallel).