Document toolboxDocument toolbox

Configuring multiple Dataproc connections on a single Hive Server

Applies to: Kyvos Enterprise  Kyvos Cloud (SaaS on AWS) Kyvos AWS Marketplace

Kyvos Azure Marketplace   Kyvos GCP Marketplace Kyvos Single Node Installation (Kyvos SNI)


You can specify Dataproc connections for processing semantic models and reading data for GCP clusters. You can also configure multiple connections on a single Hive server.

To set up or view a Dataproc connection, perform the following steps. 

  1. From the Toolbox, click Setup, then Connections.

  2. From the Actions menu ( ⋮ ) click Add Connection.

  3. Enter a name or select it from the Connection list.

  4. After you finish configuring the settings using the table shown below, click the Test button from the top left to validate the connection settings.

  5. If the connection is valid, click the Save button.

Note

To create multiple Dataproc connections on the same Hive server so that you can restrict user access to data, you can create separate connections using credentials for different users in the User Name and Password fields. Connection level and table level access rights are respected in all the operations, like table listing, preview, process, and profile entities.

  1. Click the Actions menu ( â‹® ) at the top of the Connections column to refresh connections and select Refresh.

  2. Enter details as:

  1. Parameter/Field

  1. Comments/Description

  1. Parameter/Field

  1. Comments/Description

Name 

Enter a unique name for the connection.

Category 

Select the Process option. 

Provider 

 Select the Dataproc option.

Dataproc Cluster Name 

 Enter the Dataproc cluster name that you want to configure for this connection.

Hive Authentication

Select the Password option if you want to make an authenticated Hive Server connection. 
NOTE: Select the None option to make a connection without authentication. 

User Name 

Enter your Dataproc account user name. 

Password  

Enter your Dataproc account password.

Is Data Process

The checkbox is selected by default, as this connection will be used for processing semantic models.

History server Url 

Enter the History Server URL of your GCP cluster.

Livy server Url

Enter the Livy Server URL. 

Use as source

Select the checkbox to use this connection for reading data (creating datasets) on which the semantic model will be created.

Hive Server JDBC Url 

Enter the Hive Server JDBC URL. 

Is Default SQL Engine 

Select the checkbox to use the connection for raw data querying.

Default SQL Engine 

Select the Hive option. 

Properties

Click Properties to view or set properties.

Copyright Kyvos, Inc. All rights reserved.