Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Applies to: (tick) Kyvos Enterprise  (tick) Kyvos Cloud (SaaS on AWS) (tick) Kyvos AWS Marketplace

...

Datasets identify the data used from a data source so that Kyvos can access and work with the data. You can register a dataset or a folder. Kyvos supports advanced datatypes like maps, arrays, and structs in HCatalog tables. You can fetch these with the help of a formula step in datasets using two formula functions to extract these data types.  You can also apply some formatting and filters to the file when you register it. Formatting standardizes the appearance of data such as how dates are displayed. Filtering allows you to exclude data from the original source file that you don't plan to use.

If your instance of Kyvos is configured via the portal.properties file to support it; you can create a dataset with a Presto connection. However, you can't create a dataset or semantic model process using this dataset.

When you specify Lookup as you register the dataset, the complete data is processed for both full and incremental processes regardless of the data source type. See Using lookup to learn more.

You can set up properties to control how Kyvos handles data. 

Panel
panelIconIdatlassian-info
panelIcon:info:
bgColor#FFFAE6

Important

  • Your account must have the appropriate security access to the underlying databases and tables to do some of the tasks related to these files.

  • When processing a semantic model using Spark, you must not end the SQL query on the dataset with a semicolon.

You can avoid the dependency of creating a dataset by writing SQL code when registering a dataset. 

...

Also refer to the section Common actions to learn more about the actions that you can perform on the datasets.