Skip to main content

Data Assets

Overview

Data assets are pointers to remote data storage locations that workflows use to read from or write to. Rather than configuring storage paths directly in each node, you create data assets once and reference them across multiple workflows. This centralizes storage configuration and simplifies credential management.

Supported Data Asset Types

Delta Table

Delta tables stored directly on cloud storage (S3, GCS, Azure Blob Storage). Use this type when you want to write data in Delta Lake format to your own storage buckets without going through a data platform like Databricks.

Storage ProviderPath SchemeRequired Credential
AWS S3s3a://AWS Access Key
Google Cloud Storagegs://GCP Service Account
Azure Blob Storageabfs://Azure Storage Key

Databricks Unity Catalog Table (DATABRICKS_TABLE)

Tables managed through Databricks Unity Catalog. Use this type when you want Databricks to handle storage, governance, and metadata management. Requires a configured Databricks Integration.

Managing Data Assets

Accessing Data Assets

Navigate to Data Assets in the left sidebar to view and manage your data assets.

Data Assets Sidebar

Creating Data Assets

  1. Click Create Data Asset in the top right
  2. Select the asset type (Delta Table or Databricks Table)
  3. Configure the storage location and credentials
  4. Provide a name and optional description
  5. Click Create to save

Data Asset Operations

From the Data Assets page, you can:

  • View all existing data assets with their type and storage location
  • Edit asset name and description
  • Delete assets that are no longer needed
  • Filter by asset type using the dropdown

Using Data Assets in Workflows

Reference data assets in sink nodes to write processed data to storage:

When configuring a sink node, use the Select a Saved Table dropdown to choose from your data assets instead of manually entering storage paths.