Azure Blob Storage Reader Activity

Reads file from Azure Blob Storage as a view.

Note

Authentication keys are set as azure-account-name and azure-account-key under job additional parameters.

Parameters

Name

Description

Type

Default

Title

Title of the activity. Its displayed on the designer.

text

Path

File path in wasb[s]://<containername>@<accountname>.blob.core.windows.net/<path> format.

text

Format

File format. See below for supported file formats.

enum

Schema

Optional schema definition in Avro schema format.

text

As

Name of DataRow view linked to the file.

text

Options

Additional options (e.g. delimiter for CSV file could be specified as [delimiter,\t] if file is tab delimited

key-value

Supported File Formats

Name

Description

Parquet

Apache Parquet is a columnar storage format.

ORC

ORC is also columnar storage format.

Avro

Apache Avro is a data serialization format.

CSV

CSV is a delimited text file format.

JSON

JSON is a lightweight data-interchange format.