Azure Data Lake Reader Activity

Reads file from Azure Data Lake as a view.

Parameters

Name

Description

Type

Default

Title

Title of the activity. Its displayed on the designer.

text

Path

File path in adl[s]://<account>.azuredatalakestore.net/<path> format.

text

Format

File format. See below for supported file formats.

enum

Schema

Optional schema definition in Avro schema format.

text

As

Name of DataRow view linked to the file.

text

Options

Additional options (e.g. delimiter for CSV file could be specified as [delimiter,\t] if file is tab delimited

key-value

Supported File Formats

Name

Description

Parquet

Apache Parquet is a columnar storage format.

ORC

ORC is also columnar storage format.

Avro

Apache Avro is a data serialization format.

CSV

CSV is a delimited text file format.

JSON

JSON is a lightweight data-interchange format.

Job Parameters

Note

Following job parameters need to be set for proper Azure Data Lake authentication.

Key

Value

dfs.adls.oauth2.access.token.provider.type

e.g. ClientCredential

dfs.adls.oauth2.refresh.url

under Azure Portal/Azure Active Directory/App Registrations/Endpoints/OAUTH 2.0 TOKEN ENDPOINT

dfs.adls.oauth2.client.id

AppId

dfs.adls.oauth2.credential

App Key