Azure Data Lake Writer Activity¶
Writes specified view to Azure Data Lake as a file.
Parameters¶
Name |
Description |
Type |
Default |
---|---|---|---|
Title |
Title of the activity. Its displayed on the designer. |
text |
|
Source View |
Name of the DataRow view that is going to be written into the file. |
text |
|
Path |
File path in adl[s]://<account>.azuredatalakestore.net/<path> format. |
text |
|
Format |
File format. Please see below for supported file formats. |
enum |
|
Partition By |
List of columns to partition the output by. |
enum |
error |
Mode |
Write mode. Please see blow for list modes and their meanings. |
enum |
error |
Options |
Additional options (e.g. delimiter for CSV file could be specified as [delimiter,\t] if file is tab delimited |
key-value |
Supported File Formats¶
Name |
Description |
---|---|
Parquet |
Apache Parquet is a columnar storage format. |
ORC |
ORC is also columnar storage format. |
Avro |
Apache Avro is a data serialization format. |
CSV |
CSV is a delimited text file format. |
JSON |
JSON is a lightweight data-interchange format. |
Supported Write Modes¶
Name |
Description |
---|---|
error |
Fail if file already exists. |
append |
Append to existing file or create new. |
overwrite |
Overwrite existing file or create new. |
ignore |
Ignore if file already exists or create new. |
Job Parameters¶
Note
Following job parameters need to be set for proper Azure Data Lake authentication.
Key |
Value |
---|---|
dfs.adls.oauth2.access.token.provider.type |
e.g. ClientCredential |
dfs.adls.oauth2.refresh.url |
under Azure Portal/Azure Active Directory/App Registrations/Endpoints/OAUTH 2.0 TOKEN ENDPOINT |
dfs.adls.oauth2.client.id |
AppId |
dfs.adls.oauth2.credential |
App Key |