Amazon S3 Reader Activity

Reads file from Amazon S3 as a view.

Note

Authentication keys are set as aws-access-key and aws-secret-key under job additional parameters.

Parameters

Name

Description

Type

Default

Title

Title of the activity. Its displayed on the designer.

text

Path

File path in s3://<bucketname>/<path> format

text

Format

File format. See below for supported file formats.

enum

Schema

Optional schema definition in Avro schema format.

text

As

Name of DataRow view linked to the file.

text

Options

Additional options (e.g. delimiter for CSV file could be specified as [delimiter,\t] if file is tab delimited

key-value

Supported File Formats

Name

Description

Parquet

Apache Parquet is a columnar storage format.

ORC

ORC is also columnar storage format.

Avro

Apache Avro is a data serialization format.

CSV

CSV is a delimited text file format.

JSON

JSON is a lightweight data-interchange format.