Skip to content

Azure Blob

Azure Blob (azure-blob)

Send data to a Microsoft Azure Storage blob (Block Storage).

Block Store Cloud binary json raw

Minimal example

output:
azure-blob:
blob-destination:
name: ~
container-name: ~
storage-account: ~
storage-master-key: ~
JSON
{
"output": {
"azure-blob": {
"blob-destination": {
"name": null
},
"container-name": null,
"storage-account": null,
"storage-master-key": null
}
}
}

Contents

Fields

FieldTypeRequiredDescription
container-name LocationstringThe storage service container for created blobs.
blob-destination LocationBlob DestinationEither a literal blob name or the name of the field to read it from.
storage-account AuthenticationstringThe Storage Account Name to be used (credential).
storage-master-key AuthenticationstringThe Storage Master Key to be used (credential).
retry ReliabilityRetryHow to retry operation if it fails.
batch ProcessingBatchBatching output events.
input-field Processingfield (string)Send only the content of the specified field to the blob.
Examples: data_field
mode BehaviorModeEither writing or deleting a blob.
Allowed values: put, delete
disable-blob-name-guid Object Propertiesboolean (bool)Do not append a GUID to the output blob name.
guid-prefix Object PropertiesstringGUID Prefix, will be prepended to the GUID, the default value is ”/”.
guid-suffix Object PropertiesstringGUID Suffix, will be appended to the GUID if specified.
content-type Object PropertiesstringSet the created blob to the following content-type.
preprocessors ProcessingPreprocessors[]Preprocessors (process data before making it available for upload) these processors will be run in the order they are specified.
Allowed values: gzip, parquet, base64
track-schema Reliabilityboolean (bool)Check the schema of the written data and update __SCHEMA_NUMBER (written data must be JSON).

Location

Show fields
FieldTypeRequiredDescription
container-namestringThe storage service container for created blobs.
blob-destinationBlob DestinationEither a literal blob name or the name of the field to read it from.

Reliability

Show fields
FieldTypeRequiredDescription
retryRetryHow to retry operation if it fails.
track-schemaboolean (bool)Check the schema of the written data and update __SCHEMA_NUMBER (written data must be JSON).

Processing

Show fields
FieldTypeRequiredDescription
batchBatchBatching output events.
input-fieldfield (string)Send only the content of the specified field to the blob.
Examples: data_field
preprocessorsPreprocessors[]Preprocessors (process data before making it available for upload) these processors will be run in the order they are specified.
Allowed values: gzip, parquet, base64

Behavior

Show fields
FieldTypeRequiredDescription
modeModeEither writing or deleting a blob.
Allowed values: put, delete

Object Properties

Show fields
FieldTypeRequiredDescription
disable-blob-name-guidboolean (bool)Do not append a GUID to the output blob name.
guid-prefixstringGUID Prefix, will be prepended to the GUID, the default value is ”/”.
guid-suffixstringGUID Suffix, will be appended to the GUID if specified.
content-typestringSet the created blob to the following content-type.

Authentication

Show fields
FieldTypeRequiredDescription
storage-accountstringThe Storage Account Name to be used (credential).
storage-master-keystringThe Storage Master Key to be used (credential).

Schema

Blob Destination Options

OptionNameTypeDescription
nameNamestringThe name for the output blob, this will be suffixed by a GUID by default.
fieldFieldstringA field containing the value for output_blob.

Retry Fields

FieldTypeRequiredDescription
timeouttime-interval (string)timeout (e.g. 500ms, 2s etc. - default is 30).
Examples: 500ms, 2h
retriesnumber (integer)number of retries.
Examples: 42, 1.2e-10

Batch Fields

FieldTypeRequiredDescription
fixed-sizenumber (integer)maximum number of events in an output batch.
Examples: 42, 1.2e-10
modeModeIf ‘document’ send on end of document generated by input. If ‘fixed’, use fixed_size.
Allowed values: fixed, document
timeouttime-interval (string)interval after which the batch is sent, to keep throughput going (default 100ms).
Examples: 500ms, 2h
headermultiline-text (string)put a header line before the batch.
footermultiline-text (string)put a header line after the last line of the batch.
use-document-markerboolean (bool)Enrich the job metadata with a document marker (for document handling in batch mode).
wrap-as-jsonboolean (bool)Format the output batch as a JSON array.

Batch - Mode Options

ValueDescription
fixedFixed
documentDocument

Mode Options

ValueDescription
putPut Objects
deleteDelete Objects

Preprocessors Options

ValueDescription
gzipGzip the output data
parquetExtract the received data as JSON rows from a parquet file
base64Decode base64 as binary