Kinesis Publisher Adapter

Introduction

The Spotfire Streaming Adapter for Kinesis Publisher allows the system to publish data to Kinesis Data Streams.

Kinesis AWS Set Up

We use the default AWSCredentialProvider chain that the KCL library provides out of the box. Before using the adapters, we need to set 2 environment variables: export AWS_ACCESS_KEY_ID="*******" export AWS_SECRET_ACCESS_KEY="******"

Adapter Properties

This section describes the properties you can set for this adapter, using the various tabs of the Properties view in StreamBase Studio.

General Tab

Name: Use this required field to specify or change the name of this instance of this component. The name must be unique within the current EventFlow module. The name can contain alphanumeric characters, underscores, and escaped special characters. Special characters can be escaped as described in Identifier Naming Rules. The first character must be alphabetic or an underscore.

Adapter: A read-only field that shows the formal name of the adapter.

Start options: This field provides a link to the Cluster Aware tab, where you configure the conditions under which this adapter starts.

Enable Error Output Port: Select this checkbox to add an Error Port to this component. In the EventFlow canvas, the Error Port shows as a red output port, always the last port for the component. See Using Error Ports to learn about Error Ports.

Description: Optionally, enter text to briefly describe the purpose and function of the component. In the EventFlow Editor canvas, you can see the description by pressing Ctrl while the component's tooltip is displayed.

Adapter Properties Tab

Property Type Description
Kinesis Stream string The Kinesis Stream where the data is published.
Region string The AWS region where the stream is present.
Partition Key string A partition key is used to group data by shard within a stream.
Explict Hash Key string The hash value used to explicitly determine the shard the data record is assigned to by overriding the partition key hash.
Publisher Name string The name of the enhanced fan out Publisher to use when fetching records from Kinesis.
Enable Buffer Publish check box Enable buffered publish for the Kinesis data stream.
Producer Buffer Count int The number of tuples to hold before doing a network write.
Producer Flush Interval (ms) long The number of millisecnds to wait before doing a network write.
Threading Model string the way threads are used in the library, default to thread per request.
Thread Pool Size int sets the maximum number of threads in the thread pool.
Log Level INFO Controls the level of verbosity the adapter uses to issue informational traces to the console. This setting is independent of the containing application's overall log level. Available values, in increasing order of verbosity, are: OFF, ERROR, WARN, INFO, DEBUG, TRACE.

Cluster Aware Tab

Use the settings in this tab to enable this operator or adapter for runtime start and stop conditions in a multi-node cluster. During initial development of the fragment that contains this operator or adapter, and for maximum compatibility with releases before 10.5.0, leave the Cluster start policy control in its default setting, Start with module.

Cluster awareness is an advanced topic that requires an understanding of StreamBase Runtime architecture features, including clusters, quorums, availability zones, and partitions. See Cluster Awareness Tab Settings on the Using Cluster Awareness page for instructions on configuring this tab.

Concurrency Tab

Use the Concurrency tab to specify parallel regions for this instance of this component, or multiplicity options, or both. The Concurrency tab settings are described in Concurrency Options, and dispatch styles are described in Dispatch Styles.

Caution

Concurrency settings are not suitable for every application, and using these settings requires a thorough analysis of your application. For details, see Execution Order and Concurrency, which includes important guidelines for using the concurrency options.

Data Input Port

Use the data port to publish data to the kinesis data stream.

The data port's schema is below:

Field Data Type Description
data string The data content tp publish.
sequenceNumberForOrdering(optional) string If you do not provide 'sequenceNumberForOrdering', we do not send anything for sequencing to AWS Kinesis.
partitionKey(optional) string A partition key is used to group data by shard within a stream. Kinesis Data Streams segregates the data records belonging to a stream into multiple shards. It uses the partition key that is associated with each data record to determine which shard a given data record belongs to. Partition keys are Unicode strings, with a maximum length limit of 256 characters for each key. An MD5 hash function is used to map partition keys to 128-bit integer values and to map associated data records to shards using the hash key ranges of the shards. When an application puts data into a stream, it must specify a partition key.
explicitHashKey(optional) string The hash value used to explicitly determine the shard the data record is assigned to by overriding the partition key hash.

Status Output Port

Description

You can optionally enable the status output port for this adapter instance by means of the Enable Status Port property in the Adapter Properties page of the Properties view. Use the status port to retrieve status output messages from the adapter.

Status Port Schema

  • status, string. The type of status information emitted. Status types are:

    • Error—This message relates to an error that occurred.

    • Warn—This message relates to a warning that the user should be aware of.

    • Info—This message relates to extra status information.

  • message, string. This is a formatted human readable message that explains the status message.

  • inputTuple, tuple. Pass through input tuple.