Using the Kafka Source Component

The Kafka Source Component is an SSIS data flow pipeline component that can be used to read/receive data from Kafka. The component includes the following two pages of configuration:

  • General
  • Columns

General Page

The General page allows you to configure various options that will help you receive the desired data from Kafka.

Kafka Source Component

Connection Manager

The source component requires an active connection to access Kafka. The Connection Manager drop-down will show a list of all Kafka Connection Managers that have been created in the current SSIS package.

Topic

The name of the Kafka Topic

Group ID

The consumer group ID.

Receive Mode

There are two different modes available for receiving messages:

  • Peek: If the Peek option is selected messages will be received from the queue without deleting them or modifying the queue in any way.
  • Receive and Delete: If the Receive and Delete option is selected messages will be retrieved and deleted from the queue. It is only possible to select a listener mode if the receive mode is Receive and Delete.
Message Column Type

The Message Column Type allows you to specify whether you want to read the binary content of the message or the text content of the message from Kafka. There are two different modes available for Message column types: Binary and Text. The default setting is Text.

Encoding

The encoding to use to decode the body of received text messages. This option is only available when the Message Column Type is Text.

Isolation Level

Set this property to control what transaction records are exposed to the consumer.

  • Read Uncommitted: to retrieve all records, independently on the transaction outcome (if any)
  • Read Committed: to get only the records from committed transactions.
Listener Mode

After the Kafka component has retrieved all available messages from the topic it can continue to listen for more messages for a period of time.

There are three listener modes available:

  • Fixed Time Mode: allows the component to listen for messages for a specified period of time.
  • Wait Until Mode: allows the component to listen for messages until a specified date and time.
  • Wait Until Variable: allows the component to listen for messages until a date and time specified in a variable.

Listener modes other than None are only compatible with the Receive and Delete receive modes.

When new messages are detected in listener mode, they may not enter the SSIS pipeline immediately; this depends on the buffer settings of the SSIS task. Internally any row that is directed to an output goes to an SSIS buffer, and SSIS will not actually direct the rows in that buffer until the buffer is considered full, or the component has finished processing all its data. While in listener mode the component is considered to be processing data until the listener time has expired, which is when the full buffer is guaranteed to be completely processed. To direct rows as fast as possible in listener mode consider lowering the DefaultBufferMaxRows property and DefaultBufferSize properties of the task to the lowest possible values.

Refresh Component Button

Clicking the Refresh Component button causes the component to retrieve the latest metadata and update each field to its most recent metadata. It will remove any custom fields that have been added to the columns page.

Expression fx Icon

Click the blue fx icon to launch SSIS Expression Editor to enable dynamic updates of the property at run time.

Generate Documentation Icon

Click the Generate Documentation icon to generate a Word document that describes the component's metadata including relevant mapping, and so on.

Columns Page

The Columns page shows you all available attributes of messages that will be retrieved. You may indicate which attributes to include in your source component by checking or unchecking the checkbox next to each attribute.

Kafka Source - Columns Page

Add Button

Clicking the Add button will bring up a dialog that will allow custom fields to be added. Custom fields can be used for values of any custom properties on the retrieved messages.

Advanced Page

The Advanced page shows you all available attributes for advanced features. 

Kafka Source - Advanced.png
Use Shared Session

This button can be used to enable the option to use the shared sessions