Using the Cassandra Source Component

The Cassandra Source Component is an SSIS data flow pipeline component that can be used to read/retrieve data from Apache Cassandra.

General Page

The General page of the Cassandra Source Component allows you to specify the general settings of the component.

Cassandra Source Editor

Connection Manager

The Cassandra Source Component requires a Cassandra connection in order to connect with Cassandra. The Connection Manager drop-down will show a list of all Cassandra connection managers that are available to your current SSIS package.

Keyspace

The Keyspace drop-down shows a list of Cassandra Keyspaces available to you. After selecting the Keyspace you wish to read from, the Source Table drop-down will be populated with Tables in the selected Keyspace.

Source Table

The Source Table property is used to generate the Command property. You can modify the command afterward however it will be overwritten with the basic command when a new Source Table is selected.

Command

The CQL command sent to Cassandra to retrieve records. Modify this to filter columns and records.

Note: To return records as JSON add the 'JSON' keyword after the 'SELECT' in the SELECT statement.

ex: SELECT JSON "first_name", "last_name", "id" FROM "test"."contacts";

Refresh Component Button

Clicking the Refresh Component button causes the component to retrieve updated metadata.

Expression fx Icon

Click the blue fx icon to launch SSIS Expression Editor to enable dynamic updates of the property at run time.

Generate Documentation Icon

Click the Generate Documentation icon to generate a Word document that describes the component's metadata including relevant mapping, and so on.

Columns Page

The Columns page of the Cassandra Source Component shows you all available columns from the command specified in the General page.

Cassandra Source Editor - Columns Page

The Cassandra Source Component contains a special feature to toggle between the nvarchar and ntext data type. By default, all string fields are assigned a data type of nvarchar with 4000 length. If the data in the column exceeds 4000, click the toggle button to switch the column type to ntext.