The Big Data components from the SSIS Productivity Pack are SSIS components that facilitate integration with Big Data platforms such as Hadoop.

The SSIS Productivity Pack currently supports integration with Hadoop File System (HDFS) and it provides a Connection Manager, Source, and Destination component to facilitate the integration. The Hadoop connection manager can also be used with the EDI Source, Premium Data File Source, Premium Data File DestinationPremium Flat File Source, Premium Flat File Destination, Premium File Transfer Task, Premium Excel Source, and Premium Excel Destination components.

The following are the SSIS Big Data components available within the SSIS Productivity Pack and their help manuals:

Hadoop Components:

  • Hadoop Connection Manager
    • Facilitates connecting to Hadoop from within SSIS.
  • HDFS Source Component
    • An SSIS data flow component used to retrieve data from HDFS. Includes the option to specify retrieval mode such as retrieving files and folder or only files or choosing whether sub items should be retrieved as well.
  • HDFS Destination Component
    • An SSIS data flow component used to facilitate writing data to HDFS. This component supports CreateFolder, Delete, and Upload actions when writing to HDFS.

Premium Data File Components:

  • Premium Data File Source Component
    • The Premium Data File Source Component is an SSIS data flow pipeline component that can be used to read / retrieve data from a an Avro, ORC or a Parquet file.
  • Premium Data File Destination Component
    • The Premium Data File Destination Component is an SSIS data flow pipeline component that can be used to write data from source into an Avro, ORC or a Parquet file.