Using the Data Anonymizer Component

The Data Anonymizer Component is a transformation component used to mask data. Select columns to mask and select their anonymization type. The values in these columns will be replaced with randomly generated data based on the anonymous type selected.

Columns Page

Select a column in the data grid view and configure its anonymization settings in the property grid to the right.

Data Anonymizer Editor - Columns Page

Anonymization Type

Displays the Anonymization Type selected from the Anonymizer Properties setting to generate random data.

Anonymizer Properties

The following are the Anonymization Types available for selection, some Anonymization Types have additional properties that can be configured:

Anonymization Type Description
<Ignore> The existing value of the column will remain.
Bool This generates a Boolean value. E.g. True or False
City This generates a city name. E.g. Toronto, New York, etc.
CompanyCatchPhrase This generates a company catch phrase.
CompanyName This generates a company name.
Country This generates a country name. E.g. Canada, Germany, etc.
CountryAbbreviated (since v8.0) This generates a country abbreviation. E.g. US, CA, etc. 
Custom

This option allows you to specify your own anonymous values. Additional parameters include:

  • List Of Values: specify the available anonymous values.
  • Delimiter: specify the delimiter for your list.
  • Spawn Order: specify the order of the anonymous values - Random or Sequential.
DateTime

This generates a date time value. Additional parameters include:

  • Minimum Year: specify the minimum year of generated date time value.
  • Maximum Year: specify the maximum year of generated date time value.
DomainName This generates a domain name. E.g. emard.com, armstrong.co.uk, etc.
Email

This generates a email address.

  • Domain (since v8.0): specify a static domain name for the spawned email address value.
FileContent

Use this anonymous type to randomly select files from the directory defined. Additional Parameters include:

  • Path To Parent Directory: specify the path of the parent directory which contains sample files.
  • File Selector: specify wildcard characters to select files.
  • Include Subdirectories: specify whether to read files under subfolders.
FirstName This generates a first name. E.g. Bob, John, etc.
FullName This generates a full name. E.g. John Smith, Anne Kiehn, etc.
GUID This generates a GUID value.
IPv4Address This generates a IPv4 address. E.g. 112.2.191.50, etc.
IPv6Address This generates a IPv6 address. E.g. a84:902f:e8ab:e255:e46c:182b:7a27:2bee, etc.
IncrementalValue

This outputs number incrementally. Additional parameters include:

  • Starting Value: specify the starting number.
  • Incremental Value: specify the value added to the last generated number.
LastName This generates a last name. E.g. Smith, Kiehn, etc.
Number

This generates a decimal number. The number of decimal places is represented by the Decimal Places parameter. Additional parameters include:

  • Minimum Value: specify the minimum value of the spawn range.
  • Maximum Value: specify the maximum value of the spawn range.
  • Decimal Places: specify the number of decimal places. A decimal places of 0 would be an integer, a decimal places of 2 would be a decimal with 2 decimal places.
Paragraph This generates a paragraph.
RandomString

This generates a random string. Additional parameters include:

  • Valid Characters: specify the valid characters.
  • Minimum Length: specify the minimum length of the random string.
  • Maximum Length: specify the maximum length of the random string.
RowIndex

This generates a row index. Additional Parameters include:

  • Starting Value: specify the starting row index.
  • Incremental Value: specify the value added to the last row index.
Sentence This generates a sentence.
StreetAddress This generates a street address. E.g. 3650 King St., etc.
StreetName This generates a street name. E.g. King St., etc.
URL This generates a URL. E.g. kwst.us/ssis
UkCounty This generates a UK county. E.g. Durham, Avon, etc.
UkPostalCode This generates a UK postal code. E.g. ln4 3xy, fc1 4bd, etc.
UsState This generates a US state. E.g. Georgia, Louisiana, etc.
UsStateAbbreviated This generates a US state abbreviation. E.g. GA, LA, etc.
ZipCode

This generates a zip code. Additional parameter includes:

  • Zip Code Format: specify the format of the anonymous zip code value - Random, 5 digit, and 9 digit.
Column Properties

Column properties for the field. These values are configurable.

  • Name: Specify the Column name.
  • Data Type: The data type can be changed according.
  • Length: If the data type specified is a string, the length specified here would be the maximum size. If the data type is not a string, the length will be ignored.
  • Precision: Specify the number of digits in a number.
  • Scale: Specify the number of digits to the right of the decimal point in a number.
  • CodePage: Specify the Code Page of the field.
Expression fx Button 

Clicking the fx button to launch SSIS Expression Editor to enable dynamic update of the property at run time.

Generate Documentation Button 

Clicking the Generate Documentation button to generate a Word document which describes the component's metadata including relevant mapping, and so on.

Error Handling page

The Error Handling page allows you to specify how errors should be handled when they happen. 

Data Anonymizer Editor - Error Handling Page

There are three options available. 

  1. Fail on error
  2. Redirect rows to error output
  3. Ignore error

When the Redirect rows to error output option is selected, rows that failed to be anonymized will be redirected to the 'Error Output' output of the Transformation Component. As indicated in the screenshot below, the blue output connection represent rows that were successfully anonymized, and the red 'Error Output' connection represents rows that were erroneous. The 'ErrorMessage' output column found in the 'Error Output' may contain the error message that was reported by server or the component itself.

Data Anonymizer Editor - Error Output