Guides
ProductDeveloperPartnerPersonal

Create a Data Classification

You can create a data classification to help you monitor content with specific characteristics. Custom data classifications can be used in real time rules, SaaS API rules and discovery scans.

The building blocks for data classifications are data identifiers. The data identifiers that you choose for a data classification determine the type of data for which rules using that data classification will scan.

The system offers two types of built-in data identifiers you can choose from:

Built-In Identifiers

These identify data using pattern matching and dictionary lookups. The descriptions shown in the GUI provide details about the type of data they match. For more information, see Built-In Data Identifiers.

Machine Learning Identifiers

These identify data based on AI analysis of example documents. For example, the identifier for Patent Files has been trained to recognize documents that are likely patent applications. For more information, see Built-In Data Identifiers.

The system offers three types of data identifiers you can create yourself applying different methods of data analysis:

Custom Identifiers

You can create custom identifiers to match specific terms and pattern expressions of your choosing. See Create a Custom Identifier .

Exact Data Match Identifiers

Exact Data Match Identifiers use fingerprinting to identify data in structured documents that match criteria you define. (See Create an Exact Data Match Identifier for more information.)

Indexed Document Match Identifiers

Indexed Document Match Identifiers use fingerprinting to identify data in unstructured documents that match criteria you define. See Create an Indexed Document Match Identifier for more information.

To delete or edit a data classification, see Delete or Edit a Classification.

Prerequisites

Procedure

  1. Navigate to Policies > Policy Components > Data Classification and click Add.
  1. Give your classification a meaningful name and description, then select a Boolean operator for the classification.
  • OR—At least one of the data identifiers selected must match during rule evaluation.
  • AND—All of the data identifiers selected must match during rule evaluation.
  1. Select Built-in Data Identifiers.

Choose from:

  • Built-In Identifiers
  • ML (Machine Learning) Built-In Identifiers
  1. Select Custom Data Identifiers

Choose from:

  • Custom Identifiers
  • Exact Data Match Identifiers

NOTE: Exact Data Match Identifiers that are greyed out have not yet been indexed and may not be selected. (See Create an Exact Data Match Identifier for more information.)

  • Indexed Document Match Identifiers

NOTE: Indexed Document Match Identifiers that are greyed out have not yet been indexed and may not be selected. (See Create an Indexed Document Match Identifier for more information.)


  1. You may expand any built-in data identifier and click COPY & CUSTOMIZE to create a new customized data identifier. See Copy and Customize a Data Identifier.
  1. Click Save.

Your new classification is listed on the Data Classification page, and it will be available to you when you Add a Real Time Rule to the Data Loss Prevention Policy, Add a SaaS API Rule to the Data Loss Prevention Policy, or initiate a new Discovery Scan.


Manage Data Classifications < Create a Data Classification > Copy and Customize a Built-In Data Classification