The file does not contain sensitive data, but is marked as sensitive.

Problem

The file does not contain sensitive data, but is marked as sensitive.

Cause

Some of the possible reasons for this problem to occur are:

  1. Context based classification may have been enabled for the target. Thus, files downloaded from work domains/apps/emails will automatically be marked as sensitive.
  2. File mayhave been marked sensitive due to data identified by pre-defined or custom RegEx.
  3. File may have been marked sensitive due to data identified by fingerprinting.

Resolution

Disable context based classification

To disable context based classification, on the Endpoint DLP Plus console,

  1. Go to the Policies tab
  2. Select Policy Deployment
  3. Under Action, select Modify
  4. Select Data Leak Prevention
  5. Under Settings, disable "Mark the files created from enterprise apps or downloaded from corporate web-domains/email as sensitive by default"

Remove pattern from RegEx

If the pattern to be removed is a pre-defined pattern, then report it to ManageEngine.

To Report to ManageEngine

    1. Go to the Policies tab
    2. Click on your policy
    3. Under False Positives, select Data Classification
    4. Click on Report to ManageEngine and report

If the pattern is a custom one given by the user, it can be removed by the following steps:

  1. Go to the Policies tab
  2. Under Data Classification, select your data rule
  3. Modify it by deleting the custom rule that contains the RegEx pattern

Increase the match percentage in fingerprinting

Increasing the percentage of matching in fingerprinting may help increase the accuracy of the content matching. To do this,

    1. Go to the Policies tab
    2. Under Data Classification, select Modify
    3. Select Document Matching under New Rule
    4. Increase match percentage to the required percentage

Keywords:

Context based classification, fingerprinting, false positives