Data Loss Prevention (DLP) Policy and Classification Labels: Limitations

Data Loss Prevention (DLP) Policy and Classification Labels: Limitations

The Data Loss Prevention (DLP) feature in WorkDrive relies on content extraction from uploaded files. DLP classification is performed based only on the extracted content.

Currently, WorkDrive supports text and Optical Character Recognition (OCR) extraction for DLP classification. Below are the limitations associated with content and OCR extraction.

Content Extraction Limitations:

Limitations

Limit

Maximum size of extracted content

10 MB

Maximum file size for text extraction

xls/xlsx files: 100 MB

Other files: 250 MB

Maximum ZIP file size for entry name extraction

250 MB

Maximum file size for metadata extraction

Documents: 250 MB

Images: 250 MB

Audios/Videos: 1 GB

Supported File Extensions

View list

Blocked MIME types

View list


 

OCR Extraction Limitations:

Limitations

Limit

Maximum image size

20 MB

Max number of images OCRed in a PDF

Images in the first 2 pages

Max number of images OCRed in a office formats

First 2 unique images

Maximum number of pages scanned for images in office formats

First 5 pages

Maximum document size for OCR

100 MB

Maximum Image Height/Width

5000 pixels

Supported Image Extensions for OCR

jpeg, png, jpg, tiff, tif, bmp

Supported Document Extensions for OCR

pdf, docx, doc, pptx, ppt

 

The DLP feature has other limitations related to policies, rules, and classification labels:
  1. You can create up to 100 DLP policies and 200 classification labels by default.
  2. Each DLP policy can contain a maximum of 10 rules.

  3. Limitations based on Rules:
    DLP policies support three types of rules:

    1. Keyword Identifier: Searches for configured keywords within files.
      • Maximum keyword length: 100 characters.

    2. File Identifier: Identifies files based on name, extension, and file size range.
      Limits include:
      • Maximum file name length: 100 characters
      • Maximum number of extensions: 10
      • Maximum extension length: 100 characters
      • Maximum file size range: ~50 GB

    3. Sensitive content identifiers: WorkDrive currently supports 149 pre-defined sensitive content identifiers across 47 countries/regions. These include National ID cards, bank account details, Social Security numbers, and more. View the full list of supported sensitive information types

      Note: This list is continually evolving, and additional countries and data types may be added over time.

  4. Classification Labels Limitations
    Each DLP policy can use only one classification label. However, you can assign multiple classification labels to a single file.
    1. Manual Classification Labels: Can be manually assigned or removed from a file listing page.
      • You must have EDIT or higher permissions to assign or remove these labels.

    2. Automatic Classification Labels: Automatic classification labels are assigned through a DLP policy and cannot be manually removed from the file listing page.

      To remove an automatic classification label, follow these steps:

      • Open the Admin Console and go to the Data Loss Prevention tab.
      • In the Classification Labels section, you’ll see a list of files associated with classification labels.
      • Click the More actions icon (…) next to a classification label to view its associated files.
      • In the Associated Files window, right-click the file you want to modify.
      • Select Remove Classification Label to detach the label from the file.
Notes
Note: If you need a DLP policy or classification label that exceeds the above limitations, please raise a request through WorkDrive Support with your requirements. Our team will review your request and create a modified policy or classification label based on your needs.

WorkDrive support info: