Kaspersky Security 9.x for SharePoint Server

Quotations from documents

May 17, 2024

ID 112683

Kaspersky Security allows you to check text in files that are uploaded to or stored on SharePoint for the presence of quotations from confidential documents. The Quotations from documents category allows you to specify a list of documents from which quotations must be detected by the application.

To detect quotations, Kaspersky Security uses Digital Fingerprints technology, which allows the application to convert text data into digital fragments.

When monitoring for leaks, the application compares fragments in files being scanned against fragments stored in the category. To detect quotations, the application must recognize Minimum number of matching fragments.

The application stores no original documents (nor any parts of them) in the category. No original documents (nor any of their parts) that have been added to a category can be restored or read on the basis of fragments.

Category settings

The Minimum number of matching fragments setting determines the number of text fragments from documents that have been added to a category, which is sufficient to register a data leak by this category.

The default value of this setting (4 fragments) ensures an optimal functioning of the category when handling most documents.

We recommend that you alter the default value of this setting in the following cases:

  • If scanned documents cause false positives (the application creates incidents when scanning documents that you do not view as containing any quotations from documents that have been added to the category). We recommend that you increase this value when configuring the category.

    False positives may occur if the original document and the one being scanned both contain large portions of unchanged text, which repeats in various documents (for example, common text in headers and footers). In this case, the specified number of matching fragments may be found in such repeated text, which results in a false positive.

  • If no quotations are found in documents being scanned (the application creates no incidents when scanning documents that you view as containing some quotations from documents that have been added to the category). We recommend that you decrease this value when configuring the category.

We recommend that you upload documents of an approximately equal size to a single category. We recommend that you create separate categories for documents if their size differs more than 2-3 times. Otherwise, search for quotations across documents in a category may be far from optimal.

If you cannot find an optimal value of the Minimum number of matching fragments setting, we recommend that you distribute the documents from this category by a few subcategories so that each of them contains documents with an approximately equal number of fragments.

Scenarios of document quoting check

  1. Add a category with quotations from documents and configure it.
  2. Use the category to check quotations using one of the following methods:

Did you find this article helpful?
What can we do better?
Thank you for your feedback! You're helping us improve.
Thank you for your feedback! You're helping us improve.