Kaspersky Security allows you to check text in outgoing and internal email messages for the presence of quotations from confidential documents. The Quotations from documents category allows you to specify a list of documents from which quotations must be detected by the application.
To detect quotations, Kaspersky Security uses Digital Fingerprints technology, which allows the application to convert text data into digital fragments.
When monitoring for leaks, the application compares fragments in email messages being scanned against fragments stored in the category. To detect quotations, the application must recognize Minimum number of matching fragments.
The application stores no original documents (nor any parts of them) in the category. No original documents (nor any of their parts) that have been added to a category can be restored or read on the basis of fragments.
Category settings
The Minimum number of matching fragments setting determines the number of text fragments from documents that have been added to a category, which is sufficient to register a data leak by this category.
The default value of this setting (4 fragments) ensures an optimal functioning of the category when handling most documents.
We recommend that you alter the default value of this setting in the following cases:
False positives may occur if the original document and the one being scanned both contain large portions of unchanged text, which repeats in various documents (for example, common text in headers and footers). In this case, the specified number of matching fragments may be found in such repeated text, which results in a false positive.
We recommend that you upload documents of an approximately equal size to a single category. We recommend that you create separate categories for documents if their size differs more than 2-3 times. Otherwise, search for quotations across documents in a category may be far from optimal.
If you cannot find an optimal value of the Minimum number of matching fragments setting, we recommend that you distribute the documents from this category by a few subcategories so that each of them contains documents with an approximately equal number of fragments.
Scenarios of document quoting check
The application will check documents sent by email for quotations from the category.