View the Configuration for an Import

Imports > Selected Data Set or Load File > View Configuration

Requires Imports - View Permissions

After you add a Data Set (or a Load File), you can view information about the configuration that was applied during indexing.

View Configuration provides a read-only version of the following:

  • Index Settings in effect at the time of import.
  • With Pattern Detection Enabled, Patterns in effect and enabled at the time of import.

Note that View Configuration does not provide information about the following:

  • the configured Matter Name and Matter Number, which are displayed only in the Project Index Settings.
  • the Batch name/value and the Legal Discovery options selected for the Data Set, which appear in the Data Set Scan History.
  • the OCR queries in effect at the time of import. See Manage Project Index Settings for descriptions of the default OCR queries.
  • any configured Custom Warnings and Errors, which appear on the Data Set Reports tab after import, populated when there is data to display.
  • information that applies only to a Load File import.

Index Settings

This section reports the Index Settings that were in effect, as follows:

  • Parsing Settings (initial import, or reprocessing of the entire Data Set based on an ALLDOCS search of the Data Set):
    • Parse Currency
    • Parse Numeric Quantities
    • Parse Numeric Terms
    • Detect Languages
    • Detect Viruses
    • Include Entire Email Header
    • Extract Embedded Images
    • Prioritize MAPI Fields over Transport Header Metadata
    • Split Bloomberg Chat
    • Split Journaled Emails
    • Message Archive Processing Timeout: <value> minutes
  • Advanced Analytic Operations: Stop Words setting (initial import only, not subject to reprocessing)
  • Copy to Document Storage Settings (initial import only, not subject to reprocessing)
  • Other Settings: Custodian Directory and Media Directory Settings (initial import only, not subject to reprocessing)
  • Automatic OCR Settings (initial import of a Data Set imported as of 5.1, or reprocessing of the entire Data Set based on an ALLDOCS search of the Data Set)
  • Excluded File Extensions for Extraction (initial import of a Data Set, or reprocessing of the entire Data Set based on an ALLDOCS search of the Data Set)

Note: For Data Sets imported prior to 5.1, the information for some settings will not be available. For example, Detect Viruses, Split Bloomberg Chat, and Split Journaled Emails will display the icon with a tooltip when no information is available, and the Automatic OCR Settings section will display No information available.

See Manage Project Index Settings for descriptions of the Index Settings.

Pattern Detection and Enabled Patterns

This section displays the current list of Patterns enabled through one of the following:

  • initial import of the Data Set
  • reprocessing of the entire Data Set (a Private Data Set) based on an ALLDOCS search of the Data Set
  • running Update Patterns for the entire Data Set (in the originating Project, if this is a Shared Data Set)

Note: The Pattern Detection setting is enabled by default. If Pattern Detection has been explicitly disabled for the import, you will see Pattern Detection Disabled. If you want to enable Pattern Detection once it has been explicitly disabled for import, use Update Patterns. This will ensure that Pattern Detection is reported as enabled, and that the current list of enabled Patterns is applied.

As noted in the message shown for the table, current values are displayed for all enabled Patterns, and any enabled Pattern that has been modified (that is, the regular expression itself has been modified) since import, reprocessing, or an Update Patterns operation will be highlighted in the appropriate table row (in pale yellow). Having a modified Pattern highlighted can help you decide whether you should take additional action and update the Patterns for your Data Set. Note that only the originating Project for a Shared Data Set can update the Patterns for that Shared Data Set. Any sub-project using the Shared Data Set can view the most recent list of Patterns for that Data Set in this section.

Note that the email, unc, and uri System Patterns are enabled by default.

The list of Patterns for View Configuration includes the following information:

  • Name – The Pattern name.
  • Searchable Token Name (optional column, not shown by default) – For data processed prior to 4.3.11.0, displays how to search for the token using the 'token-<token_name>' format. This format requires you to place the search within single quotes and specify the Token name in lowercase, since the software normalizes a Token name to lowercase. Example: 'token-email'. In newly processed, updated, or reprocessed data as of Release 4.3.11.0, you do not use this format and instead search for an enabled Pattern using the pattern metadata field and the Pattern name (pattern::<pattern_name>, such as pattern::email).
  • Description – A description of the Pattern, if applicable.
  • Store Value – Indicates whether this Pattern stores individual Pattern values in the patternvalue field. When enabled, as indicated by a check mark, you can search for an enabled Pattern as well as individual Pattern values. For data processed prior to 4.3.11.0, both the tokens and the individual values are added to the system dictionary and are available for search and clustering operations. For documents processed as of 4.3.11.0, this means you can search for an enabled Pattern using the pattern field and search for a Pattern value using the patternvalue field. If values are not stored for an enabled Pattern, documents processed prior to 4.3.11.0 have the token applied, which means that you can identify the documents that contain matching data but you cannot search for specific values that triggered the match.For documents processed in 4.3.11.0, this means you can search for an enabled Pattern using the pattern metadata field, but you cannot search for specific Pattern values using the patternvalue field.
  • Pattern – The contents of the Pattern.
  • Created By – The login name of the user who created the Pattern. System indicates a System Pattern.

Note: If you disable Pattern Detection at import, you can later enable it to observe the current Project Patterns by using the Update Patterns option for the Data Set.

See Manage Project Patterns (Regular Expressions) for more information about Project Patterns.