Data Mapping

Quite often technology staff and legal teams do not have a clear picture of the data contained on corporate servers beyond gross volumes and which staff or teams have access to the environment. Techicians from Global EDD Group are able to analyze custodian computers and network servers while they are online, capturing key data points with minimal impact to business operations.

This analysis enables Global EDD Group to compose detailed statistics and reports that provide valuable knowledge regarding the contents of each computer or server. In turn, This knowledge enables legal teams to selectively prioritize specific data points, identify potential issues and exclude specific data types very early in the discovery process. This service is also quite useful with large volumes of unstructured data that is stored offline on hard drives and NAS devices.

The following outlines the statistics, reports and charts/graphs that Global EDD Group is able to generate during the Data Mapping process:


  • Number of Directories
  • Name of Each Directory
  • Size of Each Directory
  • Last Access Date
  • Directory Owner
  • Path Length
  • Number of Files
  • Largest Files
  • Age of Files
  • File Extensions
  • File Extension Type Groupings
  • MD5 Checksums


  • Dynamic Directory Structure (XML)
  • Detailed Content Summary (Database, XML, Spreadsheet, Text File, HTML)
  • File Extension Summary (Spreadsheet, Text)
  • User Summary (Spreadsheet, Text)
  • Largest Files (Spreadsheet, Text)

Charts / Graphs

  • Contents
  • Size
  • File Extension
  • Distribution
  • User
  • Age


Key Term Analysis

Often legal teams are faced with terabytes of client data and little knowledge of what information may be contained inside the millions of files. Whether it be at the point of collection, a production from opposing counsel or a shipment from a trusted vendor, it can be a daunting task to garner enough knowledge about the data set to make intelligent decisions about how to proceed.

Global EDD Group provides Key Term Analysis services to clients seeking to refine their knowledge about data collections of all sizes and contents. Our technicians will research data on your direction or provide an environment for you to undertake iterative searching of key terms across emails, spreadsheets, documents, presentations, music, videos and other similar unstructured data.

Key Term Analysis Features:

  • Boolean, Proximity and Keyword search types
  • Search across collection or within file types
  • Inline File Preview of native files
  • View File Properties
  • Save Search functionality
  • Export Search Results
  • Copy files to ZIP archive or staging directory


Advanced Text Analytics

Modern organizations have access to abundant sources of data that contain jewels of useful knowledge. Emails, web pages, memos, call center transcripts, survey responses, claims notes, legal cases, patent descriptions, research articles, and incident reports – all hold valuable pieces of knowledge that are hard to discover because they are hidden within large volumes of raw data or structured datasets not typically indexed for key term searching. Overwhelmed by the glut and variety of data, law firms and corporations are seeking new methods to analyze large volumes of structured and unstructured data to not only discover key knowledge, but to create case or review strategy as well.

Automated Knowledge Discovery & E-Discovery Text Analytics

Automated Knowledge Discovery ("AKD") is our service that derives knowledge from text and structured data, including XML files, databases, spreadsheets, web sites, documents, and emails using Natural Language Processing, Machine Learning, Interactive Visualization and Report Generation. AKD enables you to gain valuable insight into large diverse data sets by leveraging intelligent data mining and analysis tools to undertake a range of knowledge discovery tasks, including:

Data Source Integration

  • XML Files
  • Spreadsheets
  • Email / PST Files / Exchange Datastores
  • File Shares
  • Databases
  • Delimited Files
  • RSS Feeds
  • Web Sites

Distinct Text Identification

Compares records to determine whether they are duplicates, though comparison is fuzzy in the sense that the two values do not have to be 100% identical in content and form. Similar to "near duplicate indentication".

Key Term Linking

Generates a visualization of associations between various keywords so that that relationships between words are well-represented in a set of documents or a column of text values.

Text Clustering

Utilizes phrases find and identify similar groups of documents or records based on the contents of text.

Entity Extraction

  • Dates
  • Human names
  • Organizations
  • Addresses
  • Phones
  • Money Amount
  • Geographic Names

Keyword Extraction

Determines the important concepts discussed in a text via the identification the most frequent keywords and the ability to drill down on each keywords to see how the words are being mentioned.

Phrase Extraction

Statistical determination of phrases by examining the co-occurrences of consecutive words within the text. Stated more simply, if two words occur next to each other repeatedly in several sentences across several documents, it can be statistically assumed that these words constitute a phrase.

Link Analysis

Displays correlations and clusters of terms extracted from a set of documents or text values with the ability to drill down to review the specific relationships.

Complex Searching

Searches for phrases, proximity, by sentence or paragraph, synonyms and alternate term forms and enables the ability to see a ranked list of matching records, search within the results of previous searches, refine search query by providing positive and negative document-based relevance feedback, and the highlighting of occurrences of words in documents for easy navigation.

Our Automated Knowledge Discovery Services provide valuable insight to any collection of discovery data, far beyond key term browsing typically associated with litigation support systems, many of which simply do not have the ability to process structured data such as SQL databases or "as found" versions of email stores. Additional benefits of AKD Services include:

  • Mobile platform deployable worldwide
  • Fee structure based on time, not volume
  • No long term commitments, user fees or storage fees
  • Compliments, not replaces, current review workflows and systems
  • Ideal platform for data sampling


digital investigations

Data Collection

  • Forensic Imaging
  • Remote Collection
  • Mobile Devices
  • Learn More

Data Analysis

  • Computer Forensics
  • Early Case Assessment
  • Advanced Text Analytics
  • Learn More

Data Processing

  • Cloud E-Discovery
  • PST Mailbox Discovery
  • Mobile Onsite E-Discovery
  • Learn More

Document Review

  • Secure Cloud Hosting
  • Global Web Access
  • Cloud Productions
  • Learn More