Ad Litem Consulting, Inc.

Ad Litem Consulting, Inc.
Technical Standards
   Includes:
     - Load Files
     - Cost Codes
     - Quotes
     - For Vendors
     - For Firms

Newsletter Signup
Read about litigation tech strategies and best practices for the case and firm.
Litigation Support Department
Includes:
   - Budget Spreadsheet
   - Needs Assesment
   - Case Technology Plan
   - Task and Check Lists
   - Member's Area Access

    

Homepage

Intro
Acknowledgements
License
Preface
1.00 Introduction
1.01 For Vendors
1.02 For Firms
1.03 How to Use This Document
2.00 Business Standards
2.01 Outgoing Media Kit
2.02 Cost Codes for Litigation Support
2.03 Request for Quotes ("RFQs")
2.04 Quotes
2.05 Weekly Updates
2.06 Color Blindness
2.07 Quality Control
2.08 Required Test Load
3.00 Technical Standards
3.01 Media Labels
3.02 File, Folder and Volume Naming
3.03 CD Content and Organization
3.04 Organization of Sub-Folders
3.05 Bates Schemes
3.06 Data Files
3.07 Database Conventions
3.08 Native Files
3.09 Project Specifications Document
3.10 Bibliographical Coding Manual
3.11 Image Format
3.12 OCR
3.13 Slip-Sheets or Unitization Rules
3.14 Video
3.15 Synchronization
3.16 Transcripts
3.17 Delivery Media
4.00 Software Specific Requirements
4.01 Casesoft Suite
4.02 IPRO
4.03 Dataflight's Concordance and Opticon
4.04 Image Capture Engineering
4.05 Summation
4.06 iCONECT
4.07 inData TrialDirector
4.nn Additional Titles to Follow
5.00 Examples of What Not To Do
5.01 Media Labels
5.02 File / Folder / Volume Name Conventions
5.03 Database
5.04 Media Content
5.05 Load Files
5.06 OCR
5.07 Opticon Load Files
5.08 Image Format
5.09 Transcripts
5.10 General Errors / Issues
5.11 Real Experiences

3.12 OCR - Technical Standards


Vendor should use auto-rotate and voting when generating OCR. Most OCR software offers an auto-rotate option. When auto-rotate is enabled, the software will OCR each image four times, rotated 90 degrees each time. It determines the best result and publishes the content to the load file. The majority of documents have the same orientation: portrait. Without auto-rotate, these documents can yield good results. The rest of the documents may be designed for a landscape layout, such as an HR chart. Other documents still may have been scanned “upside-down”, resulting in garbage OCR. OCR voting is a process where multiple OCR programs compare results to determine the best results.

Quality Check
The OCR text should best approximate and recreate the formatting found on the original image. The OCR field should never be just the words in one long string.

No text and the top, bottom or either side should be clipped.

Multi-Page Text Files
There should be a one document to one OCR text file ratio. The OCR filename must match the document image key. So, a 10 page document with the image key of AA001 should have a corresponding file AA001.TXT that contains the OCR for AA001 through AA010.

Each page of OCR should have a line identifying the page number, or Bates number. In this fashion, people can search for any Bates number and find the correct document. Please include space between the OCR text and page marker.

The following shows sample OCR:

<< AA001 >>

Text for first page

<< AA002 >>

Text for second page

The following chart shows a sample database and corresponding OCR files:

IMAGE KEY

BEGBATES

ENDBATES

PATH

FILENAME

AA001

AA001

AA0010

D:\[VOLUME NAME]\OCR\

AA001.TXT

AA011

AA011

AA0011

D:\[VOLUME NAME]\OCR\

AA011.TXT

AA012

AA012

AA0038

D:\[VOLUME NAME]\OCR\

AA012.TXT

AA039.0001*

AA039.0001

AA0100

D:\[VOLUME NAME]\OCR\

AA039.0001.TXT

* Please refer to Bates prefix and suffix conventions.

©2006 Ad Litem Consulting, Inc. - Litigation Support Services