Navigation Content

OCR SDK, Optical Text Recognition, and OCR for Developers

OmniPage Capture SDK

 

Established as the core technology behind all Nuance imaging products, OmniPage Capture SDK is widely recognized as the #1 imaging and OCR SDK toolkit on the market today.

To learn more about the benefits of OCR integration and OCR for your developers, request your free OmniPage Capture SDK 19 evaluation today!

OmniPage Capture SDK gives you everything you need to add robust imaging, OCR recognition, and PDF capabilities into your most critical applications as well as optical text recognition technology, intelligent character recognition, zonal recognition, and more.

Used by commercial vendors who are serious about high OCR accuracy and quality document imaging in their applications, the OmniPage Capture SDK provides scalable recognition, extraordinary PDF support, and a simple API that lets you create high-value, competitive products. 

 

OCR for Developers: Benefits of OmniPage

  • The world’s most accurate solution increases productivity, lowers costs, and maximizes ROI
  • Delivers everything you need for scanning, OCR, ICR, OMR, barcode, PDF, and document conversion
  • Enables developers to provide higher value to customers with new and enhanced functionality
  • Provides an easy-to-use API to shorten development cycles and accelerates time to market
  • Helps your organization differentiate itself from the competition with the most advanced scanning, OCR, and PDF technologies

 

What’s New in Version 19?


The OmniPage Capture Software Development Kit (SDK) has always been the gold standard for developers wanting to add sophisticated optical character recognition (OCR), imaging, and PDF creation and conversion capabilities to their own applications quickly and easily. 

And with the release of the OmniPage Capture SDK 19, the best just got even better. 

 

Recognition: Forms Processing Made Easy

  • Major enhancements to form-processing technologies, including the new Form Template Editor
  • New template matching and data-extraction functions accelerate development efforts
  • A newly integrated Thai OCR engine
  • Significant Asian accuracy improvements*:

- Character accuracy increased by up to 40%
- Layout accuracy increased by 45% for all Asian languages

  • Western language layout retention and document-formatting improvements: Significant enhancements to vertical text recognition
  • Support for new barcodes: Code11, Italian Post 25, MSI, Bookland, ITF 14, EAN 14, SSCC18, Databar Limited, Databar Expanded  

 

Image Processing: Convert All the Data -- Even What You Can Hardly See

  • Camera and smartphone image-handling improvements

- Obtaining camera flag from EXIF information
- Automatic resolution (DPI) estimation
- Shading correction
- New binarization method
- Modified workflow for camera images for better OCR

  • JBIG2 and MRC compression improvements
  • Support for 32-bit bitmap input images
  • More robust PDF input

 

Output: Work with the Format You Prefer

  • Support for PDF/A-1a, PDF/A-2a, PDF/A-3a, b, u (in addition to existing PDF A/2-b and PDF A/2-u support)  to improve compliance with government and industry standards
  • Support for Google Docs and Apple Pages
  • PDF file splitter to split OCR output files with maximum file size or page number
  • Support for ePub output format to enhance the reading experience on eBook readers
  • Support for WIA2 to make your application more user friendly on Windows
  • Support for MP3 audio output format with natural-sounding speech

 

Improved Developer Experience: Faster, Easier, and More Intuitive

  • Many powerful user and developer experience enhancements:

- An intuitive and easy-to-use Distribution Wizard
- More convenient and reliable license control
- Streamlined installation
- Consolidated and clearer documentation
- More convenient and productive API and settings
- Updated and improved licensing document
- Stability improvement
 

  • Native 64-bit binaries*
  • Updated system requirements:

- Supports the latest OS and environments

 

* Also available in OmniPage Capture SDK v18.5 and higher.

Features

Key Features for the OmniPage Capture SDK 19

The OmniPage Capture SDK offers a robust feature set to support all your imaging needs. You get the power and accuracy of OmniPage - the most popular OCR program in the world - integrated into your applications, along with top-of-the line OCR engines and extensive PDF capabilities.

The strength of OmniPage Capture SDK extends beyond unrivaled accuracy, with additional features to streamline application development and provide added value to your product.

The most accurate and robust OCR available
OmniPage provides a scalable voting interface and significant throughput management capabilities. Combined with highly accurate machine-print OCR (OCR, OCR-A, OCR-B and MICR), handprint (ICR), checkmark (OMR) and barcode (1D and 2D) recognition engines, the OmniPage Capture SDK delivers unmatched flexibility and accuracy.

Asian OCR support
The OmniPage Capture SDK Asian OCR module supports Simplified and Traditional Chinese, Japanese, and Korean. It can be used either as a standalone module or with the Western language kit.

Support for the .NET managed environment
OmniPage Capture SDK 19 supports NET. C# and VB.NET sample recognition programs and sample viewers are included.

Multi-core and multi-thread processor support
Better multi-threading and parallel processing on multi-page documents in the OmniPage Capture SDK let you exploit the full potential of your processing environment. In multi-page mode, OmniPage Capture SDK 19 runs faster than previous versions on a quad-core machine.

Pre-made user interfaces
The OmniPage Capture SDK’s Professional Visual Toolbox gives you pre-made interfaces for creating and executing workflows, controlling scanning devices, and document processing. It includes visual controls for advanced OCR and image enhancement tools. Use this module to create OmniPage-compatible workflows and monitor their execution.

Workflow development and execution
You can easily create complex image processing and OCR tasks and manage all parameters and settings. Then, adding OCR to your application can be just one workflow execution call. Workflow features also help balance the load on dual core and hyper-threaded systems to boost performance.

Logical Form Recognition technology and Form Template Editor
Our advanced logical form recognition (LFR) automates form template creation and streamlines form processing, providing significant savings in development time. The standalone Form Template Editor helps both developers and end users to easily create, modify, test, and manage form templates.

Throughput management
Updated throughput capabilities provide significant advantages over other SDKs, allowing you to deploy optimal document imaging throughput for your application.

Integrated PDF toolkit
Extensive PDF capabilities - including unique PDF overlay matching that achieves near-100% accuracy in PDF conversion- allow you to significantly reduce development costs and time-to-market. The OmniPage Capture SDK also supports output to the PDF/archive (PDF/A) format and generates multi-raster-content PDFs optimized for file size and quality.

Format support
The OmniPage Capture SDK supports a wide range of image and application format, including BMP, GIF, TIF, PDF, HTML, Microsoft Office formats, XML, ePub and more. These provide significant advantages over other SDKs, allowing you to achieve optimal image throughput for your applications.

Text-to-speech (TTS)
The OmniPage Capture SDK is also the only OCR SDK that includes text-to-speech technologies. You applications can turn paper and digital documents into human-sounding audio files. Not only is this an important way to provide document support for disabled users, it allows everyone to save documents to files that can be played on personal computers and mobile devices, including Apple iPod.

These advanced features, along with breakthrough PDF capabilities that achieve up to 100% word accuracy in converting text-based PDF documents, enable you to significantly reduce the cost of development and time-to-market. That’s why the OmniPage Capture SDK is the most powerful and complete document-imaging SDK in the world.

Tech Specs

The OmniPage Capture SDK can be accessed through a C/C++ API or ActiveX interface. Support for  application development on Windows XP SP3 and above enables you to easily create applications with a wide variety of recognition technologies using a single set of developer tools.

 

Developer System Requirements

  • Windows XP SP3, Vista SP2 x86-x64, Windows 7 SP1 x86-x64, and Windows 8 x86-x64
  • Windows Server 2003 x86-x64, Windows Server 2008 x86-x64/R2, Windows Server 2012 x86-x64/R2
  • Intel and AMD 32-bit and 64-bit CPUs
  • Microsoft Visual C/C++ version .NET 2003/7.1, .NET 2005/8.0, 2008, 2010, Visual Studio 2012
  • Microsoft Visual Basic .NET   

 

Runtime System Requirements

  • Windows XP SP3, Vista SP2 x86-x64, Windows 7 SP1 x86-x64, Windows 8 x86-x64
  • Windows Server 2003 x86-x64, Windows Server 2008 x86-x64/R2, Windows Server 2012 x86-x64/R2
  • Intel and AMD 32-bit and 64-bit CPUs

 

Architecture

Product Architecture

  • The OmniPage Capture SDK architecture accommodates multiple image-processing technologies through four main subsystems:
  • An image input subsystem for scanning or importing images.
  • An image preprocessing subsystem for improving image quality prior to recognition.
  • A recognition subsystem that provides multiple recognition technologies for image processing.
  • An export subsystem to format the output from multiple recognition modules into a common format for conversion to popular word processing formats or text. 

 

Interfaces
Two programming interfaces are available with the OmniPage Capture SDK:

C/C++ API
The C/C++ API provides control over image input, image preprocessing, recognition, and output and supports image processing on a page basis.

ActiveX
An ActiveX interface is provided for Visual C++ programmers. This interface includes all functionality of the C interface and offers document-processing capabilities so you can create solutions that manage documents more efficiently. This interface also expands the support of modern development environments, including managed environments like VB.NET or C#.

Professional Visual Toolbox
In conjunction with the ActiveX interface, a set of controls, collectively called the Professional Visual Toolbox, is available as an add-on module. Pre-made controls allow you to reduce development time and speed time-to-market by allowing plug-in interfaces for your application.


Pre-made Controls

  • Image viewing
  • Zone content validation
  • Image thumbnail viewing
  • Text verification and editing
  • Display statistical information and a draft of the document
  • Provide details and progress about the workflow being executed on the system
  • Create OmniPage-compatible workflows
  • Access and change output converter settings
  • Display and edit form fields and attributes

 

Image Input
The image input subsystem provides TWAIN, WIA, and ISIS scanner and image-conversion interfaces. Both color and grayscale images can be handled by the OmniPage Capture SDK and you can send images from memory to the preprocessing and recognition processes. Input conversion for TIFF, TIFF/JPEG, TIFF-FX, PCX, DCX, BMP, ADF, JPEG, PNG, PaperPort MAX and PDF image formats are available.

Image Pre-Processing
Image correction and pre-processing greatly enhances image quality and recognition accuracy. These capabilities include:

  • Rotate (90, 180, 270 degrees)
  • De-skew (auto and programmed)
  • Invert (auto and programmed)
  • De-speckle
  • Resolution enhancement

An interface for integrating additional image pre-processing technologies is also available and extends the system's functionality by permitting customization of the system's image processing capabilities.

Recognition Module Management
The OmniPage Capture SDK's component manager supports the integration of 12 individual recognition modules into your application. Modules for machine print OCR, ICR (handprint OCR), barcode, OMR (checkbox), OCR-A, OCR-B and E-13B (MICR) are provided. An interface is also provided to incorporate additional recognition technologies into your application. This interface lets you pass images, receive recognition output, and pass configuration commands to the desired recognition module.

Asian OCR software is supported in the OmniPage Capture SDK, including Simplified and Traditional Chinese, Japanese, and Korean with full layout retention. See Asian OCR Support for more information.

Output Processing
The OmniPage Capture SDK's output processing subsystem takes output from the recognition modules and converts it into a desired format, such as BMP, ePub, and more. PDF output is supported in formats including:

  • PDF normal (text only)
  • Image only
  • Searchable PDF (image on text)
  • Normal with image substitutes
  • See Integrated PDF Toolkit for more information.

 

Product Configurations

Product Configurations

The OmniPage Capture SDK is available in three configurations with three optional add-ons:

 

The Professional Recognition Kit

  • C/C++ Libraries
  • Two  premade voting OCR (machine print) recognition modules
  • Access to three individual OCR engines for application optimization
  • OCR-A, OCR-B, E-13B (MICR)
  • Two ICR (handprint) recognition modules
  • OMR (checkbox)
  • Barcode recognition

 

The Professional OCR Kit

  • C/C++ Libraries
  • Two  premade voting OCR (machine print) recognition modules
  • Access to three individual OCR engines for application optimization
  • OCR-A, OCR-B, E-12B (MICR)
     

Asian OCR Kit - This kit provides support for Japanese, Traditional and Simplified Chinese, Japanese and Korean OCR software with full layout retention and searchable PDF output.

Included Application Tool
Form template editor - Improves form template creation, modification, testing and management.

Add-On Options

  • PDF Output Module - Adds support for PDF 1.7, PDF/A, export to PDF Normal, PDF Image-only, PDF Image on Text formats, and high-compression rate PDF-MRC.
  • Professional Toolbox - Provides a collection of visual controls to create and customize UI elements for Windows-based applications, including image display, manual zoning, and OCR proofreading tools.
  • Thai OCR Module - An add-on to Professional OCR or Professional Recognition kit for including Thai OCR engine in the application.
     

 

Product Evaluation
Product Information
Product Videos

Form Template Editor

User Documentation

Licensing Tool

   United States & Canada
OmniPage CSDK

Choose your country.