Navigation Content

Add robust imaging, recognition, OCR and PDF capabilities to your applications

OmniPage Capture SDK and OmniPage Server

 

Established as the core technology behind all Nuance imaging products, OmniPage Capture is widely recognized as the #1 imaging and OCR technology on the market today.

If you are a software engineer creating any kind of application that must capture, recognize, classify, extract information or convert images and PDF files into editable documents then OmniPage Technology is for you. OmniPage toolkits, pre-built modules and APIs provide the absolute fastest way to incorporate imaging capabilities into your solutions on any platform or embedded device. OmniPage Capture SDK and OmniPage Server gives you the options and flexibility you need to add robust imaging, OCR recognition, document classification, and PDF capabilities into your most critical applications as well as barcode recognition technology, intelligent character recognition, zonal recognition, and more. Used by commercial vendors who are serious about high OCR accuracy and quality document imaging in their applications, the OmniPage Capture SDK provides scalable recognition, extraordinary PDF support, and a simple API that lets you create high-value, competitive products.

The OmniPage Capture SDK is available for Windows version 20, Linux and Mac platforms version 19.

The OmniPage Server is available for Windows server environments.

To learn more about the benefits of OCR integration and OCR for your developers, request your free OmniPage Capture SDK or OmniPage Server evaluation today!

Request Free Evaluation

 

 

OCR for Developers: Benefits of OmniPage

  • The world’s most accurate OCR solution increases productivity, lowers costs, and maximizes ROI
  • Can be implemented through a comprehensive developer toolkit or from a highly scalable server
  • Delivers everything you need for scanning, OCR, ICR, OMR, barcode, PDF, forms recognition, document conversion and document classification
  • Enables developers to provide higher value to customers with new and enhanced functionality
  • Provides an easy-to-use API to shorten development cycles and accelerates time to market
  • Helps you differentiate your application from the competition with the most advanced imaging technology available

 

What’s New?


The OmniPage Capture Software Development Kit (SDK) has always been the gold standard for developers who want to add sophisticated optical character recognition (OCR), imaging, and PDF creation and conversion capabilities to their own applications quickly and easily. 

And with the release of the OmniPage Capture SDK 20 for Windows and the OmniPage Server for Windows, the best just got even better. 

OmniPage Capture SDK 20 for Windows

  • Document Classifier empowers developers to create applications that separate documents consisting of multiple types and apply different processing technologies to each document type. This helps businesses create automated document pre-processes, such as invoice sorting and document routing, within an organization.
  •  Intelligent Workflow Runner features a new XML descriptor that defines the conversion processes with as many settings as required to produce the best possible format and OCR results from the original documents. It also includes a utility that converts OmniPage Ultimate Workflows into XML, and an API that connects into a COM Server to queue up and manage conversion jobs. The unmatched level of automation in this feature ensures that developers don’t have to dedicate time to creating these processes on their own.
  • The Mixed Raster Content compression method to create small-sized PDF files with perfectly legible textual content has been enhanced with the latest JPEG2000 compression technology and now delivers faster, more efficient PDF creation.
  • Improved Logical Form Recognition technology with support for radio buttons and enhanced recognition of checkboxes. This enables more accurate conversion of scanned forms into fillable PDF forms.

 
OmniPage Server for Windows

OmniPage Server is a highly scalable, 24/7, industrial strength standalone server product for document conversions, or for linking business applications requiring high-volume document processing and conversion. It is the ideal solution for organizations requiring the ability to quickly and easily create automated, watched folder conversion processes with minimal effort from their local IT. The server also allows more sophisticated business application developers to programmatically connect to other applications further enhancing and advancing business infrastructure and processes. Product capabilities include:

  •  A simple-to-use API enabling, developers to integrate OCR functions into client applications which can support virtually all operating system platforms for desktop computers and mobile devices
  • Web browser integration allowing end users to initiate conversion processes and access the results using their preferred browser
  • Watched folder support providing users the ability to quickly process scanned documents in the network folders that automatically place the converted, editable and searchable files into output subfolders for archival or additional use in the document workflow

Features

Key Features for the OmniPage Capture SDK 

The OmniPage Capture SDK offers a robust feature set to support all your document imaging needs. You get the power and accuracy of OmniPage - the most popular OCR software in the world - integrated into your applications, along with top-of-the line OCR engines and extensive PDF capabilities.

The strength of OmniPage Capture SDK extends beyond unrivaled accuracy, with additional features to streamline application development and provide added value to your product.

The most accurate and robust OCR available
OmniPage provides a scalable voting interface and significant throughput management capabilities. Combined with highly accurate machine-print OCR (OCR, OCR-A*, OCR-B* and MICR*), handprint (ICR), checkmark (OMR) and barcode (1D and 2D) recognition engines, the OmniPage Capture SDK delivers unmatched flexibility and accuracy.

Document classifier assistant
Is an application that allows you to assign your documents to classes, like letters, invoices, contracts, etc, and then you train the system to recognize these documents and their characteristics, test your scheme and then apply the technology and scheme to your application to  take action on the documents based on their class or type.  For example you can decided to route all invoices to finance and archive all letters to a DMS system.

Intelligent workflow runner
Features a new XML descriptor that defines the conversion processes with as many settings as required to produce the best possible format and OCR results from the original documents. It also includes a utility that converts OmniPage Ultimate Workflows into XML, and an API that connects into a COM Server to queue up and manage conversion jobs. The unmatched level of automation in this feature ensures that developers don’t have to dedicate time to creating these processes on their own.

Asian, Thai and Arabic OCR
The OmniPage Capture SDK Asian OCR module supports Simplified and Traditional Chinese, Japanese, and Korean. It can be used either as a standalone module or with the Western language kit. Thai and Arabic OCR modules are available as add-ons.

Support for the .NET Object Oriented Programming*
OmniPage Capture SDK 19 fully supports object oriented programming in .NET, C# and VB.NET. Sample recognition programs and sample viewers are included.

Multi-core and multi-thread processor support*
Better multi-threading and parallel processing on multi-page documents in the OmniPage Capture SDK let you exploit the full potential of your processing environment. In multi-page mode, OmniPage Capture SDK runs faster than previous versions on a quad-core machine.

Pre-made user interfaces*
The OmniPage Capture SDK’s Professional Visual Toolbox gives you pre-made interfaces for creating and executing workflows, controlling scanning devices, and document processing. It includes visual controls for advanced OCR and image enhancement tools. Use this module to create OmniPage-compatible workflows and monitor their execution.

Workflow development and execution*
You can easily create complex image processing and OCR tasks and manage all parameters and settings. Then, adding OCR to your application can be just one workflow execution call. Workflow features also help balance the load on dual core and hyper-threaded systems to boost performance.

Logical Form Recognition technology and Form Template Editor
Our advanced logical form recognition (LFR) automates form template creation and streamlines form processing, providing significant savings in development time. The standalone Form Template Editor* helps both developers and end users to easily create, modify, test, and manage form templates.

Throughput management
Updated throughput capabilities provide significant advantages over other SDKs, allowing you to deploy optimal document imaging throughput for your application.

Integrated PDF toolkit
Extensive PDF capabilities - including unique PDF overlay matching that achieves near-100% accuracy in PDF conversion- allow you to significantly reduce development costs and time-to-market. The OmniPage Capture SDK also supports output to the PDF/archive (PDF/A) format and generates multi-raster-content PDFs optimized for file size and quality.

Format support
The OmniPage Capture SDK supports a wide range of image and application format, including BMP, GIF, TIF, PDF, HTML, Microsoft Office formats, XML, ePub* and more. These provide significant advantages over other SDKs, allowing you to achieve optimal image throughput for your applications.

Text-to-speech (TTS)*
The OmniPage Capture SDK is also the only OCR SDK that includes text-to-speech technologies. You applications can turn paper and digital documents into human-sounding audio files. Not only is this an important way to provide document support for disabled users, it allows everyone to save documents to files that can be played on personal computers and mobile devices, including Apple iPod.

These advanced features, along with breakthrough PDF capabilities that achieve up to 100% word accuracy in converting text-based PDF documents, enable you to significantly reduce the cost of development and time-to-market. That’s why the OmniPage Capture SDK is the most powerful and complete document-imaging SDK in the world.

 

* Available in Windows version only.

Tech Specs

For Windows

The functionalities of OmniPage Capture SDK can be accessed through C/C++ API, .NET API, or ActiveX interface. Support for  application development on Windows XP SP3 and above enables you to easily create applications with a wide variety of recognition technologies using a single set of developer tools.

Development environment

  • Windows 7, 8, 8.1, 10 x86-x64
  • Windows Server 2008 R2, 2012 R2, 2016 
  • Intel Pentium 4 1.6 GHz or higher processor (Intel Core or higher CPU is recommended) 
  • 4 GB RAM (6 GB recommended)
  • 4 GB free disk space
  • Microsoft Visual Studio 2010 SP1, 2012, 2013.4, 2015.2

Runtime environment

  • Windows Vista SP1, 7, 8, 8.1, 10 x86-x64
  • Windows Server 2008 R2, 2012 R2, 2016
  • Intel Pentium 4 1.6 GHz or higher processor (Intel Core or higher CPU is recommended) 
  • 2 GB RAM (4 GB recommended)
  • 4 GB free disk space
  • 300 MB free disk space (less if not all recognition modules are distributed) 
     

For Linux

System Requirements

  • Intel or AMD 64-bit CPUs
  • Tested operating systems:
    • Fedora 20, 21
    • Debian 7.5, 7.7 and 8.1
    • Oracle Linux 6.5, 7.0
    • CentOS 6.3

 

For Mac

System Requirements

  • Intel 32-bit or 64-bit CPUs
  • OS X 10.8 or higher

 

Architecture

Product Architecture

  • The OmniPage Capture SDK architecture accommodates multiple image-processing technologies through four main subsystems:
    • An image input subsystem for scanning* or importing images.
    • An image preprocessing subsystem for improving image quality prior to recognition.
    • A recognition subsystem that provides multiple recognition technologies for image processing.
    • An export subsystem to format the output from multiple recognition modules into a common format for conversion to popular word processing formats or text. 

 

Interfaces
Two programming interfaces are available with the OmniPage Capture SDK:

C/C++ API
The C/C++ API provides control over image input, image preprocessing, recognition, and output and supports image processing on a page basis.

ActiveX*
An ActiveX interface is provided for Visual C++ programmers. This interface includes all functionality of the C interface and offers document-processing capabilities so you can create solutions that manage documents more efficiently. This interface also expands the support of modern development environments, including managed environments like VB.NET or C#.

Professional Visual Toolbox*
In conjunction with the ActiveX interface, a set of controls, collectively called the Professional Visual Toolbox, is available as an add-on module. Pre-made controls allow you to reduce development time and speed time-to-market by allowing plug-in interfaces for your application.

    Pre-made Controls

    • Image viewing
    • Zone content validation
    • Image thumbnail viewing
    • Text verification and editing
    • Display statistical information and a draft of the document
    • Provide details and progress about the workflow being executed on the system
    • Create OmniPage-compatible workflows
    • Access and change output converter settings
    • Display and edit form fields and attributes

 

Image Input
The image input subsystem provides TWAIN, WIA, and ISIS scanner* and image-conversion interfaces. Both color and grayscale images can be handled by the OmniPage Capture SDK and you can send images from memory to the preprocessing and recognition processes. Input conversion for TIFF, TIFF/JPEG, TIFF-FX, PCX, DCX, BMP, ADF, JPEG, PNG and PDF image formats are available.

Image Pre-Processing
Image correction and pre-processing greatly enhances image quality and recognition accuracy. These capabilities include:

  • Rotate (90, 180, 270 degrees)
  • De-skew (auto and programmed)
  • Invert (auto and programmed)
  • De-speckle
  • Resolution enhancement

An interface for integrating additional image pre-processing technologies is also available and extends the system's functionality by permitting customization of the system's image processing capabilities.

Recognition Module Management
The OmniPage Capture SDK's component manager supports the integration of 12 individual recognition modules into your application. Modules for machine print OCR, ICR (handprint OCR), barcode, OMR (checkbox), OCR-A*, OCR-B* and E-13B (MICR)* are provided.

Asian OCR software is supported in the OmniPage Capture SDK, including Simplified and Traditional Chinese, Japanese, and Korean with full layout retention. Thai and Arabic languages are supported with Direct TXT output.

Output Processing
The OmniPage Capture SDK's output processing subsystem takes output from the recognition modules and converts it into a desired format, including TXT, XML, PDF, DOCX, XLSX, PPTX*, HTML, and many more. PDF output is supported in formats including:

  • PDF normal (text, image, and graphics)*
  • Image only
  • Searchable PDF (image on text)
  • Normal with image substitutes*
  • PDF 1.4 - 1.7
  • All conformations of PDF/A
 

* Available in Windows version only.

 

Product Configurations

Product Configurations

The OmniPage Capture SDK is available in three configurations with optional add-ons:

 

The Professional Recognition Kit

  • C/C++ Libraries
  • Two premade voting OCR (machine print) recognition modules
  • Access to individual OCR engines for application optimization
  • OCR-A, OCR-B, E-13B (MICR)
  • ICR (hand-printed character recognition)
  • OMR (checkbox recognition)
  • Barcode recognition

 

The Professional OCR Kit

  • C/C++ Libraries
  • Two premade voting OCR (machine print) recognition modules
  • Access to individual OCR engines for application optimization
  • OCR-A, OCR-B, E-12B (MICR)
     

Asian OCR Kit - This kit provides support for Japanese, Traditional and Simplified Chinese, Japanese and Korean OCR software with full layout retention and searchable PDF output.

 

Included Application Tool

Form Template Editor* - Improves form template creation, modification, testing and management.

 

Add-On Options

  • PDF Output Module - Adds support for PDF 1.7, PDF/A, export to PDF Normal*, PDF Image-only, PDF Image on Text formats, and high-compression rate PDF-MRC.
  • Professional Toolbox* - Provides a collection of visual controls to create and customize UI elements for Windows-based applications, including image display, manual zoning, and OCR proofreading tools.
  • Thai OCR Module - An add-on to Professional OCR or Professional Recognition kit for including Thai OCR engine in the application.
  • Arabic OCR Module - An add-on to Professional OCR or Professional Recognition kit for including Arabic OCR in the application.
     
* Available in Windows version only

 

Product Evaluation
Product Videos

Form Template Editor

User Documentation

Licensing Tool

   United States & Canada
OmniPage CSDK

Choose your country.