Saving recognition results

You can save recognized pages to disk in a wide variety of file types.

  1. Choose Export Results... in the File menu, or click the Export Results button in the OmniPage Toolbox with Save to File selected in the drop-down list.
  2. The Save to File dialog box appears. Select Text under Save as.
  3. Select a folder location and a file type for your document. Select a page range, file options, naming options and a formatting level for the document. See “Selecting a formatting level” on this page.
  4. Type in a file name. Click Converter Options... if you want to specify precise settings for the export. See “Selecting converter options” on this page.
  5. Click OK. The document is saved to disk as specified. If View Result is selected, the exported file will appear in its target application; that is the one associated with the selected file type in your Windows system or in the advanced saving options for your selected file type converter.

Selecting a formatting level

The formatting level for export is defined at export time, in the saving dialog box (Save to File, Copy to Clipboard, Send in Mail or other dialog box). Three of the levels correspond to the format views of the same name in the Text Editor. However, the level to be applied for saving is independent of the formatting view displayed in the Text Editor. When exporting to file or mail, first specify a file type. This determines which formatting levels are available.

The formatting levels are:

Plain TextPlain Text

This exports plain decolumnized left-aligned text in a single font and font size. When exporting to Text or Unicode file types, graphics and tables are not supported. You can export plain text to nearly all file types and target applications; in these cases graphics, tables and bullets can be retained.

Formatted textFormatted Text

This exports decolumnized text with font and paragraph styling, along with graphics and tables. This is available for nearly all file types.

Flowing TextFlowing Page

This keeps the original layout of the pages, including columns. This is done wherever possible with column and indent settings, not with text boxes or frames. Text will then flow from one column to the other, which does not happen when text boxes are used.

True PageTrue Page

This keeps the original layout of the pages, including columns. This is done with text, picture and table boxes and frames. This is offered only for target applications capable of handling these. True Page formatting is the only choice for XML export and for all PDF export, except to the file type ‘PDF Edited’.

SpreadsheetSpreadsheet

This exports recognition results in tabular form, suitable for use in spreadsheet applications. This places each document page onto a separate worksheet.

When exporting to Microsoft Excel, 'Spreadsheet' is good for saving whole-page tables. Prefer 'Formatted Text' if your document contains smaller tables: each table will be placed on a separate worksheet with non-table parts placed in an index worksheet with hyperlinks to each relevant worksheet.

Selecting converter options

Click the Converter Options... button in a saving dialog box to have precise control over the export. This brings up a dialog box with the name of the converter associated with the current file type. It presents a series of options tailored to this file type. First, confirm or change the formatting level, because this influences which other options are presented. Select options as desired. Online Help details how to do this.

Using multiple converters

Multiple converters allow you to export to two or more file types in one export step. Choose Multiple in the saving dialog box:

Radio Group Multiple Selected

To make your own multiple converter, open the Save Preferences dialog box from the Tools menu. Choose the heading Multiple converters. Select a converter and click Create from... . This will make a copy of the selected converter that you can freely modify without overwriting the original one.

The new converter appears in the list. Select it and click Options... to specify its settings. You receive a list of all text converters, followed by all image converters. Checkmark the desired ones. Optionally specify sub-folder paths for each file type.

You can save pages with different formatting levels or file options to the different file types, as defined in their simple converters. A few saving operations cannot be done with multiple converters. These are:

Saving OmniPage Documents

Use a workflow with two saving steps, or perform two separate saves.

Saving to two targets

For instance, you cannot use a multiple converter to save a document to file and also send it in mail. Use a workflow with two saving steps, or perform two separate saves.

Saving different page ranges

You cannot save different page ranges to different file types, because only one set of selected pages can exist at saving time. For the same reason, a single workflow cannot be used either. Perform two separate saves or use two workflows.

Saving to PDF

You have five choices when saving to Portable Document Format (PDF) files. The first four are presented as Text converters, the last one is listed among the Image converters.

PDF (Normal):

Pages are exported as they appeared in the Text Editor in True Page view. The PDF file can be viewed and searched in a PDF viewer and edited in a PDF editor.

PDF Edited:

Use this if you have made significant editing changes in the recognition results. You have three formatting level choices, including True Page. The PDF file can be viewed, searched and edited.

PDF Searchable Image (formerly PDF Image on Text):

The PDF file is viewable only and cannot be modified in a PDF editor. The original images are exported, but there is a linked text file behind each image, so the text can be searched. A found word is highlighted in the image.

PDF with image substitutes:

As for PDF (Normal), but words containing reject and suspect characters have image overlays, so these uncertain words display as they were in the original document. The PDF file can be viewed, searched and edited.

PDF Image (formerly PDF, image only):

The original images are exported. The PDF file is viewable only and cannot be modified in a PDF editor and text cannot be searched.

Besides the above flavors, you can use other parameters in defining your PDF output:

PDF 1.6

Save to PDF version 1.6 for enhanced security, markup and attachment embedding functionality.

PDF-A

Choose to create a PDF-A compliant file to make sure that your PDF displays exactly identically, regardless of the computer environment.

Tagged PDF

Create a tagged PDF file to preserve its structure. This will ensure logical reading order, correct table structure and more.

PDF MRC

Use this high compression technology for good quality and smaller file size. Available for color and grayscale PDF Images or PDF Searchable Images.

Converting from PDF

To extract text content from a PDF file, load it into OmniPage, recognize it, and save the results to a text format.

A variety of outputs is also available from a PDF file shortcut menu: Word, Excel, RTF, WordPerfect or text. For more options, use the Convert Now Wizard.