|
OCR Engine System Software and Data
The SDK includes all OCR engine system software and data that will be required to use the OCR engine. This includes, but is not limited to:
Dictionaries
Shape Recognition Tools and Data
Supported Languages
The OCR engine supports the following languages and character sets:
Japanese (Shift-JIS)
Simplified Chinese (GB-2312 character set)
Traditional Chinese (BIG5 character set)
Korean (KSC)
Image Modes
Black and white, Grayscale and Color
Image Input
Scanner, Image file, and Memory, in strips at a time for both gray-scale and color
Output file formats
Single page and multi-page Text, XML, RTF, Excel, Searchable PDF.
Font Information
Simplified Chinese: Hei, Song, Kai, SimSun, SimHei
Traditional Chinese: MingLiu, Gothic
Korean: Batang, Myeongjo, Gothic
Japanese: Mincho, Gothic
Text Detection
Horizontal and vertical text layout
Full and half width spacing
Japanese Ruby (Hiragana/Katakana (8pt), Kanji (9pt), Latin (7pt))
|