Products
Dragon AudioMining
Enable Text Search of Audio Files
The ability to use text keywords and phrases to automatically search audio files is now within reach for organizations of any size. Dragon AudioMining eliminates the time and cost associated with manually indexing rich media, enables the indexing of 100% of the speech information within audio files.
By using an advanced speaker-independent dictation engine, Dragon AudioMining creates XML speech index data for every word spoken within an audio file. The index data includes word, time stamp, confidence levels and metadata associated with the speech information, and can be created from broadcast and telephony-quality sources. Ideal for application and Web developers, system integrators and OEM customers, the Dragon AudioMining SDK enables XML speech indexing and search capabilities to be added to commercial, Web and custom applications.
- New Support for Audio Streaming. Recognition can now be performed from a streaming file, eliminating the need to produce an audio file.
- Telephony and Broadcast Acoustic Models. The AudioMining SDK offers developers a choice of acoustic models, which work in conjunction with complex language analysis to produce unsurpassed results. The models are speaker-independent, meaning they’re built to support recognition for an unlimited number of speakers with different voices and accents.
- Recognition Confidence Score. Accuracy levels depend upon the quality of the recording. Studio-based content will provide higher accuracy levels, but the system also provides a reasonable level of accuracy for telephone, public presentation and broadcast content.
- Increased Accuracy. The system recognizes "all words," not just keywords. The accuracy of preconfigured vocabularies can be further fine-tuned using the Vocabulary Tool to include organization-specific terms and proper names. This tool automatically customizes vocabularies with unique terms, such as industry-specific terminology or topics, resulting in outstanding recognition.
Installation requirements
- CPU: Intel® Pentium4® or later or AMD Athlon 64 1 GHz or later. (SSE2 instruction set required).
- Memory: 512 MB RAM (1 GB RAM for Windows Vista™)
- Free hard disk space: 1 GB (2 GB for localized non-English versions)
- L2 Cache: 512 KB
- Supported Operating Systems:
- Windows Server 2000
- Windows Server 2003
- Windows XP Service Pack 2 or higher, 32-bit
- Windows 2000 Service Pack 4 or higher
- Windows Vista™ or Windows Vista™ Service Pack 1, 32-bit; 64-bit OS not currently supported
- DVD-ROM drive (required for installation)
Recommended specifications
- CPU: Intel® Pentium4® / 2.4 GHz (1.6 GHz dual core) or equivalent AMD processor. (SSE2 instruction set required).
- Memory: 1 GB RAM
- L2 Cache: 1 MB
Supported Audio Formats
Dragon Audiomining SDK supports the following audio file types in both mono and stereo (8 kHz to 99 kHz):
- WAVE PCM
- MS ADPCM
- IMA ADPCM
- a-law
- mu-law
- VOX
- MP3
- WMA
Language Versions Available
- Dutch
- French
- German
- Italian
- Spanish
- UK English
- US English