VoCon® Hybrid is a feature-rich SDK that provides superior functionality, unmatched accuracy and high performance for a variety of applications that utilise speech control.
The VoCon® product family has delivered the power of speech to consumer and industrial products for over 10 years.
VoCon Hybrid delivers a new level of speaker-independent and continuous speech recognition as well as multi-lingual language understanding.
Full range of embedded and connected speech recognition services from embedded digit recognition to connected dictation and complex search functionality. Embedded failsafe for many use cases if network connectivity is lost.
VoCon Hybrid can be combined with other solutions in the VoCon product family or Vocalizer's text-to-speech for a more robust speech dialogue solution and improved accuracy.
The VoCon® product family has delivered the power of speech to consumer and industrial products for over 10 years.
VoCon Hybrid enables voice control in a range of products including TVs, games and toys, language learning software and mobile phones.
VoCon Hybrid is used in a wide variety of industrial applications in warehousing and distribution equipment as well as medical equipment.
VoCon Hybrid is developed for mobile applications with high accuracy operation to SNR levels as low as 5dB.
Recognises natural speech, eliminating restriction to predefined commands for all VoCon Hybrid languages.
Enables all commands to be spoken in a single utterance on the main menu.
Worldwide support for over 30 languages provides universal functionality.
Enables embedded recognition for large lists with connected dictation and search capabilities.
Always listening mode allows the user to “wake up” or activate their system with a keyword, such as “Hello Dragon”. Removes the need for a press-to-talk button.
Talk over barge-in allows user to speak over spoken dialogue prompts and be recognised.
Enables recognition from combined large lists recognising only valid combinations, such as street + city + state for all USA, or part + warehouse combinations.
Recognises partial contact names for multi-lingual phonebooks.
Spelling module available as backup to whole word recognition.
Whenever 8kHz audio input is the only available format, such as Bluetooth, it is now possible to recognise 8kHz audio input with standard 16kHz acoustic models.
Supports Spansion and Neon Expansions to improve the operation point of the speech recognition both in terms of speed and accuracy by shifting a part of the algorithmic processing to dedicated hardware.
Recognises every possible word in a database in every possible order and permutation, including partial utterances. This search algorithm is especially useful when the user does not know the exact wording of the content he or she is searching for.
Gain exceptional recognition accuracy tuned for noisy automotive environments. VoCon Hybrid Engine delivers a superior user experience and high acceptance rates across a wide variety of in-vehicle systems and applications. Continuous improvements of the speech algorithms ensure the highest accuracy standards, even for large and complex tasks.
Adds speech recognition functionality to any application. Includes VoCon® Hybrid speech recognition engine, a robust set of development tools, guides and sample code that allow developers to build a high-quality speech-enabled application with optimum speed and efficiency.
Enables embedded recognition for large lists with connected dictation and search capabilities.
Provides Nuance's state-of-the-art VoCon technology for embedded speech recognition and voice barge-in. This is recommended for embedded-only applications with no requirement for connected speech recognition while offering the flexibility to add connected speech recognition later.
Provides access to Nuance's Dragon connected speech services in the network in addition to state-of-the-art VoCon technology for embedded speech recognition and voice barge-in. Connected services are available through the NDEV community powered by Dragon.
Supports a range of operating points so that smaller applications are not affected by higher CPU and RAM required for larger applications.
The software interface in the Windows SDK is identical to the interface once ported to an embedded platform allowing code built on the PC to be reused on the platform.
Nuance, in partnership with Gracenote, provides the capability to retrieve precompiled and editorially checked phonetics for artist and album names, alternative names and/or artist nicknames from regularly updated dictionaries. VoCon Music Premium requires VoCon Hybrid (Base) on platform and can be used with ASR and TTS solutions.
Text-to-speech solution that enriches the user experience with enhanced expressivity by generating high-quality and natural-sounding speech.
A suite of technologies that work together to remove noise from microphone input and send out a cleaner signal.