Highly Accurate Automated Transcription Unlocks the Data Trapped in Multi-speaker Audio; Enables Big Data Analytics, Data Mining of Audio and More
BURLINGTON, Mass., – May 5, 2016 – Nuance Communications, Inc. (NASDAQ: NUAN) today introduced Nuance Transcription Engine (NTE), a powerful, fast and accurate engine that can quickly transform massive amounts of recorded audio into actionable assets, across a wide range of industries. By applying highly accurate (88 percent) transcription capabilities, NTE quickly and reliably produces search-optimized text output from a multi-speaker audio file, allowing organizations to turn enormous volumes of audio into a meaningful asset by mining the wealth of data it contains. NTE supports 15 languages and 30 dialects, with additional languages rolling out regularly, to increase efficiency and streamline operations around the world
With decades of experience in voice and language solutions, Nuance is extending its portfolio to support the transcription needs of a wide range of organizations and applications including freeform audio in Enterprises, broadcast media, and analytics among others. For example, with NTE an organization can: transcribe large volumes of audio content within an organization’s call center to provide rich customer insights and improve service, produce rapid transcription of broadcast media in order to understand and analyze what is being said in near-real time worldwide, or fully transcribe corporate audio and video assets for rapid searching and indexing. Additionally, NTE can capture and document what is being said in a conference room or interview to create accurate automated archives of critical conversations. NTE can also act as a key enabling technology utilized by organizations for speech analytics or for insights pertaining to compliance.
“In a number of real-world situations, the ability to generate fast and accurate transcription can have an immeasurable impact,” said Robert Weideman, executive vice president and general manager, Nuance Enterprise. “Whether it’s to provide deeper voice of the customer insights, to understand discussions in health and human services interviews, or to provide insights for law enforcement, we’re proud to introduce Nuance Transcription Engine which leverages our vast experience in speech and natural language understanding to unlock, capture, and mine data, providing the valuable insights that organizations desire.”
Nuance Transcription Engine transcribes speech content with the highest accuracy in the industry at a remarkable speed, making previously inaccessible unstructured data available for analysis, searching and indexing. Pre-recorded audio can be processed from 1x Real Time Factor (RTF), making it possible to process 60s of audio in 60s, with maximum accuracy, and up to 10x RTF enabling processing of 60s of audio in just 6s. NTE can also transcribe live conversations, streaming results after only approximately ten seconds, making transcription available almost immediately after the words are spoken.
To provide organizations with the most meaningful insights, and to appeal to a wide range of applications, NTE has two output formats designed to power analytics/data mining as well as those that require formatted text. The first output format, optimized for search/analytics use-cases, produces word level time-stamps and n-best lists. In pre-recorded audio, NTE can identify and assign transcript elements to specific speakers – even if the audio file was recorded in mono – and non-speech audio like noise and silence is flagged. The alternative output style generates a human-readable format. In both formats, customers can easily customize the vocabulary for their application (e.g. adding product names or person names at runtime), or create a set of specialized language models offline, and activate them selectively at runtime.
Accurately capturing massive amounts of unstructured data that was previously inaccessible is rapidly creating new disruptive business opportunities to analyze and act on transient content. For example, Nuance partner, Veritone, is already using NTE to enable unsurpassed efficiency and accuracy in transcribing global media. “We are thrilled to work with Nuance to incorporate the powerful NTE engine into our Cognitive Media Platform,” said Chad Steelberg, CEO, Veritone. “Through this partnership, we are able to transcribe global media in near-real time to unlock actionable intelligence. As a result, Veritone’s customers and development partners are finding that streaming and recorded media are increasingly valuable assets for cognitive analysis and decision making.”
To learn more about Nuance Transcription Engine, click here.
Nuance Communications, Inc. (NASDAQ: NUAN) is a leading provider of voice and language solutions for businesses and consumers around the world. Its technologies, applications and services make the user experience more compelling by transforming the way people interact with devices and systems. Every day, millions of users and thousands of businesses experience Nuance’s proven applications. For more information, please visit www.nuance.com.
Nuance and the Nuance logo are trademarks or registered trademarks of Nuance Communications, Inc. or its subsidiaries in the United States of America and/or other countries. All other company names or product names may be the trademarks of their respective owners.