Administering Speaker Profiles for more Accurate Speech Recognition
Last reviewed: 8/15/2022
HOW Article ID: H072227
The information in this article applies to:
- Chant Developer Workbench 2022
- ProfileKit 8
Administrating speaker profiles is key to ensure the most accurate speech recognition possible.
Windows desktop speech recognizers from Microsoft and Nuance rely on speaker profiles for improving the recognition quality when transcribing audio to text.
One element used is speaker training where the recognizer develops a model from user voice samples used when detecting speech.
Other elements include adjusting the audio levels to optimize detecting speech utterances from noise and building reference models from documents and phrases.
Developers can construct apps that provide profile administration capabilities supported by the recognizer using ProfileKit.
Chant Developer Workbench provides a Profile Browser that illustrates how profile administration can be performed and used when transcribing speech with the Recognizer Browser.
The supported dialogs for each speech recognizer are enumerated when a speaker profile is selected. The buttons are enabled if the service is available for the speaker profile.
Chant ProfileKit includes a sample application that illustrates profile management features. Look for the Training folder in the ProfileKit sample projects for the desired programming language.
Build and launch the sample speaker training application. You can create, delete, train, and administrate profile for the recognizers that are installed on your system.