How do I administer speaker profiles for more accurate speech recognition?

Last reviewed: 8/15/2022

HOW Article ID: H072227

The information in this article applies to:

  • Chant Developer Workbench 2022
  • ProfileKit 8

Summary

Administrating speaker profiles is key to ensure the most accurate speech recognition possible.

More Information

Windows desktop speech recognizers from Microsoft and Nuance rely on speaker profiles for improving the recognition quality when transcribing audio to text.

One element used is speaker training where the recognizer develops a model from user voice samples used when detecting speech.

Other elements include adjusting the audio levels to optimize detecting speech utterances from noise and building reference models from documents and phrases.

Developers can construct apps that provide profile administration capabilities supported by the recognizer using ProfileKit.

Chant Developer Workbench provides a Profile Browser that illustrates how profile administration can be performed and used when transcribing speech with the Recognizer Browser.

The supported dialogs for each speech recognizer are enumerated when a speaker profile is selected. The buttons are enabled if the service is available for the speaker profile.

Built-in Speaker Training
Built-in Speaker Training: Enumerate and train Microsoft speech recognition profiles with recognizer built-in speaker training.
Built-in Speaker Training
Built-in Speaker Training: Enumerate and train Nuance speech recognition profiles with recognizer built-in speaker training.

Chant ProfileKit includes a sample application that illustrates profile management features. Look for the Training folder in the ProfileKit sample projects for the desired programming language.

Build and launch the sample speaker training application. You can create, delete, train, and administrate profile for the recognizers that are installed on your system.

Profile Training Sample
Profile Training Sample: Select a speech recognizer engine, speaker profile, and the desired dialog. Press the show dialog button to launch the service.