Chant SpeechKit

You really don't have to sit in front of a computer with a mouse and keyboard to use information technology. Your applications can be enhanced to speak and listen to you from where ever you need them to.

Speech recognition is the process of converting an acoustic signal (i.e. audio data), captured by a microphone or a telephone, to a set of words. These words can be used for controlling computer functions, data entry, and application processing.

Speech synthesis is the process of converting words to phonetic and prosodic symbols and generating synthetic speech audio data. Synthesized speech can be used for answering questions, event notification, and reading documents aloud.

What is Speech Management?

Speech management enables you to:

  • control application functions by speaking rather than having to use a mouse or keyboard,
  • capture data by speaking rather than typing, and
  • prompt and confirm data capture with spoken or audio acknowledgement.

Applications benefits include:

  • enhanced speed and accuracy of data capture,
  • added flexibility of running applications in a variety of environments, and
  • expanded operating scenarios for hands-free computing.

What is SpeechKit?

Chant SpeechKit handles the complexities of speech recognition and speech synthesis to minimize the programming necessary to develop software that speaks and listens.

It simplifies the process of managing Microsoft SAPI 5, Microsoft Speech Platform, Nuance Dragon NaturallySpeaking, and Nuance Vocon 3200 recognizers, and managing Cepstral, CereProc CereVoice, Microsoft SAPI 5, Microsoft Speech Platform, and Nuance Vocalizer synthesizers.

SpeechKit includes C++, C++Builder, Delphi, Java, .NET Framework, and Silverlight class libraries to support all your programming languages and sample projects for popular IDEs—such as the latest Visual Studio from Microsoft and RAD Studio from Embarcadero.

The class libraries can be integrated with 32-bit and 64-bit applications.

Speech Recognition and Synthesis Component Architecture

The SpeechKit class library includes a speech recognition management class that provides you a productive way to develop software that listens. Your application sets properties and invokes methods through the speech recognition management class. This class handles the low-level functions with speech recognition engines (i.e., recognizers).

The speech recognition management class, ChantSR, enables you to establish a session with a recognizer, through which spoken language captured live via a microphone or from recorded audio can be processed and converted to text. Your application uses the ChantSR class to manage the activities for speech recognition on behalf of your application. The ChantSR class manages the resources and interacts directly with a recognizer application program interface (API). The ChantSR class supports the following speech APIs:

Your application receives recognized speech as text and notification of other processing states through event callbacks.

SpeechKit Architecture for Speech Recognition

The ChantSR class encapsulates all of the technologies necessary to make the process of recognizing speech simple and efficient for your application. Optionally, it can save the session properties for your application to ensure they persist across application invocations.

SpeechKit Architecture for Speech Recognition

The ChantSR class simplifies the process of recognizing speech by handling the low-level activities directly with a recognizer.

You instantiate a ChantSR class object before you want to recognize speech within your application. You destroy the ChantSR class object and release its resources when you no longer want to recognize speech within your application.

Speech Synthesis Management

The SpeechKit class library also includes a speech synthesis (i.e., text-to-speech) management class that provides you a productive way to develop software that speaks. Your application sets properties and invokes methods through the speech synthesis management class. This class handles the low-level functions with text-to-speech engines (i.e., synthesizers or voices).

The speech synthesis management class, ChantTTS, enables you to establish a session with a synthesizer, through which speech is synthesized from text. Your application uses the ChantTTS class to manage the synthesizer resources on behalf of the application. The ChantTTS class manages the resources and interacts directly with a synthesizer application program interface (API). The ChantTTS class supports the following speech APIs:

Your application receives notification of other processing states through event callbacks.

SpeechKit Architecture for Speech Synthesis

The ChantTTS class encapsulates all of the technologies necessary to make the process of synthesizing speech simple for your application. Optionally, it can save the session properties for your application to ensure they persist across application invocations.

SpeechKit Architecture for Speech Synthesis

The ChantTTS class simplifies the process of synthesizing speech by handling the low-level activities directly with a synthesizer.

You instantiate a ChantTTS class object before you want to synthesize speech within your application. You destroy the ChantTTS class object and release its resources when you no longer want to synthesize speech within your application.

Feature Summary

Chant SpeechKit handles the complexities of speech recognition and speech synthesis. The classes minimize the programming efforts necessary to construct software that speaks and listens.

A SpeechKit application can:

  • Control application functions by speaking rather than having to use a mouse or keyboard;
  • Prompt users for applicable data capture;
  • Capture data by speaking rather than typing;
  • Confirm data capture with spoken or audio acknowledgement;
  • Transcribe audio buffers, files, and streams to text; and
  • Synthesize speech to audio buffers, files, and streams.

Recognizers provide proprietary programming interfaces (i.e., APIs). SpeechKit supports the following recognizers and their APIs:

RecognizerSpeech APIPlatforms
Microsoft SAPI 5 (all languages)SAPI 5Win64, Win32
Microsoft Speech Platform (all languages)MSPWin64, Win32
Nuance Dragon NaturallySpeaking (all languages)Dragon COM APIWin64, Win32
Nuance VocCon 3200 V2 (all languages)VoCon 3200 V2Win32
Nuance VocCon 3200 V3 (all languages)VoCon 3200 V3Win32
Nuance VocCon 3200 V4 (all languages)VoCon 3200 V4Win32

Synthesizers provide proprietary programming interfaces (i.e., APIs). SpeechKit supports the following synthesizers and their APIs:

SynthesizerSpeech APIPlatforms
Acapela (all languages)BabTTSWin64, Win32
Acepela (all languages)NSCAPIWin64, Win32
Cepstral (all languages)Cepstral SwiftWin64, Win32
CereProc (all languages)CereVoiceWin64, Win32
Microsoft SAPI 5 (all languages)SAPI 5Win64, Win32
Microsoft Speech Platform (all languages)MSPWin64, Win32
Nuance Vocalizer Auotmotive (all languages)Vocalizer AuotmotiveWin32
Nuance Vocalizer Expressive (all languages)Vocalizer ExpressiveWin64, Win32
Nuance Vocalizer Mobile (all languages)Vocalizer MobileWin32
Nuance Vocalizer Network (all languages)Vocalizer NetworkWin64, Win32

Within Chant Developer Workbench, you can:

  • Enumerate audio devices and speech engines for selection and testing of audio-, recognizer-, and synthesizer-specific features;
  • Trace audio, recognition, and synthesis events;
  • Support grammar activation and testing (requires GrammarKit); and
  • Support TTS markup playback (requires VoiceMarkupKit).
Audio Device Management
Audio Device Management: Enumerate audio devices and inspect device properties.
Recognizer Management
Recognizer Management: Enumerate and test recognizers. Use the Speech Recognition window to recognize speech from a microphone, prerecorded audio, or simluate recognition from text. Trace recognition events in the Events window.
Synthesizer Management
Synthesizer Management: Enumerate and test synthesizers. Use the Speech Synthesis window to synthesize text. Trace synthesis events in the Events window.

SpeechKit License

You may explore the capabilities of Chant SpeechKit for 30 days. To continue to use the product after 30 days, you must purchase a license for the software or stop using the software and remove it from your system.

A valid purchased license gives you the right to construct executable applications that use the applicable class library and distribute it with executable applications without royalty obligations to Chant.

The Chant SpeechKit license is a single end-user license. Each developer who installs and uses SpeechKit to develop applications must have their own license.

SpeechKit class library names vary by platform: Windows 32-bit and 64-bit. This helps ensure the correct library is deployed with your application.

You may purchase a license for Chant SpeechKit on-line at the Chant store or through your preferred software reseller.

SpeechKit System Requirements

Development Environment

  • Intel processor or equivalent,
  • Microsoft Windows 10, 8, 7
  • 120 MB of hard drive space,
  • CD-ROM drive,
  • VGA or higher-resolution monitor,
  • Microsoft SAPI 5, Microsoft Speech Platform, Nuance Dragon NaturallySpeaking, or Nuance VoCon 3200 recognizer,
  • Acapela, Cepstral, CereProc, Microsoft SAPI 5, Microsoft Speech Platform, or Nuance Vocalizer synthesizer,
  • C++, C++Builder, Delphi, Java, .NET Framework, or Silverlight development environment, and
  • Close-talk microphone.

Where to Buy

Order Chant SpeechKit online at the Chant store

Order Chant SpeechKit online at the Chant store or through your preferred software reseller.

Try It Buy It