

SpeechEvalPro is an advanced pronunciation evaluation and scoring API designed to deliver precise and multi-faceted assessments for both Chinese and English pronunciation. Leveraging cutting-edge voice analysis, speech recognition, and other core technologies, it ensures accurate and dependable pronunciation evaluations for educational applications.
SpeechEvalPro is a pronunciation evaluation and scoring API that provides high-quality, multi-dimensional assessments for Chinese and English pronunciation. It utilizes advanced voice evaluation and speech recognition technologies to deliver accurate and reliable assessments for educational purposes.
To use SpeechEvalPro, sign up for a free trial or select a pricing plan. Once you have access, integrate the API into your educational product or application via HTTP or WebSocket requests. The API accepts audio files in recommended formats and supports various evaluation modes, including phoneme, word, sentence, and chapter. Refer to the documentation for detailed usage instructions.
Currently, an SDK is not available. You can directly use the WebAPI, which offers streaming capabilities and is lightweight and cross-platform.
It is recommended to use audio files with a 16-bit sample size, 16K sample rate, 1 channel in opus_raw, pcm, wav, or mp3 formats. Other formats may affect the scoring results.
SpeechEvalPro supports phoneme, word, sentence, and chapter (paragraph) modes. The duration and text length restrictions vary by mode. For phoneme & word mode, the duration is up to 20 seconds. For sentence mode, the duration is up to 40 seconds with a text length of fewer than 300 characters. For chapter mode, the duration is up to 300 seconds with a text length of fewer than 10,000 characters. Refer to the documentation for specific details.