Introduction
Overview
The SpeechToText SDK provides a voice recognition interface that allows infotainment apps running on Pleos Connect to integrate with the Pleos Voice Recognition Platform (Speech Service). This SDK enables the transmission of user voice to the platform, and the recognized text results are delivered back to the app. The app can utilize the 42dot voice recognition service without needing to manage microphone control or input/output settings.
Development Environment Setup
To use the SpeechToText SDK, install the Pleos Connect Emulator in Android Studio and then proceed with AVD setup.
SpeechToText API
The list of APIs provided by the SpeechToText SDK is as follows.
| Function | Summary |
|---|---|
| initialize API | API to initialize the SpeechToText SDK. |
| release API | API to release the SpeechToText SDK. |
| registerApp API | API to register an app for using server-based speech recognition features. |
| addListener API | API to add an event listener for voice recognition results. |
| removeListener API | API to remove a registered event listener. |
| request API | API to request real-time speech recognition. |
| stop API | API to stop the current voice recognition. |
| sendAudio API | API to extract text from stored PCM audio data. |
| completeAudioSend API | API to complete sending PCM audio data. |