You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the current implementation of the Speech-to-Text (STT) feature on iOS using MicrosoftCognitiveServicesSpeech-iOS (1.41.1), the addRecognizedEventHandler provides complete and detailed information, including properties such as Lexical, Nbest, and confidence scores. However, the addRecognizingEventHandler, which is designed for real-time processing, only provides partial results with limited information.
This makes it difficult to process intermediate results effectively in real-time, as important metadata is only available after the final result is delivered through addRecognizedEventHandler.
I am requesting an enhancement to the addRecognizingEventHandler to include the following properties:
• Lexical
• Nbest
Use Case:
For real-time speech recognition on iOS, having access to these properties in addRecognizingEventHandler would:
1. Allow developers to display more meaningful and accurate partial results to users.
2. Enable advanced real-time processing, such as handling alternative results (Nbest).
3. Reduce dependency on addRecognizedEventHandler for information that could enhance real-time user experiences.
By including these enhancements, addRecognizingEventHandler would align better with the capabilities of addRecognizedEventHandler and significantly improve real-time STT applications.
This improvement will enhance the usability of the STT service for iOS developers by providing consistent and richer data for both intermediate and final results.
The text was updated successfully, but these errors were encountered:
In the current implementation of the Speech-to-Text (STT) feature on iOS using MicrosoftCognitiveServicesSpeech-iOS (1.41.1), the addRecognizedEventHandler provides complete and detailed information, including properties such as Lexical, Nbest, and confidence scores. However, the addRecognizingEventHandler, which is designed for real-time processing, only provides partial results with limited information.
This makes it difficult to process intermediate results effectively in real-time, as important metadata is only available after the final result is delivered through addRecognizedEventHandler.
I am requesting an enhancement to the addRecognizingEventHandler to include the following properties:
• Lexical
• Nbest
Use Case:
For real-time speech recognition on iOS, having access to these properties in addRecognizingEventHandler would:
1. Allow developers to display more meaningful and accurate partial results to users.
2. Enable advanced real-time processing, such as handling alternative results (Nbest).
3. Reduce dependency on addRecognizedEventHandler for information that could enhance real-time user experiences.
By including these enhancements, addRecognizingEventHandler would align better with the capabilities of addRecognizedEventHandler and significantly improve real-time STT applications.
Framework Version:
MicrosoftCognitiveServicesSpeech-iOS (1.41.1)
This improvement will enhance the usability of the STT service for iOS developers by providing consistent and richer data for both intermediate and final results.
The text was updated successfully, but these errors were encountered: