Logo
Sign in
Product Logo
Intelligent Speech InteractionAlibaba Cloud

Intelligent Speech Interaction is developed based on state-of-the-art technologies such as speech recognition, speech synthesis, and natural language understanding. Enterprises can integrate Intelligent Speech Interaction into their products to enable them to listen, understand, and converse with users, providing users with an immersive human-computer interaction experience.

Vendor

Vendor

Alibaba Cloud

Company Website

Company Website

TB1BlLOkgHqK1RjSZFPXXcwapXa-580-350.jpg
Product details

Overview

Intelligent Speech Interaction is developed based on state-of-the-art technologies such as speech recognition, speech synthesis, and natural language understanding. Enterprises can integrate Intelligent Speech Interaction into their products to enable them to listen, understand, and converse with users, providing users with an immersive human-computer interaction experience. Intelligent Speech Interaction is currently available in Mandarin Chinese, Cantonese Chinese, English, Japanese, Korean, French and Indonesian, and please stay tuned for other languages. Intelligent Speech Interaction is suitable for various scenarios, including intelligent Q&A, intelligent quality inspection, real-time subtitling for speeches, and transcription of audio recordings. Intelligent Speech Interaction has been successfully applied in many industries such as finance, insurance, e-commerce and smart home. Intelligent Speech Interaction allows you to use self-learning platform to improve speech recognition accuracy, and provides a comprehensive management console and easy-to-use SDKs. You are welcome to activate Intelligent Speech Interaction.

Benefits

  • High Recognition Accuracy Alibaba Cloud is the first cloud service provider in China to use word-level LC-BLSTM and DFSMN-CTC models. Compared with the traditional CTC method in the industry, these models reduce the error rate by 20%, greatly improving the accuracy of speech recognition.
  • Ultra-high Decoding Speed Alibaba Cloud is the first cloud service provider in China to use the low frame rate (LFR) decoding technology. This technology increases the decoding speed by more than three times without compromising recognition accuracy, greatly shortening response time and improving user experience
  • Novel Self-learning Platform Intelligent Speech Interaction is the first system in the industry that provides a self-learning platform. It allows you to specify hotwords, and upload business-related data to build specific models for better recognition accuracy.
  • Extensive Industry Coverage Currently, Intelligent Speech Interaction has customers in a wide variety of industries, such as finance, insurance, e-commerce, and smart home. It is ideal for various scenarios, including intelligent Q&A, intelligent quality inspection, real-time subtitling for speeches, and voice assistants.