Internet giant AI field scuffle, why is speech technology a major event?

In the second half of the Internet, many giant companies have set their sights on artificial intelligence (AI). More accurately, it is the speech recognition technology in the field of artificial intelligence.

At the launch of the hammer M1 mobile phone, the University of Science and Technology has a fast and accurate voice-converted text function, which makes the general public intuitively impressed with the speech recognition technology.

This year's Yunqi Conference Hangzhou main venue and other sub-meeting venues, Alibaba Cloud's "small AI" robot instant text interpretation function implies that the robot is coming to grab the simultaneous interpretation of the rice bowl.

Coincidentally, at the 3rd World Internet Conference, Sogou also launched a real-time machine translation product. This product can not only quickly convert the speech of Sogou CEO Wang Xiaochuan into text, but also make an English translation. Perhaps in the future, speech recognition technology will really make the simultaneous translation of the scene unemployed.

â–² Sogou CEO Wang Xiaochuan shows voice real-time translation technology (Source: Sogou mobile phone input method Weibo)

On November 22nd, Baidu announced the opening of four new voice technology interfaces, namely emotional synthesis, far-field solution, wake-up phase II technology and long-voice solution. Baidu pointed out that these technologies have great potential to solve the problems that people are generally troubled with when using speech recognition technology.

For example, the far-field solution can increase the range of speech recognition to 3 to 5 meters. The “small robot” of the Shanghai KFC flagship store can use this technology to answer at any time. Another example is emotional synthesis, which adds emotion to synthetic speech to achieve the effect of real human voice.

The above-mentioned Internet giants, despite their different focuses, rely heavily on speech recognition technology because speech recognition is the most convenient way of human-computer interaction and an important entry point for artificial intelligence. Wu Enda, the chief scientist of Baidu, made a new breakthrough in speech recognition technology and confidently told the media, "We are already at the dawn of artificial intelligence."

The speech recognition technology consists of two levels, one is to interpret the speech by text; the other is to convert the speech signal into a command to control the operation of the robot. At present, the speech text translation has achieved good results. Some companies have achieved a voice input accuracy rate of 97%. Sogou's voice translation has an accuracy rate of 90%.

Next, Internet companies need to improve voice commands, such as increasing the speed at which the machine recognizes speech, and making accurate actions.

Smartwatch

Smartwatch

Smartwatch,Smart Watch,Screen Touch Watch,G Shock Smart Watch

everyone enjoys luck , http://www.eeluckwatch.com

This entry was posted in on