Meta’s New AI Models Can Recognize and Produce Speech For More Than 1,000 Languages

Meta has built AI models that can recognize and produce speech for more than 1,000 languages — a tenfold increase on what’s currently available. It’s a significant step toward preserving languages that are at risk of disappearing, the company says.

Meta is releasing its models to the public via the code hosting service GitHub. It claims that making them open source will help developers working in different languages to build new speech applications — like messaging services that understand everyone, or virtual-reality systems that can be used in any language. There are around 7,000 languages in the world, but existing speech recognition models cover only about 100 of them comprehensively. This is because these kinds of models tend to require huge amounts of labeled training data, which is available for only a small number of languages, including English, Spanish, and Chinese. Meta researchers got around this problem by retraining an existing AI model developed by the company in 2020 that is able to learn speech patterns from audio without requiring large amounts of labeled data, such as transcripts.

Meta’s New AI Models Can Recognize and Produce Speech For More Than 1,000 Languages

Published by admin on May 22, 2023

Pan Am Plane Crash That Inspired Modern Safety Briefings Found After 74 Years

The US Army Is Burning Through Its AI Tokens

😎

Meta’s New AI Models Can Recognize and Produce Speech For More Than 1,000 Languages

Published by admin on May 22, 2023

Related Posts

Pan Am Plane Crash That Inspired Modern Safety Briefings Found After 74 Years

The US Army Is Burning Through Its AI Tokens

😎