10th Indian Delegation to Dubai, Gitex & Expand North Star – World’s Largest Startup Investor Connect
Social Media

Meta AI unveils Voicebox: A revolutionary text-to-speech (TTS) generator with unprecedented speed and generalization abilities

Meta AI has unveiled a groundbreaking text-to-speech (TTS) generator called Voicebox. This new system claims to be up to 20 times faster than existing AI models while delivering comparable performance. Unlike traditional TTS architecture, Voicebox adopts a model similar to OpenAI’s ChatGPT and Google’s Bard.

One of the key distinctions of Voicebox from other TTS models like ElevenLabs Prime Voice AI is its ability to generalize through in-context learning. While previous attempts to use large audio datasets resulted in degraded audio outputs, Voicebox overcomes this challenge with a unique training scheme. It abandons labels and curation in favor of an architecture capable of “in-filling” audio information.

Voicebox stands out as the first model capable of accomplishing speech-generation tasks it wasn’t specifically trained for, achieving state-of-the-art performance. It can translate text to speech, remove unwanted noise, synthesize replacement speech, and even apply a speaker’s voice to different language outputs using just the desired output text and a three-second audio clip.

The release of powerful speech generation technology comes at a crucial time when social media companies grapple with moderation challenges, and the United States faces an upcoming presidential election that could strain online misinformation detection.

To address concerns of potential misuse, Meta has developed a tool to detect speech generated by Voicebox, claiming it can easily differentiate between real and fake audio. The company acknowledges the potential risks associated with such powerful AI technology and has implemented measures to mitigate them.

In the world of cryptocurrencies, AI has become an integral part of daily operations for many businesses. Major exchanges rely on AI chatbots for customer interactions and sentiment analysis, while trading bots have become commonplace.

Meta’s Voicebox represents a significant advancement in text-to-speech technology, offering faster performance and the ability to generalize in various speech-generation tasks. However, as with any powerful AI innovation, the responsible and ethical use of this technology remains crucial.

by 9to5mac

Apple just launched AirPods Pro 3 this fall, but rumors for the next model of AirPods Pro are already coming, highlighting three unique changes Apple may have on the horizon. Rumors indicate Apple is breaking from its norm with next AirPods Pro launch All signs indicate Apple is about to get experimental with AirPods Pro. There’s a new model coming with IR cameras, which Bloomberg says will enable Apple Intelligence capabilities. But outside of the actual features of… Source link

by 9to5mac

Apple is reportedly delaying the launch of the iPhone Air 2. The Information reports that Apple recently “notified engineers and suppliers that they were taking the next iPhone Air off the schedule without providing a new release date.” The report cites “three people involved in the project.” iPhone Air 2 release delayed The second-generation iPhone Air was initially set to launch next fall alongside the iPhone 18 Pro and iPhone Fold. According to The Information, the… Source link

by CNBCTV

The 245th Report of the Parliamentary Standing Committee calls for a review of the IT Act 2000 since many of the serious offences are bailable; it has recommended amending the Act to make the offences severely punishable and to make intermediaries responsible for compensating victims, notes former Central Board of Indirect Tax & Customs chairman Najib Shah. Source link