10th Indian Delegation to Dubai, Gitex & Expand North Star – World’s Largest Startup Investor Connect
All News

OpenAI Silently Unveils Whisper 3, A New Generation Open Source ASR Model

During its inaugural Developer Day, AI startup OpenAI released a series of open-source models. The slew of products included an upgraded version of its open-source automatic speech recognition model, Whisper large-v3. The company’s future plans involve making the model’s API accessible to users.

The models for English-only applications tend to perform better, especially for the `tiny.en` and `base.en` models as per the official page. The model’s  performance varies widely depending on the language.

(Source: OpenAI)

Initially focused on English, the neural net model was released in September last year. Then it got an upgraded version 2 in December which was enhanced to support multiple languages, although specific languages were not explicitly mentioned. 

Accessible on GitHub under a permissive license, Whisper large-v3 effortlessly transcribes various content for users and has been called the best transcription tool out there. The model features a unique timestamp section that facilitates its application as subtitles on platforms such as YouTube.

The tool initiates the process by segmenting audio into 30-second clips, converting them, and subsequently passing them through an encoder and decoder, which predict the corresponding text caption. Technical intricacies also involve language identification, facilitating multilingual speech transcription, and translation to English.

The model was initially expected to be integrated with ChatGPT, to let the users converse directly with the chatbot through speech. But OpenAI then decided to release the model to the public directly. Interestingly, Whisper is not aimed at the end users as of now but rather at researchers. 

The reason for open-sourcing as per OpenAI was to “serve as a foundation for building useful applications and for further research on robust speech processing“. OpenAI’s AI tool was honed using an extensive dataset of 680,000 hours of meticulously supervised data sourced from the internet, with one third portion originating from non-English sources. 

The post OpenAI Silently Unveils Whisper 3, A New Generation Open Source ASR Model appeared first on Analytics India Magazine.

by Siliconluxembourg

Would-be entrepreneurs have an extra helping hand from Luxembourg’s Chamber of Commerce, which has published a new practical guide. ‘Developing your business: actions to take and mistakes to avoid’, was written to respond to  the needs and answer the common questions of entrepreneurs.  “Testimonials, practical tools, expert insights and presentations from key players in our ecosystem have been brought together to create a comprehensive toolkit that you can consult at any stage of your journey,” the introduction… Source link

by WIRED

B&H Photo is one of our favorite places to shop for camera gear. If you’re ever in New York, head to the store to check out the giant overhead conveyor belt system that brings your purchase from the upper floors to the registers downstairs (yes, seriously, here’s a video). Fortunately B&H Photo’s website is here for the rest of us with some good deals on photo gear we love. Save on the Latest Gear at B&H Photo B&H Photo has plenty of great deals, including Nikon’s brand-new Z6III full-frame… Source link

by Gizmodo

Long before Edgar Wright’s The Running Man hits theaters this week, the director of Shaun of the Dead and Hot Fuzz had been thinking about making it. He read the original 1982 novel by Stephen King (under his pseudonym Richard Bachman) as a boy and excitedly went to theaters in 1987 to see the film version, starring Arnold Schwarzenegger. Wright enjoyed the adaptation but was a little let down by just how different it was from the novel. Years later, after he’d become a successful… Source link