10th Indian Delegation to Dubai, Gitex & Expand North Star – World’s Largest Startup Investor Connect
All News

EleutherAI Launches Open-Source English-Hindi Bilingual Model, Hi-NOLIN

EleutherAI in collaboration with INCITE Project, AAI CERC lab at the Université de Montréal, have introduced Hi-NOLIN, an open-source English-Hindi bilingual model. 

Hi-NOLIN’s journey began with the goal of creating the first open-source English-Hindi bilingual model. Researchers expanded the 7B Pythia architecture to a 9B model, enhancing efficiency on their hardware while training on the 300B token Pile text corpus, encompassing both English and code data. Hi-NOLIN stands out for its ability to transition seamlessly between languages, mastering both Hindi and English, while processing code.

As researchers continue training Hi-NOLIN, leveraging the Summit supercomputer with its unique 6 GPUs per node configuration, preliminary results demonstrate remarkable potential. Despite being far from convergence, the 9B model shows a steady reduction in training loss and promises substantial improvements. 

Employing advanced techniques from GPT-NeoX, Megatron-LM, and DeepSpeed, Hi-NOLIN utilizes 3D parallelism and ZeRO redundancy optimizer, maximizing its training resources and computational prowess.

Hi-NOLIN shines through in various standard LLM benchmarks, including HellaSwag, TruthfulQA, Arc, and Human Eval. Remarkably, even in its preliminary stage with 600B tokens, Hi-NOLIN outperforms Pythia 12B and multilingual Bloom models across most evaluation benchmarks, narrowing the gap with LLaMa 2 models.

In a landscape dominated by English language models, Hi-NOLIN is a significant stride towards linguistic inclusivity, addressing the gap in state-of-the-art language models for non-English languages. 

EleutherAI  is a non-profit research group dedicated to the development of open-source LLMs. The group was founded by a group of hackers—namely, Connor Leahy, Sid Black, and Leo Gao in 2020  who wanted to create a more accessible and transparent alternative to commercial LLMs.
Meanwhile, Indian IT firm Tech Mahindra intends to launch Project Indus, its LLM designed for Hindi and its 37 dialects, by the end of December or early January.

The post EleutherAI Launches Open-Source English-Hindi Bilingual Model, Hi-NOLIN appeared first on Analytics India Magazine.

by Siliconluxembourg

Would-be entrepreneurs have an extra helping hand from Luxembourg’s Chamber of Commerce, which has published a new practical guide. ‘Developing your business: actions to take and mistakes to avoid’, was written to respond to  the needs and answer the common questions of entrepreneurs.  “Testimonials, practical tools, expert insights and presentations from key players in our ecosystem have been brought together to create a comprehensive toolkit that you can consult at any stage of your journey,” the introduction… Source link

by WIRED

B&H Photo is one of our favorite places to shop for camera gear. If you’re ever in New York, head to the store to check out the giant overhead conveyor belt system that brings your purchase from the upper floors to the registers downstairs (yes, seriously, here’s a video). Fortunately B&H Photo’s website is here for the rest of us with some good deals on photo gear we love. Save on the Latest Gear at B&H Photo B&H Photo has plenty of great deals, including Nikon’s brand-new Z6III full-frame… Source link

by Gizmodo

Long before Edgar Wright’s The Running Man hits theaters this week, the director of Shaun of the Dead and Hot Fuzz had been thinking about making it. He read the original 1982 novel by Stephen King (under his pseudonym Richard Bachman) as a boy and excitedly went to theaters in 1987 to see the film version, starring Arnold Schwarzenegger. Wright enjoyed the adaptation but was a little let down by just how different it was from the novel. Years later, after he’d become a successful… Source link