AI4Bharat aims to gather 15,000 hours of transcribed data from over 400 districts encompassing all 22 scheduled languages of India. In parallel, its in-house team of over 100 translators is creating a parallel corpus with 2.2 million translation pairs across 22 languages.
“This…








