BharatGen AI to Support All 22 Indian Languages by June 2026, Covering Text, Speech & Vision Models

 

India’s ambition to develop a sovereign artificial intelligence framework tailored for its linguistic and cultural diversity is gaining momentum. BharatGen, the government-backed national AI initiative, is at the heart of this vision, aiming to create foundational AI models for all 22 scheduled Indian languages by June 2026. This strategic move will not only bridge the linguistic digital divide but also enhance India’s technological independence.

 

Current Language Coverage and Expansion Plan 

At present, BharatGen covers nine major Indian languages: Hindi, Marathi, Tamil, Malayalam, Bengali, Punjabi, Gujarati, Telugu, and Kannada. By December 2025, this list is set to expand to 15 languages, with Assamese, Maithili, Nepali, Odia, Sanskrit, and Sindhi joining the roster. The complete rollout to include all 22 scheduled languages is targeted for mid-2026.

 

Technological Scope and Applications 

BharatGen’s AI capabilities span multiple modalities, including:

 

Large Language Models (LLMs) for text

Text-to-Speech (TTS) systems

Automatic Speech Recognition (ASR)

Vision-Language systems

 

The initiative has already developed pilot applications for agriculture, governance, and defence. These will be implemented nationwide after the full deployment of the platform.

 

Organisational Structure and Leadership 

BharatGen operates under the Department of Science and Technology’s National Mission on Interdisciplinary Cyber-Physical Systems (NM-ICPS). The TIH Foundation for IoT and IoE at IIT Bombay acts as the central hub, managing execution, academic collaboration, and ecosystem partnerships for compute resources, data, and talent.

 

IITM Pravartak Technologies Foundation at IIT Madras plays a critical role as an implementation partner, focusing on governance, security, and media-oriented applications.

 

Key Consortium Members and Contributions:

 

IIT Bombay: Lead institution overseeing research and integration

IIIT Hyderabad: Vision-language document modelling

IIT Madras: Speech model development and evaluation

IIT Kanpur: Legal AI research and multilingual tokenisation strategies

IIT Hyderabad: Vocabulary optimisation for multilingual LLMs

IIT Mandi: Inclusive multilingual model development and efficient training methods

IIM Indore: Bharat-centric benchmarking and multilingual data collection

 

Government Vision and Future Deployment 

Union Minister Dr Jitendra Singh confirmed that BharatGen is still in its pilot phase and not yet accessible to the public. However, plans are in place to deploy the system across all states and districts once fully operational. The government is also exploring potential collaborations with additional research institutions in Karnataka.

 

Conclusion 

BharatGen represents a milestone in India’s AI journey, aiming to empower millions of citizens by enabling advanced AI capabilities in their native languages. With a clear roadmap and strong institutional backing, the initiative promises to transform how AI serves India’s diverse population, reinforcing technological sovereignty and inclusivity.

 

Follow Before You Take on

Latest Technology News | Updates | Latest Electric Vehicle News | Updates | Electronics News | Mobile News | Updates | Software Updates

Facebook | Twitter | WhatsApp Channel | Instagram | Telegram | Threads | LinkedIn | YouTube

 

Stay informed, Stay Connected!

The post BharatGen AI to Support All 22 Indian Languages by June 2026, Covering Text, Speech & Vision Models appeared first on Before You Take.

Leave a Reply

Your email address will not be published. Required fields are marked *

*