The focus of AI4Bhārat, an initiative of IIT-Madras, is on building open-source language AI for Indian languages, including datasets, models, and applications.
We are pleased to announce the launch of the Nilekani Center at AI4Bharat, IIT Madras on 28th July. The Center's mission is to innovate on open-source Indian language technology with the intention to create societal impact.
🚀 Announcing the Indic LLM-Arena 🇮🇳
At AI4Bharat (IIT Madras), our mission has always been clear - build open, inclusive, and world-class AI for Indian languages.
To further this goal, today, we’re introducing the Indic LLM-Arena, a crowd-sourced, human-in-the-loop leaderboard
🚀 AI4Bharat: Advancing Indian Language AI - Open & Scalable! 🇮🇳✨
Over the past 4 years, we at AI4Bharat have been on a mission to accelerate Indian language AI 🚀 —building large-scale datasets, models, and tools and releasing everything open-source for the community. Now, all
AI4Bharat began with a handful of students at IIT Madras exploring deep learning and building small models, testing computer vision, and translating text. What started as experiments quickly became a focused mission: making AI work for India’s many languages.
Today, AI4Bharat’s
🚨🚨 Paper Alert!! 🚨🚨
New #LLMs supporting Indian Languages come out each day, yet no reliable benchmarks exist to evaluate them. Introducing our work: MILU - A Multi-task Indic Language Understanding Benchmark - a comprehensive benchmark for evaluating LLMs for 11 Indian
A course on LLMs will be offered by Prof Mitesh Khapra. If you are a beginner or have some experience and looking to deepen your knowledge then this course is for you. Right from theory and fundamentals to LLMs in practice everything will be covered.
Introducing Indic Parler-TTS: Open-Source Text-to-Speech for Over a Billion Indic Speakers! 🌏
In collaboration with @huggingface, we are excited to release Indic Parler-TTS, a state-of-the-art open-source text-to-speech system designed to bring accessible and high-quality
We are pleased to announce that we will begin recruiting AI residents (and associates) for 2024-25. The AI resident program is an year long pre-doctoral program which allows you to work intensively on NLP, Speech and Vision projects.
Apply below:
Happy to have been a part of this effort with @SarvamAI.
This is a significant leap from our past sentence-level work — now supporting multi-format, document-level translation. Big step forward for Indian languages!
Now on HuggingFace to download + deploy. Also fast API access
New model drop - Sarvam-Translate is here. Can translate between 22 Indian languages & English. Significantly better than much larger models. Improves on nuance, long-form, structured text. Available as super-fast APIs. Try it here: dashboard.sarvam.ai/translate
📢 Presenting IndicSeamless: A Speech Translation Model for Indian Languages 🎙️🌍
IndicSeamless is a speech translation model fine-tuned from SeamlessM4Tv2-large on 13 Indian languages. Trained on a curated subset of BhasaAnuvaad, the largest open-source Speech Translation