CSA Talk: Indian Language Processing

Delivered a talk on Indian Language Processing: Challenges and Opportunities for the Computer Science Association (CSA), BITS Pilani, Hyderabad Campus.

India’s remarkable linguistic diversity — over 20 constitutionally recognized languages spanning multiple scripts and hundreds of dialects — makes it one of the most challenging yet rewarding settings for NLP research. The talk surveyed the core challenges: severe data scarcity for most Indian languages, morphological richness, script heterogeneity, and the prevalence of code-switching in everyday text and speech. It also covered recent progress in multilingual and cross-lingual models, and highlighted concrete avenues through which undergraduate students can contribute meaningfully to this research landscape.