The authors present the creation of INDICVOICES, a dataset of natural and spontaneous speech covering 22 Indian languages, aiming to capture cultural and linguistic diversity in India.
Creating a diverse and representative speech dataset for Indian languages through inclusive data collection efforts.
Creating a diverse and representative speech dataset for Indian languages to support speech technology development.