The Hub invites African researchers, linguists, startups, technologists and community organizations to build high-quality, inclusive and community-driven datasets for 50 African languages.
The Masakhane African Languages Hub (the Hub) has announced a major Request for Proposals (RFP) to address the severe underrepresentation of African languages in Artificial Intelligence (AI). The Hub invites African researchers, linguists, startups, technologists and community organizations to build high-quality, inclusive and community-driven datasets for 50 African languages.
While Africa is home to over 2,000 languages, they are almost absent from the global digital landscape; none of the top 34 languages used globally on the internet are African. This imbalance risks not only excluding more than one billion speakers from emerging technologies but also perpetuating harmful biases and inaccuracies that can undermine equitable use of new technologies.
Supported by Google.org, and FCDO, IDRC and the Gates Foundation, the new project aims to counter this trend by ensuring AI technologies are built on accurate, inclusive, culturally relevant, and ethically sourced data.
The Hub is calling for Expressions of Interest (EOI) across three core pillars:
- Automatic Speech Recognition (ASR): Large-scale, culturally grounded voice data for 18 African languages, emphasizing gender balance and contextual authenticity.
- Benchmarking “In the Wild”: Studies that test how AI models actually perform in authentic, practical, real-world African contexts.
- Culturally Relevant Multimodal Datasets: High-quality image, text, and speech datasets for 40 African languages to power the next generation of translation and education tools.
Successful applicants will receive funding, visibility, and institutional support to help them achieve sustainable growth. Partnering with the Masakhane African Languages Hub will also provide a platform to develop African-led AI technology and pioneer inclusivity for languages historically marginalized in tech.
The ultimate goal is to empower one billion Africans by 2029 with locally relevant AI tools and resources, unlocking opportunities for economic development, local innovation, and the preservation of Africa’s linguistic heritage.
“We are committed to championing African-led innovation that ensures AI reflects the continent’s rich linguistic and cultural diversity. This call for proposals goes beyond building models; it is a movement toward a more equitable digital future. It’s an opportunity to center marginalized groups such as women, rural communities, and the elderly, and take forward the wisdom of our elders and embody the spirit of Ubuntu.”
– Chenai Chair, Director, Masakhane African Languages Hub

