Access over 1 million hours of transcribed recordings from across Africa

Enhance Your AI Models with Genuine Human Voices and Diverse Accents for Unmatched Linguistic and Cultural Precision

Available Recordings

CountryLanguageTotal hours
AlgeriaArabic1340.5
English50
AngolaEnglish50
Fiote50
Portuguese27330
Quicongo50
Umbundu50
BeninFon50
French814
BotswanaEnglish581.5
Setswana582
Burkina-FasoDioula50
Dyula4125
English50
French13005
Moré19777.5
BurundiEnglish50
French115
Kirundi141.5
CameroonEnglish8300
Foulbe50
French27988
Fula94.5
Central African RepublicFrench7200.5
Sango31279.5
ChadArabic406.5
Chadian Arabic21414.5
French12908
Kanembou50
Sara3918.5
ComorosComorian1013.5
French487
Shikomoro323.5
CongoFrench8436.5
Kituba1098
Lingala764
Democratic Republic of the CongoEnglish50
French36639
Kikongo50
Kiswahili3937
Lingala19042.5
Tshiluba50
DjiboutiAfar146.5
Arabic287.5
French50
Somali173
EgyptArabic1515.5
English50
EritreaEnglish50
Tigrinya690.5
EthiopiaAfar483
Amharic36323.5
Anuak50
Arabic50
English105.5
Nuer50
Oromo4792
Somali6188.5
Tigrinya1600.5
GambiaEnglish536.5
Mandinka131
Wolof98.5
GhanaAkan90.5
Dagbani50
English7151.5
Ewe78
Ga50
Twi1074
GuineaFrench9795
Fula5270.5
Guerze50
Kissi412
Kpelle610
Malinke5441
Susu3414.5
Toma83
Ivory Coast (Cote D’Ivoire)English50
French3268.5
Malinke50
KenyaEnglish63051.5
French50
Kiswahili83659.5
Somali652.5
LesothoEnglish426.5
Sotho337
MadagascarAtanosy50
Mahafaly50
Malagasy1040
Tandory50
MalawiChichewa54256.5
English456.5
Tumbuka50
MaliBambara18681.5
Dogon50
French3152
Tamachek50
MauritaniaArabic423
French428
Hassaniya987
Soninke50
Wolof54.5
MoroccoArabic258.5
English50
Moroccan Arabic518.5
MozambiqueChangana83
Emakhuwa64
English50
Maconde50
Macua50
Portuguese8698.5
Xichangana113.5
NamibiaAfrikaans272.5
English8782.5
Oshikwanyama8049.5
Oshiwambo64.5
Oshondonga1598.5
Rukwangali50
Setswana50
Silozi62
NigerDjerma6393.5
French3065.5
Hausa29498.5
Kanuri50
Zarma1067
NigeriaEnglish133464.5
Hausa36294
Igbo233
Kanuri2979.5
Pidgin1353.5
Yoruba1223.5
RwandaEnglish64
Kinyarwanda3524.5
SenegalEnglish50
French2510
Wolof666
Sierra LeoneEnglish2118
Krio41534.5
Limba50
Mende183.5
Temne101
SomaliaArabic52
English758.5
Somali37348.5
South AfricaAfrikaans61.5
English10577.5
Sepedi182.5
Setswana50
Xhosa688
Zulu1591.5
South SudanEastern Dinka88
English547.5
Juba Arabic270.5
Rek Dinka64
SudanArabic693.5
English50
SwazilandEnglish1009.5
Siswati2324.5
TanzaniaEnglish145.5
Kiswahili55143.5
UgandaAcholi50
Aringa50
Ateso50
English25066
Kiswahili1631
Langi50
Luganda8825.5
Lugbara58.5
Luo50
Lutooro50
Maadi50
Ngakarimojong1112
Runyankole50
Runyankore72
ZambiaBemba12133
English10926.5
Lozi103.5
Nyanja13130.5
Tonga733
ZimbabweEnglish2719.5
Ndebele3294.5
Shona36407.5
Grand Total1,096,422

The Value

Human-Written Transcripts

Our recordings come with meticulously human-written transcripts, ensuring the highest level of accuracy and authenticity. This attention to detail allows your AI models to learn from precise and contextually relevant data, improving their natural language processing capabilities.

Representative

With over 1 million hours of recordings, our dataset captures a wide range of dialects, accents, and languages from across Africa. This representative data enables your AI to understand and interact with diverse linguistic and cultural nuances, making your applications more inclusive and effective.

High-Quality Voice and Video

Our recordings are of exceptional quality, featuring clear audio and high-definition video. This ensures that your AI models are trained on the best possible data, leading to more accurate speech recognition, sentiment analysis, and other advanced AI functionalities.

Privacy and Compliance

We prioritize user privacy and adhere to stringent data protection standards. Our data collection methods are ethically sound and compliant with international regulations, ensuring that your use of our datasets respects user rights and maintains trust.

Get the data

Let us know what data you need and we will make it happen