aimerou / online_wolof_data

Curate online wolof text resources that can be used to build models

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Wolof Data

This repository curates online wolof resources

Text Data

Machine Translation Named Entity Recognition Part-of-speech tagging Question Answering Sentiment Analysis
OPUS MasakhaNER MasakhaPOS AfriQA WOLOF IA
FLORES-200 UD_Wolof-WTB
Microsoft NTREX
LOREILEI (payant)
MAFAND-MT
Wolof books (non exhaustive list)
Bataaxal bu gudde nii (Une si longue lettre), Mariyaama Ba
Doomi golo (Le fils de la guenon), Bubakar Bόris Jόob
Goneg nit ku nuul gi (L'enfant noir), Camara Laye
Ndoomu Buur Si (Le petit prince), Antoine de Saint Exupery
Bàmmeelu Kocc Barma, Buubakar Bóris Jóob
Puukare, Ceerno Séydu Sàll
Doxandéem, Ibraayima Saaxo Caam)

Wolof web sites

Online news & Docs Religious content Medical content Learning Wolof Twitter accounts
Defu Waxu Nouveau Testament InfoVIHTal Jàng Wolof Saabal
Wolof-online Bible WaxWolof
Wikipedia The words
Yoonu njub
Jeovah Witnesses

Audio Data

Automatic Speach Recognition (ASR) Text To Speech (TTS) Keyword Spotting
AI4D Baamtu URBAN Dataset AI4D-BAAMTU Dataset Keyword spotting dataset (Wolof, Pulaar, Serere, Mandinka, Diola, Soninke)
ALFFA_PUBLIC
Waxal Multilingual
News
TFM Youtube Playlist (Senegal)
Elmourabitoune Multilingual (Wolof, Pulaar, Arabic) News TV (Mauritania)
Learning resources
UCLA Wolof Audio-Video Course

About

Curate online wolof text resources that can be used to build models