WolofProcessing / online_wolof_data

Curate online wolof text resources that can be used to build models

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Wolof Data

This repository curates online wolof resources

Text Data

Machine Translation Named Entity Recognition Part-of-speech tagging Question Answering
OPUS MasakhaNER MasakhaPOS AfriQA
FLORES-200 UD_Wolof-WTB
Microsoft NTREX
LOREILEI (payant)
MAFAND-MT
Wolof books (non exhaustive list)
Bataaxal bu gudde nii (Une si longue lettre), Mariyaama Ba
Doomi golo (Le fils de la guenon), Bubakar Bόris Jόob
Goneg nit ku nuul gi (L'enfant noir), Camara Laye
Ndoomu Buur Si (Le petit prince), Antoine de Saint Exupery
Bàmmeelu Kocc Barma, Buubakar Bóris Jóob
Puukare, Ceerno Séydu Sàll
Doxandéem, Ibraayima Saaxo Caam)

Wolof web sites

Online news & Docs Religious content Medical content Learning Wolof Twitter accounts
Defu Waxu Nouveau Testament InfoVIHTal Jàng Wolof Saabal
Wolof-online Bible WaxWolof
Wikipedia The words
Yoonu njub
Jeovah Witnesses

Audio Data

Automatic Speach Recognition (ASR) Text To Speech (TTS) Keyword Spotting
AI4D URBAN Dataset (Baamtu) AI4D-BAAMTU Dataset Keyword spotting dataset (Wolof, Pulaar, Serere, Mandinka, Diola, Soninke)
ALFFA_PUBLIC
Waxal Multilingual
Kallaama (Jokalante, Orange, EPT)
Google Fleurs
News
TFM Youtube Playlist (Senegal)
Elmourabitoune Multilingual (Wolof, Pulaar, Arabic) News TV (Mauritania)
Learning resources
UCLA Wolof Audio-Video Course

About

Curate online wolof text resources that can be used to build models