Bangla Language Processing Research's repositories
Katha-Bangla-TTS
The first Bangla Text To Speech System for Bangladeshi Bangla (Katha)
Bangla-Speech-Corpora
Bangla cleaned speech corpus, specially developed for Bangla Text to Speech
Arabic_speech_code_switching
The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguistic and the acoustic cues. This dataset is a potential benchmark for DCS in spontaneous speech.
Bangla-Content-Annotation-Bank
Content annotation bank for Bangla. It includes, named entity, temporal expressions, relation, and event annotation.
Bangla-pronunciation
Lexicon and machine learning based Bangla pronunciation system development
Language-Features-for-News
Language features used in the NELA Toolkit and other news studies