Hezam Gawbah's repositories
text-categorization
We used sub data from a large Single-labeled Arabic News Articles Dataset (SANAD) of textual data collected from three news portals. The dataset is a large one consisting of almost 200k articles distributed into seven categories that we offer to the research community on Arabic computational linguistics.
Arabic-Healthcare-Dataset-AHD-
To address shortcomings of Arabic natural language generation models, we introduce a large Arabic Healthcare Dataset (AHD) of textual data. For this motivation, we named our dataset ‘AHD’. The largest Arabic Healthcare Dataset (AHD) as we know was collected from medical website.
Language:Jupyter Notebook000
Exploring-factors-contributing-to-student-dropout-A-case-study-of-IBB-University
Dataset for student dropout: A case study of IBB University