Data Science for Social Impact Research Group @ University of Pretoria (dsfsi)

Data Science for Social Impact Research Group @ University of Pretoria

dsfsi

Geek Repo

We are the Data Science for Social Impact research group at the Computer Science Department, University of Pretoria.

Location:University of Pretoria, South Africa

Home Page:https://dsfsi.github.io

Twitter:@dsfsi_research

Github PK Tool:Github PK Tool

Data Science for Social Impact Research Group @ University of Pretoria's repositories

textaugment

TextAugment: Text Augmentation Library

Language:PythonLicense:MITStargazers:402Issues:8Issues:23

covid19za

Coronavirus COVID-19 (2019-nCoV) Data Repository and Dashboard for South Africa

Language:Jupyter NotebookLicense:MITStargazers:255Issues:32Issues:186

deadlines

:alarm_clock: AI/ML/DS conference/workshop/event deadlines on the African continent

Language:HTMLStargazers:18Issues:5Issues:0

vukuzenzele-nlp

The dataset contains editions from the South African government magazine Vuk'uzenzele. Data was scraped from PDFs that have been placed in the data/raw folder. The PDFS were obtained from the Vuk'uzenzele website.

Language:Jupyter NotebookLicense:MITStargazers:6Issues:0Issues:22

gov-za-multilingual

The data set contains cabinet statements from the South African government. Data was scraped from the governments website: https://www.gov.za/cabinet-statements

Language:Jupyter NotebookLicense:MITStargazers:4Issues:3Issues:8

PuoBERTa

A Roberta-based language model specially designed for Setswana, using the new PuoData dataset.

Language:MakefileLicense:NOASSERTIONStargazers:4Issues:1Issues:0

Higher_Education_EDA

This is an EDA Git for education researchers and practitioners

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3Issues:2Issues:1

dsfsi-datasets

Datasets made available for different small projects

Language:Jupyter NotebookLicense:MITStargazers:2Issues:2Issues:0

za-mavito

DSFSI South African Terminlogy Lists and Lexicon Project

Language:HTMLLicense:NOASSERTIONStargazers:2Issues:1Issues:1

embedding-eval-data

Embedding Evaluation Data for South African Languages

izindaba-zesizulu

Categorised isiZulu News. Source data is the isiZulu news from the SABC social media posts.

zabantu-beta

ZaBantu is a fleet of light-weight Masked Language Models for Southern Bantu Languages

Language:PythonLicense:NOASSERTIONStargazers:1Issues:2Issues:0

healthfacilitymap

South African Health Facility map. Created to aid in covid19za responses

Language:JavaScriptLicense:MITStargazers:0Issues:4Issues:1
Stargazers:0Issues:2Issues:0

StatsSA-Language

StatsSA statistical language glossary in machine-readable format

Language:Jupyter NotebookLicense:MITStargazers:0Issues:3Issues:0

za-fake-news-2020

Dataset of South African Disinformation [Fake News] Website Data collected in 2020

License:MITStargazers:0Issues:2Issues:0
Stargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0

academic-project-page-template

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

Language:JavaScriptStargazers:0Issues:0Issues:0

bibtextomd

Convert BibTeX entries to formatted Markdown

Language:PythonLicense:MITStargazers:0Issues:0Issues:1

cos802

Defense against the dark text arts

Language:SCSSLicense:MITStargazers:0Issues:2Issues:0
Language:HTMLLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

dlindaba-2019-uber

UBER Rider Rating Data from the DLIndaba 2019

License:MITStargazers:0Issues:3Issues:0

dsfsi-lid

Language Identification For South African languages

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:1Issues:1

edu-assessment-llm-prompt

Educational Assesement using LLMs

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

thapelo-sindane-msc-public

Public Repository containing msc code

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

za-lid

This repository contains datasets extracted from Vuk'zenzele prepared to train N-gram models, and traditional ML models (Naive Bases, SVM, and Logistic Regression), and Large pretrained multilingual models for language identification

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0