Data Science for Social Impact Research Group @ University of Pretoria (dsfsi)

Data Science for Social Impact Research Group @ University of Pretoria

dsfsi

Geek Repo

We are the Data Science for Social Impact research group at the Computer Science Department, University of Pretoria.

Location:University of Pretoria, South Africa

Home Page:https://dsfsi.github.io

Twitter:@dsfsi_research

Github PK Tool:Github PK Tool

Data Science for Social Impact Research Group @ University of Pretoria's repositories

deadlines

:alarm_clock: AI/ML/DS conference/workshop/event deadlines on the African continent

Language:HTMLStargazers:17Issues:0Issues:0

dsfsi-lid

Language Identification For South African languages

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

gov-za-multilingual

The data set contains cabinet statements from the South African government. Data was scraped from the governments website: https://www.gov.za/cabinet-statements

Language:Jupyter NotebookLicense:MITStargazers:4Issues:0Issues:0

dsfsi-datasets

Datasets made available for different small projects

Language:Jupyter NotebookLicense:MITStargazers:2Issues:0Issues:0
Stargazers:0Issues:0Issues:0

academic-project-page-template

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

textaugment

TextAugment: Text Augmentation Library

Language:PythonLicense:MITStargazers:382Issues:0Issues:0

bibtextomd

Convert BibTeX entries to formatted Markdown

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

vukuzenzele-nlp

The dataset contains editions from the South African government magazine Vuk'uzenzele. Data was scraped from PDFs that have been placed in the data/raw folder. The PDFS were obtained from the Vuk'uzenzele website.

Language:Jupyter NotebookLicense:MITStargazers:6Issues:0Issues:0

covid19za

Coronavirus COVID-19 (2019-nCoV) Data Repository and Dashboard for South Africa

Language:Jupyter NotebookLicense:MITStargazers:255Issues:0Issues:0

PuoBERTa

A Roberta-based language model specially designed for Setswana, using the new PuoData dataset.

Language:MakefileLicense:NOASSERTIONStargazers:4Issues:0Issues:0

cos802

Defense against the dark text arts

Language:SCSSLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

edu-assessment-llm-prompt

Educational Assesement using LLMs

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

za-fake-news-2020

Dataset of South African Disinformation [Fake News] Website Data collected in 2020

License:MITStargazers:0Issues:0Issues:0

healthfacilitymap

South African Health Facility map. Created to aid in covid19za responses

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

dlindaba-2019-uber

UBER Rider Rating Data from the DLIndaba 2019

License:MITStargazers:0Issues:0Issues:0

izindaba-zesizulu

Categorised isiZulu News. Source data is the isiZulu news from the SABC social media posts.

Stargazers:1Issues:0Issues:0

StatsSA-Language

StatsSA statistical language glossary in machine-readable format

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

za-terminology

DSFSI South African Terminlogy Lists and Lexicon Project

Language:MakefileLicense:NOASSERTIONStargazers:2Issues:0Issues:0

embedding-eval-data

Embedding Evaluation Data for South African Languages

Stargazers:1Issues:0Issues:0

za-bank-risk

This repository is an initial pipeline for reading, processing, labelling and classifying unstructured annual reports of South African (SA) banks with the aim of identifying financial risk. It leveraged work by the Corporate Financial Information Environment-Final Report Structure Extractor (CFIE–FRSE) of El-Haj et al. which created a corpus of annual reports of United Kingdom (UK) companies.

Language:Jupyter NotebookLicense:CC-BY-SA-4.0Stargazers:1Issues:0Issues:0

za-isizulu-siswati-news-2022

IsiZulu News (articles and headlines) and Siswati News (headlines) Corpora - za-isizulu-siswati-news-2022

License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

Higher_Education_EDA

This is an EDA Git for education researchers and practitioners

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3Issues:0Issues:0

sa-parliament

South African Member Of Parliament Data

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

PuoData

Curated corpora for Setswana. Used to train PuoBERTa.

License:CC-BY-SA-4.0Stargazers:2Issues:0Issues:0

project-state-capture

Zondo Commission or State Capture Commission Transcripts

License:CC-BY-SA-4.0Stargazers:2Issues:0Issues:0