almugabo's repositories
openalex_qa
Assessing and Improving data quality in OpenAlex
open_metadata
an overview of open scientometric resources
reference_processing
a repo with scripts to finetune an LLM to process bibliographic references of scholarly documents
africanlp-public-datasets
A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.
examgen
A Python class that can automatically generate mathematics exams, with solution keys, using Sympy and LaTeX.
google-gemma-finetuning-n2sql
Finetuning Google's Gemma Model for Translating Natural Language into SQL
grants_dataset
an open dataset of data on research funding of selected agencies/programs .
maths_S1S3
a repo with Maths topics
NLP-Projects-NHV
NLP Projects playlist
opensearch-gitpod-test
A Docker Compose template, configured for Gitpod (www.gitpod.io) to give you pre-built, ephemeral development environments in the cloud.
ref_scholarly_docs
this repo containts some work on curating lists of references in non traditional scholarly documents. work in progress
test_dev
repository to quickly test things. will be periodically deleted
TextRL
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on small scale model (any generation model in hugging face's transformers)
trl
Train transformer language models with reinforcement learning.