Ndamulelo Nemakhavhani's repositories
dp203-study-guide
A curated list of crucial topics to grasp in preparation for Azure DP-203 certification exam
zabantu-beta
ZaBantu is a fleet of light-weight Masked Language Models for Southern Bantu Languages
afriberta
AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages
amazon-ion-to-json-cli
Python scripts for converting ion data to JSON format(and vice-versa)
astro-ghostcms-dot-xyz
Main website for Astro-GhostCMS
azure-dp600-fabrics-analytics-engineer-study-guide
Azure DP600 (Fabric Analytics Engineer Associate) Exam topics and tips
cloud-gpu-handbook
A guide to help users quickly navigate and compare GPU offerings across major cloud platforms
cookbook
Chainlit's cookbook repo
cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
data-science-on-gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
datahub
The Metadata Platform for the Modern Data Stack
Flowise
Drag & drop UI to build your customized LLM flow
hundzula-2024-reproducible-nlp
Code for my talk on Reproducible NLP experiments with DVC and CometML given at the Hundzula Retreat 2024
ignore-cli
A convenience command line tool for adding .ignore files to various projects
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
lstm-siamese-text-similarity
⚛️ It is keras based implementation of siamese architecture using lstm encoders to compute text similarity
mrjob
Run MapReduce jobs on Hadoop or Amazon Web Services
ndamulelonemakh
Config files for my GitHub profile.
nodejs.org
The Node.js website.
notion-cms-astro-blog
A rudimentary implementation of a CMS using Notion as the backend and Astro Content Collection API
optimization-algorithms
Implementation of optimization algorithms in python
our-stopwords
Auto-generated stopwords for South African Bantu Languages
pyfranc
Text language detection basic on trigrams.
pyidw
A standalone python library for inverse distance weighted (idw) interpolation
scrapy-redis
Redis-based components for Scrapy.
shared-notebooks
This repo contains misc quickstart notebooks on a variety of topics including, NLP, Language Modelling, Fine-tuning etc.
training-data-analyst
Labs and demos for courses for GCP Training (http://cloud.google.com/training).