There are 3 repositories under snorkel topic.
A system for quickly generating training data with weak supervision
Extracting biomedical relationships from literature with Snorkel 🏊
Jupyter book showing how to build an ML powered book genre classifier
Unsupervised tableQA and databaseQA on chinese finance question and tabular data
Code for the KDD-2023 paper: Neural-Hidden-CRF: A Robust Weakly-Supervised Sequence Labeler
Big Old Heuristic Repository
[SIGIR 2022] ORCAS-I: Queries Annotated with Intent using Weak Supervision
Mongolian Polarity Detection in Weakly Supervised manner
Implementing wide variety of transformers, fine tuning as well as trying architectural variants from various research papers and blogs.
Code accompanying the TOP paper "Predicting the Demographics of Twitter Users with Programmatic Weak Supervision".
Semi-supervised labelling of news snippets to extract cleantech news
Process flow to generate labels on Text data using Snorkel and maintain DB to repurpose unlabelled data
"Snorkel with Turtle" is a realtime labeling system based on Snorkel
Convertir máscara Snorkel a CPAP y a máscara de Equipo de Prevención Individual (EPIs)
A curated list of awesome Weak-Supervision-Sequence-Labeling (WSSL) papers, methods & resources.
Data labeling using weak supervision
In this project, we are using Snorkel Python to work with ML algorithms with an unlabeled text dataset.
Snorkel MeTaL: A framework for training models with multi-task weak supervision
Personal project that wants to predict the sexual content on reggaeton songs lyrics using natural language processing
Sentiment analysis web app for non-technical users.
Utilizing the snorkel machine learning model to label biomimicry papers. Snorkel uses weak supervision to label large amounts of training data using programmatic labeling functions based on keyword rules.
This is a POC project of a system that aim to label unlabeled streaming data using Snorkel
Fine-grained semantic indexing of biomedical literature (a Weakly-Supervised approach)
AbusiveLanguage2020 is an open source dataset with over 1.5M tweets labeled for abusive language.
Weak Labeling of Fake News Articles with Snorkel and Snuba
Interactive Label Generation Tool for Time Series Data
Labelling dataset with Snorkel and TextBlob, building model with Scikit-Learn (SVM), wiring up a web app using Flask.