GoooIce / projects

πŸ’ Example projects for various NLP tasks with datasets, scripts and results

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Example projects

This repo contains example projects for various NLP tasks, including scripts, benchmarks, results and datasets created with Prodigy.

πŸ’ Projects

Name Description Best result
ner-fashion-brands Use sense2vec to boostrap an NER model to detect fashion brands in Reddit comments. Includes 1735 annotated examples, a data visualizer, training and evaluation scripts for spaCy and pretrained tok2vec weights. 82.1 (F)
ner-drugs Use word vectors to boostrap an NER model to detect drug names in Reddit comments. Includes 1977 annotated examples, a data visualizer, training and evaluation scripts for spaCy and pretrained tok2vec weights. 80.6 (F)
textcat-docs-issues Train a binary text classifier with exclusive classes to predict whether a GitHub issue title is about documentation. Includes 1161 annotated examples, a live demo and downloadable model and training and evaluation scripts for spaCy. 91.9 (F)

About

πŸ’ Example projects for various NLP tasks with datasets, scripts and results

License:MIT License


Languages

Language:Python 100.0%