GoooIce / projects

💝 Example projects for various NLP tasks with datasets, scripts and results

Example projects

This repo contains example projects for various NLP tasks, including scripts, benchmarks, results and datasets created with Prodigy.

💝 Projects

Name	Description	Best result
`ner-fashion-brands`	Use `sense2vec` to boostrap an NER model to detect fashion brands in Reddit comments. Includes 1735 annotated examples, a data visualizer, training and evaluation scripts for spaCy and pretrained tok2vec weights.	82.1 (F)
`ner-drugs`	Use word vectors to boostrap an NER model to detect drug names in Reddit comments. Includes 1977 annotated examples, a data visualizer, training and evaluation scripts for spaCy and pretrained tok2vec weights.	80.6 (F)
`textcat-docs-issues`	Train a binary text classifier with exclusive classes to predict whether a GitHub issue title is about documentation. Includes 1161 annotated examples, a live demo and downloadable model and training and evaluation scripts for spaCy.	91.9 (F)

About

💝 Example projects for various NLP tasks with datasets, scripts and results

MIT License

Languages

Language:Python 100.0%