big-c-note / parsley

Unsupervised text clustering

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Parsley

Parsley is a unsupervised semantic classifier. Why group a bunch of questions by hand when a computer can?

Current Research and Roadmap

Currently I've implemented a simple unsupervised clustering method based off already trained word embedding models from Facebook and Google.

Cluster Your Own text

Start off by cleaning the data. This currently only supports Ahrefs keyword data.

test

parsley clean --help

Then cluster the text. You can choose which model you want to use.

parsley cluster --help

Finally, you can create a simple plot.

parsley plot --help

Sentence (or Keyword) Embeddings

Agglomeritiver Clustering

Helpful Links

About

Unsupervised text clustering

License:Other


Languages

Language:Python 94.7%Language:Dockerfile 5.3%