zhaozhe987 / short-text-clustering-enhancement

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

--at first change the parameters (input file, number of clusters) in the main.py
--to run: python main.py
--final output file format:clusterlabel	truelabel	text 
--output file location: data/stackoverflow/traintest
--You can run generate_initial_labels.py to generate initial labels for a dataset (not mandatory)

About


Languages

Language:Python 100.0%