inspirehep / magpie

Deep neural network framework for multi-label text classification

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Can we train Magpie models with just one word?

kaundinya5 opened this issue · comments

I'm trying to use Magpie to classify different alphanumeric numbers into either policy number, account number. I changed the minimum number of words in word2vec to 1 to support this. After I trained the model, it always produces the same probability. Am I doing something wrong or do Magpie models always be trained with sentences and paragraphs?

Thank you for your response, I'm not trying to implement binary classification, I just started out with two labels, will be adding more labels once I get this working, I just wanted to try and see if Magpie could work on just two labels first.

Here are some example inputs:
Policy numbers: 594034, 02499357-3, 04187428-7, 04202703-0, 5572732, 06080677-0, 6498924502, 100014713, 100023672
Account numbers: AAAMOTO-01, ALDEINC-03 ,ALLAFOR-01, AMSMARK-01, AMERMOD-03

I have created the appropriate txt and lab files as well.

@kaudinya5 Magpie is designed to classify free text documents that consist of many words. It's power relies on the fact that it can learn implicit relationships between those words and use them for classification. If your input is a unique alphanumeric character, it's unlikely the model will learn much.