hse-aml / natural-language-processing

Resources for "Natural Language Processing" Coursera course.

Home Page:https://www.coursera.org/learn/language-processing

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Week 1 MultiLabelBinarizer

creeston opened this issue · comments

In Week1 module during Multiclass training, scikit-learn raises this kind of exception:
"Scikit Learn Multilabel Classification: ValueError: You appear to be using a legacy multi-label data representation..."

So, I've found out that we should use MultiLabelBinarizer in order to preprocess labels, done.

But when we need to evaluate "val" dataset on trained classifiers, there is "mlb" variable referenced, which was not instantiated. I assume that it refers to "MultiLabelBinarizer". As you see, there is an inconsistency here, which currently should be manually fixed.