Port CountVectorizer

Question

Port CountVectorizer

nicolalandro opened this issue 6 years ago · comments

For text mining it's important to fit also a CountVectorizer (or a TFIDFTransformer), so should be possible to export it in the targhet lenguage

Jiahao Zhao · Answer 1 · Tue Apr 02 2019 22:04:59 GMT+0800 (China Standard Time)

Currently, is there any alternative way?

nicolalandro · Answer 2 · Sun Apr 14 2019 14:21:49 GMT+0800 (China Standard Time)

@Opdoop the way is to extract the dictionary from CountVectorizer (get_feature_names()) and reimplement the logic manually.
You can do the same thing on TFIDF.

Darius Morawiec · Answer 3 · Tue May 17 2022 05:55:20 GMT+0800 (China Standard Time)

Hello @nicolalandro ,

can you provide some snippets? Withit I will have a better start.

Kind regards,
Darius