RubixML / ML

A high-level machine learning and deep learning library for the PHP language.

Home Page:https://rubixml.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How to use TF-IDF when it is not categorical?

sr13579 opened this issue · comments

Maybe I am asking a very 'noob' question. If the tfidf doesn't work with categorical data then how can I use it on a document?
Example tutorials are to find. Anyone here to help?

Hey @sr13579, TF-IDF works on token frequency vectors. You can transform blobs of text into token frequency vectors using either Word Count Vectorizer or Token Hashing Vectorizer.