shuyo / iir

Machine Learning / Natural Language Processing / Information Retrieval

Home Page:http://shuyo.wordpress.com/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

lda

CherryP opened this issue · comments

I am new to python or R. I would like to implement LDA by using either python or R. I am looking at your code but I don't quite understand what is the format of the input (filename). I tried to see what does re r'\w+(?:'\w+)?' do.
Can I input several documents?

Hi,
On my lda.py , -f option can specify a filename of corpus whose each line is regarded as one document.
lda.py -f sometext.txt

Oh do you mean that when I run, I just use
python lda.py -f text1.txt -f text2.txt ....
I am sorry if my question seems nonsense.

As I mentioned, -f option can specify A filename of corpus whose each line is regarded as one document.
If you want to pass more than 2 files, it is necessary to modify the script.