Question: tweets_predict_* vs tweets_*

Question

Question: tweets_predict_* vs tweets_*

Nowi5 opened this issue 4 years ago · comments

Dear ugis22,

thank you very much for your work. I try to follow but I have a question in mind.
So far I understood, you listen to Twitter API and every new Tweet as from interaction or from keyword goes into table predict. In Database clean etc. you use both tables. So I would assume that you have filled tweets_* beforehand. This script is missing in here, right?
Why did you separate the two tables?

Thanks

Eugenia Inzaugarat · Answer 1 · Tue Apr 07 2020 04:34:34 GMT+0800 (China Standard Time)

What I did is stream one time for the training part, and store it under the table tweet_avengers. Then, stream a second time and store it under the table tweet_predict_avengers. The cleaning DB is because I do not want any duplicate tweet to be in my tweet_predict_avengers, as I need only tweets that haven't been seen by the model before.