Identify features
kalaidin opened this issue · comments
PGK commented
[
"PostId",
"PostCreationDate",
"OwnerUserId",
"OwnerCreationDate",
"ReputationAtPostCreation",
"OwnerUndeletedAnswerCountAtPostTime",
"Title",
"BodyMarkdown",
"Tag1",
"Tag2",
"Tag3",
"Tag4",
"Tag5",
"PostClosedDate",
"OpenStatus",
]
PGK commented
Current:
"BodyMarkdown" tfidf table
Marat Zaynutdinov commented
merge title with BodyMarkdown and then create TF table
Marat Zaynutdinov commented
TF table on tags
PGK commented
Suggested features:
"ReputationAtPostCreation"
"OwnerUndeletedAnswerCountAtPostTime"
"PostCreationDate" - "OwnerCreationDate"
Marat Zaynutdinov commented
number of keywords supplied
PGK commented
len("BodyMarkdown")
Marat Zaynutdinov commented
- number of words in title
- number of words in bodymarkdown
- is code supplied in bodymarkdown
- propotion of body to code
Marat Zaynutdinov commented
- time (day or night for example)
Marat Zaynutdinov commented
- number of code blocks
PGK commented
All done except "time (day or night for example)" with does not seem interesting.