oooo26 / CS4248_project

4-way document classification on Labeled Unreliable News (LUN) dataset.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CS4248 Project

4-way document classification on Labeled Unreliable News (LUN) dataset. All preprocessing (including feature extraction) and models can be checked in models\*.

Time Schedule

  • First Meeting (February 12th)
  • Second Meeting (February 28th)
  • Intermediate (March 2rd)
  • Preprocessing (March 19th)
    • Zhehao - Data Preprocessing, data augmentation, etc
  • Model implementation (March 31st)
    • Trushti - tf-idf based naive bayes and logistic regression
    • Zhehao - Transformer & Performer
    • Rongrong - BiLSTM & small BERT
    • Junhao - RoBERTa
    • Sagar - XLNet
  • Report (April 6th)

Team Member

  • Zhehao Gu
  • Rongrong Zhan
  • Sagar Sureka
  • Modi Trushti Sanjaybhai
  • Junhao Huang

About

4-way document classification on Labeled Unreliable News (LUN) dataset.

License:Apache License 2.0


Languages

Language:Jupyter Notebook 100.0%