NaturalStupidlty / FakeNewsDetection

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ai_house_fake_news_detection

Setup Working Environment

Pre-requirements

Setup environment

If you have CUDA

conda env create -f environment_gpu.yaml

Otherwise

conda env create -f environment.yaml

Start Jupyter

You may use any port

jupyter lab --port 7766

Content

Use Kaggle or Colab for computations

Kaggle

  1. Create Kaggle account
  2. Create Notebook
  3. Explore docs and find out how
    • Add Kaggle dataset to notebook
    • Turn on GPU

Colab

  1. Create Notebook in Colab
  2. Enable GPU
  3. Add Kaggle dataset to Colab - https://www.geeksforgeeks.org/how-to-import-kaggle-datasets-directly-into-google-colab/

Data

  • For Week 2 and 3 we will use Kaggle datasets. Prepare in advance
    • Fake and real news dataset API command: kaggle datasets download -d clmentbisaillon/fake-and-real-news-dataset
    • Toxic Comment Classification Challenge API command: kaggle competitions download -c jigsaw-toxic-comment-classification-challenge
    • Our propaganda detection dataset - kaggle datasets download -d vladimirsydor/propaganda-detection-our-data
    • Unsupervised ru propaganda dataset 01-02-2022 till 30-01-2023 - kaggle datasets download -d vladimirsydor/ru-propaganda-2022-year

How to use Kaggle datasets

  1. Create Kaggle account
  2. Create Kaggle account
  3. Proceed with Installation & Authentication
  4. Download dataset with API command

About


Languages

Language:Jupyter Notebook 100.0%Language:Python 0.0%Language:HTML 0.0%Language:Dockerfile 0.0%