Anthonyive / DSCI-550-Assignment-3

πŸ“Š Building Visual Apps to Explore Fake Scientific People and Literature using Data Science: Creating Data Insights

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Building Visual Apps to Explore Fake Scientific People and Literature using Data Science: Creating Data Insights

Quick Access to Instructions | Quick Access to Report

Prerequisite

  1. Python virtual environment has been set up using pipenv. You need pipenv installed (learn more about installation and usage).
  2. There are several other packages/tools you may want to use along the way. You should check out the instruction about this assignment.

Usage

  1. [Task 1] Take your TSV dataset and convert the data to JSON to use in D3.
tsvtojson -t additional_features.tsv -c cols.txt -o "Email" -s 1.0 -v -j assignment-2.json
  1. [Task 2]
  • Data preparation: Run notebooks in notebooks directory except for visualization 4. For visualization 4, there's a Python script called vis4conversion.py in the src directory and run it in the root directory.

  • We are using Flask to build our website. To run locally:

    1. Set Flask app in your virtual environment:
    export FLASK_APP=app.py
    1. (Optional) Set Flask environment as development:
    export FLASK_ENV=development
    1. Run Flask
    flask run
    1. Click the localhost link it provides.
  1. [Task 6]
  • We have already parsed the fradulent emails data from assignment 1 and assignment 2 in Apache Solr and created the location graph in GeoParser. The generated graphs are included in the GeoParser Directory.
  • If you would love to recreate the graphs, follow the steps in GeoParser Repo, and switch the example of Covid19 to one of the folders we created here.

About

This is the assignment 3 from DSCI 550 Spring 2021 at USC Viterbi School of Engineering. This repo is collaborated by a group of six.

Team members: Zixi Jiang, Peizhen Li, Xiaoyu Wang, Xiuwen Zhang, Yuchen Zhang, Nat Zheng

About

πŸ“Š Building Visual Apps to Explore Fake Scientific People and Literature using Data Science: Creating Data Insights

License:MIT License


Languages

Language:Jupyter Notebook 99.5%Language:HTML 0.3%Language:Python 0.1%Language:Shell 0.0%Language:CSS 0.0%