DamascenoRafael / identify-bots-reddit-comment-network

Characterization and classification of bots using only structural characteristics of the network. Python development of network construction, component analysis and Neural Network for classification.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Characterization and Classification of Bots using Reddit’s Comment Network

About

This repository refers to a study to characterize and identify bots in a social network using only the network structure. This work considers a directed network of users constructed from comments in Reddit. The network characterization highlights the significant structural differences of bots, allowing them to be classified using only network features.

This work was guided by Professor Daniel R. Figueiredo (PESC / COPPE / UFRJ).

Publications

Using this repository

The code elaborated for this work is compatible with Python 3.7.8.

To install all dependencies and use Jupyter Notebook with a virtual environment:

python3 -m venv venv
source venv/bin/activate
pip3 install -r requirements.txt
python3 -m ipykernel install --user --name=venv
jupyter notebook

In the notebooks folder are the Jupyter Notebook files that can be used to evaluate other datasets.

Reddit Comments datasets can be found in pushshift, the files should be placed uncompressed in data/raw/.

About

Characterization and classification of bots using only structural characteristics of the network. Python development of network construction, component analysis and Neural Network for classification.

License:GNU General Public License v3.0


Languages

Language:Jupyter Notebook 92.2%Language:Python 7.8%