SandeepKundalwal / Automated-Plagiarism-Detector

An automated plagiarism detector that handles unzipping, generates plagiarism report and scraps the reports for threshold plagiarism.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Automated Plagiarism Detector

An automated plagiarism detector that handles unzipping, generates plagiarism report and scraps the reports for minimum threshold plagiarism.

The project consists of three modules:

  1. Staging Files: All the zip files are unzipped and the files that are present in unzipped folder are segreggated based on a particular format which is {rollno.}_{questionno.}. Makes multiple directories based on the number of questions.
  2. Plagiarism Script: Checks for plagiarism and provides the percentage of plagiarism for each file against all the other files that are present in a directory. Generates a HTML file containing plagiarism report.
    - run script: python3 plag.py {Assignment file location} {Report generation location}
  3. Scrapping Plagiarism Report: Scrapes the percentage of plagiarism from each report generated by Plagiarism Script and returns a .txt file that contains the Roll No. of all the students who have plagiarism above the minimum allowed threshold.

About

An automated plagiarism detector that handles unzipping, generates plagiarism report and scraps the reports for threshold plagiarism.

License:MIT License


Languages

Language:Java 94.7%Language:Python 5.3%