josecruzado21 / plagiarism_detection

On this repository I use the dataset created by Clough and Stevenson to train a plagiarism detection model. The dataset contains around 100 data points and includes 4 types of plagiarism, ranging from near-copy to heavy revision. The algorithm used to classify a text as plagiarised or not was Supoort Vector Machines.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

josecruzado21/plagiarism_detection Stargazers