shivamsharma00 / Avengers_ensemble

Author Obfuscation Implementation of paper "Avengers Ensemble".

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Avengers Ensemble: Advanced Authorship Obfuscation

Welcome to the Avengers Ensemble repository, home to cutting-edge authorship obfuscation techniques using MUTANT-X. Elevate your text security with our implementation that surpasses the capabilities of existing methods.

Project Overview

Dive into the realm of Mutant-X authorship obfuscation, inspired by the groundbreaking research paper, "Mutant-X: Efficient Authorship Obfuscation Techniques." Our approach refines the original concept by introducing the Ensemble Vote classifier and harnessing the power of writeprint features to enhance SVM classifiers' training.

Key Features

  • Robust ensemble attribution classifier with a 10-SVM classifier pipeline.
  • Ensemble Vote classifier dynamically optimizes the ensemble by excluding low-accuracy classifiers.
  • Writeprint features, including:
  • Average character per word
  • Letter frequency
  • Key letter bigrams and trigrams
  • Digits and characters percentages
  • Frequency of digits and word lengths
  • Integration of Word2vec model (gensim) for efficient word neighbor computations.
  • MUTANT-X classifier with SVC, trained on the Amazon review dataset as an internal attribution classifier.
  • Local dictionary for storing computed neighbors, enhancing computation efficiency.
  • Achieved an outstanding average METEOR score of 0.5 on the Amazon review dataset, surpassing the implemented paper's results.

Usage Guide

Empower your projects with Mutant-X authorship obfuscation:

  1. Clone Repository: Copy this GitHub repository to your local machine.
  2. Install Dependencies: Install the necessary dependencies listed in the requirements.txt file.
  3. Prepare Input: Create the input text file you wish to obfuscate.
  4. Run Script: Execute the main.py script with the input text file as a command-line argument.
  5. Obfuscated Output: Retrieve the obfuscated text, saved in a file with the original name appended by "_obfuscated."

Conclusion

Revolutionize your text security with Mutant-X authorship obfuscation. Our project excels by combining Ensemble Vote classification, advanced writeprint features, and a meticulously trained MUTANT-X classifier. Notably, the local dictionary optimizes computation time, making the obfuscation process swift and effective. Experience superior results on the Amazon review dataset, marking a significant improvement over the original paper's implementation. Join us in shaping the future of authorship obfuscation!

About

Author Obfuscation Implementation of paper "Avengers Ensemble".


Languages

Language:Python 100.0%