ramtiin/Detecting-Machine-Generated-Text

ai ai-ethics gpt machine-learning python text-detection

Abstract

The remarkable advances in natural language processing, embodied by the advent of large language models (LLMs) like GPT-3, have brought about both promise and peril. These LLMs, with their capacity to generate impressively coherent and fluent text, empower a spectrum of applications ranging from conversational agents to creative writing aids, and productivity-boosting auto-completion programs. However, these same capabilities risk enabling malicious activities, such as mass production of synthetic disinformation, spam, and phishing content (Solaiman et al., 2019, Spitale, 2023, Gradon, 2023).

Given this duality, the critical need to develop robust strategies for detecting machine-generated text is evident. This dissertation addresses this pressing challenge by focusing on linguistic techniques for distinguishing between human-written and machine-generated texts. Our investigation seeks to deepen understanding of the subtle yet distinct differences between human and LLMs writing styles, thereby enabling the identification of AI-generated content.

Research Methods

For this research, two primary methods were developed:

A hand-crafted feature-based approach that applies classical machine learning algorithms to identify signals and patterns suggestive of artificially constructed text. Metrics used include perplexity, lexical diversity, average sentence length, and semantic inconsistencies, among others.
A deep learning-based approach based on RoBERTa leverages machine learning techniques to train models on examples of human and machine-generated texts.

Evaluation

The techniques were evaluated using the Human ChatGPT Comparison Corpus (HC3) established by Guo et al., 2023. This corpus consists of almost 40,000 questions and their corresponding responses from both human experts and ChatGPT (GPT-3). Additionally, the GPT-2-output-dataset ¹ provided by OpenAI was used. This dataset includes 250,000 documents from the WebText test set, as well as 250,000 random samples for each GPT-2 model.

Findings

The findings of this research reveal several intriguing disparities between human and AI text generation. I demonstrated that these differences could be successfully utilized by classifiers to distinguish between human and AI-generated text. Both the hand-crafted feature-based approach and the RoBERTa-based deep learning approach achieved high precision and recall scores in identifying AI-generated content. These findings serve to underline the effectiveness of the developed techniques in this burgeoning field of study.

GPT-2 Output Dataset ↩

About

The findings of this research reveal several intriguing disparities between human and AI text generation. I demonstrated that these differences could be successfully utilized by classifiers to distinguish between human and AI-generated text.

ai ai-ethics gpt machine-learning python text-detection

GNU General Public License v3.0

Languages