jesselawson / chatgpt-generated-text-detection-corpus

ChatGPT Generated Text Detection Corpus

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ChatGPT Generated Text Detection Corpus

Hello! And thank you for being here!

In this repository you may find the following:

Human and ChatGPT written essays on the given 126 different topics. These essays are located in their respective folders inside the main folder of the application.

Human essays were extracted automatically from this PDF which is a collection of TOEFL essays.

ChatGPT essays were retrieved automatically from the ChatGPT Api using this client.

It's worth mentioning that, as a meta-data point, each line in each file corresponds to a paragraph of that essay, and the file name for each essay aligns with the question number.

Furthermore, you may find the 126 asked questions in this file.

This dataset was used in our work ChatGPT Generated Text Detection which is published here.

About

ChatGPT Generated Text Detection Corpus

License:MIT License