monperrus / ODSExperiment

Automated Classification of Overfitting Patches with Statically Extracted Code Features

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Automated Classification of Overfitting Patches with Statically Extracted Code Features

In this repository, we do overfitting decttion system (ODS) for program repair patches.

Folder Structure

├── Experiment: csv feature data and script for reproducing our experiment
│ 
├── Features: ODS code features
│   └── Code: ODS code description features in JSON format
│   └── Patterns: ODS repair pattern features in JSON format
│   └── Context: ODS context features in JSON format 
├── Source: The source program files that can be taken input for Coming to generate ODS features
│
├── Tests: Evosuite tests generated for Bugs.jar and Bears for labeling the correctness of RepairThemAll patches
│
└── RawRepairThemAllPatches: raw patches from the experiment of RepairThemAll

ODS Feature Extraction

We have integrated ODS feature extraction with an open source tool Coming. To extract code features, you can parse a pair of source and target files in Source folder. Use the feature mode of Coming to obtain ODS features.

parameters

We use the default parameters of XGBoost (i.e., learning_rate sets to 0.3 and max_depth sets to 6), only turning the gamma to 0.5. All parameters can be found in our notebooks.

About

Automated Classification of Overfitting Patches with Statically Extracted Code Features