Install the packages by using pip install -r requirements.txt
ref
file contains the refs to the keywords suggested on the comments
main.py
has an usage example, using the Pull Request file. The file has a list of PRs with all its info.
output_<project_name>.json
contains the features
In that output we are extracting the following features:
number
: PR number
title_n_words
: PR title number of wordsdescription_n_words
: PR description number of wordscomments_n_words
: PR comments number of wordscomments_review_n_words
PR review comments number of words
n_commits
Number of commits associated with the PRmedian_words_commits
: Median of words in commits associated with the PR
core_all
: Number of keywords (in general) mentioned by core developers on comments and review commentscontributor_all
: Number of keywords (in general) mentioned by contributors developers on comments and review commentsnewcomer_all
: Number of keywords (in general) mentioned by newcomers developers on comments and review commentscore_maint
: Number of maintainability keywords mentioned by core developers on comments and review commentscontributor_maint
: Number of maintainability keywords mentioned by contributor developers on comments and review commentsnewcomer_maint
: Number of maintainability keywords mentioned by newcomer developers on comments and review commentscore_sec
: Number of security keywords mentioned by core developers on comments and review commentscontributor_sec
: Number of security keywords mentioned by contributor developers on comments and review commentsnewcomer_sec
: Number of security keywords mentioned by newcomers developers on comments and review commentscore_perf
: Number of performance keywords mentioned by core developers on comments and review commentscontributor_perf
: Number of performance keywords mentioned by contributor developers on comments and review commentsnewcomer_perf
: Number of performance keywords mentioned by newcomer developers on comments and review commentscore_robu
: Number of robustness keywords mentioned by core developers on comments and review commentscontributor_robu
: Number of robustness keywords mentioned by contributor developers on comments and review commentsnewcomer_robu
: Number of robustness keywords mentioned by newcomer developers on comments and review comments
title_all
: Number of keywords (in general) in PR titledescription_all
: Number of keywords (in general) in PR descriptioncomments_all
: Number of keywords (in general) in PR commentsreview_comments_all
: Number of keywords (in general) in PR review commentscommits_all
: Number of keywords (in general) in PR associated commitsgeneral_all
: Number of keywords (in general) in all artifactstitle_maint
: Number of maintainability keywords in PR titledescription_maint
: Number of maintainability keywords in PR descriptioncomments_maint
: Number of maintainability keywords in PR commentsreview_comments_maint
: Number of maintainability keywords in PR review commentscommits_maint
: Number of maintainability keywords in PR associated commitsgeneral_maint
: Number of maintainability keywords in all artifactstitle_sec
: Number of security keywords in PR titledescription_sec
: Number of security keywords in PR descriptioncomments_sec
: Number of security keywords in PR commentsreview_comments_sec
: Number of security keywords in PR review commentscommits_sec
: Number of security keywords in PR associated commitsgeneral_sec
: Number of security keywords in all artifactstitle_perf
: Number of performance keywords in PR titledescription_perf
: Number of performance keywords in PR descriptioncomments_perf
: Number of performance keywords in PR commentsreview_comments_perf
: Number of performance keywords in PR review commentscommits_perf
: Number of performance keywords in PR associated commitsgeneral_perf
: Number of performance keywords in all artifactstitle_robu
: Number of robustness keywords in PR titledescription_robu
: Number of robustness keywords in PR descriptioncomments_robu
: Number of robustness keywords in PR robustness commentsreview_comments_robu
: Number of robustness keywords in PR review commentscommits_robu
: Number of robustness keywords in PR associated commitsgeneral_robu
: Number of robustness keywords in all artifacts