asgaardlab / 21-markos-test_case_similarity_technique-code

Repository with the source code of our technique to analyze a test suite and find similar test cases written in natural language

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Identifying Similar Test Cases That Are Specified in Natural Language

This repository contains the source code of our technique and related experiments to identify similar test cases written in natural language. The technique first clusters test steps which are semantically similar and then uses those clusters to identify similar test cases.


To cluster similar test steps, we performed several experiments with the following text embedding techniques, text similarity metrics, and clustering algorithms:

Text embedding techniques

Text similarity metrics

Clustering algorithms

  • Hierarchical Agglomerative Clustering
  • K-means

To find similar test cases, we used the identified clusters of similar test steps to build and evaluate four different techniques.

Structure of directories

The following directories contains the source code of all the approaches that were part of our experiments.

  • test-step-clustering: contains the notebooks with the source code for our test step clustering experiments.

  • test-case-similarity: contains the notebooks with the source code for our test case similarity experiments.

  • evaluations: contains the notebooks with the source code to evaluate all the approaches for test step clustering and techniques for test case similarity.


Dependencies

The following dependencies are required to run the notebooks on your local machine:

About

Repository with the source code of our technique to analyze a test suite and find similar test cases written in natural language


Languages

Language:Jupyter Notebook 100.0%