Lars-H / tpf_profiler

Framework to measure and record the performance of Triple Pattern Fragments.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DOI tee pee

Triple Pattern Fragment Profiler

The Triple Pattern Fragment Profiler is used to study the performance w.r.t. response time of Triple Pattern Fragments (TPFs). The profiler samples as set of triples from a given TPF and derives a set of triple patterns from those triples by replacing RDF terms with variables. Thereafter, these triple patterns are used to measure the response time of the TPF and record additional metadata.

Setup

Prerequisites:

  • Unix-based OS (Linux / Mac OS)
  • Python 2.7
  • pip

Follow these steps to setup and run the profiler:

  1. Download or clone this git repository
  2. cd tpf_profiler
  3. Install virtual environment package: [sudo] pip install virtualenv
  4. Activate the virtual environment: . venv/bin/activate
  5. Optional: Edit the sources.json to specify the mappings for TPF server pairs (local and remote)
  6. See the command line tool options via `python run_study.py -h``

Setting up the Experimental Settings

Setting up the controlled Environment:

Examples

Use command line to run the profiler and set the options to specify the profiler settings.

Example:
`

  • DBLP TPF with 10 samples and 1 run:
python run_study --url http://data.linkeddatafragments.org/dblp -s 10 -r -1
  • DBpedia TPF with 100 samples, 2 runs and write the results to a CSV file:
python run_study --url http://data.linkeddatafragments.org/dbpedia -s 100 -r -2 -w 1

How to Cite

Lars-H. (2018, April 6). 
Lars-H/tpf_profiler: Release v0.2 (Version 0.2). 
Zenodo. http://doi.org/10.5281/zenodo.1213694

Study Results and Analysis

  • The raw data create for the evaluation of TPF servers in our study is provided freely available here as a raw CSV file.
  • The statistical analysis providing the basis for our evaluation is available in the notebooks directory as a Jupyter Notebook.
  • The visulaizations for the publication is also provided in the notebooks directory

License

This work is licensed under BSD-3-Clause.

About

Framework to measure and record the performance of Triple Pattern Fragments.

License:BSD 3-Clause "New" or "Revised" License


Languages

Language:Python 94.9%Language:Jupyter Notebook 1.8%Language:HTML 1.3%Language:C++ 0.9%Language:C 0.9%Language:TeX 0.1%Language:JavaScript 0.1%Language:Fortran 0.0%Language:CSS 0.0%Language:MATLAB 0.0%Language:Shell 0.0%Language:Smarty 0.0%Language:Makefile 0.0%