ssmehta / container-pwiz

The ProteoWizard Library and Tools are a set of modular and extensible open-source, cross-platform tools and software libraries that facilitate proteomics data analysis, developed by the Proteowizard Team at http://proteowizard.sourceforge.net/. This repository contains the Docker image to convert from many vendor raw data formats to mzML via Linux WINE.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Proteowizard msconvert Docker Buildfile

The first step in a metabolomics data processing workflow with Open Source tools is the conversion to an open raw data format like [mzML]:https://github.com/HUPO-PSI/mzML/ .

One of the main routes to mzML-formatted data is using Open Source converter msconvert developed by the Proteowizard team (Chambers et al. 2012), which is one of the reference implementations for mzML. It can convert to mzML from Sciex, Bruker, Thermo, Agilent, Shimadzu, Waters and also the earlier file formats like mzData or mzXML.

Although Proteowizard was initially targeting LC/MS data, it can also readily convert GC/MS data for example from the Waters GCT Premier or Agilent instruments.

Building the Docker image

Please note that for licensing reasons we can not include all required files in this repository. Please head over to http://proteowizard.sourceforge.net/downloads.shtml and place the installer pwiz-setup-3.0.9098-x86.msi in this directory. Also note that the build is known to fail with Docker-1.9, make sure to use Docker-1.10 or above.

docker build --tag="phnmnl/pwiz:latest" .

Using the Docker image

After building the image, the conversion can be started with e.g.

docker run -v $PWD:/data:rw phnmnl/pwiz:latest /data/neg-MM8_1-A,1_01_376.d -o /data/ --mzML

The currently tested vendor formats are:

  • mzXML: docker run -it -v $PWD:/data phnmnl/pwiz:3.0.9098-0.1 threonine_i2_e35_pH_tree.mzXML
  • Bruker *.d: docker run -it -v $PWD:/data phnmnl/pwiz:3.0.9098-0.1 neg-MM8_1-A,1_01_376.d

Galaxy usage

A rudimentary Galaxy node description is included as msconvert.xml, it was obtained from the msconvert.ctd using python CTD2Galaxy/generator.py -i /vol/phenomenal/vmis/docker-pwiz/msconvert.ctd -m sample_files/macros.xml -o /vol/phenomenal/vmis/docker-pwiz/msconvert.xml

About

The ProteoWizard Library and Tools are a set of modular and extensible open-source, cross-platform tools and software libraries that facilitate proteomics data analysis, developed by the Proteowizard Team at http://proteowizard.sourceforge.net/. This repository contains the Docker image to convert from many vendor raw data formats to mzML via Linux WINE.

License:Apache License 2.0


Languages

Language:Python 76.2%Language:Shell 21.0%Language:Ruby 2.8%