run_cellranger.pl

Create and run a snakemake pipeline that launches cellranger commands to biowulf.

View the entire readme with run_cellranger.pl -h

Parts of the cellranger workflow

This script produces a Snakemake file that will invoke a series of Cellranger commands to analyze single-cell RNA-seq data. The Cellranger steps invoked are:

mkfastq
count
mat2csv

Additionally, Seurat objects will be made for each sample.

Preparing for the run

Input consists of a tab-delimited file with the following column order, with each row representing a sample:

Library ID
Sample ID [must be unique]
Raw Data Directory
Lane
Sample Barcode Index
Group

'Sample ID' must be unique. 'Group' determines what samples will be merged together during the Seurat merge step. All samples with a given Group ID will be merged for that group. Samples can be in more than one group (separate Group ID values with commas), or can be ignored during merging by setting this field to 'NA'.

TODO:

1. [ ]Cleanup intermediate files (could be done using native snakemake nomenclature)

Author

Jason Inman inmanjm@nih.gov

About

Snakemake workflow and supporting files for running typical cellranger commands

ncats-dpi-ifx rna-seq ncats-multi-omics

Languages

Language:Perl 93.6%Language:R 6.4%