icbi-lab / luca

Single-cell Lung Cancer Atlas with 1.2M cells

Home Page:https://luca.icbi.at

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RAM usage in SCISSOR_TCGA

Mengflz opened this issue · comments

Hi, thank you for your work. When I was performing SCISSOR_TCGA step, I encountered an issue requiring over three thousand GB RAM. I wanted to inquire about how much memory resources you used when performing SCISSOR_TCGA?

I was working on a system where memory wasn't that much of an issue, so I can't say exactly, but I think you should be good with ~30 GB per process. Since there will be several hundred scissor processes be lauched at the same time, 3000GB doesn't sound too unrealistic.

If you have memory issues, I recommend adding

memory = 30.GB

to the process configuration here:

withName: ".*scissor:SCISSOR_TCGA" {
// errorStrategy = 'ignore'
ext.ignore_error = true
container = "${baseDir}/containers/2020-pircher-scissor2_2022-04-13.sif"
publishDir = [
path: { "${params.outdir}/scissor_tcga/scissor_by_sample/${id}" },
mode: params.publish_dir_mode
]
}

Then nextflow should take care of not launching more processes simultaneously than there's memory available.

Thank you for your response. I found that the problem might be due to my incorrect usage of the data.