Stochastic output

Question

Stochastic output

mwalker174 opened this issue 5 years ago · comments

We are observing non-deterministic behavior of whamg when run multiple times across cloud VMs on the same BAM file, where about 5-10% of calls are different. I see from a quick search of the code base that random sampling is used to estimate metrics (this seems to be seeded to the system clock with srand in wham and not in whamg), but from the logs it looks like these estimates converge consistently.

Can anyone reproduce this behavior?

Either way, it would also be very useful to set the random seed manually from the command line.

Zev Kronenberg · Answer 1 · Wed Sep 25 2019 05:27:48 GMT+0800 (China Standard Time)

Hi @mwalker174,

I've marked this as a question, but it's possible it is a bug. Let me think about this for a bit.