Regarding oxford nanopore data analysis

Question

Regarding oxford nanopore data analysis

ps120195 opened this issue 5 years ago · comments

ps120195 commented 5 years ago

Duncan MacCannell · Answer 1 · Tue Mar 31 2020 04:35:10 GMT+0800 (China Standard Time)

Was there a specific issue, or is this more of a philosophical conjecture?

ps120195 · Answer 2 · Tue Mar 31 2020 04:47:44 GMT+0800 (China Standard Time)

Hii, Please find the attachments. I ran SARS-Cov2 sequencing pipeline for nanopore data, where I am getting two kinds of results. All the commands were same ,even the samples were same,but ran on different systems. Can you tell why this is happening?

…

On Tue, Mar 31, 2020 at 2:05 AM Duncan MacCannell ***@***.***> wrote: Was there a specific issue, or is this more of a philosophical conjecture? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#9 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AO76WNPA2W4GCBJ6GVDB4YDRKD7AZANCNFSM4LW44E7Q> .

Duncan MacCannell · Answer 3 · Tue Mar 31 2020 04:50:47 GMT+0800 (China Standard Time)

Happy to help. Which pipeline? Attachments were missing.

ps120195 · Answer 4 · Tue Mar 31 2020 04:52:22 GMT+0800 (China Standard Time)

ps120195 commented 5 years ago

ps120195 · Answer 5 · Tue Mar 31 2020 05:00:40 GMT+0800 (China Standard Time)

I ran it thrice, still I am not getting details of vcf which is there in the image 1 ,saying the fasta sequence does not match the REF allele ... and so on

Duncan MacCannell · Answer 6 · Tue Mar 31 2020 05:01:31 GMT+0800 (China Standard Time)

If these are two different systems, you're sure that the perl environment and all dependencies are the same version?

ps120195 · Answer 7 · Tue Mar 31 2020 05:04:56 GMT+0800 (China Standard Time)

Does that make any difference like this?

…

On Tue 31 Mar, 2020, 2:31 AM Duncan MacCannell, ***@***.***> wrote: If these are two different systems, you're sure that the perl environment and all dependencies are the same version? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#9 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AO76WNMY3FLJ2AUYYRWRNL3RKECDVANCNFSM4LW44E7Q> .

ps120195 · Answer 8 · Tue Mar 31 2020 05:08:50 GMT+0800 (China Standard Time)

All dependencies and perl environment is same for sure

…

On Tue 31 Mar, 2020, 2:34 AM priya singh, ***@***.***> wrote: Does that make any difference like this? On Tue 31 Mar, 2020, 2:31 AM Duncan MacCannell, ***@***.***> wrote: > If these are two different systems, you're sure that the perl environment > and all dependencies are the same version? > > — > You are receiving this because you authored the thread. > Reply to this email directly, view it on GitHub > <#9 (comment)>, > or unsubscribe > <https://github.com/notifications/unsubscribe-auth/AO76WNMY3FLJ2AUYYRWRNL3RKECDVANCNFSM4LW44E7Q> > . >

ps120195 · Answer 9 · Tue Mar 31 2020 13:44:24 GMT+0800 (China Standard Time)

Also the dependencies were installed by pip ,so versions of dependencies are same in both systems. Please suggest why this is happening .what is the actual output we expect from this vcf_mask_lowcoverage.pl in terminal.

…

On Tue 31 Mar, 2020, 2:38 AM priya singh, ***@***.***> wrote: All dependencies and perl environment is same for sure On Tue 31 Mar, 2020, 2:34 AM priya singh, ***@***.***> wrote: > Does that make any difference like this? > > > On Tue 31 Mar, 2020, 2:31 AM Duncan MacCannell, ***@***.***> > wrote: > >> If these are two different systems, you're sure that the perl >> environment and all dependencies are the same version? >> >> — >> You are receiving this because you authored the thread. >> Reply to this email directly, view it on GitHub >> <#9 (comment)>, >> or unsubscribe >> <https://github.com/notifications/unsubscribe-auth/AO76WNMY3FLJ2AUYYRWRNL3RKECDVANCNFSM4LW44E7Q> >> . >> >

Clint · Answer 10 · Tue Mar 31 2020 23:11:07 GMT+0800 (China Standard Time)

I'm not clear on the difference between screen shots 1 and 2.

In screenshot 1, it looks like it finished correctly. Did you get a reasonable consensus in 'consensus.fasta'?

In screenshot 2, something went wrong. Are you using the same reference fasta that was used for read mapping? Bcftools is very picky about the vcf and the reference to which it applies variants. It may be possible that the reference was getting masked incorrectly, but I can't work out why that would be. I wonder if you could check the samtools depth at position 8782 and potentially let me have a look at your vcf? Interestingly, position 8782 is one where we have observed a lot of variation.

ps120195 · Answer 11 · Wed Apr 01 2020 00:15:53 GMT+0800 (China Standard Time)

Yes I am using the same reference that i used for mapping.

…

On Tue 31 Mar, 2020, 8:41 PM Clint, ***@***.***> wrote: I'm not clear on the difference between screen shots 1 and 2. In screenshot 1, it looks like it finished correctly. Did you get a reasonable consensus in 'consensus.fasta'? In screenshot 2, something went wrong. Are you using the same reference fasta that was used for read mapping? Bcftools is very picky about the vcf and the reference to which it applies variants. It may be possible that the reference was getting masked incorrectly, but I can't work out why that would be. I wonder if you could check the samtools depth at position 8782 and potentially let me have a look at your vcf? Interestingly, position 8782 is one where we have observed a lot of variation. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#9 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AO76WNOFEKK6GI5QGSX5JRLRKIBZVANCNFSM4LW44E7Q> .

ps120195 · Answer 12 · Wed Apr 01 2020 00:24:01 GMT+0800 (China Standard Time)

consensus2.fasta is my consensus fasta and MN908947.3.fasta is my reference file which i used in mapping too.Also I am getting that C to T variant at same 8782 location

ps120195 · Answer 13 · Wed Apr 01 2020 00:42:03 GMT+0800 (China Standard Time)

samtools depth at position 8782 is 1871

ps120195 · Answer 14 · Wed Apr 01 2020 00:46:49 GMT+0800 (China Standard Time)

Here I ran from start till last,still result is same, please see the screenshot

Clint · Answer 15 · Wed Apr 01 2020 01:31:10 GMT+0800 (China Standard Time)

Hmm. I'd like to get to the bottom this, but I need a little more info. Can you show me the output of the following:

bcftools view VIC07_ONT.vcf |grep -EC3 "\s8282\s"
bcftools view VIC07_ONT.vcf.masked.vcf.gz |grep -EC3 "\s8282\s"

ps120195 · Answer 16 · Wed Apr 01 2020 01:39:47 GMT+0800 (China Standard Time)

ps120195 commented 5 years ago

ps120195 · Answer 17 · Wed Apr 01 2020 01:43:36 GMT+0800 (China Standard Time)

It was -EC3 ,,sorry

Clint · Answer 18 · Wed Apr 01 2020 02:26:24 GMT+0800 (China Standard Time)

Those look OK to me. The only other thing I can think of is that there is something funky going on with the reference. Can you try running dos2unix MN908947.3.fasta and then running the script again? If that is the issue, I can make a change to fix this (I will add it in in any case).

ps120195 · Answer 19 · Wed Apr 01 2020 02:28:59 GMT+0800 (China Standard Time)

Yaa sure ,

ps120195 · Answer 20 · Wed Apr 01 2020 02:51:58 GMT+0800 (China Standard Time)

I tried dos2Unix command and ran the full script again, Still no change in output.

ps120195 · Answer 21 · Wed Apr 01 2020 03:26:35 GMT+0800 (China Standard Time)

I tried now using MN908947.fna instead of MN908947.fasta ,and it worked.
See the output

Clint · Answer 22 · Wed Apr 01 2020 03:42:38 GMT+0800 (China Standard Time)

Ok, so it looks like you converted the line endings for "MN908947.fasta" and it worked. Using "MN908947.fna" (which is identical except line endings were not converted to unix line endings) trows the error. I think these are all consistent, unless I misunderstand you. I will make the change to take into consideration fasta files with Windows line endings.

ps120195 · Answer 23 · Wed Apr 01 2020 03:53:39 GMT+0800 (China Standard Time)

Thank you for helping me out. I learnt alot during this error hunt.As the error is resolved ,I want to know if I have to use only file.fna for this pipeline ?

Clint · Answer 24 · Wed Apr 01 2020 04:39:46 GMT+0800 (China Standard Time)

No worries! Glad you caught this, as it's an easy fix but annoying for users.
The filename doesn't matter. As long as the fasta header is the same and (for now) the windows line endings of your reference file are converted to unix line endings.

Clint · Answer 25 · Fri Apr 03 2020 04:43:42 GMT+0800 (China Standard Time)

@dmaccannell I think this can be closed