[E::bwa_idx_load_from_disk] fail to locate the index files

Question

[E::bwa_idx_load_from_disk] fail to locate the index files

xxYaaoo opened this issue 8 months ago · comments

Hello, Dr. Chen~! I met a problem when I tried to run test data. The .err log showed me the detailed infor below:

and here was my command line:

how could I solve these problems?
thank you for your help!

Xun Chen · Answer 1 · Fri Oct 27 2023 12:55:10 GMT+0800 (China Standard Time)

Hi,

If you correctly indexed the human genome and TE consensus sequences, the error from the very early bwa alignment step may be because no chimeric/split reads were identified.

may I get a list of intermediate output files through "ls -lh" under your output folder?

Best,
Xun

xxYaaoo · Answer 2 · Fri Oct 27 2023 13:19:13 GMT+0800 (China Standard Time)

Thank you for your reply!
My output folder:

Besides, I am wondering....what you mean "correctly indexed the human genome and TE consensus sequences"....
Is there any step needed to be done about the human genome and TE consensus sequences before running the ERVcaller_v1.4.pl?

Xun Chen · Answer 3 · Fri Oct 27 2023 13:23:17 GMT+0800 (China Standard Time)

yes, when you prepared the reference files before running ERVcaller.pl, you need to run the "bwa index" command for both human and TEs. You could try it first and then let me know if you still have the same issue.

Best,
Xun

xxYaaoo · Answer 4 · Fri Oct 27 2023 16:09:45 GMT+0800 (China Standard Time)

Hi, Dr. Chen! I think I forgot to "bwa index hg38.fa" previously. After running this command, I ran the ERVcaller command line again.
This is my output folder: (while the .vcf file is still empty...). I feel there might still have some problems...

Thank you for your help！

Xun Chen · Answer 5 · Fri Oct 27 2023 16:14:28 GMT+0800 (China Standard Time)

Hi,

Have you also indexed your TE consensus sequences? Can you also share the log file and the file sizes under the temp folder?

Xun

xxYaaoo · Answer 6 · Fri Oct 27 2023 16:23:07 GMT+0800 (China Standard Time)

Yes, I have checked my notes that I had indexed the TE consensus sequences before.

This is my temp folder:

My slurm.out file:

Head part of my slurm.err file:

Thank you so much!!!

Xun Chen · Answer 7 · Fri Oct 27 2023 16:34:27 GMT+0800 (China Standard Time)

I can't find any problem with your log and temp files.

Could you try using the BAM file or the TE_seq fastq file as the inputs? (not TE_seq2 which may not contain simulated insertions and used for testing separate FASTQ inputs)

Best,
Xun

xxYaaoo · Answer 8 · Fri Oct 27 2023 16:58:24 GMT+0800 (China Standard Time)

YEAH, Dr. Chen!~ I used the BAM file and it seemed like success?!

And I am curious that the slurm.err file will not be empty, even if the running is successful?
(I will try to figure out the problems related with .fq.gz files and further run my own data.

Xun Chen · Answer 9 · Mon Oct 30 2023 10:43:01 GMT+0800 (China Standard Time)

Hi,

I am glad that it works!

I don't know what is included in your slurm.err file, but sometimes it is just the log BWA or samtools running which should be fine.

Sure, let me know if you have other questions. As I suggested, TE_seq files contacted the simulated integration sites but TE_seq2 may not, which could be the potential issue.

Best,
Xun

xxYaaoo · Answer 10 · Wed Nov 01 2023 13:35:44 GMT+0800 (China Standard Time)

Dear Dr. Chen,

I have run my own data successfully using ERVcaller! You really make my these days! Thank you so much! Appreciate~

Best wishes,
Yaaoo