Code for the analysis of sequence and OTU occurance data to derive optimal taxonomic structures for bacterial communities.
See original if you want to do anything with this.
I am just messing it up right now, as I struggle to figure out what is happening!
Paper describing technique:
FunFrame Pipeline used to get files to the right starting point:
- http://faculty.www.umb.edu/jennifer.bowen/software/FunFrame.zip
- See http://bioinformatics.oxfordjournals.org/content/29/9/1212.long
Puts together multiple .fasta files from different directories??
-
INPUT: Multiple directories each of which contain a
fasta
formatted file; specify common file name in variableFILE_NAME
. THat is, for the example above, you would setFILE_NAME=data.fasta
-
OUTPUT:
OTU_MAT.csv
This file is then used by the other files
? Don't really understand which file here
-
INPUT:
./clusterDist_0.12/concordance.fasta
-
INPUT:
OTU_MAT.csv
-
OUTPUT:
seq_otu_abundance.csv
-
INPUT:
./seq_otu_abundance.csv
-
OUTPUT:
-
INPUT:
/seq_otu_abundance.csv
-
OUTPUT: a plot
-
INPUT:
./seq_otu_abundance.csv
-
OUTPUT: a plot
Helper file; used by analyze_otu_networks.R
and sampleAICNet.R
Confused.