Difference between revisions of "CH391L/UTpond"

From Marcotte Lab
Jump to: navigation, search
(Number of reads)
(Databases)
Line 8: Line 8:
  
 
== Databases ==
 
== Databases ==
* Prokaryotes:  
+
* Prokaryotes: http://www.marcottelab.org/users/CH391L/UTpond/NCBI.bacteria/NCBI_bacteria_rep.tgz <b>(835 MB)</b> (870 genomes)
* Eukaryotes:
+
** Source: ftp://ftp.ncbi.nih.gov/genomes/Bacteria/ (2011-Feb-28 version).
* Viruses:  
+
** Select one chromosome per species randomly.
 
+
* Eukaryotes (Organelle): http://www.marcottelab.org/users/CH391L/UTpond/ENA.organelle/ENA_organelle.fna.gz (2,982 genomes)
 +
** Source: http://www.ebi.ac.uk/genomes/organelle.html
  
 
== Analysis tool ==
 
== Analysis tool ==

Revision as of 16:42, 28 March 2011

Number of reads

  • V3BC21: 213,419 (F3), 199,158 (F5), 223,100 (raw)
  • V3BC22: 634,431 (F3), 595,251 (F5), 661,120 (raw)
  • V3BC23: 1,015,471 (F3), 951,357 (F5), 1,060,308 (raw)
  • V3BC24: 562,803 (F3), 525,892 (F5), 588,304 (raw)
  • V3BC25: 569,125 (F3), 531,015 (F5), 595,064 (raw)

You can download CSFASTA files from here.

Databases

Analysis tool

$ gsmapper-cs foobar.csfasta db.fasta -E -N 4 -o 1000 -h 80% >foobar.gsmapper_sam 2>foobar.gsmapper_log