Xenopus reference
From Marcotte Lab
All of these files are derived from XenBase (downloaded on May, 01, 2011).
Version 1. RefSeq of cDNA & protein
- Used XenBase files: NcbiMrnaXenbaseGene_laevis.txt, xlaevisMRNA.fasta
- Used XenBase files: NcbiProteinXenbaseGene_laevis.txt, xlaevisProtein.fasta
- Read gene name for each NCBI id from 'Ncbi...' file. Filter out genes with 'unnamed' in gene name field.
- Read all sequences from '.fasta' file. Convert all sequence character to upper case.
- If I find a sequence with '>gi|<gi number>|ref|<genbank accession>' header (means it is RefSeq entity), write it down.