* This format contains a one line header followed by lines of sequence data.
* Sequences in fasta formatted files are preceded by a line starting with a " >" symbol.
* The first word on this line is the name of the sequence. The rest of the line is a description of the sequence.
* The remaining lines contain the sequence itself.
* Blank lines in a FASTA file are ignored, and so are spaces or other gap symbols (dashes, underscores, periods) in a sequence.
* Fasta files containing multiple sequences are just the same, with one sequence listed right after another.