Fasta file format

Fasta file format Fasta file format Fasta file format

FASTA (pronounced "fast-ay") is a commonly used format for representing nucleotide or protein sequences in bioinformatics. The format was developed by David J. Lipman and William R. Pearson in the early 1980s, and has since become a standard format for sequence data.

Fasta file format Fasta file format Fasta file format

A FASTA file typically consists of a header line followed by a sequence of nucleotides or amino acids. The header line starts with a ">" symbol and is followed by a description of the sequence, which can include any relevant information about the source organism or experiment. The sequence data follows the header line and can be composed of any combination of nucleotide or amino acid characters, depending on the type of sequence being represented.

Fasta file format Fasta file format Fasta file format

Here is an example of a FASTA file format for a protein sequence:

Ex.

shell

Copy code

>gi|142022655|gb|EQ086233.1|114

MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPK

VKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFG

KEFTPPVQAAYQKVVAGVANALAHKYH

Fasta file format Fasta file format

In this example, the header line contains information about the sequence identifier and source, while the sequence data consists of amino acid characters.

FASTA is a simple and flexible format that can be used to represent a wide range of sequence data, including genomic and transcriptomic data, protein sequences, and even alignment data. Many bioinformatics software tools and databases support FASTA format for input and output of sequence data. Fasta file format Fasta file format Fasta file format

Fasta file format

Fasta file format

Difference between local and global alignment

Contact Form