Biomolecular sequences can be thought of as a long series of letters, four for DNA and twenty for amino acids. Aligning sequences to detect similarities is useful for finding patterns. Common patterns might give insights to common functionality of the sequences, assuming that the important functions were preserved in the genes. We introduce the alignment problem in the context of two sequences and extend to multiple sequence alignment. Three standard approaches are reviewed, comprising clustering techniques, multinomial models using Gibbs sampling, and Hidden Markov Models (HMM).
More details are available.