Given a family
of k sequences,
such that the sequences are ``similar'' to each other, we would
like to find out the common characteristics of this family.
Aligning each pair of sequences from
separately, often does not
reveal this common information.
A multiple alignment of
is a new set of sequences
such that:
All the strings in
are of equal length. We
denote this length by l.
Each S'i was generated from Si by
inserting spaces.
When performing multiple alignment, as in the case of pairwise
alignment, one wishes to evaluate the quality of the alignment
by giving it a numeric score (see also lecture 3).
Definition 5.1
- The sum of pairs (SP) score of a multiple alignment
is the sum of the
scores of pairwise global alignments induced by
.
Let
be our scoring function, i.e., the price of aligning
the character x with the character y, for
.
We assume that
,
,
and that the triangle inequality
holds.
Itshack Pe`er 1999-03-16