
April 2, 2007 14:42 World Scientific Review Volume - 9in x 6in Main˙WorldSc˙IPR˙SAB
38 Synthesis and Analysis in Biometrics
(i-1, j-1)
(i, j-1)
(i-1, j)
(i, j)
V
(
--
b
)
V
(
a
i
-
)
V
(
a
i
b
j
)
More formally, there are three possibilities:
a
b
,
−
b
,and
a
−
. The figure on the left
depicts the cell (i, j) and possible scores. For
each aligned pair
A
B
,whereA and B are
either normal sequence entries or gaps, there
is an assigned score σ
A
B
. The total score of
a pairwise alignment is defined to be the sum
of the σ values of all aligned pairs.
Objective function. The central problem in sequence matching is to find
a mathematical function, so called objective function, which will be able
to measure the quality of an alignment. Such an objective function should
incorporate everything that is known about the sequences including their
structure, functionality and a priory data. These data are rarely known
and usually being replaced with sequence similarity based on metrics and
scoring.
Metrics and scoring. The main idea of metrics is to get a universal
mechanism of comparing two sequences. Metrics can vary depending on the
objective function, they can embrace distance measures and/or correspond
to statistical computations. The scoring is determined by the amount of
credit an alignment receives for each aligned pair of identical entities (the
match score), the penalty for aligned pairs of non-identical entities (the
mismatch score), and the gap penalty for the alignment with gaps. A
simple alignment score can be computed as follows:
K
k=1
gap penalty; if a
i
=
−
or b
j
=
−
,
match score; if no gaps and a
i
= b
j
,
mismatch score; if no gaps and a
i
= b
j
.
Data alphabet. The alphabet for sequence processing in case of
curvature, velocity and pressure are represented originally by continuous
values and cab be encoded by integers forming a multiple-valued
representation. This encoding procedure can be based on learning rules
and/or discretization principles for continuous sequences
[
13
]
.
Example 2.2. Let us apply the alignment technique described above for
two sequences {1, 2, 1, 4, 2, 1, 4} and {1, 3, 2, 3, 4} assuming that the
gap penalty is −1, the match score is +1, and the mismatch score is 0.
The global alignment of the two sequences is equivalent to a path from the
upper left corner of the matrix to the lower right. A horizontal move in
the matrix represents a gap in the sequence along the left axis. A vertical