Next: The Construction Algorithm
Up: Constructing Physical Maps from
Previous: Problem Statement
Let Ca and Cb be two clones viewed as intervals of
the same length l. Define
=
and
.
The relative position of Ca,Cb and
is
shown in figure 9.10. The
overlap score uses the hybridization vectors
to produce a
vector probabilities for each length
of the overlap.
Figure 9.10:
Clone pair overlap score
|
We first calculate the probability
.
Let
,
,
and recall that
Ai,j is the number of occurrences of probe j in Ci. We
can thus write the following equation:
The calculation of the probabilities inside the summation is
straightforward using the statistical model. Since hybridization
is a virtual certainty if a probe occurs many times inside a
clone, we can limit the summation to small values of Ki (say
), thereby making the score computation feasible
while introducing only a negligible error. Considering each probe
as an independent source of information, the conditional
probability of the vector pair
is:
|
(8) |
Assuming uniform parameters for the probes, the expression
inside the product is
independent of j. Therefore, we can define
Px,y[t] by
Px,y[t] = Pr(Ba,j = x, Bb,j = y | t). In practice, instead of computing
Px,y[t] for each t in the interavl [0,l], we use score quantization of this interval, and perform the computation only for representative values of t.
Denoting by
Sx,y(a,b) the set of probes
,
such that
Ba,j = x and
Bb,j = y, we can write:
|
(9) |
Having computed
we can use Bayes Theorem:
|
(10) |
Next: The Construction Algorithm
Up: Constructing Physical Maps from
Previous: Problem Statement
Peer Itsik
2001-01-09