next up previous
Next: Problem Statement Up: Constructing Physical Maps from Previous: The Statistical Model

   
Clone Pair Overlap Score

Let Ca and Cb be two clones viewed as intervals of lengths la and lb respectively. Without loss of generality, assume $l_{a} \leq l_{b}$. Define $C'_{\gamma}$ = $C_{a} \bigcap C_{b}$ and $l_{\gamma} = \vert C_{\gamma} \vert$. The overlap score uses the hybridization vectors $\overrightarrow{B_{a}} , \overrightarrow{B_{b}} $ to produce a vector probabilities for the overlap length $l_{\gamma}$.
  
Figure 10.10: Clone pair overlap score

\fbox{\epsfig{figure=lec10_fig/bio10-8.eps,width=13cm}}




The relative position of Ca,Cb and $C'_{\gamma}$ is shown in figure 10.10. We first calculate the probability $Pr(\overrightarrow{B_{a}} ,
\overrightarrow{B_{b}} \vert l_{\gamma} = t)$. Let $C'_{a} = C_{a}
\backslash C_{b}$, $C'_{b} = C_{b} \backslash C_{a}$, and let Ai,j be the number of occurrences of probe j in Ci. We can thus write the following equation:

\begin{eqnarray*}Pr(B_{a,j},B_{b,j} \vert l_{\gamma} = t) &= &
\sum_{K'_{a}}\s...
...
& & \cdot Pr(A'_{\gamma,j} = K'_{\gamma}\vert l_{\gamma} = t)
\end{eqnarray*}


The calculation of the probabilities inside the summation is straightforward using the statistical model. Since hybridization is a virtual certainty if a probe occurs many times inside a clone, we can limit the summation to small values of k (say $0
\leq k \leq 5$), thereby making feasible the score's computation while introducing only a negligible error. Considering each probe as an independent source of information, the conditional probability of the vector pair $(\overrightarrow{B_{a}} ,
\overrightarrow{B_{b}} )$ is:

\begin{displaymath}Pr(\overrightarrow{B_{a}} ,\overrightarrow{B_{b}} \vert l_{\g...
...t)
= \prod_{j=1}^{n} Pr(B_{aj},B_{bj} \vert l_{\gamma} = t)
\end{displaymath} (10.15)

Assuming uniform parameters for the probes, the expression $Pr(B_{a,j},B_{b,j} \vert l_{\gamma} = t)$ inside the product is independent of j. Therefore, we can define Px,yt by Px,yt = Pr(Ba,j = x, Bb,j = y | t). Denoting by Sx,y(a,b) the set of probes $1 \leq j \leq n$, such that Ba,j = x and Bb,j = y, we can write:

\begin{displaymath}Pr(\overrightarrow{B_{a}} ,\overrightarrow{B_{b}} \vert t)
...
...rod_{x=0}^{1}\prod_{y=0}^{1}P_{x,y}^{\vert S_{x,y}(a,b)\vert}
\end{displaymath} (10.16)

Having computed $Pr(\overrightarrow{B_{a}} ,\overrightarrow{B_{b}}
\vert t)$ we can use Bayes Theorem:

\begin{displaymath}Pr(l_{\gamma} = t_{0} \vert
\overrightarrow{B_{a}} ,\overri...
...errightarrow{B_{b}} \vert l_{\gamma} = t)Pr(l_{\gamma}
=t)}
\end{displaymath} (10.17)


next up previous
Next: Problem Statement Up: Constructing Physical Maps from Previous: The Statistical Model
Itshack Pe`er
1999-03-21