A non decreasing function
where
N(t) = number of events until time t
N(0) = 0
The number of events in disjoint intervals are independent
(10.2)
Distribution of the number of events in an interval is stationary, i.e. depends
only on the length of the interval. The expected number of events
in an interval of length t is given by
.
Denote:
Tn = time between event n-1 and event n
S0 = 0
Recall that inter-event times in a Poisson process are i.i.d.
random variables, exponentially distributed with parameter
,
i.e.,
(10.3)
If it is known that
events occurred in a Poisson
process until time t, then the inter-arrival times
are distributed uniformly and
independently in [0,t].
Assume clone length L, genome length G, and choose N clones
at random. What is the expected fraction of the genome covered by
clones?
For a random point b, and an arbitrary clone C the probability
of the point b being included in the clone c is given by:
(10.4)
and therefore, the probability of b being out of all the clones
is given by:
(10.5)
with the last approximation being valid when
and .
is said
to be the redundancy of the clone set.
is given by
(10.6)
where R is the redundancy of the clone set
Table 10.2 shows that using
redundancy factor of 2 to 5 gives a good coverage of the genome
segment considered.
Table 10.1:
Coverage of genome segment depending on redundancy factor
R
Coverage
1
0.63
2
0.865
3
0.95
4
0.98
5
0.993
Define the length by setting clone length = 1, and denote N =
number of clones, R = redundancy factor, and assume that the
clone starting positions follow a Poisson process with rate
.
We define a minimal overlap factor
between
clones to identify overlap.
A set of clones covering a continuous segment of the genome,
together with their physical distances is called a contig.
Contigs are sometimes referred to as islands.
The expected number of apparent islands is given by
(10.7)
2.
The expected number of apparent islands with exactly
clones is given by
(10.8)
3.
The expected number of clones in an apparent island is given
by
(10.9)
4.
The expected length of an apparent island is
(10.10)
Proof:We will prove the first item of the theorem. In order to prove the
formula for expected number of apparent islands, we define J(x)
as follows:
Since [b-1,a] is of length 1-x, J(x) can be computed from
the redundancy factor R as follows:
(10.11)
The number of islands is the number of times leaving a clone
without detecting an overlap. Let Ec denote the event of a
clone c being the right-hand clone of an island. If the
right-hand side of the island is at a point t, we require that
t and
are not covered by a common clone (other than
c) .
The probability of such an event Ec is given by:
(10.12)
and the expected number of apparent islands is therefore given by: