next up previous
Next: The Chooser Up: Interactive Inference and Experimental Previous: Interactive Inference and Experimental

The Predictor

The predictor is a method for inferring Boolean networks using the expression data given by the matrix E. We seek a Boolean function fn independently for each node an. To this end, we first pick the input variables to fn: we determine a minimum set sn of nodes, whose levels must be input to fn, in order for sn to explain the observed data E. Then, we construct a truth table using these nodes as inputs. Specifically, the function for node an is determined according to the following procedure:
1.
Build sets Sij of nodes with different values in rows i and j
Consider all pairs of rows (i, j) in E in which the expression level of an differs, excluding rows in which an was itself forced to a high or low value. For each such pair, find the set Sij of all other nodes whose expression level also differs between the two rows (i, j). Because the network is self-contained, a change in at least one of these genes or stimuli must have caused the corresponding difference in an. Therefore, at least one node in this set must be included as a variable in fn.
2.
Find a minimum cover set Smin of $\{S_{ij}\}$
Identify the smallest set of nodes Smin required to explain the observed differences over all pairs of rows (i, j), i.e., Smin is such that at least one of its nodes is present in each set Sij. This task is a classic combinatorial problem called minimum set covering which can be solved by the branch and bound technique. More than one smallest set Smin may be found, in which case a distinct function fn is inferred and reported for each such set.
3.
Determine truth table of an from Smin and E
Once Smin has been determined for the node an, a truth table is determined for fn in terms of the levels of genes and/or stimuli in Smin by taking relevant levels directly from E. If all combinations of input levels are not present in E, the corresponding output level for gene an cannot be determined and is represented by the symbol "*" in the truth table.
If a node has more than one minimum cover set, several networks are inferred, each with a distinct function corresponding to each set. If several such nodes exist, a separate network hypothesis is returned for each combination of functions at each node. The minimum set cover ensures that only the most parsimonious networks will be returned.
next up previous
Next: The Chooser Up: Interactive Inference and Experimental Previous: Interactive Inference and Experimental
Peer Itsik
2001-03-04