Next: The CLICK Algorithm
Up: Analyzing Gene Expression Data
Previous: Analyzing Gene Expression Data
In any living cell undergoing any biological process, different subsets of its
genes are expressed in different stages of the process. The particular genes
expressed at a given stage and their relative abundance are crucial to the
cell's proper function. Measuring gene expression levels in different stages,
different body tissues, and different organisms is instrumental in
understanding biological processes. Such information can help
characterize gene/function relationships, determine effects
of experimental treatments for diseases, and increase
understanding of many other molecular biological processes.
One of the approaches to measuring gene expression profiles is hybridization
based arrays. According to this approach, a set of oligos is immobilized on a
surface to form the hybridization array. When a labeled target RNA mixture,
which was sampled in a specific condition (stage, tissue, organism, etc.), is
introduced to the array, target sequences hybridize to complementary
immobilized molecules. The resulting hybridization intensity (detected, for
example, by fluorescence) is indicative of the mixture's content and of the
relative genes expression measures in the tested condition. Different
conditions are tested, and eventually, every gene has its own profile,
i.e., vector of expression intensities, corresponding to the different
conditions.
Clustering techniques are used to identify subsets of genes that behave
similarly under the set of tested conditions. Analyzing multi-conditional gene
expresion patterns with clustering algorithms involves the following steps:
- 1.
- Measuring gene expression levels, reported as a vector of real numbers.
- 2.
- Computing a similarity matrix for the genes (e.g., correlations).
- 3.
- Clustering the genes based on their similarity to each other.
- 4.
- Visual representation of the clusters.
- 5.
- Analysis of the results.
Next: The CLICK Algorithm
Up: Analyzing Gene Expression Data
Previous: Analyzing Gene Expression Data
Peer Itsik
2001-01-31