We will discuss the following problem:
The process of solving this problem is called inferring the phylogeny. The input is usually given as an matrix M, where Mi,j represents the value of the jth character of the ith species. The state (value), of each character is taken from a known set Aj.
The input may also include other relevant parameters - e.g., the distribution of changes (mutations) in each character, weights representing relative importance of characters, etc. The goal will be to maximize some score over the possible phylogenetic trees and produce the best one.
We will make the following assumptions in attempting to infer phylogenies: