Luckily, in agreement with evolutionary principles, scientific research to date has shown that all genes share common elements. For many genetic elements, it has been possible to construct consensus sequences, those sequences best representing the norm for a given class of organisms (e.g, bacteria, eukaroytes). Common genetic elements include promoters, enhancers, polyadenylation signal sequences and protein binding sites. These elements have also been further characterized into further subelements.
Genetic elements share common sequences, and it is this fact that allows mathematical algorithms to be applied to the analysis of sequence data. A computer program for finding genes will contain at least the following elements.
Elements of a Gene-seeking Computer Program | |
| Algorithms for pattern recognition | Probability formulae are used to determine if two sequences are statistically similar. |
| Data Tables | These tables contain information on consensus sequences for various genetic elements. More information enables a better analysis. |
| Taxonomic Differences | Consensus sequences vary between different taxonomic classes of organisms. Inclusion of these differences in an analysis speeds processing and minimizes error. |
| Analysis rules | These programming instructions define how algorithms are applied. They define the degree of similarity accepted and whether entire sequences and/or fragments thereof will be considered in the analysis. A good program design enables users to adjust these variables. |
[About BioTech | Home | BioTech Dictionary | BioTech Resources | BioTech Search | BioTech Feedback | Tour ]