|
|
What is Genome analysis ?
Segments of genome called genes determine the sequence of amino
acids in proteins. The mechanism is simple for the prokaryotic
cell where all the genes are converted into the corresponding
mRNA (messenger ribonucleic acid) and then into proteins.
The process is more complex for eukaryotic cells where rather than
full DNA sequence, some parts of genes called exons are expressed
in the form of mRNA interrupted at places by random DNA sequences
called introns. Of the several questions posed here, one is that how
some parts of the genome are expressed as proteins and yet other parts
(introns as well as intergenic regions) are not expressed.
Genome analysis problem entails the prediction of genes in uncharacterized genomic sequences.
The 21st century has seen the announcement of the draft version of the human genome sequence.
Model organisms have been sequenced in both the plant and animal kingdoms. As we begin the new millennium,
the major goal of molecular biology is to obtain the complete sequences of as many genomes as possible.
A comparison of the genome sizes of different organisms (Table 1) raises questions like what types of
genetic modifications are responsible for the four times large genome size of wheat plant and seven
times small size of the rice plant as compared to that of humans. Mice and humans contain roughly
the same number of genes . about 28K protein coding regions. The chimp and human genomes vary by
an average of just 2% i.e. just about 160 enzymes.
Table 1: Genome size of some organisms
Organism
| Genome Size (Mb)
Mb=Mega base
|
Eschericia coli
| 4.64 |
M tuberculosis
| 4.4 |
H.Influenza
| 1.83 |
Homo sapiens
| 3300 |
Mouse
| 3000 |
Rice
| 430 |
Wheat
| 13500 |
|
|
|