Optimal features in the genetic code
http://reasonandscience.heavenforum.org/t1501-optimal-features-in-the-genetic-code
Information and function in a biological system
Literature from those who posture in favor of creation abounds with examples of the tremendous odds against chance producing a meaningful code. For instance, the estimated number of elementary particles in the universe is 10^80. The most rapid events occur at an amazing 10^45 per second. Thirty billion years contains only 10^18 seconds. By totaling those, we find that the maximum elementary particle events in 30 billion years could only be 10^143. Yet, the simplest known free-living organism, Mycoplasma genitalium, has 470 genes that code for 470 proteins that average 347 amino acids in length. The odds against just one specified protein of that length are 1:10^451.
http://www.doesgodexist.org/NovDec09/Information-Function.html
If amino acids were randomly assigned to triplet codons, then there would be 1.5 x 10^84 possible genetic codes to choose from
http://en.wikipedia.org/wiki/Genetic_code
Origin and evolution of the genetic code: the universal enigma
In our opinion, despite extensive and, in many cases, elaborate attempts to model code optimization, ingenious theorizing along the lines of the coevolution theory, and considerable experimentation, very little definitive progress has been made. Summarizing the state of the art in the study of the code evolution, we cannot escape considerable skepticism. It seems that the two-pronged fundamental question: “why is the genetic code the way it is and how did it come to be?”, that was asked over 50 years ago, at the dawn of molecular biology, might remain pertinent even in another 50 years. Our consolation is that we cannot think of a more fundamental problem in biology.
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3293468/
The genetic code is one in a million
if we employ weightings to allow for biases in translation, then only 1 in every million random alternative codes generated is more efficient than the natural code. We thus conclude not only that the natural genetic code is extremely efficient at minimizing the effects of errors, but also that its structure reflects biases in these errors, as might be expected were the code the product of selection.
http://www.ncbi.nlm.nih.gov/pubmed/9732450
The genetic code is nearly optimal for allowing additional information within protein-coding sequences
DNA sequences that code for proteins need to convey, in addition to the protein-coding information, several different signals at the same time. These “parallel codes” include binding sequences for regulatory and structural proteins, signals for splicing, and RNA secondary structure. Here, we show that the universal genetic code can efficiently carry arbitrary parallel codes much better than the vast majority of other possible genetic codes. This property is related to the identity of the stop codons. We find that the ability to support parallel codes is strongly tied to another useful property of the genetic code—minimization of the effects of frame-shift translation errors. Whereas many of the known regulatory codes reside in nontranslated regions of the genome, the present findings suggest that protein-coding regions can readily carry abundant additional information.
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1832087/?report=classic
Determination of the Core of a Minimal Bacterial Gene Set
Based on the conjoint analysis of several computational and experimental strategies designed to define the minimal set of protein-coding genes that are necessary to maintain a functional bacterial cell, we propose a minimal gene set composed of 206 genes ( which code for 13 protein complexes ) Such a gene set will be able to sustain the main vital functions of a hypothetical simplest bacterial cell with the following features. These protein complexes could not emerge through evolution ( muations and natural selection ) , because evolution depends on the dna replication, which requires precisely these original genes and proteins ( chicken and egg prolem ). So the only mechanism left is chance, and physical necessity.
http://mmbr.asm.org/content/68/3/518.full.pdf
On the origin of the translation system and the genetic code in the RNA world by means of natural selection, exaptation, and subfunctionalization
The origin of the translation system is, arguably, the central and the hardest problem in the study of the origin of life, and one of the hardest in all evolutionary biology. The problem has a clear catch-22 aspect: high translation fidelity hardly can be achieved without a complex, highly evolved set of RNAs and proteins but an elaborate protein machinery could not evolve without an accurate translation system. The origin of the genetic code and whether it evolved on the basis of a stereochemical correspondence between amino acids and their cognate codons (or anticodons), through selectional optimization of the code vocabulary, as a "frozen accident" or via a combination of all these routes is another wide open problem despite extensive theoretical and experimental studies.
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1894784/
However, the genetic code used by all known forms of life is nearly universal with few minor variations.
http://www.arn.org/blogs/index.php/literature/2007/03/01/optimality_features_in_the_genetic_code
"The genetic code is the mapping of 64 three-letter codons to 20 amino-acids and a stop signal." It stands out among possible competing codes in several ways. "First, the assignment of amino acids to codons appears to be optimal for minimizing the effect of translational misread errors." Errors in misreading a codon tend to have minimal effects on the translated protein. "Second, amino acids with simple chemical structure tend to have more codons assigned to them", as "they are required more often in protein assembly". But the researchers main interest in the paper below was the ability of the genetic code to carry parallel messages. The list is already impressive (and there is no reason why it should not be extended with research) - binding sequences of regulatory proteins that bind within coding regions, splicing signals that include specific 6-8 bp sequences within coding regions and mRNA secondary structure signals - all higher-order codes that ride over the protein forming code. "They found that the real genetic code could accommodate more arbitrary motifs in coding sequence than almost any of the other possibilities - it has a higher information content. One reason for the real genetic code's superiority is the fact that its stop codons, when frame-shifted, tend to form common codons, whereas in other codes frame-shifted stop codons form rarer codons or even other stop codons."
The authors appeal to selection to explain why the genetic code is optimal. The implication of this approach is that the selection had to take place before the Last Common Ancestor emerged on Earth. All this complexity had to be fine tuned in a single celled organism that predated all subsequent diversity. An information-based approach linked to Intelligent Agency deserves a fair hearing when seeking an explanation for optimal design.
The genetic code is nearly optimal for allowing additional information within protein-coding sequences
Shalev Itzkovitz and Uri Alon
Genome Research, 2007 17: 405-412. doi 10.1101/gr.5987307(Open Access)
DNA sequences that code for proteins need to convey, in addition to the protein-coding information, several different signals at the same time. These "parallel codes" include binding sequences for regulatory and structural proteins, signals for splicing, and RNA secondary structure. Here, we show that the universal genetic code can efficiently carry arbitrary parallel codes much better than the vast majority of other possible genetic codes. This property is related to the identity of the stop codons. We find that the ability to support parallel codes is strongly tied to another useful property of the genetic codeâ€â€minimization of the effects of frame-shift translation errors. Whereas many of the known regulatory codes reside in nontranslated regions of the genome, the present findings suggest that protein-coding regions can readily carry abundant additional information.
See also:
Goymer, P., Evolution: The genetic code sees off rivals, Nature Reviews Genetics 8, 168-169 (March 2007) | doi:10.1038/nrg2076
There are many possible three-letter genetic codes that could adequately encode protein sequences, but what about the need to encode higher-order information on binding and splicing sites? New research shows that the actual genetic code is better than potential alternatives at encoding such information at the same time as encoding protein.
Evolution and multilevel optimization of the genetic code
Tobias Bollenbach, Kalin Vetsigian, and Roy Kishony
Genome Research, 2007 17: 401-404. doi 10.1101/gr.6144007
Abstract: The discovery of the genetic code was one of the most important advances of modern biology. But there is more to a DNA code than protein sequence; DNA carries signals for splicing, localization, folding, and regulation that are often embedded within the protein-coding sequence. In this issue, Itzkovitz and Alon show that the specific 64-to-20 mapping found in the genetic code may have been optimized for permitting protein-coding regions to carry this extra information and suggest that this property may have evolved as a side benefit of selection to minimize the negative effects of frameshift errors.
Last paragraph: "As we learn more about the functions of the genetic code, it becomes ever clearer that the degeneracy in the genetic code is not exploited in such a way as to optimize one function, but rather to optimize a combination of several different functions simultaneously. Looking deeper into the structure of the code, we wonder what other remarkable properties it may bear. While our understanding of the genetic code has increased substantially over the last decades, it seems that exciting discoveries are waiting to be made."
http://reasonandscience.heavenforum.org/t1501-optimal-features-in-the-genetic-code
Information and function in a biological system
Literature from those who posture in favor of creation abounds with examples of the tremendous odds against chance producing a meaningful code. For instance, the estimated number of elementary particles in the universe is 10^80. The most rapid events occur at an amazing 10^45 per second. Thirty billion years contains only 10^18 seconds. By totaling those, we find that the maximum elementary particle events in 30 billion years could only be 10^143. Yet, the simplest known free-living organism, Mycoplasma genitalium, has 470 genes that code for 470 proteins that average 347 amino acids in length. The odds against just one specified protein of that length are 1:10^451.
http://www.doesgodexist.org/NovDec09/Information-Function.html
If amino acids were randomly assigned to triplet codons, then there would be 1.5 x 10^84 possible genetic codes to choose from
http://en.wikipedia.org/wiki/Genetic_code
Origin and evolution of the genetic code: the universal enigma
In our opinion, despite extensive and, in many cases, elaborate attempts to model code optimization, ingenious theorizing along the lines of the coevolution theory, and considerable experimentation, very little definitive progress has been made. Summarizing the state of the art in the study of the code evolution, we cannot escape considerable skepticism. It seems that the two-pronged fundamental question: “why is the genetic code the way it is and how did it come to be?”, that was asked over 50 years ago, at the dawn of molecular biology, might remain pertinent even in another 50 years. Our consolation is that we cannot think of a more fundamental problem in biology.
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3293468/
The genetic code is one in a million
if we employ weightings to allow for biases in translation, then only 1 in every million random alternative codes generated is more efficient than the natural code. We thus conclude not only that the natural genetic code is extremely efficient at minimizing the effects of errors, but also that its structure reflects biases in these errors, as might be expected were the code the product of selection.
http://www.ncbi.nlm.nih.gov/pubmed/9732450
The genetic code is nearly optimal for allowing additional information within protein-coding sequences
DNA sequences that code for proteins need to convey, in addition to the protein-coding information, several different signals at the same time. These “parallel codes” include binding sequences for regulatory and structural proteins, signals for splicing, and RNA secondary structure. Here, we show that the universal genetic code can efficiently carry arbitrary parallel codes much better than the vast majority of other possible genetic codes. This property is related to the identity of the stop codons. We find that the ability to support parallel codes is strongly tied to another useful property of the genetic code—minimization of the effects of frame-shift translation errors. Whereas many of the known regulatory codes reside in nontranslated regions of the genome, the present findings suggest that protein-coding regions can readily carry abundant additional information.
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1832087/?report=classic
Determination of the Core of a Minimal Bacterial Gene Set
Based on the conjoint analysis of several computational and experimental strategies designed to define the minimal set of protein-coding genes that are necessary to maintain a functional bacterial cell, we propose a minimal gene set composed of 206 genes ( which code for 13 protein complexes ) Such a gene set will be able to sustain the main vital functions of a hypothetical simplest bacterial cell with the following features. These protein complexes could not emerge through evolution ( muations and natural selection ) , because evolution depends on the dna replication, which requires precisely these original genes and proteins ( chicken and egg prolem ). So the only mechanism left is chance, and physical necessity.
http://mmbr.asm.org/content/68/3/518.full.pdf
On the origin of the translation system and the genetic code in the RNA world by means of natural selection, exaptation, and subfunctionalization
The origin of the translation system is, arguably, the central and the hardest problem in the study of the origin of life, and one of the hardest in all evolutionary biology. The problem has a clear catch-22 aspect: high translation fidelity hardly can be achieved without a complex, highly evolved set of RNAs and proteins but an elaborate protein machinery could not evolve without an accurate translation system. The origin of the genetic code and whether it evolved on the basis of a stereochemical correspondence between amino acids and their cognate codons (or anticodons), through selectional optimization of the code vocabulary, as a "frozen accident" or via a combination of all these routes is another wide open problem despite extensive theoretical and experimental studies.
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1894784/
However, the genetic code used by all known forms of life is nearly universal with few minor variations.
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1832087/?report=classic
Abstract
DNA sequences that code for proteins need to convey, in addition to the protein-coding information, several different signals at the same time. These “parallel codes” include binding sequences for regulatory and structural proteins, signals for splicing, and RNA secondary structure. Here, we show that the universal genetic code can efficiently carry arbitrary parallel codes much better than the vast majority of other possible genetic codes. This property is related to the identity of the stop codons. We find that the ability to support parallel codes is strongly tied to another useful property of the genetic code—minimization of the effects of frame-shift translation errors. Whereas many of the known regulatory codes reside in nontranslated regions of the genome, the present findings suggest that protein-coding regions can readily carry abundant additional information.
http://www.arn.org/blogs/index.php/literature/2007/03/01/optimality_features_in_the_genetic_code
"The genetic code is the mapping of 64 three-letter codons to 20 amino-acids and a stop signal." It stands out among possible competing codes in several ways. "First, the assignment of amino acids to codons appears to be optimal for minimizing the effect of translational misread errors." Errors in misreading a codon tend to have minimal effects on the translated protein. "Second, amino acids with simple chemical structure tend to have more codons assigned to them", as "they are required more often in protein assembly". But the researchers main interest in the paper below was the ability of the genetic code to carry parallel messages. The list is already impressive (and there is no reason why it should not be extended with research) - binding sequences of regulatory proteins that bind within coding regions, splicing signals that include specific 6-8 bp sequences within coding regions and mRNA secondary structure signals - all higher-order codes that ride over the protein forming code. "They found that the real genetic code could accommodate more arbitrary motifs in coding sequence than almost any of the other possibilities - it has a higher information content. One reason for the real genetic code's superiority is the fact that its stop codons, when frame-shifted, tend to form common codons, whereas in other codes frame-shifted stop codons form rarer codons or even other stop codons."
The authors appeal to selection to explain why the genetic code is optimal. The implication of this approach is that the selection had to take place before the Last Common Ancestor emerged on Earth. All this complexity had to be fine tuned in a single celled organism that predated all subsequent diversity. An information-based approach linked to Intelligent Agency deserves a fair hearing when seeking an explanation for optimal design.
The genetic code is nearly optimal for allowing additional information within protein-coding sequences
Shalev Itzkovitz and Uri Alon
Genome Research, 2007 17: 405-412. doi 10.1101/gr.5987307(Open Access)
DNA sequences that code for proteins need to convey, in addition to the protein-coding information, several different signals at the same time. These "parallel codes" include binding sequences for regulatory and structural proteins, signals for splicing, and RNA secondary structure. Here, we show that the universal genetic code can efficiently carry arbitrary parallel codes much better than the vast majority of other possible genetic codes. This property is related to the identity of the stop codons. We find that the ability to support parallel codes is strongly tied to another useful property of the genetic codeâ€â€minimization of the effects of frame-shift translation errors. Whereas many of the known regulatory codes reside in nontranslated regions of the genome, the present findings suggest that protein-coding regions can readily carry abundant additional information.
See also:
Goymer, P., Evolution: The genetic code sees off rivals, Nature Reviews Genetics 8, 168-169 (March 2007) | doi:10.1038/nrg2076
There are many possible three-letter genetic codes that could adequately encode protein sequences, but what about the need to encode higher-order information on binding and splicing sites? New research shows that the actual genetic code is better than potential alternatives at encoding such information at the same time as encoding protein.
Evolution and multilevel optimization of the genetic code
Tobias Bollenbach, Kalin Vetsigian, and Roy Kishony
Genome Research, 2007 17: 401-404. doi 10.1101/gr.6144007
Abstract: The discovery of the genetic code was one of the most important advances of modern biology. But there is more to a DNA code than protein sequence; DNA carries signals for splicing, localization, folding, and regulation that are often embedded within the protein-coding sequence. In this issue, Itzkovitz and Alon show that the specific 64-to-20 mapping found in the genetic code may have been optimized for permitting protein-coding regions to carry this extra information and suggest that this property may have evolved as a side benefit of selection to minimize the negative effects of frameshift errors.
Last paragraph: "As we learn more about the functions of the genetic code, it becomes ever clearer that the degeneracy in the genetic code is not exploited in such a way as to optimize one function, but rather to optimize a combination of several different functions simultaneously. Looking deeper into the structure of the code, we wonder what other remarkable properties it may bear. While our understanding of the genetic code has increased substantially over the last decades, it seems that exciting discoveries are waiting to be made."
Last edited by Admin on Wed Jan 25, 2017 1:56 pm; edited 8 times in total