Thermophiles as Extraterrestrial Models
Introduction
The National Center for Biotechnology Inforation (NCBI) Microbial Genome Project Database uses five terms to categorize the temperature range an organism grows at, where cryophilic refers to –30° to –2°C, psychrophilic refers to –1° to +10°C, mesophilic refers to +11° to +45°C, thermophilic refers to +46° to 75°C, and hyperthermophilic refers to above +75°C.
High temperatures can often denature enzymes and proteins that are vital to an organisms survival. In terms of proteins, heating affects the secondary structure of proteins, causing changes in the shape of the molecule. Specifically, heat disrupts hydrogen bonds and non-polar hydrophobic interactions. This occurs because heat increases the kinetic energy and cause the molecules to vibrate so rapidly and violently that the bonds are disrupted. Changing a proteins shape can alter its natural function which can disrupt vital processes like metabolism. In terms of DNA, the denaturation of nucleic acids is the separation of a double strand into two single strangs, which occurs when the hydrogen bonds between the strands are broken. Denaturation of an organisms genetic material renders it unfit and highly susceptible to malicious mutations. Unlike most organisms, thermophiles can survive and thrive at very high temperatures. They found in geothermally heated regions of the Earth like deep-sea hydrothermal vents and the hot springs of Yellowstone National Park(Figure 1). The investigation of thermophilic structure and chemistry poses very promising and intriguing contributions to the scientific community. For one, some of the enzymes used in molecular biology, like DNA polymerases, have derived from investigating heat-stable enzymes. In addition, astrobiologists look to understand the structural and genomic correlates of hyoerthermostability in order to give indication to what life may look like on planets hotter than ours.
Research has recently begun to investigate physical adaptations that allow thermophiles to remain functional and alive at high temperatures. Over the last 20 years, researcher have begun to discover key structural components of DNA and proteins that allow a thermophilic lifestyle. First, increasing the number of salt bridges is a driving force for enhancement of the thermotolerance of proteins from hyperthermophilic microorganisms. Second, research suggests that the replacement of polar noncharged resides by charged ones constitutes a major stabilization mechanisms in the proteins of hyperthermophilic organisms. Third, thermophilic protein sequences are more likely than their mesophilic homologs to have deletions in exposed loop regions. Lastly, the guanine-cytosine (GC) content levels of the coding/non-coding regions of certain genes are highly likely to be correlated with the temperature range conditions of prokaryotic organisms.
Salt Bridges
The optimization of electrostatic interactions by increasing the number of salt bridges is a driving force for enhancement of the thermotolerance of proteins from hyperthermophilic microorganisms. 2
This trend is less evident in thermophilic organisms and absent from mesophile-derived proteins. A salt bridge is a combination of two noncovalent interactions, hydrogen bonding and electrostatic interaction. Salt bridges often occur between groups distant in the protein sequence and form cross-links that stabilize tertiary structure. This interaction can increase the kinetic barrier towards thermal inactivation or thermal unfolding and, thus, prevents proteins from denaturing at high temperatures.
Table 1 shows the number of salt bridges in select thermo- and hyperthermophilic organisms. Ns indicates the number of salt bridges, Nr represents the number of salt bridges statistically expected for that protein structure, and Topt represents the temperature of optimal growth for the protein. Proteins from hyperthermophilic organisms are characterized by an increased number of ion pairs with respect to the statistical expectance and/or the number of ion-pairs in their mesophilic counterparts. This finding suggests that electrostatic interactions are a principal factor responsible for the elevation of the melting temperature of proteins from hyperthermophilic organisms.
The figure and table above show that charged residues in the enzyme from Aquifex aeolicus replace most of the polar residues in the Bacillus subtilis enzyme.
Specifically, the number of ion pairs in the protein from Aquifex is increased by >90%. Melting point assessments are often employed in thermostability studies in order to examine the effect of structural changes. In this case, the corresponding change in melting temperature when B. subtilis to A. aeolicus was about 27°C. 2
Molecular dynamics calculations on a prototypical ion-pair model system have suggested that a sizeable energy barrier exists for the solvation of a salt bridge and that the height of this barrier increases with temperature. Interestingly, a similar barrier is not seen with isosteric hydrophobic groups. Thus, the heightened kinetic barrier towards protein unfolding is one of the mechanisms thermophiles can employ to stabilize their proteins and prevent them from denaturing.
Polar Charged Residues
The replacement of polar noncharged residues by charged ones constitutes a major stabilization mechanism in the proteins of hyperthermophilic organisms. Residue changes allow the stabilization of proteins through ion bonds. A stronger intramolecular interaction between the protein and itself decreases the effect that an intermolecular force such as heat. In other word, more heat is required to denature the protein if there are stronger intramolecular interactions taking place. The proteome analysis in amino acid classes indicated that hyperthermophilicity is characterized by a sharp increase of charged residues, Lys and Glu, at the expense of polar noncharged residues, mainly Gln (Figure 3). Figure 3A is a plot of the sum of the percentages of charged amino acids (Lys, Arg, Asp, Glu; CHA, blue), polar noncharged amino acids (Asn, Gln, Ser, Thr; POL, green), and of the difference of the two values (CH-POL, red). The mesophiles and hyperthermophiles are identified by MESO and HYPER, respectively. Figure 3B is a plot of the percentages of the various amino acids in mesophiles (blue) and hyperthermophiles (red). Figure 3C is a plot of the sum of percentages of the various amino acid classes in mesophiles (blue) and hyperthermophiles (red). A threshold is observed around 10% for extremeophiles Ch-Po values, while a higher limit of 5% characterizes mesophiles. 1Cambillau and Claverie suggest from this data that the difference between charged and polar noncharged amino acids (Ch-Po) is the best indicator of an organism’s lifestyle.1
The charged polar amino acid content is a genomic signature that can give strong indications to an organisms growth environment. The advantage of this characteristic comes from the increased stability of coulombic interactions with temperature. In adition to this, in most structures of hyperthermophilic proteins, the existence of long chains of ion pairs provide cooperative stabilization. 3 Excitingly, this finding implies the existence of global structural features associated with hyperthermostability common to thermophilic Bacteria and Achaea.
Loop Deletions
Sequence alignments to proteins of known structure indicate that thermophilic sequences are more likely than their mesophilic homologs to have deletions in exposed loop regions. Loops were identified as the intervening regions between transmembrane segments.3 This finding indicates a general evolutionary strategy for increasing thermostability and is thought to be a mechanism for reducing unfolded state entropy. By employing loop deletions as a mechanism for protein stability, an organisms should be able to withstand higher temperatures without protein denaturation4
Figure 4 demonstrates that thermophilic sequences have an increased propensity for deletions in exposed loops.3 In the study conducted by Mandrich et al., three types of secondary structure, helix, strand, and loop, were considered. The figure depicts the structural propensities for gaps found in alignments between proteins of known structure and their mesophilic or thermophilic homologs. Propensities less than 0 for a given structure type indicate that fewer gaps are associate with that structure type than random expectation while higher propensities indicate that more gaps are associated. Here, the propensities were averaged over homologs for each protein of known structure. To add to the argument that thermophiles employ loop deletions for protein stabilization, the opposite effect is observed with organisms that live in cold temperatures. Cold-adapted organisms have been observed to have insertions in exposed loop regions in psychrophilic proteins Davail et al 1994. This figure suggests that exposed loops of thermophiles are the only structura elements having significantly greater gaps compared to mesophiles.3
Deletion of the exposed loop residues would decrease the unfolding entropy while having minimal impact on the enthalpy of unfolding and will cause protein stabilization as a result. A full thermodynamic argument for this conclusion can be found in Thomson and Eisenberg 1999.
GC Content
Since the GC pair is bound by three hydrogen bonds while the adenine-thymine (AT) pair is bound by two hydrogen bonds, it is expected that organisms growing at higher temperature would have a higher proportion of GC than AT pairs. Hao and Wu found the GC content levels of the coding/non-coding regions of certain genes are highly likely to be correlated with the temperature range conditions of prokaryotic organisms. Four genes were consistently identified as correlated with the temperature range condition: K01251 (adenosylhomocysteinase), K03724 (DNA repair and recombination proteins), K07588 (LAO/AO transport system kinase), and K09122 (hypothetical protein). When these four genomic regions were used to predict the temperature range condition of an organism, the prediction accuracy was 84.52% for complete genomes, 84.09% for the in-progress genomes, and 82.70% for the metagenomes. Considering that these four genes only account for less than 1% of all the 413 genomic regions potentially correlated with the temperature range condition but can to a great extent retain the prediction accuracy, we may interpret these four genomic regions as the core of the temperature range-correlated.
Conclusion
The employment of structural salt bridges, loop deletions, polar charged residues, and heightened GC content have all been observed to heighten protein or DNA stability under high temperatures. The increased intramolecular forces, like ion and hydrogen bonds, of thermophilic structures are keystone to their thermostability. Astrobiologist can use these structural motifs to hypothesize and investigate possible extraterrestrial life forms. Perhaps planets that have a similar hot environment, like those of deep-sea hydrothermal vents or hot springs, can harbor organisms similar to the thermophiles we see on Earth. Although it is important to take into account that extraterrestrial life may not employ the same mechanisms of genetic material, metabolic processes, or protein structure, the study of thermophilic organisms opens a window into the amazing mechanisms employed in order to live in extreme environments.
References
Edited by (Paloma Medina), a student of Nora Sullivan in BIOL187S (Microbial Life) in The Keck Science Department of the Claremont Colleges Spring 2013.