Journal of Molecular Biology
Regular ArticleSatisfying Hydrogen Bonding Potential in Proteins
Abstract
We have analysed the frequency with which potential hydrogen bond donors and acceptors are satisfied in protein molecules. There are a small percentage of nitrogen or oxygen atoms that do not form hydrogen bonds with either solvent or protein atoms, when standard criteria are used. For high resolution structures 9·5% and 5·1% of buried main-chain nitrogen and oxygen atoms, respectively, fail to hydrogen bond under our standard criteria, representing 5·8% and 2·1% of all main-chain nitrogen and oxygen atoms. We find that as the resolution of the data improves, the percentages fall. If the hydrogen bond criteria are relaxed many of these unsatisfied atoms form weak hydrogen bonds. However, there remain some buried atoms (1·3% NH and 1·8% CO) that fail to hydrogen bond without any immediately obvious compensating interactions.
References (0)
Cited by (1924)
Cysteine hyperoxidation rewires communication pathways in the nucleosome and destabilizes the dyad
2024, Computational and Structural Biotechnology JournalGene activity is tightly controlled by reversible chemical modifications called epigenetic marks, which are of various types and modulate gene accessibility without affecting the DNA sequence. Despite an increasing body of evidence demonstrating the role of oxidative-type modifications of histones in gene expression regulation, there remains a complete absence of structural data at the atomistic level to understand the molecular mechanisms behind their regulatory action. Owing to μs time-scale MD simulations and protein communication networks analysis, we describe the impact of histone H3 hyperoxidation (i.e., S-sulfonylation) on the nucleosome core particle dynamics. Our results reveal the atomic-scale details of the intrinsic structural networks within the canonical histone core and their perturbation by hyperoxidation of the histone H3 C110. We show that this modification involves local rearrangements of the communication networks and destabilizes the dyad, and that one modification is enough to induce a maximal structural signature. Our results suggest that cysteine hyperoxidation in the nucleosome core particle might favor its disassembly.
Statistical analysis of sequential motifs at biologically relevant protein-protein interfaces
2024, Computational and Structural Biotechnology JournalUnderstanding protein-protein interactions (PPIs) at the molecular level may lead to innovations in medicine and biochemistry. The assumption that there are certain “hot spots” on protein surfaces that mediate their interactions with other proteins has led to a search for specific sequences involved in protein-protein contacts. In this work, we analyze sequential amino acid motifs, both at the single motif and at the motif-motif level, across a large and diverse dataset of biologically relevant protein-protein interfaces retrieved from the PDB, comparing their presence at interfaces and surfaces in a statistically rigorous manner. At the single motif level, our results indicate statistically significant over-presence of hydrophobic and in particular aromatic residues and under-presence of charged residues at protein-protein interfaces. Certain PPI-mediating motifs reported in the literature (e.g., the Tyrosine-based Motif YxxΦ and the PDZ-Binding Motif X-S/T-X-V/I) were confirmed to have a significant presence at interfaces. In addition, multiple PPI-mediating motifs were reported in the ELM database and from those present in our dataset, half were confirmed to have a statistically significant presence at interfaces whereas others were not. At the single residue, motif-motif level, Cysteine-Cysteine contacts were found to be the most abundant ones followed by interactions involving aromatic/hydrophobic residues. Top ranking, longer motif-motif pairs show predominance of Leucine and aromatic residues. Finally, preliminary energy calculations (using the MM/GBSA procedure) indicate a partial correlation between the probability of motifs-pair to be a part of a protein-protein interface and the strength of the interactions between the motifs. In conclusion, this study points to specific characteristics of motifs that have a higher probability to mediate protein-protein interactions. Prominent motifs identified in this study may be used in the future as possible components in protein engineering.
Investigating the effect of chemical structures on water sorption and diffusion in amine-cured epoxy resins by molecular dynamics simulations
2024, Computational Materials ScienceMechanical degradation of polymer materials by moisture ingress is inevitable due to their chemical and physical properties. In this regard, it is necessary to better understand the mechanisms governing interactions between water and polymer. In this research, the sorption and diffusion of moisture in epoxy systems with different chemical structures of amines (DGEBA/DDS and DGEBA/DDM) has been investigated by molecular dynamics simulations. For the validation, the physical properties of the simulation models and the calculated values of the diffusion coefficient and equilibrium water content have been compared with the existing data and acceptable results have been obtained. The study results of polymer systems showed that both the polarity and the average radius of nanopores played an effective role in diffusivity and equilibrium water content values. It was also found that the interactions between water and polymer molecules at the water-polymer interface played an important role in the initiation of the moisture sorption process. The results of the simulation during the hydrothermal aging process showed that moisture first entered the pores in the epoxy network, then diffused deeper into the polymer, and finally, with a change in the polymer structure, caused water-polymer hydrogen bonds with stronger strength.
Computational approaches for identifying disease-causing mutations in proteins
2024, Advances in Protein Chemistry and Structural BiologyAdvancements in genome sequencing have expanded the scope of investigating mutations in proteins across different diseases. Amino acid mutations in a protein alter its structure, stability and function and some of them lead to diseases. Identification of disease-causing mutations is a challenging task and it will be helpful for designing therapeutic strategies. Hence, mutation data available in the literature have been curated and stored in several databases, which have been effectively utilized for developing computational methods to identify deleterious mutations (drivers), using sequence and structure-based properties of proteins. In this chapter, we describe the contents of specific databases that have information on disease-causing and neutral mutations followed by sequence and structure-based properties. Further, characteristic features of disease-causing mutations will be discussed along with computational methods for identifying cancer hotspot residues and disease-causing mutations in proteins.
Adenosine dialdehyde, a methyltransferase inhibitor, induces colorectal cancer cells apoptosis by regulating PIMT:p53 interaction
2023, Biochemical and Biophysical Research CommunicationsPost-translational modification (PTM) is important in controlling many biological processes by changing the structure and function of a protein. Protein methylation is an important PTM, and the role of methyltransferases has been implicated in numerous cellular functions. Protein L-isoaspartyl methyltransferase (PIMT) is ubiquitously expressed in almost all organisms and govern important cellular processes including apoptosis. Among other functions, PIMT has also been identified as a potent oncogene because it destabilizes the structure of the tumor suppressor p53 via methylation at the transactivation domain. In the present study we identified that out of the three methyltransferase inhibitors tested, namely, S-adenosyl-l-homocysteine (AdoHcy), adenosine and adenosine dialdehyde (AdOx), only AdOx augments p53 expression by destabilizing PIMT structure, as evident from far-UV CD. The effect of the inhibitors, AdOx in particular, to the structure of PIMT, and the binding of PIMT to the p53 transactivation domain have been investigated by docking and molecular dynamics simulations. AdOx significantly increases p53 accumulation and nuclear translocation in colon cancer cells, triggering the p53-mediated apoptotic pathway. To better understand the molecular mechanisms underlying p53 accumulation in colon cancer cells, we observed that the level of PIMT is considerably lower in AdOx-treated cells, reducing its association with p53, which stabilized p53. p53 then transactivated BAX, increasing the BAX: BCL-2 ratio and causing colon cancer cell death.
Protein-DNA interface hotspots prediction based on fusion features of embeddings of protein language model and handcrafted features
2023, Computational Biology and ChemistryThe identification of hotspot residues at the protein-DNA binding interfaces plays a crucial role in various aspects such as drug discovery and disease treatment. Although experimental methods such as alanine scanning mutagenesis have been developed to determine the hotspot residues on protein-DNA interfaces, they are both inefficient and costly. Therefore, it is highly necessary to develop efficient and accurate computational methods for predicting hotspot residues. Several computational methods have been developed, however, they are mainly based on hand-crafted features which may not be able to represent all the information of proteins. In this regard, we propose a model called PDH-EH, which utilizes fused features of embeddings extracted from a protein language model (PLM) and handcrafted features. After we extracted the total 1141 dimensional features, we used mRMR to select the optimal feature subset. Based on the optimal feature subset, several different learning algorithms such as Random Forest, Support Vector Machine, and XGBoost were used to build the models. The cross-validation results on the training dataset show that the model built by using Random Forest achieves the highest AUROC. Further evaluation on the independent test set shows that our model outperforms the existing state-of-the-art models. Moreover, the effectiveness and interpretability of embeddings extracted from PLM were demonstrated in our analysis. The codes and datasets used in this study are available at: https://github.com/lixiangli01/PDH-EH.