收藏切换
Highly Accurate and Efficient Deep Learning Paradigm for Full-Atom Protein Loop Modeling with KarmaLoop
收藏切换
PDF
Tianyue Wang1, , Xujun Zhang1, , Odin Zhang1, , Guangyong Chen2, Peichen Pan1, Ercheng Wang1, 2, Jike Wang1, Jialu Wu1, Donghao Zhou3, Langcheng Wang4, Ruofan Jin1, 5, Shicheng Chen1, Chao Shen1, Yu Kang1, *, Chang-Yu Hsieh1, *, Tingjun Hou1, *
Research. Vol 7 Article ID 0408
Less
收藏切换
Research. Vol 7 Article ID 0408
Research Article
Highly Accurate and Efficient Deep Learning Paradigm for Full-Atom Protein Loop Modeling with KarmaLoop
Full
Tianyue Wang1, , Xujun Zhang1, , Odin Zhang1, , Guangyong Chen2, Peichen Pan1, Ercheng Wang1, 2, Jike Wang1, Jialu Wu1, Donghao Zhou3, Langcheng Wang4, Ruofan Jin1, 5, Shicheng Chen1, Chao Shen1, Yu Kang1, *, Chang-Yu Hsieh1, *, Tingjun Hou1, *
Affiliations
  • 1Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
  • 2 Zhejiang Laboratory, Hangzhou 311100, Zhejiang, China.
  • 3Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, Guangdong, China.
  • 4Department of Pathology, New York University Medical Center, New York, NY 10016, USA.
  • 5College of Life Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
Published: 2024-07-25 doi: 10.34133/research.0408
Outline
收藏切换

Protein loop modeling is a challenging yet highly nontrivial task in protein structure prediction. Despite recent progress, existing methods including knowledge-based, ab initio, hybrid, and deep learning (DL) methods fall substantially short of either atomic accuracy or computational efficiency. To overcome these limitations, we present KarmaLoop, a novel paradigm that distinguishes itself as the first DL method centered on full-atom (encompassing both backbone and side-chain heavy atoms) protein loop modeling. Our results demonstrate that KarmaLoop considerably outperforms conventional and DL-based methods of loop modeling in terms of both accuracy and efficiency, with the average RMSDs of 1.77 and 1.95 Å for the CASP13+14 and CASP15 benchmark datasets, respectively, and manifests at least 2 orders of magnitude speedup in general compared with other methods. Consequently, our comprehensive evaluations indicate that KarmaLoop provides a state-of-the-art DL solution for protein loop modeling, with the potential to hasten the advancement of protein engineering, antibody–antigen recognition, and drug design.

Tianyue Wang, Xujun Zhang, Odin Zhang, Guangyong Chen, Peichen Pan, Ercheng Wang, Jike Wang, Jialu Wu, Donghao Zhou, Langcheng Wang, Ruofan Jin, Shicheng Chen, Chao Shen, Yu Kang, Chang-Yu Hsieh, Tingjun Hou. Highly Accurate and Efficient Deep Learning Paradigm for Full-Atom Protein Loop Modeling with KarmaLoop[J]. Research, 2024 , 7 (7) : 0408 . DOI: 10.34133/research.0408
Loops are irregular segments in protein structures that often link 2 regular secondary structures such as α helices or β sheets [1,2]. These structures are typically situated near the surface of proteins [3], positioned in a way to allow them to facilitate numerous essential biological functions [4], including regulation of enzyme activity [58], protein–protein interaction (PPI) [9], and protein-ligand recognition [1012]. Given their high flexibility, loops heavily influence the overall protein dynamics [13], rendering loop prediction one of the most challenging yet indispensable tasks in protein modeling [14]. Furthermore, since over half of the experimentally determined structures in the Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) [15] contain missing fragments, and these missing regions predominantly correspond to loops [4], accurate modeling of loop structures is of high importance. For example, the prediction of antibody complementarity determining region (CDR) H3 loops is the most challenging task in antibody modeling [16], mainly due to the much higher flexibility of H3 compared with other CDR loops. To address this issue, various computational methods including traditional methods (knowledge-based, ab initio, and hybrid) and deep learning (DL) methods have been proposed. Knowledge-based methods, such as FREAD [17], LoopIng [18], SuperLooper [19], and DaReUS-Loop [20], rely on template repositories collected from pre-existing protein structures and perform modeling based on certain standards, such as structural similarity. For instance, FREAD performs database searching using the 4 rule-based filters based on anchor Cα separations and the environmentally constrained substitution score. LoopIng selects templates from the database by taking advantage of both sequence- and geometry-related features. SuperLooper provides the first online server for modeling globular and membrane protein loops, where the loop candidates are chosen by the standard of the root mean square deviation (RMSD) of the stem atoms. DaReUS-Loop uses fragments from remote or unrelated proteins to complete loop modeling and filters by sequence similarity and conformational profile. These methods are widely used in loop modeling, particularly in cases where a comparable fragment is available in the template database. However, the modeling performance of these methods for novel structures (especially for long loops) would drop sharply without proper references, suggesting that the accuracy of these methods is severely limited by the available templates [21,22].
Ab initio methods, mainly consisting of 2 components, conformational sampling and scoring, including DISGRO [1], LEAP [23], cyclic coordinate descent (CCD) [24,25], robotics-based kinematic closure (KIC) [26], and next-generation KIC (NGK) [27], offer advantages over knowledge-based methods due to their independence from templates and the ability to explore a broader conformational space. However, they need more computational resources than knowledge-based methods, and their computational cost increases exponentially with the loop length [21]. DISGRO, for instance, employs a chain growth sequential Monte Carlo sampling strategy for loop modeling and an atom-based distance-dependent empirical potential scoring function specifically designed for loops, and it has been described as an efficient computational method and particularly effective for modeling loops with 10 to 17 residues. CCD minimizes the distances between the backbone atoms of the C-terminal moving anchor and the corresponding atoms in the fixed anchor by adjusting one dihedral angle at a time. KIC utilizes a robotics-inspired local loop reconstruction method for loop modeling, achieving sub-angstrom accuracy. Among ab initio methods, NGK has been commonly regarded to be the state of the art [20,21], as it combines intensification and annealing strategies and yields a 4-fold improvement over standard KIC.
Hybrid methods, such as CODA [28] and Sphinx [22], integrate both ab initio and knowledge-based methods to improve performance [20]. CODA utilizes a combination of the knowledge-based approach FREAD and the ab initio method PETRA [29] for generating a consensus prediction that should satisfy a set of rule-based filters. Sphinx combines FREAD and an in-house ab initio method that first selects potential fragments shorter than the length of loop to be modeled. Then, the ab initio method adjusts the length of the fragments to generate accurate decoys of the desired length.
With the continual development of computing resources, DL has become increasingly popular and has demonstrated its potential in predicting protein structures. For instance, AlphaFold [30] incorporates biological and physical information and utilizes multi-sequence alignments (MSAs) to design a DL algorithm, achieves atomic-level accuracy, and ranks first in the 14th Critical Assessment of Structure Prediction (CASP) competition. Likewise, RoseTTAFold [31] utilizes a 3-track neural network that simultaneously considers protein sequences, amino acid interactions, and structures to attain accuracy close to AlphaFold [32]. However, both methods often face great challenges in accurately modeling loop regions [32,33], which is related to the fact that these methods typically do not incorporate factors such as ligand presence, the complexities of PPIs, and multiple conformations [34]. AlphaFold-Multimer [35] is developed as a useful tool in protein complex prediction and exhibits outstanding performance in antibody structure prediction [36]. More recently, AlphaFold3 [37] has shown remarkable capabilities in extending more accurate structural predictions to a broader scope that includes interactions with ligands, proteins, nucleic acids, and posttranslational modifications. Despite the need for further validation from the scientific community, these advancements in AlphaFold3 have brought great hope to this field.
The first DL method particularly for loop modeling was proposed in 2019 [38], which employs convolutional neural networks (CNNs) [39] on the distance matrix to predict the pairwise distances of loop atoms for conformation reconstruction. Recently, 2 DL architectures for loop modeling have been proposed [40], including one using a combined CNN–recursive neural network (RNN) structure (DeepMUSICS) and the other based on a refinement of histograms using a two-dimensional (2D) CNN architecture (DeepHisto). However, the source code of the aforementioned methods is not publicly available, which limits their broader applications. In the case of CDR H3 loop modeling, DeepH3 [41] was introduced using CNN to predict the interresidue distances and orientations from antibody sequences. Furthermore, DeepAb [42], the updated version of DeepH3, has demonstrated improvements with the inclusion of a pre-trained antibody sequence model and an interpretable attention mechanism to accurately predict the structures of antibodies.
So far, both traditional and DL-based methods have been unable to achieve high accuracy and efficiency simultaneously in loop prediction. Moreover, most proposed methods have focused solely on the prediction accuracy of heavy atoms in the backbone within the loop region [1,24,4347]. Nonetheless, it is crucial and challenging to predict the conformation of side chains accurately [4850], which highlights the importance of executing meticulous full-atom modeling of loops.
For the reasons above, we present a novel end-to-end DL paradigm named KarmaLoop, which is designed for accurate and efficient full-atom prediction of protein loops. Figure 1A depicts an example of a loop modeled by KarmaLoop, which starts with a random initialization of the loop coordinates and then predicts an accurate loop structure. KarmaLoop (Fig. 1B) employs graph neural network (GNN) architectures, which comprise 2 encoders [i.e., Graph Transformer (GT) [5153] and Geometric Vector Perceptrons (GVPs) [54]] for intramolecular interaction encoding, a mixture density network (MDN) block is employed to assess confidence levels in the predictions, and an E(n) equivariant GNN (EGNN) block is used to generate the loop conformation. While MDN is adept at distinguishing between high-RMSD and low-RMSD conformations, it does not display a satisfactory linear correlation with the RMSD results. The innovations are as follows: (a) better representation: by integrating atom features within their respective residues, we achieve a more holistic representation of proteins (Fig. 1C); (b) enhanced accuracy: the MDN block is trained prior to the introduction of the EGNN to better learn the distribution of distances between loop atoms and nonloop residues, thereby refining the learning process for pose generation; (c) higher efficiency: instead of relying on searching algorithms, loop conformations are updated using pseudo-forces as predicted by EGNN, thereby accelerating the modeling process.
KarmaLoop has been evaluated on multiple benchmark datasets, including the general protein loop benchmark CASP13+14 (the combination of CASP13 [55] and CASP14 [56]), the CASP15 dataset, and the antibody H3 loop benchmark (RosettaAntibody and therapeutic benchmark used in DeepAb), and filter the samples with high sequence similarity with the training dataset. In terms of the general protein loop modeling capability, KarmaLoop outperforms all the tested traditional and DL-based methods on both accuracy and efficiency, with the medium full-atom RMSDs of 1.41 and 1.31 Å, the average RMSDs of 1.77 and 1.95 Å, the success rates of 75.09% and 71.71% for single-conformation generation and 77.51% and 76.21% for generating 5 conformations, and the speeds of 0.047 s and 0.049 s per task on the CASP13+14 and CASP15 benchmark datasets, respectively. Several successful predictions of antibody CDR H3 loops are shown in Fig. 1E, where KarmaLoop achieves the average full-atom and medium RMSDs of 3.25 and 2.97 Å, respectively, compared with the baseline methods with the best performance (average and medium RMSDs of 3.49 and 3.34 Å, respectively). In protein engineering and structure-based drug design, accurately modeling protein loops under an imprecise structural environment holds tremendous potential, but precisely reconstructing loop structures within an accurate framework structure is also quite important, a focus that has been extensively explored in most previous studies. To provide a comprehensive assessment, we also conducted the task of antibody H3 modeling based on inaccurate structural frameworks predicted by other methods. When using the AlphaFold-Multimer, AlphaFold, and DeepAb modeled antibody structures as the input, KarmaLoop could further refine the conformations of H3 loops and increase the accuracy by an average of 9%. Hence, we proposed a universal protocol for precise antibody modeling by employing DeepAb to first predict an overall antibody structure, followed by KarmaLoop to refine the H3 loops. In summary, KarmaLoop has shown a remarkable superiority (over prior arts) in predicting both general loops and antibody H3 loops across most loop lengths.
As shown in Fig. 1B, KarmaLoop consists of 3 components: (a) 2 encoders, designed to comprehend representations of both residues and atoms, thereby implementing a hierarchical representation; (b) E(n) EGNNs enhanced with self-attention, aiming at updating the loop conformation by considering both intermolecular and intramolecular interactions; (c) an MDN designed for confidence evaluation and introducing the prior knowledge to facilitate prediction. To assess the ability of KarmaLoop, it is critical to make comparisons with other widely used methods in terms of prediction accuracy and efficiency. Consequently, in the subsequent sections, we conducted some experiments to evaluate the performance including the accuracy of generating conformations, the effect of loop length on accuracy, computational efficiency, the capacity for antibody H3 loop modeling, and 2 cases to prove that KarmaLoop is capable of reproducing diverse conformations according to varying ligand-binding conditions.
The accuracy of loop modeling is of utmost importance for many practical applications. To validate the capability of our model, 7 widely used methods including 3 DL-based methods (AlphaFold, RoseTTAFold, and ColabFold [57], and we used a customized template database that contains all the nonloop structures in the CASP13+14 and CASP15 benchmarks), one knowledge-based method (FREAD), and 3 ab initio methods [DISGRO, NGK, and Rosetta model_missing_loop (RML) [58]] were employed for comparison with KarmaLoop in a comprehensive benchmark. The full-atom RMSDs were calculated for each method, with the AlphaFold, ColabFold, and RoseTTAFold predicted structures being aligned before computation.
To perform quantitative comparisons with the other methods, empirical cumulative distribution function (eCDF) was selected to describe the proportion of samples at given RMSD values. We defined a task as successful if the RMSD of a given sample falls below a specified threshold. Figure 2A and B demonstrates that when the RMSD threshold was set to 2 Å, KarmaLoop exhibited the highest success rates of 75.09% and 71.71%, respectively, on the CASP13+14 and CASP15 benchmark datasets, yielding impressive improvements by 22.92% and 14.30% over the next best method (i.e., AlphaFold, 52.17%; NGK, 57.41%). Even adopting a stricter RMSD standard of 1 Å, KarmaLoop (23.87% and 29.93%) still outperformed the other methods, ranging from RoseTTAFold (1.74% and 1.07%) to NGK (19.23% and 22.26%).
As some baseline methods may generate multiple conformations for a given loop sample, KarmaLoop was further evaluated in the context of generating multiple conformations (all methods produced 5 conformations). It is important to note that, regardless of the number of conformations generated for a given task, a successful prediction is acknowledged when the lowest RMSD is no more than 2 Å. As shown in Fig. 2C, KarmaLoop achieved the best performance and improved the success rates by 11.38% and 4.26%, respectively, on the 2 benchmark datasets, compared with the best model NGK with the success rates of 66.13% and 71.95%, respectively. ColabFold and RoseTTAFold exhibited the lowest success rates of only 34.40% and 30.58%, respectively.
To directly compare the performance, the distribution of each method was plotted in Fig. 2D. KarmaLoop showed the lowest medium RMSDs on the CASP13+14 and CASP15 datasets, with 1.41 and 1.31 Å, respectively, as compared to the other methods, and showed much lower RMSDs than the second-best method NGK (1.60 Å) and AlphaFold (1.90 Å). Besides, KarmaLoop achieved the average RMSD of 1.77 Å (±1.43 Å) and 1.95 Å (±2.33 Å), holding a marked leading margin of 42.34% and 34.56% over the next best baseline method NGK on the CASP13+14 and CASP15 datasets (Fig. 2E).
We further investigated the influence of loop length on prediction accuracy. As shown in Fig. 1E, KarmaLoop exhibits the possibility to generate satisfactory conformations of loops with varying lengths. Figure 3A and B displays the RMSDs of loop length with more than one sample in the CASP13+14 and CASP15 datasets to avoid statistically inconclusive results. Our results indicated that KarmaLoop outperformed the baseline methods for the majority of length cases (9/11 and 8/12 on the CASP13+14 and CASP15 datasets) and maintained a relatively consistent performance across different loop lengths. In our analysis of over 20,000 proteins from AlphaFold pdb_mmcif dataset, we found that regions comprising no more than 10 residues, predominantly short loops, account for close to 80% of the total missing regions. This finding underscores the critical importance of accurate short loop prediction in protein modeling. Interestingly, AlphaFold and RoseTTAFold fall short in accurately predicting short loops compared with more traditional methods (Fig. 3). In contrast, Karmaloop emerges as a superior method, far surpassing nearly all other methods in short loop precision, with the RMSDs for most loops shorter than 10 residues around 2.0 Å. Besides, the observed increase in RMSD with loop length may be attributed to the fact that the training dataset predominantly consisted of short loops (residues ≤ 10), and long loops (particularly those over 20 residues) were only represented by 0.058% of all the samples (Fig. S1).
Computational efficiency is also a key factor in loop modeling. In this section, the time consumption of each method was evaluated. KarmaLoop was tested on a Tesla V100S GPU, traditional methods (FREAD, DISGRO, NGK, and RML) were run on a single core Intel Xeon Gold 6240R CPU @ 2.40 GHz, and AlphaFold, ColabFold, and RoseTTAFold were evaluated in parallel with 20 cores on the Intel Xeon Gold 6240R CPU @ 2.40 GHz and a Tesla V100S GPU. It is important to emphasize that the results presented by KarmaLoop do not include any postprocessing step, which typically requires approximately 5 s to complete a sample. Figure 4A shows the distribution of time spent for each method, illustrating the remarkable efficiency of KarmaLoop in terms of time cost on each task ranging from 0.015 to 0.14 s. Contrastingly, the time cost of DISGRO ranges from 0.773 to 51.07 s, and the other methods are considerably slower. As shown in Fig. 4B, the average time costs of KarmaLoop on the CASP13+14 and CASP15 datasets are 0.047 and 0.049 s, respectively, displaying a considerable speed advantage over traditional methods. Specifically, the minimum speed advantages of KarmaLoop over DISGRO (17.006 and 21.579 s) are 362× and 440× for CASP13+14 and CASP15, respectively, and the maximum speed advantages of KarmaLoop over RML (1172.342 s) and NGK (978.427 s) are 24,943× and 19,968×, respectively. However, it should be noted that, unlike other methods that only model loop region, AlphaFold, ColabFold, and RoseTTAFold require predicting the entire protein conformation. Although the modeling speed for these methods cannot be directly compared, ColabFold with a customized template database performs faster than AlphaFold and RoseTTAFold. While the reported speedup may vary slightly depending on the hardware used, the 2 to 4 orders of magnitude in speedup should still hold unless these baseline tools have been completely re-engineered to boost their efficiency.
The CDR H3 of antibody is widely acknowledged as the most flexible region and contributes the most to the structural diversity and binding site topography [16]. Consequently, accurate modeling of the H3 conformation is urgently needed. Sphinx, proposed by Marks et al. [22], is a hybrid method that models both general loops and antibody H3 loops with the same architecture but using different fragment databases. In contrast, KarmaLoop consistently employs the same process for H3 modeling.
In this section, we present a comparative analysis of the H3 modeling performance of KarmaLoop against 4 widely used methods (i.e., DeepAb [42], RosettaAntibody-G [59], RepertoireBuilder [60], AbodyBuilder [61], AlphaFold, and AlphaFold-Multimer) for predicting antibody structures on the antibody H3 benchmark. To evaluate the quality of the modeling results, we used the experimentally determined structures from SAbDab as the input. Figure 5A depicts the distribution of the RMSDs between the predicted samples from the baseline methods and the experimentally determined structures. The results demonstrate that KarmaLoop (2.97 and 3.25 Å) outperforms DeepAb (3.35 and 3.49 Å) in terms of the medium and average RMSDs by 0.38 and 0.24 Å, respectively. Figure 5B displays the RMSDs of samples with varying loop lengths, and KarmaLoop exhibits the lowest average RMSD for most lengths of the H3 loop samples.
Next, we assessed the ability of refinement by using the antibody structures predicted by AlphaFold-Multimer, AlphaFold, and DeepAb as the input for KarmaLoop. Notably, different from the version of the models used before, as for the models of the loops in imprecise protein structures, a refined version of the models was used (more details in the “Loop initialization and pocket selection” section). Figure 5C and D illustrates the predictions based on the structures modeled by DeepAb, which can be used to test the capability of KarmaLoop to improve structures under a suboptimal structural situation. The results depicted in Fig. 5 (C and D) demonstrate that KarmaLoop is capable of refining the loop structures with an average RMSD reduction from 3.55 to 3.32 Å, 3.73 to 3.42 Å, and 3.49 to 3.21 Å, and a medium RMSD reduction from 3.45 to 3.23 Å, 3.70 to 3.38 Å, and 3.34 to 2.95 Å for AlphaFold-Multimer, AlphaFold, and DeepAb, respectively. It is worth noting that this improvement is observed across most lengths of the H3 loops, highlighting the consistent ability of KarmaLoop to improve the H3 loop structures regardless of their length. Thus, a universal protocol can be applied for antibody prediction, whereby DeepAb models the entire antibody structures and KarmaLoop reconstructs the H3 structures.
In studying the conformational dynamics of protein loops, it is observed that these structures exhibit diverse conformations when binding with different small-molecule ligands, which is very relevant to drug design scenarios. To elucidate the predictive capabilities of KarmaLoop in capturing these multi-conformational states, we investigated the ground-truth structure of 2 well-documented protein families from PDB: the androgen receptor (AR) and Renin, each under distinct conditions and exhibiting unique loop conformations. Specifically, we examined AR in scenarios of small-molecule binding (PDB ID: 2PIX) and homodimer formation (PDB ID: 5JJM); and 2 holo forms of Renin in interactions with different small-molecule inhibitors (PDB IDs: 1BIL, 2I4Q). Figure 6 presents a comparative analysis between the conformational predictions made by KarmaLoop and the experimentally determined ground-truth conformations, along with an examination of the discrepancies between 2 sets of ground-truth conformations. In both cases, the loop regions exhibit distinct preferential conformations in complex with different guest molecules, and KarmaLoop demonstrates the possibility of reproducing diverse conformational responses across varying ligand-binding conditions.
To verify the effectiveness of KarmaLoop, several evaluations and comparisons have been conducted, including general loop modeling accuracy, loop-length impact and compute time, as well as potential applications in antibody H3 prediction. Of note, KarmaLoop achieved a substantial improvement in the average RMSD by more than 2-fold and the medium RMSD by nearly 2-fold. Regarding the success rate, the minimum enhancement of 22.92% and 14.30% in the CASP13+14 and CASP15 benchmark datasets was observed on one conformation generated by each method. Even when compared to other methods that generate multiple conformations, KarmaLoop still achieved a minimum improvement of 11.38% and 4.26%. KarmaLoop is also capable of accurately modeling loops of different lengths and obtained a majority length preponderance against the other methods. Surprisingly, KarmaLoop achieved at least a 362× speed advantage over other traditional methods. In terms of antibody CDR H3 loop modeling, which is considered the most challenging task in antibody structure prediction, KarmaLoop also showed the best performance and maintained its superiority across various lengths. Notably, based on the AlphaFold-Multimer, AlphaFold, and DeepAb modeled antibody structures, KarmaLoop has successfully promoted its accuracy and achieved improvement upon those methods in almost all lengths of H3 loop.
In summary, KarmaLoop has demonstrated the potential to accurately model loops, refine loop conformations based on inaccurate structure environments, and markedly accelerate batch data processing. Besides, a universal protocol was proposed for antibody structure prediction by utilizing DeepAb to model the entire structure of the antibody and KarmaLoop to refine the H3 conformation. Except for an accurate loop modeling paradigm proposed, a large training dataset (0.27 million) and several benchmark datasets have been made available for researchers to further evaluate the prediction accuracy of the general loop and antibody H3. We anticipate that KarmaLoop will accelerate the advancement of protein design, antibody–antigen recognition, and drug discovery and will become an essential tool in the field of biology moving forward.
Inspired by Jin et al. [62], we consider the loop modeling problem as the docking task and selected pockets for reducing computational resources, which is a common and frequently used protocol for ligand docking [51]. As shown in Fig. 1B, the protein pocket is modeled as a 2D molecular graph, Gp, where nodes represent atoms and edges denote covalent bonds. For the nonloop region, a 3D residue graph (Gnl) is employed, where nodes represent residues. This graph connects each node to its 30 closest neighbors, enhancing the ability to capture long-range interactions more effectively and with less computational demand than an atom-based graph.
Rather than serving as direct inputs for subsequent tasks, the molecular graphs and residue graphs undergo encoding through GT [5153] and GVP [54] to learn intra-molecular interactions and update node embeddings. Then, for nonloop residues, the node embeddings for atom nodes in the same residue are summed and concatenated to the corresponding residue node embeddings, followed by a feature merging block (FMB) to generate hierarchical node embeddings.
Hp,s1=GTHp,s0,Ep,s0
Hnl,s0.5=GVPSnl,s0,Hnl,s0,Hnl,v0,Enl,s0,Enl,v0
Hnl,s,i1=FMBconcatHnl,s,i0.5,j=kkHp,s,j1
Hl,s1=Hp,s1loop_mask
where H denotes the features of nodes and E is the characteristics of edges; S indicates the type of residue involved; loop mask is a bool array for selecting loop atoms from the whole protein atoms; and p, nl, l, s, v, i, j, k, and k' on the lower right (i.e., subscripts) denote protein pocket atoms, nonloop residues, loop residues, scalar, vector, the index of residues, the index of atoms, the start index of the atom corresponding to ith residue, and the end index of the atom corresponding to ith residue, respectively.
Embeddings of nodes Hl,s1,Hnl,s1 generated by the encoders, along with the node coordinates Xl,s1,Xnl,s1 of loop atoms and nonloop residues, are integrated to create an interaction graph [Gl,nl = (Hl,nl,  El,nl,  Xl,nl)]. This graph is designed to encapsulate intermolecular interactions between molecules at both the levels of atoms and residues.
In the procedure for generating poses, the loop conformation, depicted as the coordinates of the loop atom nodes, is initialized by sampling from the normal distribution with means and SDs varying for various tasks (loop reconstruction and loop refinement, details in the “Loop initialization and pocket selection” section). The embeddings for nodes and edges begin from a baseline established by graph normalization and are further processed through an MLP layer (Eq. 5). Subsequently, an EGNN consisting of 8 layers with self-attention is employed to update the embeddings of nodes and edges as well as the positions of the loop atoms. This network architecture is designed to effectively handle interactions both within loop atoms and between loop residue and nonloop atoms (Eq. 6). Inspired by AlphaFold, a recycling approach is adopted to ensure that the EGNN block progressively enhances its ability to refine the structural poses. At the beginning of each recycling iteration, a gating mechanism combines the updated embeddings with the original raw embeddings (Eq. 7).
Hl,nl0,0=GraphNormHl,s,Hnl,s
Hl,nlr,i,El,nlr,i,Xl,nlr,i=EGNN_LayeriHl,nlr,i1,El,nlr,i1,Xl,nlr,i1
Fl,nlr+1,0=Gate_BlockFl,nlr,8,Fl,nl0,0
In this framework H and E denote the embeddings of node and edge, respectively; F is used to represent a generic embedding; and l, nl, r, and i represent loop atoms, nonloop residues, the index of recycling, and the index of EGNN layer, respectively.
Upon obtaining node embeddings, the MDN block is used to predict essential statistical parameters such as the mean, SD, and coefficient of variation. These parameters collectively provide a comprehensive representation of the internodal distance distribution, as outlined in Eq. 8. Following this step, the node-pair embeddings and internodal distances are integrated with the predicted distribution to ascertain the likelihood that an internodal distance mirrors that observed in the crystal conformation (Eq. 9). This resultant probability estimation can then be utilized as a scoring metric to evaluate the appropriateness of the current conformation in subsequent analyses.
μl,nl,σl,nl,πl,nl=MDN_BlockHl,nl1
Ux=nl=1NLl=1LlogPdl,nl|hl,hnl=score
where H represents the node embeddings; l and nl denote loop atoms and nonloop residues, respectively; and μl,nl, σl,nl, and πl,nl represent the means, SDs, and mixing coefficients, respectively.
Selecting the pocket instead of the entire protein can simplify the input of the network and guide KarmaLoop to focus on the relevant structural features that are essential for understanding the interaction between loop and nonloop regions. In our study, we introduce 2 methods for loop initialization and pocket selection.
In cases of loop reconstruction where loop conformations remain undetermined, we calculate the center of mass for each loop residue by uniformly dividing the distance between its anchor residues (those immediately preceding and following the loop region). The initial conformation of each residue is then sampled from a Gaussian distribution, centered on its individual center of mass with an SD of 1.75 Å. The pocket selection similarly relies on the anchor residues, retaining the surface residues within a sphere whose center is the midpoint between the 2 anchor residues (see the “Surface pocket selection” section).
Conversely, for loop conformation refinement, where KarmaLoop is tasked with optimizing the loop conformations that are inaccurately predicted by other algorithms, the conformations are generated from a normal distribution. This distribution has its mean set around the original loop's center of mass with an SD of 4 Å. Pockets, in this scenario, comprise the residues located within 12 Å of any original loop atom.
In this research, each protein is depicted as an undirected graph, symbolized by [Gp = (Hp,  Ep,  lp)], with nodes corresponding to atoms and edges to covalent bonds. Hp and Ep represent the node features and edge features, while the coordinates of the atoms are represented by the node positions Xp.
To facilitate the prediction of loop conformations through nonbonded interactions, it is essential to establish connections between protein and loop nodes within the model. However, creating full connections between all nonloop and loop nodes is highly resource-intensive, particularly when dealing with a vast array of protein atoms. For obtaining both atom and residue information, we propose to use residues as nodes by merging atom information with the corresponding residues in the nonloop graph, which can also greatly reduce the number of nodes and edges in the graph. Furthermore, residue graphs are capable of encoding the geometric details of each residue while effectively capturing interactions over long distances between loop and nonloop residues, which enhances the prediction of loop conformations. To enhance the prediction of loop conformations, we devise a K-nearest neighbor (KNN) graph, denoted as [Gnl = (Hnl,  Enl,  Xnl)], setting K to 30. In this graph, residues serve as nodes and their coordinates are represented by the coordinates of CA atoms. As detailed in Table S2, the features of both nodes and edges are not limited to scalar features (Hnl_s,  Enl_s) but also vector features (Hnl_v,  Enl_v), and the GVP is employed to learn the molecular topology and geometric characteristics effectively.
In the demonstrated approach for effective representation of protein graphs, GTs have been utilized to learn intramolecular interactions (Fig. 1B). hp_i ∈ Rdh×1 represents ith node feature, and ep_ij ∈ Rde×1 represents the edge feature between node i and j; at the beginning, it was initialed as hp_i0 and ep_ij0 with d-dimension through 2 distinct linear layers:
hp_i0=Wh0hp_i+bh0ep_ij0=We0ep_ij+be0
where Wh0Rd×dh, We0Rd×de, and bh0, be0Rd. The node and edge embeddings are first initialized and then processed through a series of 6 graph transformer layers, which produce the final embeddings. Each layer in this stack updates the embeddings through a process of message passing combined with a specialized version of multi-head self-attention (MHA), as detailed in the equations below:
qp_ik,l=WQk,lNormhp_il
kp_jk,l=WKk,lNormhp_jl
vp_jk,l=WVk,lNormhp_jl
ep_ijk,l=WEk,lNormep_ijl
wp_ijk,l=Softmaxj∈Niqp_ik,l·kp_jk,ldk·ep_ijk,l
ĥp_il+1=hp_il+Wh0lDropoutConcatk∈1,,HAggregation_Sumj∈Niwp_ijk,lvp_jk,l
êp_ijl+1=ep_ijl+We0lDropoutConcatk∈1,,Hwp_ijk,l
hp_il+1=ĥp_il+1+Wh2lDropoutSiLUWh1lNormĥp_il+1
ep_ijl+1=êp_ijl+1+We2lDropoutSiLUWe1lNormêp_ijl+1
where WQk,l, WKk,l, WVk,l, WEk,lRdk×d, Wh0l, We0lRd×d, Wh1l, We1k,lR2d×d, and Wh2l, We2lRd×2d are learnable parameters from linear layers; k∈1, …, H represents attention heads number; dimension of each head is denoted by dk; j∈N(i) denotes the set of nodes adjacent to node i; Norm refers to the application of batch normalization; Concat involves the operation of concatenation; Dropout operation involves randomly omitting units during training to prevent overfitting; SiLU refers to a specific activation function; Aggregation_Sumj∈N(i) implies the summing up of messages from edges that link node i with its neighbors j; and Softmaxj∈N(i) denotes the SoftMax function applied across the neighbors j.
In the KarmaLoop implementation, GVPs serve as the nonloop residue encoder. They utilize node embeddings updated according to topological connections and geometric data within and between residues. The fundamental component, known as the gvp layer, processes scalar features fs ∈ Rd and vector features fv ∈ Rd×3. The operation within the lth layer of this system can be described as follows:
fv1l=Wv0lfvl
fv2l=Wv1lfv1l
Sv1l=fv1l2rowwise
Sv2l=fv2l2rowwise
fsvl=Concatfsl,Sv1l
f̂sl=Wsvlfsvl+bsvl
fsl+1=σsf̂sl
fvl+1=σvSv2lfv2lrowwise multiplication
where Wv0lRdv1×dv0, Wv1lRdv2×dv1, and WsvlRds1×ds0+dv1 are learnable parameters; activation functions are denoted as σs and σv. Initially, sequence information is processed through an embedding layer and then concatenated with additional scalar node features before entering the gvp layers:
hseq=EmbeddingSequence
hs=Concaths0,hseq
In the architecture of the embedding layer, the dimension of the word table is structured as (dseq, dseq) and hs0∈Rdhs0. Following this, both node features and edge features are fed into an initialization module, which includes a layer normalization and a gvp layer.:
hs1,hv1=gvpLayerNormhs,hv
es1,ev1=gvpLayerNormes,ev
where hs∈Rdseq+dhs0, hv∈Rdhv0×3, hs1∈Rdhs1, hv1∈Rdhv1×3, es∈Rdes0, ev∈Rdev0×3, es1∈Rdes1, and ev1∈Rdev1×3. Following the initial setup, both node and edge attributes are fed into a sequence of GVPConv layers, where the process incorporates 2 stages of GVP layers for message transmission. The formulas defining the GVPConv layer operations are presented below:
ms_ijl=Concaths_il,es_ijl,hs_jl
mv_ijl=Concathv_il,ev_ijl,hv_jl
ms_ij_1l,mv_ij_1l=gvpms_ijl,mv_ijl
ms_ij_2l,mv_ij_2l=gvpms_ij_1l,mv_ij_1l
ms_ij_3l,mv_ij_3l=gvpms_ij_2l,mv_ij_2l
ĥs_jl,ĥv_jl=Aggregation_Meani∈Njms_ij_3l,mv_ij_3l
f̂s_j_0l,f̂v_j_0l=LayerNormhs_jl+Dropoutĥs_jl,hv_jl+Dropoutĥv_jl
f̂s_j_1l,f̂v_j_1l=gvpf̂s_j_0l,f̂v_j_0l
f̂s_j_2l,f̂v_j_2l=gvpf̂s_j_1l,f̂v_j_1l
hs_il+1,hv_jl+1=LayerNormhs_jl+Dropoutf̂s_j_2l,hv_jl+Dropoutf̂v_j_2l
In Eqs. 34, 35, and 39, the scalar and vector features utilize ReLU and sigmoid activation functions, respectively. Contrarily, other gvp layers operate without activation functions; ms_ijlR2dhs1+des1, mv_ijlR2dhv1+dev1×3, ms_ij_1l, ms_ij_2l, ms_ij_3l, ĥs_jl, f̂s_j_0l, f̂s_j_2l and hs_il+1Rdhs1, mv_ij_1l, mv_ij_2l, mv_ij_3l, ĥv_jl, f̂v_j_0l, f̂v_j_2l and hv_jl+1Rdhv1×3, f̂s_j_1lR4dhs1, and f̂v_j_1lR2dhv1×3; and Aggregation_Meani∈N(j) indicates the process of averaging the messages across the edges that link node j to its adjacent nodes i.
One of the innovations of KarmaLoop is the hierarchical representation of protein residues and atoms. By adopting both the advantage of atom-level representation (offering detailed descriptions but demanding high resource) and residue-level representation (requiring lower resource but providing less details) of protein, hierarchical representation achieves an effective balance between comprehensive protein characterization with reduced resource demands. As depicted in Fig. 1C, the scalar embeddings hp_i and hnl_i generated from the GVP block and GT block for the protein atoms and nonloop residues are merged. The atom node features are summed and concatenated with the corresponding residue features followed by linear layers and an activation layer for feature fusion.
mnl_i=Concathnl_i,j=kkhp_j
mnl_i=MLPLaekyReLUMLPmnl_i
where p, nl, i, j, k, and k' on the lower right (i.e., subscripts) denote protein atoms, nonloop residues, the index of nonloop residues, the index of nonloop atoms, the start index of the atom corresponding to ith residue, and the end index of the atom corresponding to ith residue, respectively.
For predicting loop conformations, we design an interaction graph [Gl,nl = (Hl,nl,  El, nl,Xl,nl)] derived from the overall protein atom graph Gp and nonloop residue graph Gnl. Previously, we constructed the loop graph and nonloop graph, with edges connecting nodes within the same region. However, it is crucial to establish connections between these regions. Therefore, we construct an interaction graph, which facilitates the exploration of interactions between loop and nonloop regions, thereby deepening our comprehension of their interconnected dynamics. We take nonloop residues and loop atoms as nodes. Then, we fully connect nonloop and loop nodes and consider the global interactions for accurate generation of loop conformations. Initial node features h0 ∈ Rdh0 are obtained from the GT encoder's output and merged feature from the output of merge block for loop atoms and nonloop residues, respectively. Edge features e0 ∈ Rde0 are generated through one-hot encoding of edge types and the distances between nodes, with actual distances used for intra-residue and intra-loop edges and −1 for others. Moreover, the coordinates of nonloop nodes x0 ∈ Rn×3 are directly inherited from the protein graphs, whereas the loop conformation are initialized randomly.
In this research, we incorporate self-attention mechanisms into the message-passing process of the E(n) equivariant model, EGNN, which is noted for its rapid processing capabilities in dynamical systems. Unlike traditional GNNs, the integration of self-attention, akin to transformations seen in RTMScore [51], greatly enhances the representational capacities of GNNs. Additionally, to enhance accuracy, we implement a recycling mechanism inspired by AlphaFold that compels the EGNN layer to predict atomic movements based on the current predicted conformation.
Initially, the scalar node embeddings for proteins and loops, h0, are set using graph normalization techniques, with updates processed via GVP and GT blocks. The initialization of edge features, e0, is updated through a dedicated linear layer:
h1=GraphNormh0
e1=We_inite0+be_init
where h0, h1 and e1∈Rdh, e0∈Rde; We_init∈Rdh × de and be_init∈Rdh are learnable within the linear layer.
The primary building block for updating loop coordinates employs a sequence of 8 sequentially stacked EGNN layers. The message-passing mechanism in the lth layers of EGNN layer is depicted as follows:
qik,lk∈1,,H=WQlhi_1l+bQl
kjk,lk∈1,,H=WKlhj_1l+bKl
vjk,lk∈1,,H=WVlhj_1l+bVl
eijl=Concateij_1l,xilxjl2
mijl=Wm2lLeakyReLUDropoutWm1leijl+bm1l+bm2l
kijk,lk∈1,,H=Concatk∈1,,Hkjk,lmijl
wijk,l=qik,lkijk,l/dk
αijk,l=Softmaxj∈Niwijk,l2
ĥi_1l=DropoutWhlConcatk∈1,,HAggregation_Sumj∈Niwijk,lvjk,l+bhl
hi_1l+1=Gate_Blockhi_1l,ĥi_1l
eij_1l+1=WelConcatk∈1,,Hαijk,l+bel
xil+1=Coords_Update_Blockwijk,l1,,H,jϵNi,xil,xjljϵNi
In linear layers αijk,lR1; hi_1l, hj_1l, eij_1l, eijl, mijl, ĥi_1l, hi_1l+1, and eij_1l+1Rdh; qik,l, kjk,l, kijk,l, vjk,l, wijk,lRdk; WQl, WKl, WVl, Wm2l, Whl, and WelRdh×dh, Wm1lRdh×dh+1, bQl, bKl, bVl, bm1l, bm2l, bhl, and belRdh are learnable; k∈1, …, H indicates the total number of attention heads; each head's dimension, dk, is the quotient of dh by Hj∈N(i) indicates the nodes adjacent to node i; Concat refers to the operation of concatenation; Dropout is an operation used to prevent overfitting; LeakyReLU represents a variant of activation functions; Aggregation_Sumj∈N(i) describes the process of aggregating messages from edges linking node i to its adjacent nodes j; and Softmaxj∈N(i) is the SoftMax function applied to the neighboring nodes j.
The loop node coordinates update was implemented by the Coords_Update_Block:
xijl=xilxjl
xijl=xijl/xijl2
xijk,l=xijl·Wx2lLeakeyReLUDropoutWx1lwijk,l+bx1l+bx2l
xijl=WHlConcatk∈1,,Hxijk,l
xil=Aggregation_Sumj∈Nixijl
xil+1=xil+xil
where xil, xjl, xijl, xijk,l, xil, and xil+1R1×3; Wx1lRdk/2×dk, Wx2lϵR1×dk/2, WHlϵR1×H, bx1lRdk/2, and bx2lR1.
Gate_Block serves as a foundational component for residual connections within EGNN layers and initiates each cycle of recycling.
g=SigmoidDropoutWgConcatĥnew,hold,ĥnewhold+bg
hnew=GraphNormgĥnew+hold
where hold, ĥnew, g, and hnew∈Rdh; Wg∈Rdh × dh and bg∈Rdh are learnable parameters.
The MDN block is used to learn the prior knowledge of distance. The process starts by sequentially concatenating the node embeddings from nonloop residues (hnl) and loop atoms (hl) produced by GVP block and GT block, respectively. This concatenated output is then passed through a linear layer, followed by batch normalization, an ELU activation function, and a dropout layer to enhance the capturing of interactions. Subsequently, 3 distinct linear layers are employed to compute the parameters: means μp,c, SDs σp,c, and mixing coefficients πp,c. These parameters are vital for defining a mixture density model that captures a distribution of n different distance distributions for each pair between nonloop and loop nodes:
hnl,l=DropoutELUBatchNormWp,cConcathnl,hl+bnl,l
μnl,l=ELUWμhnl,l+bμ+1
σnl,l=ELUWσhnl,l+bσ+1.1
πnl,l=SoftmaxWπhnl,l+bπ
where Wnl,l∈Rdnl,l×2dh, Wμ, Wσ, Wπ∈Rn×dnl,l, bp,c∈Rdnl,l, bμ, bσ, and bπ∈Rn are learnable parameters of linear layers; hnl and hl∈Rdh.
In our approach to modeling proteins, we utilize both residue-level and atom-level representations. For determining the proximity between loop atoms and nonloop atoms, we select the minimum distance between each loop atom and every atom in a residue as the key metric for assessing the distance between loop and nonloop nodes.
After KarmaLoop predicted the conformation, an optional force field (FF) optimization was implemented to remove unfavorable steric clashes and generate high-quality loop conformations. OpenMM [63] was deployed for energy minimization by using the ff14SB FF to optimize the predicted conformation. To prevent excessive changes to the protein, we applied a harmonic potential that restrains the backbone atoms (N, CA, C, and CB) to their original locations. Finally, we ran energy minimization on an A100 GPU and saved the optimized protein structure to a new PDB file (with an average speed of about 5 s per loop). Additionally, high tolerance was selected to make sure that not too much expensive computational cost was used, and in practical use, the tolerance can be selected automatically.
The surface pocket selection includes 2 processes.
Initially, a midpoint is determined between 2 anchor residues (the nonloop residues immediately before and after the loop), serving as the preliminary center of the pocket. Subsequently, 2 specific radii are calculated:
a. Half the distance between the centers of mass of the 2 anchor residues.
b. A value derived from doubling the number of loop residues, expressed in Å, with a maximum limit of 16 Å.
From these, the larger radius is chosen for the pocket.
The selected radius dictates one of 2 scenarios (Fig. S2):
a. if the radius is greater than 20 Å, pseudo-loop conformations are sampled uniformly using a linear interpolation method between the 2 anchor residue conformations. The residues located within 12 Å of these pseudo-loop conformations are included in the pocket.
b. In other cases, this radius is employed to define a smaller sphere, SS, centered at the previously determined midpoint. Nonloop residues located within SS are chosen as the initial pocket. A larger sphere, BS, is subsequently generated using an extended radius (initial radius + 12 Å), aiming to encompass all potential interacting residues. However, the BS-defined pocket may incorporate many residues distant from the loop–nonloop interaction interface, which may have minimal impact on loop conformation prediction. Hence, we reduced the pocket residues by removing the residues only selected by BS but forming similar residue-midpoint directions to these selected by SS. To further enhance the pocket, the residues located within 5 to 7 Å of the already selected pocket residues are also included.
The biennial Critical Assessment of protein Structure Prediction (CASP) is a worldwide contest for protein structure prediction and is also commonly used for loop modeling assessment [17,20,23,64]. The general loop benchmark consists of 2 datasets: CASP13+14 and CASP15 monomer datasets. CASP13+14 is the combination of CASP13 and CASP14, which contains 549 loop structures from 52 proteins (the unmatched structure and sequence sample were removed). CASP15 containing 430 loop structures from 48 proteins was collected from the latest CASP competition. Then, the pocket sequence similarity was calculated by BLAST+ [65] (blastp module), and the sequences with no more than 50% similarity between CASP13+14/CASP15 and the training data were selected as the loop benchmark. Finally, 289 loop structures from 39 proteins in CASP13+14 and 304 loop structures from 41 proteins in CASP15 were selected as the benchmark. The antibody H3 benchmark derived from DeepAb is the combination of the RosettaAntibody and therapeutic benchmark [66], and comprises 92 antibody CDR H3 loop structures from 92 antibody complex structures. According to the standard of the pocket sequence similarity of less than 90% and H3 sequence similarity of less than 50%, we finally got 50 antibodies. The distribution of the loop lengths of those benchmarks is depicted in Fig. S1.
In this study, the training data were collected from PDB and Structural Antibody Database (SAbDab) [67]. The protein data from PDB were filtered by PISCES [68] using the following criteria: solved by x-ray crystallography, sequence identity ≤ 90%, resolution < 3.0 Å, R factor ≤ 0.25, and containing no DNA, RNA, and UNK molecules. The nonsecondary structure of PDB data was defined by DSSP [69], and we defined loops as regions connecting 2 secondary structures that consist of at least 4 residues. The antibody dataset was selected to be the same as that of DeepAb. The training and validation datasets were created using random stratified sampling according to the loop length in a 9:1 ratio. The antibody data were annotated using the Chothia numbering scheme [70]. Specifically, the training data contain the CDR L1, L2, L3, H1, H2, and H3 regions, and each type of CDRs was also split following the 9:1 ratio, according to loop length. Finally, the total numbers of the data for training and validation are 250,257 and 27,832, respectively.
All the protein data used in this work were downloaded from PDB, SAbDab, and CASP. Water molecules were subsequently removed from the proteins, and the loop regions with nonstandard amino acid residues, missing residues, and incorrect valence bonds were removed. The nonloop pockets were selected in the way mentioned in the “Loop initialization and pocket selection” section.
In response to different application requirements, we formulated 2 specialized training variants for KarmaLoop. For the loop reconstruction task, KarmaLoop was trained on surface pockets by initializing loop conformations without reliance on the ground-truth data. Conversely, when optimizing loop conformations derived from less precise frameworks, such as those predicted by AlphaFold, the training of KarmaLoop focused on pockets located within 12 Å of the raw loop conformations. These conformations were centered around the raw loop's center of mass.
In the training process of the model, it is hypothesized that the MDN block's ability to fit distributions of distances between nonloop and loop node pairs enhances the model's efficiency in constraining the search space for stable conformations, thereby facilitating the identification of the most optimal conformation. For the purpose of predicting conformations, we initially train the MDN block within the KarmaLoop framework using updated node embeddings from encoders and actual loop conformation data as inputs. This training employs an MDN-specific loss function LMDN alongside 2 auxiliary cross-entropy loss functions (Latom, Lbond), targeting different aspects of the model:
L=LMDN+0.001×Latom+0.001×Lbond
LMDN=logPdl,nl|hl,hnl=logn=110πl,nl,nNdl,nl|μl,nl,n,σl,nl,n
where μl,nl,n, σl,nl,n, and πl,nl,n represent mean, SDs, and the mixing coefficients of nth distance distribution; hl, hnl, and dl,nl represent the node embeddings of loop atom nodes, nonloop residue nodes, and the distance between loop and nonloop nodes, respectively. The model was optimized using the Adam optimizer with a batch size of 64, a learning rate of 1 × 10−3, and a weight decay of 1 × 10−5. Training ceases when the validation loss rises for 70 consecutive epochs. At this stage, both the GT and GVP are partially attuned to the distance distributions. Subsequently, we initiate training of the EGNN module and the previously developed scoring module, utilizing the same dataset partitioning technique as in the initial training of the scoring module. We integrate the conformation predicted module's loss function into Eq. 1 with a coefficient of 1, correlating it to the RMSD between the predicted loop conformations and their actual counterparts. The training hyperparameters remain unchanged except for a reduction in the learning rate to 1 × 10−4 and the removal of weight decay.
L=Lrmsd+LMDN+0.001×Latom+0.001×Lbond
Lrmsd=RMSDxlpred,xllabel=n=1Nxl,npredxl,nlabel2N
where N denotes the loop node number and n represents the index of the loop nodes.
In our study, we observed that initializing the loops with different SDs led to diverse loop conformations, which partially captured the loop's conformational flexibility. Upon evaluating various SDs for loop initialization, we determined that a deviation of 1.75 Å, identical to the training set, yielded the highest success rate for single conformation. In contrast, a deviation of 7 Å was optimal for achieving success in multi-conformation predictions. Consequently, we used 1.75 Å for single-conformation generation and 7 Å for multi-conformation generation.
To assess the performance of KarmaLoop, we selected well-established traditional and DL methods renowned for their precision or efficiency. This choice was grounded in our thorough evaluation of loop modeling methods, as detailed in our previous research [71]. The general protein loop modeling methods that were considered include (a) knowledge-based method: FREAD; (b) ab initio methods: DISGRO, NGK, and Rosetta-missing-loop (RML); and (c) DL-based methods: AlphaFold RoseTTAFold and ColabFold, which is compatible with AlphaFold 2.3.1 (users can customize the template database). According to the user guide, NGK requires a start conformation of the loop region, and if there is a missing loop, RML can be used to fill it. In addition to these methods, several antibody structure prediction methods were also examined including DeepAb, RosettaAntibody-G [59], RepertoireBuilder [60], AbodyBuilder [61], AlphaFold, and AlphaFold-Multimer to assess the prediction ability of H3 loop region.
The versions of those methods and options are listed below:
FREAD (version 1.0): The “top_models” parameter was configured to 1 for single-conformation evaluations and 5 for multi-conformation evaluations, thereby enabling a streamlined assessment of both single and multi-conformational structures.
DISGRO (original version): The “pdbout” parameter was adjusted to 1 and 5 for various comparison modes, with the other options remaining at their default settings.
NGK and RML: Both integrated within the Rosetta suite (version 3.13), NGK employs the “out:nstruct” parameter, while RML utilizes “nstruct,” both set to 1 for single and 5 for multi-conformation evaluations. The other parameters for NGK matched the integration test standards, whereas RML mirrored the guidelines from the provided demo.
RoseTTAFold (version 1.1.0): The “run_pyrosetta_ver.sh” was chosen for the implementation of monomer structure prediction. This selection yields 5 results, analogous to the outcomes produced by AlphaFold, with the other options kept at the defaults.
AlphaFold (version 2.1.1): Two key configurations were made: the “db_preset” was set to “full-dbs”—a choice that, although time-intensive, ensures the acquisition of highly accurate structures. The “model_preset” was configured for “monomer” to facilitate monomer structure prediction.
AlphaFold-Multimer (version 2.1.1): This method was also evaluated. By choosing “model_preset” as “multimer” and other configurations as default.
ColabFold (version 1.5.2): ColabFold was implemented locally (LocalColabFold), and it was compatible with AlphaFold 2.3.1. By choosing “templates” and “custom-template-path” to the template database we made which consisted of all nonloop structures of CASP13+14 and CASP15 datasets.
Additionally, structural data from DeepAb (version 2.2), RosettaAntibody-G, RepertoireBuilder, and AbodyBuilder were sourced from the pdb-format results supplied by DeepAb.
For loop modeling tasks, it is essential to consider both accuracy and efficiency. Full-atom RMSD was utilized for evaluating the accuracy of prediction. Because AlphaFold, ColabFold, and RoseTTAFold model the entire protein structures, the predicted structures of proteins are aligned before calculating the RMSD values. Additionally, AlphaFold-Multimer, AlphaFold, DeepAb, RosettaAntibody-G, RepertoireBuilder, AbodyBuilder, and RosettaAntibody predict the complete structures of antibodies. Following the same guidelines as DeepAb, the heavy chains of these antibodies are aligned prior to the RMSD calculation. Moreover, the success rate of the method was introduced as a complementary metric to RMSD, which measures the proportion of cases where the RMSD is less than or equal to 2 Å compared to experimentally determined conformations. The efficiency was simply measured by computing time.
KarmaLoop was trained on 8 NVIDIA A100-SXM4-80GB and 64-core Intel Xeon Platinum 8358P CPU @ 2.60 GHz. For evaluation, KarmaLoop was evaluated on a Tesla V100S GPU. Traditional methods (FREAD, DISGRO, NGK, and RML) were accomplished in parallel with 48-core Intel Xeon Gold 6240R CPU @ 2.40 GHz. AlphaFold-Multimer, AlphaFold, ColabFold, and RoseTTAFold were executed in parallel on 20 cores of the Intel Xeon Gold 6240R CPU @ 2.40 GHz and a Tesla V100S GPU. DeepAb, RosettaAntibody-G, RepertoireBuilder, AbodyBuilder, and RosettaAntibody predicted antibody conformation was derived from DeepAb provided data.
  • National Key Research and Development Program of China(2022YFF1203000)
1.
Tang K, Zhang J, Liang J. Fast protein loop sampling and structure prediction using distance-guided sequential chain-growth Monte Carlo method. PLOS Comput Biol. 2014;10(4): Article e1003539.
2.
Rufino SD, Donate LE, Canard LH, Blundell TL. Predicting the conformational class of short and medium size loops connecting regular secondary structures: Application to comparative modelling. J Mol Biol. 1997;267(2):352–367.
3.
Lins L, Thomas A, Brasseur R. Analysis of accessible surface of residues in proteins. Protein Sci. 2003;12(7):1406–1417.
4.
Stevens AO, He Y. Benchmarking the accuracy of AlphaFold 2 in loop structure prediction. Biomolecules. 2022;12(7):985.
5.
Lee J, Lee D, Park H, Coutsias EA, Seok C. Protein loop modeling by using fragment assembly and analytical loop closure. Proteins. 2010;78(16):3428–3436.
6.
Li Z, Meng S, Nie K, Schwaneberg U, Davari MD, Xu H, Ji Y, Liu L. Flexibility regulation of loops surrounding the tunnel entrance in cytochrome P450 enhanced substrate access substantially. ACS Catal. 2022;12(20):12800–12808.
7.
Crean RM, Biler M, van der Kamp MW, Hengge AC, Kamerlin SCL. Loop dynamics and enzyme catalysis in protein tyrosine phosphatases. J Am Chem Soc. 2021;143(10):3830–3845.
8.
Borges PT, Brissos V, Hernandez G, Masgrau L, Lucas MF, Monza E, Frazão C, Cordeiro TN, Martins LO. Methionine-rich loop of multicopper oxidase McoA follows open-to-close transitions with a role in enzyme catalysis. ACS Catal. 2020;10(13):7162–7176.
9.
Jones S, Thornton JM. Prediction of protein-protein interaction sites using patch analysis11Edited by G. von Heijne. J Mol Biol. 1997;272(1):133–143.
10.
Streaker ED, Beckett D. Ligand-linked structural changes in the Escherichia coli biotin repressor: The significance of surface loops for binding and allostery. J Mol Biol. 1999;292(3):619–632.
11.
Myllykoski M, Raasakka A, Han H, Kursula P. Myelin 2′,3′-cyclic nucleotide 3′-phosphodiesterase: Active-site ligand binding and molecular conformation. PLOS ONE. 2012;7(2): Article e32336.
12.
Tamamis P, Floudas CA. Molecular recognition of CXCR4 by a dual tropic HIV-1 gp120 V3 loop. Biophys J. 2013;105(6):1502–1514.
13.
Papaleo E, Saladino G, Lambrughi M, Lindorff-Larsen K, Gervasio FL, Nussinov R. The role of protein loops and linkers in conformational dynamics and allostery. Chem Rev. 2016;116(11):6391–6423.
14.
Zhao S, Zhu K, Li J, Friesner RA. Progress in super long loop prediction. Proteins. 2011;79(10):2920–2935.
15.
Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The Protein Data Bank. Nucleic Acids Res. 2000;28(1):235–242.
16.
Marks C, Deane CM. Antibody H3 structure prediction. Comput Struct Biotechnol J. 2017;15:222–231.
17.
Choi Y, Deane CM. FREAD revisited: Accurate loop structure prediction using a database search algorithm. Proteins. 2010;78(6):1431–1440.
18.
Messih MA, Lepore R, Tramontano A. LoopIng: A template-based tool for predicting the structure of protein loops. Bioinformatics. 2015;31(23):3767–3772.
19.
Hildebrand PW, Goede A, Bauer RA, Gruening B, Ismer J, Michalsky E, Preissner R. SuperLooper—A prediction server for the modeling of loops in globular and membrane proteins. Nucleic Acids Res. 2009;37:W571–W574.
20.
Karami Y, Guyon F, De Vries S, Tufféry P. DaReUS-loop: Accurate loop modeling using fragments from remote or unrelated proteins. Sci Rep. 2018;8(1):13673.
21.
Barozet A, Chacón P, Cortés J. Current approaches to flexible loop modeling. Curr Res Struct Biol. 2021;3:187–191.
22.
Marks C, Nowak J, Klostermann S, Georges G, Dunbar J, Shi J, Kelm S, Deane CM. Sphinx: Merging knowledge-based and ab initio approaches to improve protein loop prediction. Bioinformatics. 2017;33(9):1346–1353.
23.
Liang S, Zhang C, Zhou Y. LEAP: Highly accurate prediction of protein loop conformations by integrating coarse-grained sampling and optimized energy scores with all-atom refinement of backbone and side chains. J Comput Chem. 2014;35(4):335–341.
24.
Canutescu AA, Dunbrack RL Jr. Cyclic coordinate descent: A robotics algorithm for protein loop closure. Protein Sci. 2003;12(5):963–972.
25.
Wang C, Bradley P, Baker D. Protein-protein docking with backbone flexibility. J Mol Biol. 2007;373(2):503–519.
26.
Mandell DJ, Coutsias EA, Kortemme T. Sub-angstrom accuracy in protein loop reconstruction by robotics-inspired conformational sampling. Nat Methods. 2009;6(8):551–552.
27.
Stein A, Kortemme T. Improvements to robotics-inspired conformational sampling in rosetta. PLOS ONE. 2013;8(5): Article e63090.
28.
Deane CM, Blundell TL. CODA: A combined algorithm for predicting the structurally variable regions of protein models. Protein Sci. 2001;10(3):599–612.
29.
Deane CM, Blundell TL. A novel exhaustive search algorithm for predicting the conformation of polypeptide segments in proteins. Proteins. 2000;40(1):135–144.
30.
Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, Tunyasuvunakool K, Bates R, Žídek A, Potapenko A, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596(7873):583–589.
31.
Baek M, DiMaio F, Anishchenko I, Dauparas J, Ovchinnikov S, Lee GR, Wang J, Cong Q, Kinch LN, Schaeffer RD, et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science. 2021;373(6557):871–876.
32.
Lee C, Su BH, Tseng YJ. Comparative studies of AlphaFold, RoseTTAFold and Modeller: A case study involving the use of G-protein-coupled receptors. Brief Bioinform. 2022;23(5):bbac308.
33.
Liang TJ, Jiang C, Yuan JY, Othman Y, Xie XQ, Feng Z. Differential performance of RoseTTAFold in antibody modeling. Brief Bioinform. 2022;23(5):bbac152.
34.
Terwilliger TC, Liebschner D, Croll TI, Williams CJ, McCoy AJ, Poon BK, Afonine PV, Oeffner RD, Richardson JS, Read RJ, et al. AlphaFold predictions are valuable hypotheses and accelerate but do not replace experimental structure determination. Nat Methods. 2024;21(1):110–116.
35.
Evans R, O'Neill M, Pritzel A, Antropova N, Senior A, Gree T, Žídek A, Bates R, Blackwell S, Yim J, et al. Protein complex prediction with AlphaFold-Multimer. bioRxiv. 2021. https://doi.org/10.1101/2021.10.04.463034
36.
Yin R, Pierce BG. Evaluation of AlphaFold antibody-antigen modeling with implications for improving predictive accuracy. Protein Sci. 2024;33(1):16.
37.
Abramson J, Adler J, Dunger J, Evans R, Green T, Pritzel A, Ronneberger O, Willmore L, Ballard AJ, Bambrick J, et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature. 2024;1–3.
38.
Nguyen SP, Li ZY, Xu D, Shang Y. New deep learning methods for protein loop modeling. IEEE/ACM Trans Comput Biol Bioinform. 2019;16(2):596–606.
39.
Ouyang W, Luo P, Zeng X, Qiu S, Tian Y, Li H, Yang S, Wang Z, Xiong Y, Qian C, et al. DeepID-Net: Multi-stage and deformable deep convolutional neural networks for object detection. arXiv. 2014. https://doi.org/10.48550/arXiv.1409.3505
40.
Pan F, Zhang Y, Lo C-C, Mandal A, Liu X, Zhang J. Protein loop modeling and refinement using deep learning models. bioRxiv. 2021. https://doi.org/10.1101/2021.11.03.467148
41.
Ruffolo JA, Guerra C, Mahajan SP, Sulam J, Gray JJ. Geometric potentials from deep learning improve prediction of CDR H3 loop structures. Bioinformatics. 2020;36(Suppl. 1):i268–i275.
42.
Ruffolo JA, Sulam J, Gray JJ. Antibody structure prediction using interpretable deep learning. Patterns. 2022;3(2):100406.
43.
Liang S, Zhang C, Sarmiento J, Standley DM. Protein loop modeling with optimized backbone potential functions. J Chem Theory Comput. 2012;8(5):1820–1827.
44.
Bonet J, Segura J, Planas-Iglesias J, Oliva B, Fernandez-Fuentes N. Frag'r'Us: Knowledge-based sampling of protein backbone conformations for de novo structure-based protein design. Bioinformatics. 2014;30(13):1935–1936.
45.
López-Blanco JR, Canosa-Valls AJ, Li Y, Chacón P. RCD+: Fast loop modeling server. Nucleic Acids Res. 2016;44(W1):W395–W400.
46.
Chys P, Chacón P. Random coordinate descent with spinor-matrices and geometric filters for efficient loop closure. J Chem Theory Comput. 2013;9(3):1821–1829.
47.
Coutsias EA, Seok C, Jacobson MP, Dill KA. A kinematic view of loop closure. J Comput Chem. 2004;25(4):510–528.
48.
Gray JJ, Moughon S, Wang C, Schueler-Furman O, Kuhlman B, Rohl CA, Baker D. Protein–protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. J Mol Biol. 2003;331(1):281–299.
49.
Meiler J, Baker D. ROSETTALIGAND: Protein-small molecule docking with full side-chain flexibility. Proteins. 2006;65(3):538–548.
50.
Leach AR. Ligand docking to proteins with discrete side-chain flexibility. J Mol Biol. 1994;235(1):345–356.
51.
Shen C, Zhang X, Deng Y, Gao J, Wang D, Xu L, Pan P, Hou T, Kang Y. Boosting protein–ligand binding pose prediction and virtual screening based on residue–atom distance likelihood potential and graph transformer. J Med Chem. 2022;65(15):10691–10706.
52.
Morehead A, Chen C, Cheng J. Geometric transformers for protein interface contact prediction. arXiv. 2021. https://doi.org/10.48550/arXiv.2110.02423
53.
Dwivedi VP, Bresson X. A generalization of transformer networks to graphs. arXiv. 2020. https://doi.org/10.48550/arXiv.2012.09699
54.
Jing B, Eismann S, Suriana P, Townshend RJL, Dror R. Learning from protein structure with geometric vector perceptrons. arXiv. 2020. https://doi.org/10.48550/arXiv.2009.01411
55.
Kryshtafovych A, Schwede T, Topf M, Fidelis K, Moult J. Critical assessment of methods of protein structure prediction (CASP)—Round XIII. Proteins. 2019;87(12):1011–1020.
56.
Kryshtafovych A, Schwede T, Topf M, Fidelis K, Moult J. Critical assessment of methods of protein structure prediction (CASP)—Round XIV. Proteins. 2021;89(12):1607–1617.
57.
Mirdita M, Schütze K, Moriwaki Y, Heo L, Ovchinnikov S, Steinegger M. ColabFold: Making protein folding accessible to all. Nat Methods. 2022;19(6):679–682.
58.
Pache R, Sperber M, Combs S, Rosenberger G, Model a missing loop. https://new.rosettacommons.org/demos/latest/public/model_missing_loop/README
59.
Jeliazkov JR, Frick R, Zhou J, Gray JJ. Robustification of RosettaAntibody and Rosetta SnugDock. PLOS ONE. 2021;16(3): Article e0234282.
60.
Schritt D, Li S, Rozewicki J, Katoh K, Yamashita K, Volkmuth W, Cavet G, Standley DM. Repertoire builder: High-throughput structural modeling of B and T cell receptors. Mol Syst Des Eng. 2019;4(4):761–768.
61.
Dunbar J, Krawczyk K, Leem J, Marks C, Nowak J, Regep C, Georges G, Kelm S, Popovic B, Deane CM. SAbPred: A structure-based antibody prediction server. Nucleic Acids Res. 2016;44(W1):W474–W478.
62.
Jin WG, Barzilay R, Jaakkola T. Antibody-antigen docking and design via hierarchical structure refinement. Paper presented at: 39th International Conference on Machine Learning (ICML); 2022; Baltimore, MD. p. 10217–10227.
63.
Eastman P, Friedrichs MS, Chodera JD, Radmer RJ, Bruns CM, Ku JP, Beauchamp KA, Lane TJ, Wang LP, Shukla D, et al. OpenMM 4: A reusable, extensible, hardware independent library for high performance molecular simulation. J Chem Theory Comput. 2013;9(1):461–469.
64.
Lee GR, Heo L, Seok C. Effective protein model structure refinement by loop modeling and overall relaxation. Proteins. 2016;84:293–301.
65.
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. BLAST+: Architecture and applications. BMC Bioinform. 2009;10:421.
66.
Raybould MIJ, Marks C, Krawczyk K, Taddese B, Nowak J, Lewis AP, Bujotzek A, Shi J, Deane CM. Five computational developability guidelines for therapeutic antibody profiling. Proc Natl Acad Sci USA. 2019;116(10):4025–4030.
67.
Dunbar J, Krawczyk K, Leem J, Baker T, Fuchs A, Georges G, Shi J, Deane CM. SAbDab: The structural antibody database. Nucleic Acids Res. 2013;42:D1140–D1146.
68.
Wang G, Dunbrack RL. PISCES: Recent improvements to a PDB sequence culling server. Nucleic Acids Res. 2005;33:W94–W98.
69.
Touw WG, Baakman C, Black J, te Beek TAH, Krieger E, Joosten RP, Vriend G. A series of PDB-related databanks for everyday needs. Nucleic Acids Res. 2015;43:D364–D368.
70.
Chothia C, Lesk AM, Tramontano A, Levitt M, Smith-Gill SJ, Air G, Sheriff S, Padlan EA, Davies D, Tulip WR, et al. Conformations of immunoglobulin hypervariable regions. Nature. 1989;342(6252):877–883.
71.
Wang TY, Wang LC, Zhang XJ, Shen C, Zhang O, Wang J, Wu J, Jin R, Zhou D, Chen S, et al. Comprehensive assessment of protein loop modeling programs on large-scale datasets: Prediction accuracy and efficiency. Brief Bioinform. 2024;25(1): Article bbad486.
Year 2024 volume 7 Issue 7
PDF
186
92
Cite this Article
BibTeX
Article Info
doi: 10.34133/research.0408
  • Receive Date:2024-04-25
  • Online Date:2025-08-07
  • Published:2024-07-25
Article Data
Affiliations
History
  • Received:2024-04-25
  • Accepted:2024-05-22
Funding
National Key Research and Development Program of China(2022YFF1203000)
Affiliations
    1Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
    2 Zhejiang Laboratory, Hangzhou 311100, Zhejiang, China.
    3Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, Guangdong, China.
    4Department of Pathology, New York University Medical Center, New York, NY 10016, USA.
    5College of Life Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.

Corresponding:

* Address correspondence to: (T.H.); (C.-Y.H.); (Y.K.)
References
Share
https://castjournals.cast.org.cn/joweb/research/EN/10.34133/research.0408
Share to
QR

Scan QR to access full text

Cite this article
BibTeX
Citations
表12种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏
  • BibTeX
  • EndNote
  • RefWorks
  • TxT