{"id":1087,"date":"2023-03-21T12:43:51","date_gmt":"2023-03-21T05:43:51","guid":{"rendered":"https:\/\/conf.icgbio.ru\/bgrs98\/?page_id=1087"},"modified":"2023-04-12T14:06:34","modified_gmt":"2023-04-12T07:06:34","slug":"104_likeness-a-system-searching-for-and-aligning-similar-protein-conformations","status":"publish","type":"page","link":"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/abstract-list\/104_likeness-a-system-searching-for-and-aligning-similar-protein-conformations\/","title":{"rendered":"LIKENESS: A SYSTEM SEARCHING FOR AND ALIGNING SIMILAR PROTEIN CONFORMATIONS"},"content":{"rendered":"<p><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#ponomarenko_mp\">PONOMARENKO M.P.<\/a><sup>+<\/sup>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#kol\">KOLCHANOV N.A.<\/a>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#shindyalov\">SHINDYALOV I.<\/a><sup>1<\/sup>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#bourne\">BOURNE P.<\/a><sup>1<\/sup><\/p>\n<p>Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, 10 Lavrentiev Ave., Novosibirsk, 630090, Russia;<\/p>\n<p>1San Diego Supercomputer Center, San Diego, CA 92186-9784, USA<\/p>\n<p>+Corresponding author<\/p>\n<p><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/keywords-index\/\">Keywords<\/a>: proteins, protein conformation, similarity, structure alignment, stochastic geometry, fuzzy logic, utility theory, decision making<\/p>\n<p><b>Abstract<\/b><\/p>\n<p>A method for estimation of similarity in protein conformations based on stochastic geometry, fuzzy logic, and the utility theory for decision making is proposed. This method was used to create a system LIKENESS designed to search for and align similar protein conformations from PDB. The system LIKENESS is shown to solve these problems under real-time mode. The system is accessible at<i>\u00a0http:\/\/cl.sdsc.edu\/.<\/i><b><\/b><\/p>\n<p><b>Introduction<\/b><\/p>\n<p>The databank PDB is the source of information on 3D structures of proteins [1]. Search for and alignment of similar protein conformations are typical problems of its analysis [2]. The methods that have been so far proposed for this aim [2-12] are so time-consuming that specialized databanks, such as FSSP [12], DALI [8], CATH [13, 14], SSAP [15], and Entrez\/3D [16] are developed to store the results obtained. We propose a method for estimation of protein conformation similarity that is based on stochastic geometry [17], fuzzy logic [18], and the utility theory for decision making [19]. The method allows similar protein conformations to be searched for in PDB [1] and aligned under real-time mode.<b><\/b><\/p>\n<p><b>Materials and methods<\/b><\/p>\n<p>The databank MOOSE [20-22] was used in the work (Fig. 1; an object-oriented PDB [1] where the data are systematized according to the protein structural patterns). We supplemented MOOSE [20] with the structural properties that are calculated from atomic coordinates (Table 1): accessible surface and polarity [23], secondary structure [24], <img loading=\"lazy\" class=\"alignnone wp-image-1097 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_fi.gif\" alt=\"\" width=\"7\" height=\"9\" \/> &#8211; and <img loading=\"lazy\" class=\"alignnone wp-image-1108 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_ksi.gif\" alt=\"\" width=\"9\" height=\"9\" \/> -angles [25-28], the distances and angles (Fig. 2) between <img loading=\"lazy\" class=\"alignnone wp-image-1094 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_c_alpha.gif\" alt=\"\" width=\"17\" height=\"12\" \/><b>\u00a0<\/b>atom of a given residue and the neighboring <img loading=\"lazy\" class=\"alignnone wp-image-1094 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_c_alpha.gif\" alt=\"\" width=\"17\" height=\"12\" \/><b>\u00a0<\/b>atoms or between the centers of mass of various protein fragments (totally 495 properties). The values of these properties for each residue of each protein have been calculated and are stored in the database LIKENESS (<i>http:\/\/cl.sdsc.edu\/).<\/i><\/p>\n<p>The method proposed is designed to estimate the similarity of conformation of arbitrary proteins\u00a0<b>S<\/b>\u00a0and\u00a0<b>T.<\/b> If these proteins had an &#8220;<img loading=\"lazy\" class=\"alignnone wp-image-1091 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_aplha.gif\" alt=\"\" width=\"10\" height=\"9\" \/> \/<img loading=\"lazy\" class=\"alignnone wp-image-1091 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_aplha.gif\" alt=\"\" width=\"10\" height=\"9\" \/> &#8221; folding, the distances between the <img loading=\"lazy\" class=\"alignnone wp-image-1094 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_c_alpha.gif\" alt=\"\" width=\"17\" height=\"12\" \/><b>\u00a0<\/b>atoms of their (<b>i-2<\/b>)th and (<b>i+2<\/b>)th residues (distance\u00a0<b>AE<\/b> in Table 1) would be important for estimation of their similarity; for &#8220;<img loading=\"lazy\" class=\"alignnone wp-image-1093 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> \/<img loading=\"lazy\" class=\"alignnone wp-image-1093 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> &#8221; folding, the distances between the <img loading=\"lazy\" class=\"alignnone wp-image-1094 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_c_alpha.gif\" alt=\"\" width=\"17\" height=\"12\" \/><b>\u00a0<\/b>of their (<b>i-1<\/b>)th and (<b>i+1<\/b>)th residues (distance\u00a0<b>BD<\/b>\u00a0in Table 1). However, such information is lacking for arbitrary proteins\u00a0<b>S<\/b>\u00a0and\u00a0<b>T.<\/b>\u00a0On the whole, additional information is lacking for the procedures decreasing dimensionality of the methods for data analysis in similarity estimation of any arbitrary proteins. That is why our method is based on the following idea: when the proteins\u00a0<b>S<\/b>\u00a0and\u00a0<b>T<\/b>\u00a0are similar, the majority of their properties\u00a0<b>{f<sub>n<\/sub>}<\/b>\u00a0is approximately similar\u00a0<b>{f<sub>n<\/sub>(S) <img loading=\"lazy\" class=\"alignnone wp-image-1092 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_appr.gif\" alt=\"\" width=\"6\" height=\"4\" \/>\u00a0f<sub>n<\/sub>(T)}<\/b>. Since the accuracy of such estimation increases with the number of the conformational properties involved, we take into consideration as many properties as our computer (Alpha, Digital Equipment Corporation) allows.<\/p>\n<p>Let&#8217;s consider the proteins\u00a0<b>S<\/b>\u00a0and\u00a0<b>T<\/b>\u00a0and a set of\u00a0<b>N<\/b>\u00a0properties of their conformation\u00a0<b>{f<sub>n<\/sub>}<\/b>\u00a0(where 1&lt;=\u00a0<b>n&lt;= N&lt;=<\/b>495). The\u00a0<b>matrix of similarity\u00a0<\/b>of their conformations characterizes their residues\u00a0<b>s<sub>i<\/sub><\/b>\u00a0and<b>\u00a0t<sub>j<\/sub><\/b>\u00a0at positions\u00a0<b>i\u00a0<\/b>and\u00a0<b>j<\/b>\u00a0in terms of equality of the values\u00a0<b>{f<sub>n<\/sub>(s<sub>i<\/sub>) <img loading=\"lazy\" class=\"alignnone wp-image-1092 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_appr.gif\" alt=\"\" width=\"6\" height=\"4\" \/>\u00a0 f<sub>n<\/sub>(t<sub>j<\/sub>)}<\/b>\u00a0of\u00a0<b>{f<sub>n<\/sub>}<\/b>\u00a0properties. Then the estimation of similarity of the residue codes is as follows:<\/p>\n<p align=\"RIGHT\"><img loading=\"lazy\" class=\"size-full wp-image-1098 alignleft\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_Image211.gif\" alt=\"\" width=\"331\" height=\"65\" \/>(1)<\/p>\n<p>&nbsp;<\/p>\n<p>where PAM (x;y) is the similarity of the residues x and y for the alignment of protein sequences [25].<\/p>\n<p>Similarity of the secondary structures f<sub>n<\/sub>(s<sub>i<\/sub>) and f<sub>n<\/sub>(t<sub>i<\/sub>) in the Kabash-Sander alphabet {h, g, t, i, b, e, s, <img loading=\"lazy\" class=\"alignnone size-full wp-image-1109\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_zero.gif\" alt=\"\" width=\"11\" height=\"12\" \/>\u00a0} [24] is:<\/p>\n<p align=\"RIGHT\"><img loading=\"lazy\" class=\"size-full wp-image-1099 alignleft\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_Image212.gif\" alt=\"\" width=\"286\" height=\"65\" \/>(2)<\/p>\n<p>&nbsp;<\/p>\n<p>Similarity of the conformations in their quantitative properties f<sub>n<\/sub>(s<sub>i<\/sub>) and f<sub>n<\/sub>(t<sub>i<\/sub>) {f<sub>n<\/sub>(s<sub>i<\/sub>)<b> <img loading=\"lazy\" class=\"alignnone size-full wp-image-1092\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_appr.gif\" alt=\"\" width=\"6\" height=\"4\" \/><\/b>\u00a0f<sub>n<\/sub>(t<sub>j<\/sub>)} with the range of values Range(f<sub>n<\/sub>) is estimated in a &#8220;null&#8221; approximation, when any f<sub>n<\/sub>\u00a0values are considered equally probable. In this case, the hypothesis {H<sub>0<\/sub>: f<sub>n<\/sub>(s<sub>i<\/sub>)=f<sub>n<\/sub>(t<sub>j<\/sub>)} is tested using the following equation [29]:<\/p>\n<p align=\"RIGHT\"><img loading=\"lazy\" class=\"size-full wp-image-1100 alignleft\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_Image213.gif\" alt=\"\" width=\"298\" height=\"43\" \/>. (3)<\/p>\n<p>&nbsp;<\/p>\n<p>This significance level <img loading=\"lazy\" class=\"alignnone size-full wp-image-1091\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_aplha.gif\" alt=\"\" width=\"10\" height=\"9\" \/>\u00a0<sub>n<\/sub>\u00a0is transformed into similarity estimation by the following equation:<\/p>\n<p align=\"RIGHT\"><img loading=\"lazy\" class=\"size-full wp-image-1101 alignleft\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_Image214.gif\" alt=\"\" width=\"430\" height=\"65\" \/>(4)<\/p>\n<p>&nbsp;<\/p>\n<p>The particular values of similarity estimations (equations 1-4) are used to calculate the integral estimation:<\/p>\n<p align=\"RIGHT\"><img loading=\"lazy\" class=\"size-full wp-image-1102 alignleft\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_Image215.gif\" alt=\"\" width=\"173\" height=\"50\" \/>. (5)<\/p>\n<p>&nbsp;<\/p>\n<p>Application of equations (1-5) to each pair of residues s<sub>i<\/sub>\u00a0and t<sub>j<\/sub>\u00a0of the proteins S and T gives the matrix {U(s<sub>i<\/sub>;t<sub>j<\/sub>)} (Fig. 3), which has the two following interpretations:<\/p>\n<p>(#)\u00a0<u>IF<\/u>\u00a0{U(s<sub>i<\/sub>;t<sub>j<\/sub>)&lt;0},\u00a0<u>THEN<\/u>\u00a0{the residues s<sub>i<\/sub>\u00a0and t<sub>j<\/sub>\u00a0dissimilar};<\/p>\n<p>($)\u00a0<u>IF<\/u>\u00a0{U(s<sub>i<\/sub>;t<sub>j<\/sub>)&gt;U(s<sub>i<\/sub>;t<sub>k<\/sub>)&gt;0},\u00a0<u>THEN<\/u>\u00a0{the conformation of the residue s<sub>i<\/sub>\u00a0is more similar to the conformation of t<sub>j<\/sub>, than of t<sub>k<\/sub>.}.<\/p>\n<p>This interpretation coincides with the maximization of Needleman-Wunsch similarity [30] for sequence aligning. Thus, LIKENESS aligns the sequences of Ca atoms and then matches it with the minimum of root-mean-square deviation RMSD [31]. For the same reason, the similarity of the fragments {s<sub>i<\/sub>,&#8230;,s<sub>i+<img loading=\"lazy\" class=\"alignnone size-full wp-image-1096\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_delta.gif\" alt=\"\" width=\"5\" height=\"6\" \/>\u00a0-1<\/sub>} and {t<sub>j<\/sub>,&#8230;,t<sub>j+<img loading=\"lazy\" class=\"alignnone size-full wp-image-1096\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_delta.gif\" alt=\"\" width=\"5\" height=\"6\" \/>\u00a0-1<\/sub>} of length D of the proteins S and T is estimated by the equation:<\/p>\n<p align=\"RIGHT\"><img loading=\"lazy\" class=\"size-full wp-image-1103 alignleft\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_Image216.gif\" alt=\"\" width=\"201\" height=\"50\" \/>. (6)<\/p>\n<p>&nbsp;<\/p>\n<p>While searching PDB for the proteins similar to the fragment {s<sub>i<\/sub>,&#8230;,s<sub>i+<img loading=\"lazy\" class=\"alignnone size-full wp-image-1096\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_delta.gif\" alt=\"\" width=\"5\" height=\"6\" \/>\u00a0-1<\/sub>} of the protein S, LIKENESS excludes the proteins lacking such similarity (interpretation #) and reveals one most similar fragment in each of the rest proteins (interpretation &amp;).<\/p>\n<p>LIKENESS is realized in C++, installed at the San-Diego Supercomputer Center (USA), and is available at\u00a0<i>&#8220;http:\/\/cl.sdsc.edu\/&#8221;<\/i>. Note that we adopted the description of geometrical objects with the help of hypothesis on &#8220;equality of their values of the same type&#8221; from stochastic geometry [17]; the qualitative similarity estimation, from Zadeh&#8217;s fuzzy logic [18]; and averaging of particular estimations into the integral estimation, from the utility theory for decision making [19].<b><\/b><\/p>\n<p><b>Results and discussion<\/b><\/p>\n<p>Let&#8217;s use the example of <img loading=\"lazy\" class=\"alignnone size-full wp-image-1091\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_aplha.gif\" alt=\"\" width=\"10\" height=\"9\" \/> &#8211; and <img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> -hemoglobins [33] to describe the operation of LIKENESS. Dark cells in the similarity matrix of <img loading=\"lazy\" class=\"alignnone size-full wp-image-1091\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_aplha.gif\" alt=\"\" width=\"10\" height=\"9\" \/>&#8211; and <img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> -hemoglobins (Fig. 3a) indicate similarity; light cells, dissimilarity; the optimal route of alignment lies along the main diagonals of the matrix with a transition between them, indicated by arrow. This transition means a deletion in <img loading=\"lazy\" class=\"alignnone size-full wp-image-1091\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_aplha.gif\" alt=\"\" width=\"10\" height=\"9\" \/> -hemoglobin compared with <img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> -hemoglobin. The <img loading=\"lazy\" class=\"alignnone size-full wp-image-1094\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_c_alpha.gif\" alt=\"\" width=\"17\" height=\"12\" \/> atoms of <img loading=\"lazy\" class=\"alignnone size-full wp-image-1091\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_aplha.gif\" alt=\"\" width=\"10\" height=\"9\" \/> &#8211; and <img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/>\u00a0-hemoglobins are matched with\u00a0<b>rmsd=2.75\u00c2\u00a0<\/b>and visualized with the program RASMOL [32] (Fig. 3b). Note that LIKENESS detected the helix D&#8217; (framed), which is the major distinction between these hemoglobins.<\/p>\n<p>LIKENESS was used to search for the proteins similar to the Greek key between positions 28 and 73 of prealbumin: four <img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> -turns and three\u00a0 <img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/>-turns [34, 35]. Greek keys 25 to 124 residues long occur in <img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> \/<img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> -domains and <img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> -barrels [35, 36]. We succeeded in finding in PDB [1] 41 fragments: the initial Greek key of prealbumin and its homologs, the Greek key of concanavalin A and its homologs, the Greek key of lectin and its homologs, and a fragment of soluble fatty acid-binding protein, lacking the recorded occurrence of Greek keys [35-38]. Shown in Fig. 4 are (a) alignment, performed by LIKENESS, of the this discovered fragment (light) with the initial Greek key of prealbumin (dark) and (b) their similarity matrix, where the dark cells indicate the similarity of <img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> -strands. Thus, LIKENESS has successfully found the similarity of the known Greek keys of prealbumin, concanavalin A, and lectin and a fragment of soluble fatty acid-binding protein with a similar arrangement of <img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> -strands and <img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/>\u00a0-turns.<\/p>\n<p>Since LIKENESS is able to find successfully the similarity of arbitrary proteins, the possibility exists to modify this system for <img loading=\"lazy\" class=\"alignnone size-full wp-image-1091\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_aplha.gif\" alt=\"\" width=\"10\" height=\"9\" \/>\/<img loading=\"lazy\" class=\"alignnone size-full wp-image-1091\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_aplha.gif\" alt=\"\" width=\"10\" height=\"9\" \/> -, <img loading=\"lazy\" class=\"alignnone size-full wp-image-1091\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_aplha.gif\" alt=\"\" width=\"10\" height=\"9\" \/> \/<img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> -, <img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> \/<img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> -domains, <img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/>\u00a0-barrels, and Greek keys: involvement of the available data on these local protein conformations will increase the accuracy and speed of LIKENESS operation.<b><\/b><\/p>\n<p><b>Acknowledgements<\/b><\/p>\n<p>The work was supported by the Russian Foundation for Basic Research (grants Nos. 97-04-49740, 97-07-90309, and 98-07-90126), Russian National Program on Human Genome, by grants Nos. IGSORAN 97\/13, BIR-9507625, and ASC 8902825, and through donation of computers Digital Equipment Corp.<b><\/b><\/p>\n<p><strong>References<\/strong><\/p>\n<ol>\n<li>Bernstein F.C. et al. \/\/ J. Mol. Biol., 1977, V. 112, P. 535.<\/li>\n<li>Holm L., Sander C. \/\/ Proteins, 1994, V. 19, P. 165.<\/li>\n<li>Taylor W.R., Orengo C.A. \/\/ J. Mol. Biol., 1989, V.208, P.1.<\/li>\n<li>Seto Y., Ikeuchi Y., Kanehisa M. \/\/ Proteins, 1990, V. 8, P. 341.<\/li>\n<li>Nussinov R., Wolfson H. \/\/ Proc. Natl. Acad. Sci. USA, 1991, V. 88, P.10495.<\/li>\n<li>Godzik A., Skolnik J., Kolinski A. \/\/ Protein Engineering, 1993, V. 6, P. 801.<\/li>\n<li>Zang K., Eisengberg D. \/\/ Protein Sci., 1994, V. 3, P. 687.<\/li>\n<li>Holm L., Sander C. \/\/ Trends Biochem. Sci., 1995, V. 20, P. 478.<\/li>\n<li>Mizuguchi K., Go N. \/\/ Protein Engineering, 1995, V. 8, P. 353.<\/li>\n<li>Hoffman D.L., Laiter S., Singh R.K. et al. \/\/ CABIOS, 1995, V. 11, P. 675.<\/li>\n<li>Madej T., Gibrat J.-F., Bryan S.H. \/\/ Proteins, 1995, V. 23, P. 356.<\/li>\n<li>Holm L., Sander C. \/\/ Nucl. Acids Res., 1996, V. 24, P. 206.<\/li>\n<li>Orengo C., Taylor W. \/\/ J. Mol. Biol., 1993, V. 233, P. 488.<\/li>\n<li>Orengo C., Flores T., Taylor W., Thornton J. \/\/ Protein Engineering, 1993, V. 6, P. 485.<\/li>\n<li>Taylor W., Flores T., Orengo C. \/\/ Protein Sci., 1994, V. 3, P. 1858.<\/li>\n<li>Hogue C.W.V., Ohkawa H., Bryan S.H. \/\/ Trends Biochem. Sci., 1996, V. 21, P. 226.<\/li>\n<li>N.G. Fedotov, &#8220;Methods of stochastic geometry in image recognition&#8221; (Moscow, Radio i Svyaz&#8217;, 1990).<\/li>\n<li>Zadeh, L.A. \/\/ Information and Control, 1965, V. 8, P. 338.<\/li>\n<li>Fishburn P.C. Utility theory for decision making, NY: John Wiley &amp; Sons, 1970.<\/li>\n<li>Shindyalov I.N. and Bourne P.E. \/\/ CABIOS, 1997, In Press.<\/li>\n<li>Shindyalov I.N., Bourne P.E. \/\/ J. App. Cryst. 1995, V. 28, P. 847.<\/li>\n<li>Bourne P.E., Shindyalov I.N. \/\/ Acta Cryst., 1996, Sup., P. 78.<\/li>\n<li>Lee B., Richards F.M. \/\/ J. Mol. Biol, 1971, V. 55, P. 379.<\/li>\n<li>Kabsh W., Sander C. \/\/ Biopolymers, 1983, V. 22, P. 2577.<\/li>\n<li>Dayhoff M.O. et al. \/\/ Atlas of protein sequence and structure, 1979, V. 5, Suppl., P. 345.<\/li>\n<li>Bogardt R.A. et al. \/\/ J. Mol. Evol., 1980, V. 15, P. 197.<\/li>\n<li>G. Schulz and R. Schirmer, &#8220;Principles of protein structural organization&#8221; (Moscow, Mir, 1982).<\/li>\n<li>Chou P.Y., Fasman G.D. \/\/ Biochemistry, 1974, V. 13, P. 211.<\/li>\n<li>E. Lemna, &#8220;Testing of statistical hypotheses&#8221; (Moscow, Nauka, 1979).<\/li>\n<li>Needleman S.B., Wunsch C.D. \/\/ J. Mol. Biol., 1970, V. 48, P. 443.<\/li>\n<li>Hendrickson W.A. \/\/ Acta Cryst., 1979, Ser. A, V. 35, P. 158.<\/li>\n<li>Sayle R.A., and Milner-White E. J. \/\/ Trends in Biochem. Sci., 1995, V. 20, P. 374.<\/li>\n<li>Fermi G., Perutz M.F., Shaanan B., Fourme R. \/\/ J. Mol. Biol., 1984, V. 175, P. 159.<\/li>\n<li>Blake C., Geisow M., Oatley S., Rerat B., Rerat C. \/\/ J. Mol. Biol., 1978, V. 121, P. 339.<\/li>\n<li>Hutchinson E.G., Thornton J.M.\/\/ Protein Engineering, 1993, V. 6, P. 233.<\/li>\n<li>Orengo C.A., Flores T.P., Taylor W.R., Thornton J.M. \/\/ Protein Engineering, 1993, V. 6, P. 485.<\/li>\n<li>Sacchettini J.C., Gordon J.I., Banszak L.J. \/\/ J. Mol. Biol., 1989, V. 208, P. 327.<\/li>\n<li>Scapin G., Gordon J.I., Sacchettini J.C. \/\/ J. Biol. Chem., 1992, V. 267, P. 4253.<\/li>\n<\/ol>\n<p>Table 1<b>.\u00a0<\/b>Examples of protein conformation properties used in the work<\/p>\n<p><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_104.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-1089 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_104.gif\" alt=\"\" width=\"619\" height=\"353\" \/><\/a><\/p>\n<p align=\"left\"><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_104a.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-1090 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_104a.gif\" alt=\"\" width=\"633\" height=\"287\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<table border=\"0\" width=\"687\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td valign=\"TOP\" width=\"449\"><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_Image217.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-1104 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_Image217.gif\" alt=\"\" width=\"447\" height=\"276\" \/><\/a><\/td>\n<td valign=\"TOP\" width=\"238\">Figure 2. Model of the conformation of the residue at the position\u00a0<b>i\u00a0<\/b>of a protein:\u00a0<b>A, B, C, D,\u00a0<\/b>and\u00a0<b>E<\/b>\u00a0are the \u2014a atoms at positions from i-2 to i+2;\u00a0<b>M,\u00a0<\/b>center of their mass;\u00a0<b>G,\u00a0<\/b>center of the mass of the \u2014a atoms at a distance of not more than 25\u00c2 from this pentapeptide (broken circle);\u00a0<b>K, I,\u00a0<\/b>and\u00a0<b>J,\u00a0<\/b>the centers of mass of the tripeptides containing the\u00a0<b>i<\/b>th residue;\u00a0<b>L<\/b>\u00a0and\u00a0<b>R,\u00a0<\/b>centers of mass of the decapeptides from i-12 to i-3 and from i+3 to i+12 (dark circles); and\u00a0<b>P,<\/b> the center of mass of the protein (ellipse). Broken lines connect the centers of mass with the corresponding <a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_calpha.gif\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-1095\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_calpha.gif\" alt=\"\" width=\"17\" height=\"12\" \/><\/a>\u00a0atoms. Arrows indicate the run of the protein chain<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<table border=\"0\" width=\"687\" cellspacing=\"0\" cellpadding=\"7\">\n<tbody>\n<tr>\n<td valign=\"TOP\" width=\"333\"><b>a)<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_Image218.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-1105 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_Image218.gif\" alt=\"\" width=\"282\" height=\"294\" \/><\/a><\/b><\/p>\n<p>Figure 3. (a) Conformation similarity matrix of <img loading=\"lazy\" class=\"alignnone wp-image-1091 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_aplha.gif\" alt=\"\" width=\"10\" height=\"9\" \/> &#8211; and <img loading=\"lazy\" class=\"alignnone wp-image-1093 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/>\u00a0-chains of hemoglobin. Dark cells indicate similarity\u00a0<b>(U&gt;0)<\/b>; light, dissimilarity\u00a0<b>(U&lt;0)<\/b>; arrow marks the deletion in the <img loading=\"lazy\" class=\"alignnone wp-image-1091 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_aplha.gif\" alt=\"\" width=\"10\" height=\"9\" \/> -chain. (b) Visualization of the alignment of conformations of <img loading=\"lazy\" class=\"alignnone wp-image-1091 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_aplha.gif\" alt=\"\" width=\"10\" height=\"9\" \/> -chain (dark) and <img loading=\"lazy\" class=\"alignnone wp-image-1093 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> -chain (light) of hemoglobin using the program RASMOL [32]. The rectangle contains the <img loading=\"lazy\" class=\"alignnone size-full wp-image-1091\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_aplha.gif\" alt=\"\" width=\"10\" height=\"9\" \/> -helix D\u00ed in <img loading=\"lazy\" class=\"alignnone size-full wp-image-1093\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_beta.gif\" alt=\"\" width=\"7\" height=\"14\" \/> -chain, deletion of which in <img loading=\"lazy\" class=\"alignnone size-full wp-image-1091\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_aplha.gif\" alt=\"\" width=\"10\" height=\"9\" \/>\u00a0-chain is the major distinction of their conformations.<\/td>\n<td valign=\"TOP\" width=\"326\"><b>b)<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_Image219.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-1106 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_Image219.gif\" alt=\"\" width=\"293\" height=\"291\" \/><\/a><\/b><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_Image221.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-1107 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis104_Image221.gif\" alt=\"\" width=\"613\" height=\"221\" \/><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>PONOMARENKO M.P.+,\u00a0KOLCHANOV N.A.,\u00a0SHINDYALOV I.1,\u00a0BOURNE P.1 Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, 10 Lavrentiev Ave., Novosibirsk, 630090, Russia; 1San Diego Supercomputer Center, San Diego, CA 92186-9784, USA +Corresponding author Keywords: proteins, protein conformation, similarity, &hellip; <a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/abstract-list\/104_likeness-a-system-searching-for-and-aligning-similar-protein-conformations\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":13,"featured_media":0,"parent":97,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":[],"_links":{"self":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/1087"}],"collection":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/users\/13"}],"replies":[{"embeddable":true,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/comments?post=1087"}],"version-history":[{"count":4,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/1087\/revisions"}],"predecessor-version":[{"id":1444,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/1087\/revisions\/1444"}],"up":[{"embeddable":true,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/97"}],"wp:attachment":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/media?parent=1087"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}