{"id":672,"date":"2023-03-15T13:16:54","date_gmt":"2023-03-15T06:16:54","guid":{"rendered":"https:\/\/conf.icgbio.ru\/bgrs98\/?page_id=672"},"modified":"2023-04-11T14:32:20","modified_gmt":"2023-04-11T07:32:20","slug":"052_revealing-the-conformational-and-physico-chemical-dna-properties-applicable-for-predicting-the-activity-of-dna-functional-sites","status":"publish","type":"page","link":"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/abstract-list\/052_revealing-the-conformational-and-physico-chemical-dna-properties-applicable-for-predicting-the-activity-of-dna-functional-sites\/","title":{"rendered":"REVEALING THE CONFORMATIONAL AND PHYSICO-CHEMICAL DNA PROPERTIES APPLICABLE FOR PREDICTING THE ACTIVITY OF DNA FUNCTIONAL SITES"},"content":{"rendered":"<p><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#ponomarenko_mp\">PONOMARENKO M.P.<\/a>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#kol\">KOLCHANOV N.A.<\/a>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#ponomarenko_jv\">PONOMARENKO J.V.<\/a>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#frolov\">FROLOV A.S.<\/a>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#podkolodnaya\">PODKOLODNAYA O.A.<\/a>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#vorobiev\">VOROBIEV D.G.<\/a>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#podkolodny\">PODKOLODNY N.L.<\/a><sup>1<\/sup>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#overton\">OVERTON G.C.<\/a><sup>2<\/sup><\/p>\n<p>Laboratory of Theoretical Genetics, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, 10 Lavrentiev Ave., Novosibirsk, 630090, Russia;<\/p>\n<p>1Institute of Computational Mathematics and Mathematical Geophysics, Novosibirsk, Russia;<\/p>\n<p>2Center for Bioinformatics, University of Pennsylvania, Philadelphia, USA<\/p>\n<p><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/keywords-index\/\">Keywords<\/a>: prediction, confirmation and physico-chemical DNA properties, activity, DNA functional sites, utility theory for decision making, fuzzy logic<\/p>\n<p>&nbsp;<\/p>\n<p>We suggest an approach to predict the activity of DNA functional sites via &#8220;a probable molecular mechanism of the site functioning&#8221;. Mean values of certain DNA physico-chemical or conformational properties for a given DNA region containing the site are used. The database ACTIVITY of the functional DNA site activities is the initial data source for this approach implementation. From these data, the mean values applicable to predict the site activities were identified and their programs compiled into the knowledge base (C-code) and library (executable),\u00a0<a href=\"http:\/\/wwwmgs.bionet.nsc.ru\/systems\/Activity\/\" target=\"_blank\" rel=\"noopener\">http:\/\/wwwmgs.bionet.nsc.ru\/systems\/Activity\/<\/a>.<\/p>\n<p><b>Introduction<\/b><\/p>\n<p>Mulligan (1984) was the first to predict the DNA site activity through homology score; Stormo (1986), through weight matrixes; Berg and von Hippel (1988), through statistical mechanics generalizing the above descriptive approaches; and Jonsson (1993), through neural networks optimized heuristically. These approaches are still dominant (Kraus, 1996; Fields, 1997). Earlier, we performed such predictions using oligonucleotide content (Ponomarenko, 1997a,b) and calculated theoretically the TATA-box consensus from the DNA\/TBP-affinity: it 13 from 15 possible matches with the consensus of the actual TATA box (Bucher, 1990). Nevertheless, a probable molecular mechanism of the site functioning remains obscure in terms of either homology score (Mulligan; 1984), weighted matrix (Stormo, 1986), neural network (Jonsson, 1993) or oligonucleotide content (Ponomarenko, 1997a,b). That is why, we have studied the applicability of the conformational and physico-chemical DNA properties to predicting the site activities from their sequences.<\/p>\n<p><b>System and methods<\/b><\/p>\n<p>We are suggesting a linear regression for predicting the activity values of a given DNA functional site through the mean values of the physico-chemical and conformational DNA properties for a given DNA region containing this site (&#8220;DNA properties&#8221;). We also suggest to generate and test as many DNA properties as the computer can afford, as it was introduced by Hajek and Havranek (1978). Earlier, this \u011egenerating and testing\u011f has been successfully applied to sequence analysis (Ponomarenko, 1997a).<\/p>\n<p>Within the framework of the linear regression, the activity of the site with sequence\u00a0<b>S<\/b>\u00a0is predicted as follows:<\/p>\n<table border=\"0\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td width=\"90%\">\n<p align=\"center\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-674\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image1.gif\" alt=\"\" width=\"189\" height=\"34\" \/><\/p>\n<\/td>\n<td width=\"10%\"><b>(1)<\/b><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>where,\u00a0<b>F<sub>0<\/sub><\/b>\u00a0is a \u011ebasal activity\u011f;\u00a0<b>X<sub>k,a,b<\/sub>(S),<\/b>\u00a0the mean value of the\u00a0<b>q<\/b>th property\u00a0<b>R<sub>q<\/sub><\/b>\u00a0for the region from\u00a0<b>a<\/b>\u00a0to\u00a0<b>b<\/b>:<\/p>\n<table border=\"0\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td width=\"90%\">\n<p align=\"center\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-675\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image2.gif\" alt=\"\" width=\"173\" height=\"50\" \/><\/p>\n<\/td>\n<td width=\"10%\"><b>(2)<\/b><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Table 1 lists the 38 conformational and physico-chemical properties\u00a0<b>R<sub>q<\/sub><\/b>\u00a0utilized in this work. Each of them is examined exhausting all the possible regions (<b>a, b<\/b>) within the site; therefore, the total number of the DNA properties\u00a0<b>X<sub>q,a,b<\/sub><\/b>\u00a0generated and tested is about 10<sup>5<\/sup>. Essentially, when such a huge number of tests is carried out, the problem to exclude any insignificant choice by chance becomes crucial. We are crossing this problem within the framework of Utility Theory for Decision Making (Fishburn, 1970) and fuzzy logic (Zadeh, 1965). For fixed &#8220;<b>q,a,b<\/b>&#8220;, the\u00a0<b>X<sub>q,a,b<\/sub>(S<sub>n<\/sub>)<\/b>\u00a0value for each sequence\u00a0<b>S<sub>n<\/sub><\/b>\u00a0with the known activity\u00a0<b>F<sub>n<\/sub><\/b>\u00a0is calculated (2)<sub>.\u00a0<\/sub>When the pairs\u00a0<b>{X<sub>q,a,b<\/sub>(S<sub>n<\/sub>), F<sub>n<\/sub>}<\/b>\u00a0meet all the necessary requirements of the linear regression (1) applicability, the activity\u00a0<b>F<\/b>\u00a0is predictable through the property\u00a0<b>X<sub>q,a,b<\/sub>.<\/b>\u00a0To test these conditions of linear regression applicability, a simple regression is optimized for the pair\u00a0<b>{X<sub>q,a,b<\/sub>(S<sub>n<\/sub>), F<sub>n<\/sub>}<\/b>:<\/p>\n<table border=\"0\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td width=\"90%\">\n<p align=\"center\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-677\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image4.gif\" alt=\"\" width=\"204\" height=\"21\" \/><\/p>\n<\/td>\n<td width=\"10%\"><b>(3)<\/b><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>To ensure the reliability of the regression between\u00a0<b>X<sub>q,a,b<\/sub>(S<sub>n<\/sub>)<\/b>\u00a0and\u00a0<b>F<sub>n<\/sub><\/b>, 22 requirements are tested: the presence of linear, sign, and rank correlations between the predicted\u00a0<b>F<sub>q,a,b<\/sub>(S<sub>n<\/sub>)<\/b>\u00a0and the experimental\u00a0<b>F<sub>n<\/sub><\/b>\u00a0activities; the uniform distributions; the Gaussian for their deviation\u00a0<b>(F<sub>q,a,b<\/sub>(S<sub>n<\/sub>)-F<sub>n<\/sub>),<\/b>\u00a0etc. When the\u00a0<b>r<\/b>th requirement is met with significance\u00a0<b>p<sub>r<\/sub><\/b>, it is fixed on a uniform scale named \u011epartial utility of the\u00a0<b>X<sub>q,a,b<\/sub><\/b>\u00a0to predict the activity\u00a0<b>F<\/b>\u011f which is highest,\u00a0<b>u<sub>r<\/sub>=1<\/b>, when the\u00a0<b>r<\/b>th requirement is met at\u00a0<b>p<sub>r<\/sub>\u00a0&lt;0.01;<\/b>\u00a0the lowest,\u00a0<b>u<sub>r<\/sub>=<\/b>\u00a0<b>-1,<\/b>\u00a0when\u00a0<b>p<sub>r<\/sub>&gt;0.1<\/b>; negative\u00a0<b>u<sub>r<\/sub>&lt;0<\/b>, when\u00a0<b>p<sub>r<\/sub>&gt;0.05;<\/b>\u00a0positive\u00a0<b>u<sub>r<\/sub>&gt;0,<\/b>\u00a0when\u00a0<b>p<sub>r<\/sub>&lt;0.05<\/b>; and\u00a0<b>u<sub>r<\/sub>=0,<\/b>\u00a0when\u00a0<b>p<sub>r<\/sub>=0.05<\/b>, as it is recommended by fuzzy logic (Zadeh, 1965):<\/p>\n<table border=\"0\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td width=\"90%\">\n<p align=\"center\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-678\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image5.gif\" alt=\"\" width=\"425\" height=\"65\" \/><\/p>\n<\/td>\n<td width=\"10%\"><b>(4)<\/b><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Applying the Utility Theory for Decision Making (Fishburn, 1970), the partial utilities are averaged to produce the general utility:<\/p>\n<table border=\"0\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td width=\"90%\">\n<p align=\"center\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-679\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image6.gif\" alt=\"\" width=\"209\" height=\"50\" \/><\/p>\n<\/td>\n<td width=\"10%\"><b>(5)<\/b><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>The linear-independent properties\u00a0<b>X<sub>q,a,b<\/sub><\/b>\u00a0with the highest positive utilities are selected,\u00a0<b>U(X<sub>q,a,b<\/sub>,F)&gt;0.<\/b>\u00a0The probability to select by chance a property\u00a0<b>X<\/b>\u00a0with a positive utility\u00a0<b>U(X, F)&gt;0<\/b>\u00a0from 10<sup>5<\/sup>\u00a0properties was approximately estimated by the binomial criterion:<\/p>\n<p align=\"CENTER\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-680\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image7.gif\" alt=\"\" width=\"345\" height=\"42\" \/>.<\/p>\n<p>Thus, the property\u00a0<b>X<sub>q,a,b<\/sub><\/b>\u00a0selected meets significantly the linear regression requirements. That is why the simplest combinatorial algorithm is applicable: all the 10<sup>5<\/sup>\u00a0possible properties\u00a0<b>X<sub>q,a,b<\/sub>(S<sub>n<\/sub>)<\/b>\u00a0for all the available site sequences\u00a0<b>S<sub>n<\/sub><\/b>\u00a0with the known activities\u00a0<b>F<sub>n<\/sub><\/b>\u00a0are calculated (2), and all the 10<sup>5<\/sup>\u00a0utilities\u00a0<b>U(X, F)<\/b>\u00a0are calculated (3-5). If all\u00a0<b>U(X<sub>q,a,b<\/sub>, P)&lt;0,<\/b>\u00a0no properties are selected; otherwise, all the possible linear-independent properties\u00a0<b>{X<sub>k<\/sub>}<\/b>\u00a0with highest utilities\u00a0<b>{U(X<sub>k<\/sub>, F)&gt;0}<\/b>\u00a0are selected; the linear regression (1) is optimized; its program is generated and stored with the SRS-based (Etzold, 1993) knowledge base.<\/p>\n<p><b>Results and discussion<\/b><\/p>\n<p>Table 1 shows the transcription activity of the mouse <img loading=\"lazy\" class=\"alignnone size-full wp-image-673\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_alpha.gif\" alt=\"\" width=\"9\" height=\"16\" \/>A-crystalline gene promoter with the PE1B\/TATA region (Sax, 1995) as an example of the initial data for the analysis described above; Table 2, the physico-chemical property &#8220;Probability to be contacting with nucleosome core&#8221; (Hogan, 1987). The output for these initial data are demonstrated in Fig. 1. The C-program predicting transcription activity of the mouse <img loading=\"lazy\" class=\"alignnone size-full wp-image-673\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_alpha.gif\" alt=\"\" width=\"9\" height=\"16\" \/>A-crystalline gene promoter from the DNA sequence of PE1B\/TATA box region (Fig. 2a) using the property \u011eProbability to be contacting with nucleosome core\u011f,\u00a0<b>Pnucl<\/b>\u00a0(Fig. 2b), the major groove distance, dist (Fig. 2c), and the Tilt angle (Fig. 2d):<\/p>\n<p align=\"center\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-681\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image8.gif\" alt=\"\" width=\"268\" height=\"16\" \/><\/p>\n<p>The negative value, -0.1, multiplying the \u011eProbability to be contacting with nucleosome core\u011f means that the tighter is the interaction of the promoter with nucleosomes, the lower is the transcription activity. That is consistent with the experimental data that the nucleosome displacement from a promoter precedes the TBP\/DNA binding (Godde, 1995; Edmondson, 1996). The linear correlation coefficient\u00a0<b>r=0.90<\/b>\u00a0shows the significant agreement between the experimental and the predicted transcription activities (Fig.2a).<\/p>\n<p>&nbsp;<\/p>\n<p>Several dozens of DNA functional sites analyzed by the Activity system are shown in Table 4 and Fig. 3 to demonstrate the universality of the linear regression (1). The DNA bending in the TBP\/TATA complex (Starr, 1995) was demonstrated to correlate with the Inclination (Fig. 3a). That agrees with the X-ray structure of the complex (Juo, 1996), where the DNA bend results from intercalation of four phenylalanines of the TBP into the DNA minor groove, which widens with the Inclination (Dickerson, 1989). Fig. 3b illustrates that USF\/DNA affinity (Bendall, 1994) is correlates negatively with the Twist angle (Table 3: r=-0.896, p&lt;10<sup>-5<\/sup>). This is consistent with the negative correlation (r=-0.766, p&lt;10<sup>-3<\/sup>) between the activity of another transcription factor (YY1) binding site (Fig. 3d). These two results are agreeing with one another and with our earlier data (Ponomarenko, 1997c) that the lowest twist is significant for the promoters. As for exotic activity, the 2-aminopurine-induced mutability (Coullondre, 1978) correlates with the DNA melting temperature in the vicinity of the mutation point (r=0.90, <img loading=\"lazy\" class=\"alignnone size-full wp-image-673\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_alpha.gif\" alt=\"\" width=\"9\" height=\"16\" \/>\u00a0&lt;10<sup>-5<\/sup>). This agrees with the commonly accepted fact (Mhaskar, 1984) that the mutability of this kind results from repair errors, the frequency of depends on the DNA melting temperature. Very close estimates (r=0.865 and r=0.860, respectively) were obtained earlier using weight matrices (Stormo, 1986) and oligonucleotide content (Ponomarenko, 1997a). These contextual correlations have not unambiguously indicated the repair errors dependent on DNA melting temperature as a possible molecular mechanism of DNA mutability.<\/p>\n<p>All the results obtained demonstrate that the linear regression (1) can be informative in molecular biological studies. Further development of this approach will be focused on involving more and more novel conformational and physico-chemical DNA properties into the analysis of the activity of DNA functional sites and to supplement the linear regression with more complex and sensitive nonlinear approaches.<\/p>\n<p>We are grateful to Ms. Galina Chirikova for help in translation. This work was supported by NIH (grant 2-R01-RR04026-08A2), Russian National Human Genome Project, Russian Ministry of Science and Technical Politics, Siberian Branch of Russian Academy of Sciences (grants IGSBRAS-97N13), and Russian Foundation for Basic Research (grants 96-04-50006, 97-07-90309, 97-04-49740, and 98-07-90126).<\/p>\n<p><b>References<\/b><\/p>\n<ol>\n<li>A.J. Bendall and Molloy, P.L.,\u00a0<i>Nucleic Acids Res.<\/i>,\u00a0<b>22<\/b>, 2801 (1994).<\/li>\n<li>O.G. Berg and P.H. von Hippel,\u00a0<i>J. Mol. Biol.<\/i>,\u00a0<b>200<\/b>, 709 (1988).<\/li>\n<li>C. Coulondre, et al.,\u00a0<i>Nature.<\/i>,\u00a0<b>274<\/b>, 775 (1978).<\/li>\n<li>R.E. Dickerson, et al.,\u00a0<i>EMBO J.<\/i>,\u00a0<b>8<\/b>, 1 (1989).<\/li>\n<li>D.G. Edmondson and S.Y. Roth,\u00a0<i>FASEB J.<\/i>,\u00a0<b>10<\/b>, 1173 (1996).<\/li>\n<li>T. Etzold and P. Argos,\u00a0<i>Comput. Appl. Biosci.<\/i>,\u00a0<b>9,<\/b>\u00a049 (1993).<\/li>\n<li>D.S. Fields, Y. He, A.Y. Al-Uzri, and G.D. Stormo,\u00a0<i>J. Mol. Biol.<\/i>,\u00a0<b>271<\/b>, 178 (1997).<\/li>\n<li>P.C. Fishburn, Utility Theory for Decision Making, NY: Jonh Wiley &amp; Sons, (1970).<\/li>\n<li>J.S. Godde, Y. Nakatani, and A.P. Wolffe,\u00a0<i>Nucleic Acids Res.,<\/i>\u00a0<b>23<\/b>, 4557 (1995).<\/li>\n<li>P. Hajek, and T. Havranek, Mechanizing hypothesis formation. Heidelberg, Springer Verlag, (1978).<\/li>\n<li>M.E. Hogan and R.H. Austin,\u00a0<i>Nature<\/i>,\u00a0<b>329<\/b>, 263 (1987).<\/li>\n<li>R. Hyde-DeRuyscher, E. Jennings, and T. Shenk,\u00a0<i>Nucleic Acids Res.<\/i>,\u00a0<b>23<\/b>, 4457 (1995).<\/li>\n<li>J. Jonsson, et al.,\u00a0<i>Nucleic Acids Res.<\/i>,\u00a0<b>21<\/b>, 733 (1993).<\/li>\n<li>Z.S. Juo, T.K. Chiu, et al.,\u00a0<i>J. Mol. Biol.,\u00a0<\/i><b>261<\/b>, 239 (1996).<\/li>\n<li>R.J. Kraus, et al.,\u00a0<i>Nucleic Acids Res.<\/i>,\u00a0<b>24<\/b>, 1531 (1996).<\/li>\n<li>D.N. Mhaskar and M.F. Goodman,\u00a0<i>J. Biol. Chem.<\/i>,\u00a0<b>259<\/b>, 11713 (1984).<\/li>\n<li>M.E. Mulligan, et al.,\u00a0<i>Nucleic Acids Res.<\/i>\u00a0<b>12<\/b>, 789 (1984).<\/li>\n<li>M.P. Ponomarenko, A.N. Kolchanova, and N.A. Kolchanov,.\u00a0<i>J. Comput. Biol.,<\/i>\u00a0<b>4,<\/b>\u00a083 (1997a).<\/li>\n<li>M.P. Ponomarenko, L.K. Savinkova, et al.,\u00a0<i>Mol. Biol. (Mosk.).<\/i>,\u00a0<b>31<\/b>, 726 (1997b).<\/li>\n<li>M.P. Ponomarenko, J.V. Ponomarenko, et al.,\u00a0<i>Mol Biol (Mosk).<\/i>,\u00a0<b>31<\/b>, 733 (1997c).<\/li>\n<li>C.M. Sax, A. Cvelk, et al.,\u00a0<i>Nucleic Acids Res.<\/i>,\u00a0<b>23<\/b>, 442 (1995).<\/li>\n<li>D.B. Starr, B.C. Hoopes, and D.K. Hawley,\u00a0<i>J. Mol. Biol.<\/i>,\u00a0<b>250<\/b>, 434 (1995).<\/li>\n<li>G.D. Stormo, T.D. Schneider, and L. Gold,\u00a0<i>Nucleic Acids Res.,<\/i>\u00a0<b>14<\/b>, 6661 (1986).<\/li>\n<li>L.A. Zadeh,\u00a0<i>Information and Control<\/i>.,\u00a0<b>8<\/b>, 338 (1965).<\/li>\n<\/ol>\n<p>&nbsp;<\/p>\n<p>Table 1. An example of the experimental data that are analyzed in this work<\/p>\n<table border=\"1\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td valign=\"TOP\" width=\"7%\">\n<p align=\"CENTER\">Name<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"80%\">\n<p align=\"CENTER\">Sequence of the PE1B\/TATA region of mouse <img loading=\"lazy\" class=\"alignnone size-full wp-image-673\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_alpha.gif\" alt=\"\" width=\"9\" height=\"16\" \/>A-crystalline gene promoter (Sax, 1995)<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"13%\">\n<p align=\"CENTER\">Act<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"7%\">\n<p align=\"CENTER\">M18<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"80%\">\n<p align=\"CENTER\">CACGCATAGG GAGTTCTGGA ACGCTAGCTC ACCACC<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"13%\">\n<p align=\"CENTER\">0.78<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"7%\" height=\"16\">\n<p align=\"CENTER\">M15<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"80%\" height=\"16\">\n<p align=\"CENTER\">CACGCATACC GAGGGCTGGA ACGCTAGCTC ACCACC<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"13%\" height=\"16\">\n<p align=\"CENTER\">0.75<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"7%\">\n<p align=\"CENTER\">M17<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"80%\">\n<p align=\"CENTER\">CACGCATAGG GCTGGCTGGA ACGCTAGCTC ACCACC<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"13%\">\n<p align=\"CENTER\">0.75<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"7%\">&nbsp;<\/p>\n<p align=\"CENTER\"><b>WT<\/b><\/p>\n<\/td>\n<td valign=\"TOP\" width=\"80%\">&nbsp;<\/p>\n<p align=\"CENTER\"><b>CATATATAGG GAGGGCTGGA ACGCTAGCTC ACCACC<\/b><\/p>\n<\/td>\n<td valign=\"TOP\" width=\"13%\">&nbsp;<\/p>\n<p align=\"CENTER\"><b>0.00<\/b><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"7%\">\n<p align=\"CENTER\">M6<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"80%\">\n<p align=\"CENTER\">CACGCATAGG GAGGGCTGGA ACGCTAGCTC ACCACC<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"13%\">\n<p align=\"CENTER\">-0.13<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"7%\">\n<p align=\"CENTER\">M16<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"80%\">\n<p align=\"CENTER\">CACGCATAGC CAGGGCTGGA ACGCTAGCTC ACCACC<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"13%\">\n<p align=\"CENTER\">-0.72<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"7%\">\n<p align=\"CENTER\">M5<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"80%\">\n<p align=\"CENTER\">GCAGGCATAT ATAGGGAGGG CTGCTAGCTC ACCACC<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"13%\">\n<p align=\"CENTER\">-0.75<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"7%\">\n<p align=\"CENTER\">M2<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"80%\">\n<p align=\"CENTER\">TGCAGGCAGG GAGGGCTGGA ACGCTAGCTC ACCACC<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"13%\">\n<p align=\"CENTER\">-1.00<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"7%\">\n<p align=\"CENTER\">M1<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"80%\">\n<p align=\"CENTER\">CATAGATAGG GAGGGCTGGA ACGCTAGCTC ACCACC<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"13%\">\n<p align=\"CENTER\">-1.15<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"7%\">\n<p align=\"CENTER\">M4<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"80%\">\n<p align=\"CENTER\">TGCAGGCATA TATAGGGAGA ACGCTAGCTC ACCACC<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"13%\">\n<p align=\"CENTER\">-1.15<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"7%\">\n<p align=\"CENTER\">M3<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"80%\">\n<p align=\"CENTER\">GTGCAGGCAT ATATACTGGA ACGCTAGCTC ACCACC<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"13%\">\n<p align=\"CENTER\">-1.34<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>Table 2.<b>\u00a0<\/b>Example of a DNA property used: Probability to be contacting with nucleosome core (Hogan, 1987)<\/p>\n<table border=\"1\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td valign=\"TOP\" width=\"15%\">\n<p align=\"CENTER\">Dinucleotide<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">%<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"16%\">\n<p align=\"CENTER\">Dinucleotide<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">%<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"16%\">\n<p align=\"CENTER\">Dinucleotide<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">%<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">Dinucleotide<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">%<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"15%\">\n<p align=\"CENTER\">AA<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">18.4<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"16%\">\n<p align=\"CENTER\">AT<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">7.2<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"16%\">\n<p align=\"CENTER\">AG<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">14.5<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">AC<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">10.2<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"15%\">\n<p align=\"CENTER\">TA<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">6.2<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"16%\">\n<p align=\"CENTER\">TT<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">18.4<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"16%\">\n<p align=\"CENTER\">TG<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">15.7<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">TC<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">11.3<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"15%\">\n<p align=\"CENTER\">GA<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">11.3<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"16%\">\n<p align=\"CENTER\">GT<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">10.2<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"16%\">\n<p align=\"CENTER\">GG<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">10.2<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">GC<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">5.2<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"15%\">\n<p align=\"CENTER\">CA<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">15.7<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"16%\">\n<p align=\"CENTER\">CT<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">14.5<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"16%\">\n<p align=\"CENTER\">CG<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">1.1<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">CC<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">10.2<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<table border=\"0\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td valign=\"top\" width=\"30%\">Fig.1. The examples of the C-programs generated in this work for predicting the activity values of DNA functional sites.<b><\/b><b>A.\u00a0<\/b>The program calculating the mean value of a given DNA property over the site region that has been found significant for predicting activity of this site.<\/p>\n<p><b><\/b><b>B.\u00a0<\/b>The linear regression for the site activity prediction using all the significant DNA properties of the site.<\/td>\n<td width=\"70%\"><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image3.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-676 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image3.gif\" alt=\"\" width=\"397\" height=\"255\" \/><\/a><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<table border=\"0\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td width=\"50%\">\n<p align=\"CENTER\"><b>A)<\/b><\/p>\n<p align=\"CENTER\"><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image9.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-682 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image9.gif\" alt=\"\" width=\"256\" height=\"204\" \/><\/a><\/p>\n<\/td>\n<td width=\"50%\">\n<p align=\"center\"><b>B)<\/b><\/p>\n<p align=\"center\"><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image10.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-683 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image10.gif\" alt=\"\" width=\"256\" height=\"205\" \/><\/a><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"50%\">\n<p align=\"CENTER\"><b>C)<\/b><\/p>\n<p align=\"CENTER\"><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image11.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-684 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image11.gif\" alt=\"\" width=\"255\" height=\"205\" \/><\/a><\/p>\n<\/td>\n<td width=\"50%\">\n<p align=\"center\"><b>D)<\/b><\/p>\n<p align=\"center\"><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image12.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-685 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image12.gif\" alt=\"\" width=\"255\" height=\"204\" \/><\/a><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td colspan=\"2\" width=\"100%\">Fig. 2. The transcription activity of mouse <img loading=\"lazy\" class=\"alignnone size-full wp-image-673\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_alpha.gif\" alt=\"\" width=\"9\" height=\"16\" \/>A-crystalline gene promoter with the PE1B\/TATA region (Sax, 1995) correlates with (a) the probability to be contacting with nucleosome core, (b) the major groove distance, and (c) the Tilt angle, and can be reliably predicted employing these properties.<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>Table 3. Examples of the functional DNA sites analyzed in this work<\/p>\n<table border=\"1\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td valign=\"TOP\" width=\"20%\"><\/td>\n<td valign=\"TOP\" width=\"11%\">\n<p align=\"CENTER\">Site<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\"><\/td>\n<td valign=\"TOP\" width=\"14%\"><\/td>\n<td colspan=\"3\" valign=\"TOP\" width=\"27%\">\n<p align=\"CENTER\">DNA property found<\/p>\n<\/td>\n<td colspan=\"3\" valign=\"TOP\" width=\"23%\">\n<p align=\"CENTER\">Significance<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"20%\">\n<p align=\"CENTER\">Name<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"11%\">\n<p align=\"CENTER\">Position #1<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\">\n<p align=\"CENTER\">n<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">Activity, F<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\">X<sub>k<\/sub><\/td>\n<td valign=\"TOP\" width=\"12%\">\n<p align=\"CENTER\">Region<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">Property<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">U<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">r<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">p<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"20%\">\n<p align=\"CENTER\">PE1B TATA box<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"11%\">\n<p align=\"CENTER\">Transc-<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\">\n<p align=\"CENTER\">11<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">Transcription<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\">X<sub>1<\/sub><\/td>\n<td valign=\"TOP\" width=\"12%\">\n<p align=\"CENTER\">-32; -25<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">Pnucl<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.36<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-0.77<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">10<sup>-2<\/sup><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"20%\">\n<p align=\"CENTER\">(Sax, 1995)<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"11%\">\n<p align=\"CENTER\">ription<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\"><\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">activity<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\">X<sub>2<\/sub><\/td>\n<td valign=\"TOP\" width=\"12%\">\n<p align=\"CENTER\">-29; -19<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">DIST<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.41<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.86<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">10<sup>-3<\/sup><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"20%\"><\/td>\n<td valign=\"TOP\" width=\"11%\">\n<p align=\"CENTER\">start<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\"><\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">of alphaA-<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\">X<sub>3<\/sub><\/td>\n<td valign=\"TOP\" width=\"12%\">\n<p align=\"CENTER\">-31; -25<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">Tilt<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.38<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-0.78<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">10<sup>-2<\/sup><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"20%\"><\/td>\n<td valign=\"TOP\" width=\"11%\"><\/td>\n<td valign=\"TOP\" width=\"5%\"><\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">crystalline<\/p>\n<\/td>\n<td colspan=\"4\" valign=\"TOP\" width=\"34%\">F=-39-0.1*X<sub>1<\/sub>+12*X<sub>2<\/sub>-X<sub>3<\/sub><\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.90<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">10<sup>-4<\/sup><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"20%\">\n<p align=\"CENTER\">TATA box (mutant)<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"11%\">\n<p align=\"CENTER\">TATA<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\">\n<p align=\"CENTER\">9<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">DNA bending<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\">X<sub>1<\/sub><\/td>\n<td valign=\"TOP\" width=\"12%\">\n<p align=\"CENTER\">0, 9<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">Inclination<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.19<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.76<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.05<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"20%\">\n<p align=\"CENTER\">(Starr, 1995)<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"11%\">\n<p align=\"CENTER\">box start<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\"><\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">in TBP\/TATA<\/p>\n<\/td>\n<td colspan=\"4\" valign=\"TOP\" width=\"34%\">F=120.15+70.32*X<sub>1<\/sub><\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.76<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.05<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"20%\">\n<p align=\"CENTER\">USF-binding site<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"11%\">\n<p align=\"CENTER\">Synthetic<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\">\n<p align=\"CENTER\">14<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">USF\/DNA<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\">X<sub>1<\/sub><\/td>\n<td valign=\"TOP\" width=\"12%\">\n<p align=\"CENTER\">11, 15<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">Depth<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.22<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-0.78<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">10<sup>-3<\/sup><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"20%\">\n<p align=\"CENTER\">(Bendall, 1994)<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"11%\">\n<p align=\"CENTER\">DNA<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\"><\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">affinity<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\">X<sub>2<\/sub><\/td>\n<td valign=\"TOP\" width=\"12%\">\n<p align=\"CENTER\">11; 20<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">Twist<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.23<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-0.86<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">10<sup>-4<\/sup><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"20%\"><\/td>\n<td valign=\"TOP\" width=\"11%\">\n<p align=\"CENTER\">start<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\"><\/td>\n<td valign=\"TOP\" width=\"14%\"><\/td>\n<td colspan=\"4\" valign=\"TOP\" width=\"34%\">F=170-16.3*X<sub>1<\/sub>-0.7*X<sub>2<\/sub><\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.91<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">10<sup>-5<\/sup><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"20%\">\n<p align=\"CENTER\">YY1-binding site<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"11%\">\n<p align=\"CENTER\">site start<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\">\n<p align=\"CENTER\">21<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">Transcription<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\">X<sub>1<\/sub><\/td>\n<td valign=\"TOP\" width=\"12%\">\n<p align=\"CENTER\">1, 12<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">Twist<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.27<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-0.76<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">10<sup>-2<\/sup><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"20%\">\n<p align=\"CENTER\">(Hyde-DeRuyscher, 1995)<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"11%\"><\/td>\n<td valign=\"TOP\" width=\"5%\"><\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">repression<\/p>\n<\/td>\n<td colspan=\"4\" valign=\"TOP\" width=\"34%\">F= 47.97 -1.37*X<sub>1<\/sub><\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.76<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">10<sup>-2<\/sup><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"20%\">\n<p align=\"CENTER\">2AP-induced mutation<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"11%\">\n<p align=\"CENTER\">Mutation<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\">\n<p align=\"CENTER\">26<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">Mutation<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\">X<sub>1<\/sub><\/td>\n<td valign=\"TOP\" width=\"12%\">\n<p align=\"CENTER\">-1, 2<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">Tmelt<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">0,20<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.90<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">10<sup>-5<\/sup><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"20%\">\n<p align=\"CENTER\">(Coullondre, 1978)<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"11%\">\n<p align=\"CENTER\">point<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"5%\"><\/td>\n<td valign=\"TOP\" width=\"14%\">\n<p align=\"CENTER\">frequency<\/p>\n<\/td>\n<td colspan=\"4\" valign=\"TOP\" width=\"34%\">F=-8.5568+0.1585*X<sub>1<\/sub><\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0.90<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">10<sup>-5<\/sup><\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><b>n<\/b>, total number of the site variants;\u00a0<b>X<sub>k<\/sub><\/b>, property selected;\u00a0<b>U<\/b>, utility;\u00a0<b>r, p<\/b>, linear correlation coefficient and its significance;\u00a0<b>Pnucl<\/b>, probability to be contacting with nucleosome core;\u00a0<b>Tmelt<\/b>, melting temperature;\u00a0<b>depth<\/b>, minor groove depth;\u00a0<b>width<\/b>, minor groove width;\u00a0<b>WIDTH<\/b>, major groove width;\u00a0<b>DIST<\/b>, major groove width distance; and <img loading=\"lazy\" class=\"alignnone size-full wp-image-690\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image17.gif\" alt=\"\" width=\"73\" height=\"16\" \/>, the linear regression (Equation 1) predicting the site activity.<\/p>\n<p>&nbsp;<\/p>\n<table border=\"0\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td width=\"50%\">\n<p align=\"CENTER\"><b>A)<\/b><\/p>\n<p align=\"CENTER\"><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image13.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-686 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image13.gif\" alt=\"\" width=\"255\" height=\"204\" \/><\/a><\/p>\n<\/td>\n<td width=\"50%\">\n<p align=\"center\"><b>B)<\/b><\/p>\n<p align=\"center\"><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image14.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-687 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image14.gif\" alt=\"\" width=\"255\" height=\"205\" \/><\/a><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"50%\">\n<p align=\"CENTER\"><b>C)<\/b><\/p>\n<p align=\"CENTER\"><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image15.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-688 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image15.gif\" alt=\"\" width=\"259\" height=\"205\" \/><\/a><\/p>\n<\/td>\n<td width=\"50%\">\n<p align=\"center\"><b>D)<\/b><\/p>\n<p align=\"center\"><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image16.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-689 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis52_Image16.gif\" alt=\"\" width=\"255\" height=\"205\" \/><\/a><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td colspan=\"2\" width=\"100%\">Fig. 3. Examples of the ACTIVITY predictions: (a) the DNA bend within the TBP\/TATA complex (Hogan, 1987) correlates with the inclination; (b) USF\/DNA affinity (Bendall, 1994) correlates with the twist; (c) YY1 transcription repression activity (Hyde-DeRuyscher, 1995) correlates with the twist; and (d) the mutability induced by 2-aminopurine (Coullondre, 1978) correlates with the melting temperature<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n","protected":false},"excerpt":{"rendered":"<p>PONOMARENKO M.P.,\u00a0KOLCHANOV N.A.,\u00a0PONOMARENKO J.V.,\u00a0FROLOV A.S.,\u00a0PODKOLODNAYA O.A.,\u00a0VOROBIEV D.G.,\u00a0PODKOLODNY N.L.1,\u00a0OVERTON G.C.2 Laboratory of Theoretical Genetics, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, 10 Lavrentiev Ave., Novosibirsk, 630090, Russia; 1Institute of Computational Mathematics and Mathematical Geophysics, Novosibirsk, &hellip; <a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/abstract-list\/052_revealing-the-conformational-and-physico-chemical-dna-properties-applicable-for-predicting-the-activity-of-dna-functional-sites\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":13,"featured_media":0,"parent":97,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":[],"_links":{"self":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/672"}],"collection":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/users\/13"}],"replies":[{"embeddable":true,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/comments?post=672"}],"version-history":[{"count":5,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/672\/revisions"}],"predecessor-version":[{"id":1389,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/672\/revisions\/1389"}],"up":[{"embeddable":true,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/97"}],"wp:attachment":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/media?parent=672"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}