{"id":695,"date":"2023-03-15T13:41:24","date_gmt":"2023-03-15T06:41:24","guid":{"rendered":"https:\/\/conf.icgbio.ru\/bgrs98\/?page_id=695"},"modified":"2023-04-11T14:33:00","modified_gmt":"2023-04-11T07:33:00","slug":"053_classification-of-eukaryotic-transcription-factors-based-on-significant-b-dna-conformational-and-physico-chemical-properties-of-their-binding-sites","status":"publish","type":"page","link":"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/abstract-list\/053_classification-of-eukaryotic-transcription-factors-based-on-significant-b-dna-conformational-and-physico-chemical-properties-of-their-binding-sites\/","title":{"rendered":"CLASSIFICATION OF EUKARYOTIC TRANSCRIPTION FACTORS BASED ON SIGNIFICANT B-DNA CONFORMATIONAL AND PHYSICO-CHEMICAL PROPERTIES OF THEIR BINDING SITES"},"content":{"rendered":"<p><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#ponomarenko_jv\">PONOMARENKO J.V.<\/a><\/p>\n<p>Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, 10 Lavrentiev Ave., Novosibirsk, 630090, Russia<\/p>\n<p><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/keywords-index\/\">Keywords<\/a>: eukaryotic transcription factors, conformational and physico-chemical properties of DNA, classification<\/p>\n<p>&nbsp;<\/p>\n<p>Local nonuniformity of DNA conformational and physico-chemical properties and their dependence on nucleotide content have been demonstrated in numerous studies [1, 2]. X-ray structure analysis of B-DNA duplexes and complexes with proteins provided for determination of the mean conformational parameters of each dinucleotide [3-5]. Mean values of a number of physico-chemical properties have been determined for each dinucleotide too [6-10].<\/p>\n<p>A method for classification of transcription factors based on significant context-dependent B-DNA conformational and physico-chemical properties of their binding sites, revealed by the system B-DNA-VIDEO [11] is proposed in this work. Sets of nucleotide sequences of transcription factor binding sites from the SAMPLES database without 100% homologues [12] (Table 1) were used for the analysis. The sequences in each set are aligned relative to the center of the experimentally determined binding sites of a definite transcription factor; their location was determined according to the information contained in the TRANSFAC database and published data (in case of YY1 site, the TRRD database, v 3.5, was also employed). The sequences considered was 120 bp long with flanks of 60 bp from the center of the site.<\/p>\n<p>Table 1. Eukaryotic transcription factor binding sites<b>\u00a0<\/b>involved in the analysis<\/p>\n<table border=\"1\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td valign=\"TOP\" width=\"5%\"><\/td>\n<td valign=\"middle\" width=\"11%\">\n<p align=\"CENTER\">Factor<\/p>\n<\/td>\n<td colspan=\"2\" valign=\"middle\" width=\"46%\">\n<p align=\"CENTER\">Class according to TRANSFAC<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"10%\">\n<p align=\"CENTER\">Number of sequences in<\/p>\n<\/td>\n<td colspan=\"3\" valign=\"TOP\" width=\"29%\">\n<p align=\"CENTER\">Size of footprint or TRANSFAC sequence element, bp<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\"><\/td>\n<td width=\"11%\"><\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">Number<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">Name<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">set<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">Mean<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">Min<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">Max<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">1<\/td>\n<td width=\"11%\">AP-1<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">1.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">Leucine zipper factors (bZIP)<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">69<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">14.9<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">6<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">53<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">2<\/td>\n<td width=\"11%\">c-Fos<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">1.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">19<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">13.9<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">6<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">29<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">3<\/td>\n<td width=\"11%\">c-Jun<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">1.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">28<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">14.6<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">6<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">43<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">4<\/td>\n<td width=\"11%\">NF-E2<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">1.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">12<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">10.7<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">3<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">20<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">5<\/td>\n<td width=\"11%\">CRE-BP1<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">1.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">22<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">12.7<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">6<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">25<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">6<\/td>\n<td width=\"11%\">ATF<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">1.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">25<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">12.9<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">5<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">25<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">7<\/td>\n<td width=\"11%\">CREB<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">1.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">37<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">13.6<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">4<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">37<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">8<\/td>\n<td width=\"11%\">C\/EBP<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">1.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">108<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">19.1<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">4<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">96<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">9<\/td>\n<td width=\"11%\">NF-IL6<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">1.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">21<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">19.9<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">8<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">41<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">10<\/td>\n<td width=\"11%\">MyoD<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">1.2<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">Helix-loop-helix factors (bHLH)<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">16<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">18.5<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">8<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">59<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">11<\/td>\n<td width=\"11%\">E2F<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">1.3<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">(bHLH-ZIP)<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">9<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">13.2<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">7<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">30<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">12<\/td>\n<td width=\"11%\">USF<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">1.3<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">25<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">14.1<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">5<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">30<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">13<\/td>\n<td width=\"11%\">NF-1<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">1.4<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">NF-1<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">101<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">17.2<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">4<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">53<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">14<\/td>\n<td width=\"11%\">RF-X<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">1.5<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">RF-X<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">12<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">17.0<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">11<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">24<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">15<\/td>\n<td width=\"11%\">CP1<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">1.6<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">Heteromeric CCAAT factors<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">33<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">17.2<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">4<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">41<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">16<\/td>\n<td width=\"11%\">ER<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">2.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">Cys4 zinc finger of nuclear receptor type<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">25<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">17.5<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">4<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">40<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">17<\/td>\n<td width=\"11%\">GR<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">2.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">54<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">12.6<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">4<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">39<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">18<\/td>\n<td width=\"11%\">PR<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">2.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">20<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">11.7<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">5<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">20<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">19<\/td>\n<td width=\"11%\">RAR<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">2.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">16<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">24.6<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">7<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">56<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">20<\/td>\n<td width=\"11%\">RXR<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">2.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">21<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">20.9<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">10<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">28<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">21<\/td>\n<td width=\"11%\">T3R<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">2.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">21<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">20.5<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">10<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">28<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">22<\/td>\n<td width=\"11%\">COUP<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">2.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">17<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">20.6<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">11<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">35<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">23<\/td>\n<td width=\"11%\">GATA-1<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">2.2<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">Diverse Cys4 zinc fingers<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">76<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">16.1<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">5<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">41<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">24<\/td>\n<td width=\"11%\">Sp1<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">2.3<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">Cys2His2 zinc finger domain<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">176<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">13.1<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">4<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">96<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">25<\/td>\n<td width=\"11%\">YY1<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">2.3<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">27<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">13.6<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">6<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">27<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">26<\/td>\n<td width=\"11%\">GAL4<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">2.4<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">Cys6 cysteine-zinc cluster<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">16<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">14.7<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">3<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">22<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">27<\/td>\n<td width=\"11%\">EN<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">3.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">Homeodomain<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">12<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">11.4<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">6<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">20<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">28<\/td>\n<td width=\"11%\">HNF-1<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">3.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">38<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">21.0<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">5<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">96<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">29<\/td>\n<td width=\"11%\">OCT<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">3.1<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">73<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">14.9<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">7<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">46<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">30<\/td>\n<td width=\"11%\">HNF-3<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">3.3<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">Fork head \/ winged helix<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">10<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">24.6<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">8<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">96<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">31<\/td>\n<td width=\"11%\">c-Myb<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">3.5<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">Tryptophan clusters<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">19<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">18.9<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">5<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">31<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">32<\/td>\n<td width=\"11%\">Ets<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">3.5<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">15<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">20.9<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">9<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">45<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">33<\/td>\n<td width=\"11%\">IRF-1<\/td>\n<td width=\"9%\">\n<p align=\"CENTER\">3.5<\/p>\n<\/td>\n<td width=\"36%\">\n<p align=\"CENTER\">&#8211; \u201c &#8211;<\/p>\n<\/td>\n<td width=\"10%\">\n<p align=\"CENTER\">7<\/p>\n<\/td>\n<td width=\"14%\">\n<p align=\"CENTER\">16.1<\/p>\n<\/td>\n<td width=\"6%\">\n<p align=\"CENTER\">5<\/p>\n<\/td>\n<td width=\"8%\">\n<p align=\"CENTER\">22<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>Table2. Significant conformational and physico-chemical properties of the DNA transcription factor binding sites.<\/p>\n<p><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis53_Image1.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-697 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis53_Image1.gif\" alt=\"\" width=\"600\" height=\"400\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<p>Listed in Table 2 are the conformational and physico-chemical properties from the PROPERTY database used in this work. Basing on these data, the computer system B-DNA-VIDEO automatically performs the search for significant conformational and physico-chemical properties and the regions of sites where the mean value of a conformational and physico-chemical property in question differs significantly from the value characteristic of random nucleotide sequences.<\/p>\n<p>The system B-DNA-VIDEO employs the following algorithm. Consider a nucleotide sequence S={s<sub>1<\/sub>&#8230;s<sub>i<\/sub>&#8230;s<sub>L<\/sub>}of length L. There is the dinucleotide s<sub>i<\/sub>s<sub>i+1<\/sub>\u00a0at the i-th position. The mean of the k-th property from Table 2, X<sub>k<\/sub>, averaged for a region [a, b] (1&lt;= a&lt;= b&lt;= L) with the starting position \u011ea\u011f and the terminal position \u011eb\u011f of the sequence S is calculated as follows:<\/p>\n<table border=\"0\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td width=\"90%\"><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis53_Image2.gif\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-698\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis53_Image2.gif\" alt=\"\" width=\"194\" height=\"45\" \/><\/a><\/td>\n<td width=\"10%\">(1)<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Equation (1) was applied to each of the 27 properties and to each of the 33 sites listed in Table 2. All the possible sequence regions [a, b] not smaller than one dinucleotide were taken into account. Their number for a sequence L long was n(L)=L x (L-1)\/2. The total number of sequence regions was n(120)=120 x (120-1)\/2=7140 for a sample. A number of properties X<sub>k,a,b<\/sub>\u00a0can be calculated N(L)=27 x n(L) for a sequence L long. Thus, a total number of N(70)=27 x 7140=192780 properties was tested for each sample.<\/p>\n<p>Applying Equation (1) to the set of site sequences {S} at a fixed k, a and b, we yield the distribution X<sub>k,a,b<\/sub>{S} for the site. Similarly, the distribution X<sub>k,a,b<\/sub>{R} is generated for random sequences {R} with the same nucleotide frequencies as in the real sequences. The difference between these distributions X<sub>k,a,b<\/sub>{S} and X<sub>k,a,b<\/sub>{R} is tested for significance using four statistical criteria [13]. B-DNA-VIDEO [11] processes the sets {S} and {R} and outputs the list of the significant B-DNA properties {X<sub>k,a,b<\/sub>}.<\/p>\n<p>The results obtained by the described analysis of transcription factor binding sites are stored in the knowledge base on functional sites B-DNA-FEATURES, which is a constituent of the computer system B-DNA-VIDEO.<\/p>\n<p>We have analyzed 33 transcription factor binding sites; the results are summarized in Table 2. The conformational and physico-chemical properties, the mean value of which within certain regions of a site differs significantly from the values characteristic of random sequences, have been discovered for each site considered. If the mean value of a significant property for the site exceeded that for random sequences, it was indicates by &#8220;+&#8221; in Table 2; otherwise, &#8220;-&#8220;. The lengths of the significant regions vary in the range of 10 to 25 bp; this corresponds to 1-2.5 coils of B-DNA helix as well as to the mean length of the region of DNA interaction with transcription factors (see Table 1).<\/p>\n<p>An approach to classification of transcription factors based on the revealed sets of significant conformational and physico-chemical properties of their binding sites is suggested. The classification of transcription factor DNA-binding domains proposed by Wingender [14] distinguishes four superclasses: I superclass containing basic domains; II superclass, zinc-coordinated DNA-binding domains; III superclass, helix-turn-helix; and IV superclass, beta-scaffold factors with minor groove contacts. Fifteen transcription factors of thirty three considered belong to various classes of the I superclass according to the type of their domain; eleven sites, to the II superclass; and seven, to the III supercalss (Table 1).<\/p>\n<p>The information contained in Table 2 was processed by the program Cluster Analysis from software package STATISTICA (Windows\u201995) to construct the similarity tree for the transcription factors analyzed (Figure). Euclidean distance was used as a measure of similarity between two sites i and j:<\/p>\n<table border=\"0\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td width=\"90%\"><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis53_Image3.gif\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-699\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis53_Image3.gif\" alt=\"\" width=\"150\" height=\"50\" \/><\/a><\/td>\n<td width=\"10%\">(2)<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>where x<sub>i,k<\/sub>\u00a0= 1, if the mean value of kth property on the region of the site i exceeds significantly the corresponding mean value for random sequences (indicated by &#8220;+&#8221; in Table 2); x<sub>i,k\u00a0<\/sub>= -1, if the mean value of the kth property on the region of the site i is significantly lower than the corresponding mean value for random sequences (indicated by &#8220;-&#8221; in Table 2); x<sub>i,k<\/sub>\u00a0= 0, if the mean value of the kth property on the region of site i equals the corresponding mean value for random sequences; and k is the number of the property considered, k = 1,&#8230;,27. The distances between clusters are determined by the greatest distance between any two objects in separate clusters (i.e., by the &#8220;furthest neighbors&#8221;).<\/p>\n<p>&nbsp;<\/p>\n<p><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis53_Image4.gif\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" class=\"alignnone wp-image-700 size-full\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis53_Image4.gif\" alt=\"\" width=\"499\" height=\"265\" \/><\/a><\/p>\n<p>Figure. The similarity tree for transcription factors.<\/p>\n<p>&nbsp;<\/p>\n<p>The tree obtained has two branches (Figure). The left branch unites all transcription factors that belong to the III superclass according to the type of DNA-binding domains and eight transcription factors of the I and II superclass according to the classification of Wingender [14]. The factors belonging to the III superclass contain the DNA-binding domain of HTH type. The probability that a transcription factor is attributed to the III superclass accidentally is a &lt;0.004. In turn, the right branch divides in two branches: one containing six transcription factors of the II superclass; the other, eleven factors of the I superclass. The probabilities of accidental attribution to these classes are <img loading=\"lazy\" class=\"alignnone size-full wp-image-696\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis53_alpha.gif\" alt=\"\" width=\"9\" height=\"16\" \/>&lt;0.0014 and <img loading=\"lazy\" class=\"alignnone size-full wp-image-696\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis53_alpha.gif\" alt=\"\" width=\"9\" height=\"16\" \/>&lt;0.0012, respectively. These probabilities were calculated basin of binomial distribution as follows:<\/p>\n<table border=\"0\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td width=\"90%\"><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis53_Image5.gif\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-701\" src=\"https:\/\/conf.icgbio.ru\/bgrs98\/wp-content\/uploads\/sites\/111\/2023\/03\/Thesis53_Image5.gif\" alt=\"\" width=\"179\" height=\"45\" \/><\/a><\/td>\n<td width=\"10%\">(3)<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>where m is the number of factors in the tree branch considered; n, the number of factors of a definite superclass contained in this branch; N, total number of the factors in this superclass; M, total number of factors considered (33); and P = N\/M, the probability of accidental attribution of one factor to the superclass considered.<\/p>\n<p>The classification of transcription factor binding sites obtained by this procedure is consistent with the X-ray and NMR structures of DNA-protein complexes as well as with other available data on DNA-protein interactions. Thus, the transcription factors of the III superclass interact with the DNA as a rule in a form of monomers forming abundant H-bonds and van der Waals (hydrophobic) contacts with bases and sugar-phosphate backbone of the DNA in both major and minor grooves as well as a plenty of water-mediated contacts. The transcription factors of the I superclass and class 2.1 (Cys4 zinc finger of nuclear receptor type) bind to the DNA in major groove in a form of homo- and heterodimers mainly through electrostatic interactions. The transcription factors belonging to the rest classes of the second superclass interact with the major groove mainly as monomers. Interaction of GATA-1 with DNA is more likely to the interaction type of HTH domains with DNA. The NMR structure of GATA-1\/DNA complex indicate that the protein binds to both grooves mainly through hydrophobic interactions.<\/p>\n<p>Thus, the result obtained indicates a certain correlation between the transcription binding site classification based on their significant conformational and physico-chemical properties, on the one hand, and the classification of the transcription factors binding to these sites according to their DNA-binding domains, on the other.<\/p>\n<p>I am grateful to Ms. Galina Chirikova for help in translation. This work was supported by Russian Foundation for Basic Research<\/p>\n<p><b>REFERENCES<\/b><\/p>\n<ol>\n<li>Suzuki M., Amano N., Kakinuma J., Tateno M. (1997) J Mol Biol., 274, 421-435<\/li>\n<li>el Hassan H.A., Calladine C.R. (1996) J. Mol. Biol., 259, 95-103.<\/li>\n<li>Suzuki M., Yagi N., Finch J.T. (1996) FEBS L., 379, 148-152.<\/li>\n<li>Shpigelman E.S., Trifonov E.N., Bolshoy A. (1993) Comput. Appl. Biosci., 9, 435-140.<\/li>\n<li>Gorin, A.A., Zhurkin, V.B., and Olson, W.K., (1995) J. Mol. Biol., 247, 34-48.<\/li>\n<li>Hogan, M.E., and Austin, R.H., (1987) Nature, 329, 263-266.<\/li>\n<li>Gotoh,O. and Tagashira,Y. (1981) Biopolymers, 20, 1043-1058.<\/li>\n<li>Satchwell S.C., Drew H.R., Travers A.A. (1986) J. Mol. Biol., 191, 659-675<\/li>\n<li>Gartenberg, M.R., and Crothers, D.M., (1988) Nature, 333, 824-829.<\/li>\n<li>Sugimoto, N., Nakano, S., Yoneyama, M., and Honda, K. (1996) Nucleic Acids Res., 24, 4501-4505.<\/li>\n<li>Ponomarenko M.P. et al., 1998, \u201cB-DNA-VIDEO\u201d in this issue.<\/li>\n<li>Vorobjev D.G. and Ponomarenko J.V.,1998, \u201cSAMPLES\u201d in this issue.<\/li>\n<li>Lehman,E.L. (1959) Testing statistical hypotheses. Willey. New York.<\/li>\n<li>Wingender E (1997) Mol Biol (Mosk), 31, 584-600.<\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>PONOMARENKO J.V. Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, 10 Lavrentiev Ave., Novosibirsk, 630090, Russia Keywords: eukaryotic transcription factors, conformational and physico-chemical properties of DNA, classification &nbsp; Local nonuniformity of DNA conformational and physico-chemical &hellip; <a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/abstract-list\/053_classification-of-eukaryotic-transcription-factors-based-on-significant-b-dna-conformational-and-physico-chemical-properties-of-their-binding-sites\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":13,"featured_media":0,"parent":97,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":[],"_links":{"self":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/695"}],"collection":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/users\/13"}],"replies":[{"embeddable":true,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/comments?post=695"}],"version-history":[{"count":3,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/695\/revisions"}],"predecessor-version":[{"id":1390,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/695\/revisions\/1390"}],"up":[{"embeddable":true,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/97"}],"wp:attachment":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/media?parent=695"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}