{"id":864,"date":"2023-03-20T13:19:00","date_gmt":"2023-03-20T06:19:00","guid":{"rendered":"https:\/\/conf.icgbio.ru\/bgrs98\/?page_id=864"},"modified":"2023-09-04T15:21:52","modified_gmt":"2023-09-04T08:21:52","slug":"075_pre-mrna-splicing-in-eukaryotes-intron-structure-intron-detection-algorithms-and-data-structures","status":"publish","type":"page","link":"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/abstract-list\/075_pre-mrna-splicing-in-eukaryotes-intron-structure-intron-detection-algorithms-and-data-structures\/","title":{"rendered":"PRE-MRNA SPLICING IN EUKARYOTES &#8211; INTRON STRUCTURE, INTRON DETECTION, ALGORITHMS AND DATA STRUCTURES"},"content":{"rendered":"<p><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#chekmenev\">CHEKMENEV D.S.<\/a><\/p>\n<p>GNII genetika, 1<sup>st<\/sup>\u00a0Dorozhny proezd, Moscow, 113545, Russia;<br \/>\ne-mail:\u00a0chicha@mail.cir.ru;<\/p>\n<p><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/keywords-index\/\">Keywords<\/a>: pre-mRNA, splicing, gene expression, intron structure, intron detection, RNA secondary structure<\/p>\n<p><b>Introduction<\/b><\/p>\n<p>More than 20 years have passed since the discovery of the pre-mRNA splicing. Many components of spliceosome and snRNP moieties interactions were identified, but intron\/exon detection algorithms have not reached accuracy of splicing machinery. In this abstract I sugest the way to improve intron detection methods with respect to identified pathways of splicing process.<b><\/b><\/p>\n<p><strong>1. PRE-mRNA splicing<\/strong><\/p>\n<p><b><i>1.1. Role of splicing in gene expression<\/i><\/b><\/p>\n<p>Genes in eukaryotes are often interrupted by intervening sequences (IVSs or introns) that must be removed during gene expression. RNA splicing is the process by which these intrervening sequences are precisely removed and the flanking, functional sequences (exons) are joined together [1, 2]. So RNA splicing as significant part of pre-mRNA processing is one of the major steps in the control of gene expression in eukaryotes.<\/p>\n<p>Regulated mechanism of alternative splicing allow multiple different proteins to be translated from the single RNA transcript. By alternative splicing, a single sequence has been found to be able to code for dozen different proteins, depending on how its exons are assembled. Alternative splicing is regulated in a developmental or tissue specific manner. For example, a gene in thyroid tissue produces calcitonin in rats. The same gene in brain tissue produces a neuropeptide by using a different exon combination.<\/p>\n<p>Mutations can affect splicing of certain introns, leading to abnormal conditions. For example a form of thalassemia, a blood disorder, is due to a mutation causing splicing failure of an intron in a globin transcript, which then becomes untranslatable. Abnormal beta-amyloid in Alzheimer disease is result of intron mutation that impair splicing.<\/p>\n<p>So elucidation of splicing mechanism will help us to find new ways in genetic diseases treatment and better understanding of genetic information organisation and gene expression.<i><b><\/b><\/i><\/p>\n<p><i><b>1.2. Splicing mechanism<\/b><\/i><\/p>\n<p>Splicing of nuclear introns occurrs by a two step pathway. In the first step, the phosphodiester bond at the 5\u00ed splice site is attacked by the 2\u00ed-OH of an adenosine residue in the intron, the branch point. This reaction produces a free upstream exon and a lariat intermediate molecule containing both the downstream exon and the intron with its 5\u00ed end covalently linked to the branch nucleotide. During the second cleavage-ligation step, the 3\u00ed hydroxyl of the 5\u00ed exon attacks the phosphate at the 3\u00ed splice site. This results in the ligation of the two exons and the release of the intron in lariat form [1, 2, 3].<\/p>\n<p>Removal of introns is catalysed by a large ribonucleoprotein complex called the spliceosome, which consists of four small nuclear ribonucleoprotein particles (U1, U2, U5, and U4\/U6 snRNPs) and auxiliary protein factors [3, 4]. A minor type of AT-AC introns require U11, U12, U5, U4atac and U6atac snRNAs [5, 6].<\/p>\n<p>Before the two steps of splicing, the pre-mRNA has to be assembled into a highly complex ribonucleoprotein structure, the spliceosome. RNA interactions are thought to be central to the splicing process and may play an important role in the catalytic core of the active spliceosome [3, 4]. U1 snRNP interacts with the 5\u00ed splice site [10, 11] and U2 snRNP with the branch site of pre-mRNA [12, 13] both of this interactions involve Watson-Crick base pairing. U1 binds at an early step in spliceosome assembly and commits the pre-mRNA to the splicing pathway [14, 15, 16]. Genetic and biochemical data place U5 in close proximity to the 5\u00ed and 3\u00ed exon sequences [17, 18, 19]. Crosslinking experiments in mammalian [18] and yeast [20] extracts revealed 5\u00ed splice site &#8211; conserved domain of U6 (ACAGAG) interaction [21, 22]. The conserved domain of U6 is immediately upstream of a helix formed by base-pairing interactions between U6 and U2 [23, 24]. This helix juxtapose 5\u00ed splice site with the branch point interaction domain of U2 [25]. So active site of spliceosome are formed by Watson-Crick RNA-RNA interaction.<b><\/b><\/p>\n<p><strong>2. Role of RNA &#8211; RNA interactions<\/strong><\/p>\n<p><b><i>2.1. snRNA &#8211; intron interactions (intron primary structure)<\/i><\/b><\/p>\n<p>The hypothesis that, splicing is RNA-catalyzed process mediated by the spliceosomal snRNAs, was galvanized by the observation that Group II self-splicing introns are removed by a two-step chemical pathway that is highly similar if not identical to that which accomplishes nuclear pre-mRNA splicing [7, 8, 9, 26, 27]. Most actual for intron detection is interaction of snRNAs and pre-mRNA (intron). The early U1 snRNA interaction with the 5\u00ed splice site is important to recruit RNA sequences into commitment complexes and pre-spliceosomes [15]. The presence of U1 at 5\u00ed splice site is necessary for binding U2 snRNA to branch point of intron. All these initial intron &#8211; snRNA interaction require Watson-Crick basepairing.<\/p>\n<p>As might be expected from the fact that exons must encode diverse sequences, conserved information at the 5\u00ed and 3\u00ed splice sites residues almost completely in the intron [3, 28]. For yeast introns it are \/GUauGu for 5\u00ed and YAG\/ for 3\u00ed (\/ &#8211; splice site; upper case &#8211; most conserved, lower case &#8211; less conserved). Branch point has UACUA<u>A<\/u>CA (branch point adenosine is underlined). Mammalian introns are less conservative especially in branch point [3].<i><b><\/b><\/i><\/p>\n<p><i><b>2.2. Methods applied in splice sites detection<\/b><\/i><\/p>\n<p>A common approach to locating sites of all kinds is to search for similarities to \u00ebconsensus sequences\u00ed. The method suffers from the fact that individual sites are not usually identical to the consensus, and different positions vary in their importance within the consensus. For example, realy conserved are dinucleotides at 3\u00ed and 5\u00ed splice sites, G at position 5 of intron and branch point adenosine. Other nucleotides are significantly lower conserved. A superior method is to search using a matrix. The matrix contains an element for each posible base at every position within a site. The evaluation of each potential site involves summing the elements that correspond to the sequence at that site. Such matrix can be used to find all sites within some range of similarity (Tables 1 and 2. Method from [Mount S.M. Nucleic Acids Res.\u00a0<b>10<\/b>, 459 (1982)] was used for calculation).<\/p>\n<p>Table 1. Matrix to find 5\u00ed splice sites. Intron begins at position 0.<\/p>\n<table border=\"1\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td valign=\"TOP\" width=\"16%\">\n<p align=\"CENTER\">Pos:<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-3<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-2<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-1<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">0<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">1<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">2<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">3<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">4<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">5<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"16%\">\n<p align=\"CENTER\">A<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">5<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">9<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-11<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-35<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-35<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">9<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">10<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-11<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-4<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"16%\">\n<p align=\"CENTER\">C<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">5<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-8<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-15<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-35<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-35<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-24<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-10<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-35<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-7<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"16%\">\n<p align=\"CENTER\">G<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-10<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-8<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">11<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">14<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-35<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">2<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-8<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">12<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-11<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"16%\">\n<p align=\"CENTER\">T<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-12<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-7<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-7<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-35<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">14<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-14<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-8<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">-16<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"9%\">\n<p align=\"CENTER\">9<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Table 2. Matrix to find 3\u00ed splice site Exon begins at position 0.<\/p>\n<table border=\"1\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">Pos:<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-11<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-10<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-9<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-8<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-7<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-6<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-5<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-4<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-3<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-2<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-1<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">0<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">1<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">A<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-14<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-5<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-8<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-3<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-7<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-21<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-9<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-19<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">14<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-35<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-1<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-4<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">C<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">0<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">2<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">3<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">1<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">4<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">4<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">1<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-1<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">9<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-35<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-35<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-3<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-1<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">G<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-9<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-15<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-13<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-11<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-13<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-17<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-17<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">0<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-35<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-35<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">14<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">7<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">0<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">T<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">9<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">7<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">7<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">6<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">6<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">8<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">8<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">2<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">2<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-35<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"8%\">\n<p align=\"CENTER\">-35<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">-11<\/p>\n<\/td>\n<td valign=\"TOP\" width=\"6%\">\n<p align=\"CENTER\">4<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Unfortunately there is quite a lot of overlap between the values of real sites and unused sites. There are too many posible sites found by the matrices.<\/p>\n<p>Real algorithms using for finding splice sites are using other information than intron conserved sequences. It is coding potential of exons. Base\/position preferences and codon bias also can be taken in consideration for detrmining proper reading frame in exon. But this features hardly reflect real processes that take place in spliceosome operation. Exon mutations do not impair splicing [19].<\/p>\n<p>Probably other information besides the primary sequence is used, such as the secondary structure of the RNA [32].<i><b><\/b><\/i><\/p>\n<p><i><b>2.3. Role of intron secondary structure in splicing<\/b><\/i><\/p>\n<p>Conserved secondary structure motifs of intron can play importanat role in intron recognition by spliceosome. This can be supported by possible evolutionary origin of pre-mRNA splicing from self-excised introns of group II. Self-splicing of group II introns mediated only by it secondary structure. A number of secondary structure motifs from group II introns were found in snRNAs. U2-U6 snRNA helix is similar to domain 5 of group II intron [25]. U6 &#8211; 5\u00ed splice site helix functioning analogously to epsilon, a sequence that pairs with intron nucleotides near the 5\u00ed splice site of group II self-splicing [29]. The U5 conserved loop can be viewed as the spliceosomal counterpart of the exon binding site (EBS1) of group II introns [17, 30]. In Group II introns, the branchpoint adenosine is found bulged out of a duplex, termed domain 6. Similarly, in nuclear pre-mRNAs, the branchpoint is identified in part through a basepairing interaction with U2 snRNA in which the adenosine nucleophile is bulged out of the U2 &#8211; pre-mRNA duplex [31]. I suppose that in pre-mRNA intron can be found secondary structure motifs similar to those in group II introns that will help us significantly improve algorithms of intron detection.<b><\/b><\/p>\n<p><strong>3. Conserved structures search method requirements<\/strong><i><\/i><\/p>\n<p><b><i>3.1. Requirements for conserved structures search methods<\/i><\/b><\/p>\n<p>For finding conserved secondary structure motifs some enhancements to the search matrices can be applied.<\/p>\n<p>Conserved RNA secondary structures motives can be treated as RNA double helices hairpins in conserved positions of intron. The existence of such helices can\u00edt be found by analysing of nucleotide positions in RNA sequence.<\/p>\n<p>To identify conserved secondary structure we must add probabilities for nucleotide complementarity to the search matrices. So every element of search matrix must contain not only four probabilities for nucleotide appearance, but also vector of complementarity of current position to other sites of intron. Calculation of such probability vector can take great computational resources, because intron length is high enough. (up to several thousand of nucleotides).<\/p>\n<p>Some restriction can be taken in consideration to reduce computational resources. First, length of hairpin loops can be reduce to dozen of nucleotides. Hairpins with very long internal loops hardly can be formed during pre-mRNA splicing, because it is relatively fast running, almost co-transcriptional event [34]. Second, such conserved structures can be awayted strictly in particular regions of intron. It must be found near splice sites and especially near branch point or in the polypyrimidine tract between branch point and 3\u00ed splice site. Third, search of conserved structures can be fulfiled on the relatively short introns (up to one hundred nucleotides).<i><b><\/b><\/i><\/p>\n<p><i><b>3.2 Requirements for splice site detection method<\/b><\/i><\/p>\n<p>The idea of high role of RNA secondary structure contribution to particular RNA\/DNA site detection is not only restricted by splice site. RNA secondary structures can also contribute to determination of translation initiation sites and even to transcription initiation and promoter regions [33]. So information on conserved structural motifs must be integrated in methods of particular regions detection and posible in genetic data banks.<b><\/b><\/p>\n<p>References<\/p>\n<ol>\n<li>M.R. Green, &#8220;Biochemical mechanisms of constitutive and regulated pre-mRNA splicing&#8221; Annu. Rev. Cell Biol.\u00a0<b>7<\/b>, 559-599 (1991)<\/li>\n<li>M.J. Moore, C.C. Query, P.A. Sharp, &#8220;Splicing of precursors to mRNA by the spliceosome&#8221; p.p. 303-358 in R.F. Gesteland and J.F. Atkins (ed.) &#8220;The RNA World&#8221; (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., (1993)<\/li>\n<li>H.D. Madhani, C. Guthrie, &#8220;Dynamic RNA &#8211; RNA interactions in the spliceosome&#8221; Annu. Rev. Genet.\u00a0<b>28<\/b>, 1-26 (1994)<\/li>\n<li>J.A. Steitz, D.L. Black, V. Gerke, K.A. Parker, A. Kramer, &#8220;Functions of the abundant U-snRNPs&#8221; in &#8220;Structure and Function of Major and Minor Small Nuclear Ribonucleoprotein Particles&#8221;, 115-154 (1988)<\/li>\n<li>W.-Y. Tarn, J.A. Steitz, &#8220;Highly Diverged U4 and U6 Small Nuclear RNAs Required for Splicing Rare AT-AC Introns&#8221;, Science\u00a0<b>273<\/b>, 1824-1833 (1996)<\/li>\n<li>T.W. Nilsen, &#8220;A Parallel Spliceosome&#8221;, Science\u00a0<b>273<\/b>, 1813 (1996)<\/li>\n<li>P.A. Sharp, &#8220;On the Origin of RNA Splicing and Introns&#8221;, Cell\u00a0<b>42<\/b>, 397-400 (1985)<\/li>\n<li>A.M. Weiner &#8220;mRNA Splicing and Autocatalytic Introns: Distant Cousins or the Products of Chemical Determinism?&#8221;, Cell\u00a0<b>72<\/b>, 161-164 (1993)<\/li>\n<li>M.Belfort, M.E. Reaban, T. Coetzee, J.Z. Dalgaard, &#8220;Prokaryotic Introns and Inteins: a Panoply of Form and Function&#8221;, J. of Bacteriology\u00a0<b>177<\/b>, 3897-3903 (1995)<\/li>\n<li>B. Seraphin, L. Kretzner, M. Rosbash, &#8220;A U1 snRNA: pre-mRNA base pairing interaction is required early in yeast spliceosome assembly but does not uniquely define the 5\u00ed cleavage site&#8221;, EMBO J.\u00a0<b>7<\/b>, 2533-2538 (1988)<\/li>\n<li>Y. Zhuang, A.M. Weiner, &#8220;A compensatory base change in U1 snRNA suppresses a 5\u00ed splice site mutation&#8221;, Cell\u00a0<b>46<\/b>, 827-835 (1986)<\/li>\n<li>R. Parker, P.G. Siliciano, C. Guthrie, &#8220;Recognition of the TACTAAC box during mRNA splicing in yeast involves base pairing to the U2-like snRNA&#8221;, Cell\u00a0<b>49<\/b>, 229-239 (1987)<\/li>\n<li>J.A. Wu, J.L. Manley, &#8220;Mammalian pre-mRNA branch site selection by U2 snRNP involves base pairing&#8221;, Genes Dev.\u00a0<b>3<\/b>, 1553-1561 (1989)<\/li>\n<li>P. Legrain, B. Seraphin, M. Rosbash, &#8220;Early commitment of yeast pre-mRNA to the spliceosome pathway&#8221;, Mol. Cell Biol.\u00a0<b>8<\/b>, 3755-3760 (1988)<\/li>\n<li>S.W. Ruby, J.N. Abelson, &#8220;An early hierarchic role of U1 small nuclear ribonucleoprotein in spliceosome assembly&#8221;, Science\u00a0<b>242<\/b>, 1028-1035 (1988)<\/li>\n<li>B. Seraphin, M. Rosbash, &#8220;Identification of functional U1 snRNA &#8211; pre-mRNA complexes committed to spliceosome assembly and splicing&#8221;, Cell\u00a0<b>59<\/b>, 349-358 (1989)<\/li>\n<li>A.J.Newman, C. Norman, &#8220;U5 snRNA interacts with exon sequences at 5\u00ed and 3\u00ed splice sites&#8221;, Cell\u00a0<b>68<\/b>, 743-754 (1992)<\/li>\n<li>D.A. Wassarman, J.A. Steitz, &#8220;Interactions of small nuclear RNAs with precursor messenger RNA during in vitro splicing&#8221;, Science\u00a0<b>257<\/b>, 1918-1925 (1992)<\/li>\n<li>J.R. Wyatt, E.J. Sontheimer, J.A. Steitz, &#8220;Site-specific cross-linking of mammalian U5 snRNP to the 5\u00ed splice site before the first step of pre-mRNA splicing&#8221;, Genes Dev.\u00a0<b>6<\/b>, 2542-2553 (1992)<\/li>\n<li>H. Sawa, J.N. Abelson, &#8220;Evidence for a base-pairing interaction between U6 small nuclear RNA and 5\u00ed splice site during the splicing reaction in yeast&#8221;, Proc. Natl. Acaad. Sci. USA\u00a0<b>89<\/b>, 11269-11273 (1992)<\/li>\n<li>S. Kandels-Lewis, B. Seraphin, &#8220;Role of U6 snRNA in 5\u00ed splice site selection&#8221;, Science\u00a0<b>262<\/b>, 2035-2039 (1993)<\/li>\n<li>C.F. Lesser, C. Guthrie, &#8220;Mutations in U6 snRNA that alter splice site specificity: implications for the active site&#8221;, Science\u00a0<b>262<\/b>, 1982-1988 (1993)<\/li>\n<li>J.A. Wu, J.L. Manley, &#8220;Base pairing between U2 and U6 snRNAs is necessary for splicing of a mammalian pre-mRNA&#8221;, Nature\u00a0<b>352<\/b>, 818-821 (1991)<\/li>\n<li>B. Datta, A.M. Weiner, &#8220;Genetic evidence for base pairing between U2 and U6 anRNA in mammalian mRNA splicing&#8221;, Nature\u00a0<b>352<\/b>, 821-824, (1991)<\/li>\n<li>H.D. Madhani, C. Guthrie, &#8220;A novel base-pairing interaction between U2 and U6 snRNAs suggests a mechanism for the catalytic activation of the spliceosome&#8221;, Cell\u00a0<b>71<\/b>, 803-817 (1992)<\/li>\n<li>C.L. Peebles, P.S. Perlman, K.L. Mecklenburg, M.L. Petrillo, J.H. Tabor, &#8220;A self-splicing RNA excises an intron lariat&#8221;, Cell<b>\u00a044<\/b>, 213-223 (1986)<\/li>\n<li>R. van der Veen, A.C. Arnbegr, G. van der Horst, L. Bonen, H.F. Tabak, L.A. Grivell, &#8220;Excised group II introns in yeast mitochondria are lariats and can be formed by self-splicing in vitro&#8221;, Cell\u00a0<b>44<\/b>, 225-234 (1986)<\/li>\n<li>F.E. Penotti, &#8220;Human pre-mRNA splicing signals&#8221; J. Theor. Biol.\u00a0<b>150<\/b>, 385-420 (1991)<\/li>\n<li>E.J. Sontheimer, J.A. Steitz &#8220;The U5 and U6 small nuclear RNAs as active site components of the spliceosome&#8221;, Science\u00a0<b>262<\/b>, 1989-1996 (1993)<\/li>\n<li>A. Jacquier, N. Jacquesson-Breuleux &#8220;Splice site selection and role of the lariat in a group II intron&#8221;, J. Mol. Biol.\u00a0<b>219<\/b>, 415-428 (1991)<\/li>\n<li>C.C. Query, M.J. Moore, P.A. Sharp &#8220;Branch nucleophile selection in pre-mRNA splicing: evidence for the bulged duplex model&#8221;, Genes Dev.\u00a0<b>8<\/b>, 587-597 (1994)<\/li>\n<li>G.D. Stormo &#8220;Identifying coding sequences&#8221; in &#8220;Nucleic acid and protein sequence analysis, a practical approach&#8221; (IRL Press, Oxford Washington DC, ed. M.J. Bishop, C.J. Rawlings), 231-258 (1987)<\/li>\n<li>M. Gouy &#8220;Secondary structure prediction of RNA&#8221; in &#8220;Nucleic acid and protein sequence analysis, a practical approach&#8221; (IRL Press, Oxford Washington DC, ed. M.J. Bishop, C.J. Rawlings), 259-284 (1987)<\/li>\n<li>G. Zhang, K.L. Taneja, R.H. Singer, M.R. Green &#8220;Localization of pre-mRNA splicing in mammalian nuclei&#8221;, Nature\u00a0<b>372<\/b>, 809-812 (1994)<\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>CHEKMENEV D.S. GNII genetika, 1st\u00a0Dorozhny proezd, Moscow, 113545, Russia; e-mail:\u00a0chicha@mail.cir.ru; Keywords: pre-mRNA, splicing, gene expression, intron structure, intron detection, RNA secondary structure Introduction More than 20 years have passed since the discovery of the pre-mRNA splicing. Many components of spliceosome &hellip; <a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/abstract-list\/075_pre-mrna-splicing-in-eukaryotes-intron-structure-intron-detection-algorithms-and-data-structures\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":13,"featured_media":0,"parent":97,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":[],"_links":{"self":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/864"}],"collection":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/users\/13"}],"replies":[{"embeddable":true,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/comments?post=864"}],"version-history":[{"count":4,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/864\/revisions"}],"predecessor-version":[{"id":1505,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/864\/revisions\/1505"}],"up":[{"embeddable":true,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/97"}],"wp:attachment":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/media?parent=864"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}