{"id":941,"date":"2023-03-20T14:47:38","date_gmt":"2023-03-20T07:47:38","guid":{"rendered":"https:\/\/conf.icgbio.ru\/bgrs98\/?page_id=941"},"modified":"2023-09-04T15:25:11","modified_gmt":"2023-09-04T08:25:11","slug":"080_characterization-of-the-compact-model-genome-of-the-japanese-puffer-fish-fugu-rubripes-using-a-cosmid-sequence-scanning-approach","status":"publish","type":"page","link":"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/abstract-list\/080_characterization-of-the-compact-model-genome-of-the-japanese-puffer-fish-fugu-rubripes-using-a-cosmid-sequence-scanning-approach\/","title":{"rendered":"CHARACTERIZATION OF THE COMPACT MODEL GENOME OF THE JAPANESE PUFFER FISH (FUGU RUBRIPES) USING A COSMID SEQUENCE SCANNING APPROACH"},"content":{"rendered":"<p><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#elgar\">ELGAR G.<\/a><sup>+<\/sup>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#clark\">CLARK M.S.<\/a>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#edwards\">EDWARDS Y.J.K.<\/a>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#meek\">MEEK S.<\/a>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#smith\">SMITH S.<\/a>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#umrania\">UMRANIA Y.<\/a>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#warner\">WARNER S.<\/a>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#williams\">WILLIAMS G.<\/a>,\u00a0<a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/authors-index\/#bishop\">BISHOP M.J.<\/a><\/p>\n<p>UK Human Genome Mapping Project Resource Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SB, UK.;<br \/>\ne-mail:\u00a0gelgar@hgmp.mrc.ac.uk;<\/p>\n<p>+Corresponding author<\/p>\n<p><a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/keywords-index\/\">Keywords<\/a>: japanese puffer fish genome, cosmid sequence scanning, genome characterization, minimalist model genome<\/p>\n<p><b>Abstract<\/b><\/p>\n<p>In comparison with other vertebrate genomes, the genome of\u00a0<i>Fugu rubripes<\/i>\u00a0(<i>Fugu<\/i>) comprises less repetitive DNA sequence, smaller intergenic regions, smaller introns and is gene dense [1,2,3]. In the light of the above, a landmark map is being generated for the genome of\u00a0<i>Fugu<\/i>.<i>\u00a0<\/i>The aim of the\u00a0<i>Fugu<\/i>\u00a0Landmark Mapping Project is to generate partial sequences from 1000 cosmids using a random shotgun approach to provide reference points covering 10% of the\u00a0<i>Fugu<\/i>\u00a0genome. The\u00a0<i>Fugu<\/i>\u00a0Landmark Mapping Project was initiated in November 1996. This DNA sequence will cover 10% of the genome and provide data on short range linkage in\u00a0<i>Fugu<\/i>\u00a0and synteny with higher vertebrates. The\u00a0<i>Fugu<\/i>\u00a0Landmark Mapping Project has been running for over two years and 1000 cosmids will be completed around mid August 1998. The general characteristics of the minimalist model genome are described.<b><\/b><\/p>\n<p><b>1. Introduction<\/b><i><\/i><\/p>\n<p><i>Fugu<\/i>\u00a0has one of the smallest vertebrate genome which comprises approximately 400Mb. As an established model vertebrate genome, it is an economical tool on which to perform comparative sequence analyses both in hybridization experiments and molecular bioinformatics studies [1,2,3]. The\u00a0<i>Fugu<\/i>\u00a0Landmark Mapping Project primarily involves large scale single pass sequencing and sequence analysis on\u00a0<i>Fugu<\/i>\u00a0DNA. Sequence analyses have identified genomic regions likely to code for proteins, syntenic genes similar to that observed in the genome of other species, repetitive DNA sequences such as microsatellites and minisatellites and conserved regulatory regions in equivalent gene structures. The\u00a0<i>Fugu<\/i>\u00a0DNA sequence database at the time of writing this document contains over 22Mb of DNA and this is estimated to cover about 7% of the genome.<b><\/b><\/p>\n<p><b>2. Constructing the<i>\u00a0Fugu<\/i>\u00a0DNA Sequence Database<\/b><\/p>\n<p>Cosmids from the<i>\u00a0Fugu<\/i>\u00a0genomic library are selected randomly and sequenced using a shotgun scanning approach. DNA is prepared from cosmids by alkaline lysis and sonicated to an average size of 500-1000bp. These fragments are cloned into pBluescript II KS and sequenced from T3\/T7 primer generated PCR products with KS primer giving an average clone length of 480bp. Approximately 50 clones are sequenced per cosmid. Primer and vector sequences are identified and clipped off. Sequences that comprise less than 60 bases or have sequence ambiguities of 10% or more are discarded. The sequences are screened for significant similarities using BLAST [4] matches against the EMBL, TrEMBL, and SWISSPROT databases. Identical or near identical matches with E. coli sequences are removed. The resulting cosmid-subcloned sequences are searched systematically for various types of protein coding regions, microsatellite, minisatellite sequences. Detailed information connected with the<i>\u00a0Fugu<\/i>\u00a0Landmark Mapping Project such as the project description, project protocols, cosmid sequences, clone sequences and the BLAST matches to EMBL, TrEMBL and SWISSPROT databases is available via the UK Human Genome Mapping Project Resource Centre\u00eds World Wide Web site at the following URL\u00a0http:\/\/<i>Fugu<\/i>.hgmp.mrc.ac.uk.<b><\/b><\/p>\n<p><b>3. Identification and Characterization of Protein Coding Regions<\/b><\/p>\n<p>Each cosmid contains on average 50 clones approximating to 50% single pass coverage per cosmid. There are on average 3 strong gene hits per cosmid idetified using BLAST. Whilst the majority of searches reveal homology to mammalian genes, a significant proportion match\u00a0<i>yeast S. cerevisiae<\/i>\u00a0and\u00a0<i>C. elegans<\/i>\u00a0hypothetical genes. An increased number of conserved linkages in the genomes of\u00a0<i>Fugu<\/i>\u00a0and human have been discovered (Table 1) along with several evolutionary breakpoint regions.<\/p>\n<p>Apart from the significant similarities using BLAST matches against the protein sequence databases, several gene prediction programs were tested on the<i>\u00a0Fugu<\/i>\u00a0Landmark Mapping DNA sequences. This type of analyses is useful to identify new genes likely to be present in the genomic sequences of<i>\u00a0Fugu<\/i>. The Genemark program [5] contains a<i>\u00a0Fugu-<\/i>DNA matrix that performed most successfully. A random subset of output files were subjected to further analysis using the information of known genes established from BLAST. Genemark proved effective at predicting the region of an ORF in the correct reading frame. This is particularly impressive considering that the maximum length of each clone is 650bp. Analysis of the Genemark ORFs enabled a re-estimation of the total percentage of protein coding regions in the<i>\u00a0Fugu<\/i>\u00a0genome. 14% of the genome is predicted to code for protein. This prediction is based on 529 cosmids (26,220 clones; 11.338Mb of\u00a0<i>Fugu<\/i>\u00a0DNA) which have 10,986 ORFs assigned by Genemark. The predicted quantity of protein coding in\u00a0<i>Fugu<\/i>\u00a0here is similar with a previous estimate in an earlier study in which 128Kb\u00a0<i>Fugu<\/i>\u00a0DNA was used [1]. This preliminary analysis of ORFs using the Genemark program indicates that about 60% of ORFs show no significant sequence homology to genes currently in any database and this finding is typical of other genomic sequencing projects; an example being the complete DNA sequence of yeast\u00a0<i>Saccharomyces cerevisiae<\/i>\u00a0[6].<\/p>\n<p>Table 1. Examples of Conserved Synteny in\u00a0<i>Fugu<\/i><\/p>\n<table border=\"1\" width=\"100%\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td valign=\"TOP\" width=\"42%\"><b>Physically linked genes in\u00a0<i>Fugu<\/i><\/b><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><b>Distance apart (kb)<\/b><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\"><b>Human chromosome assignment<\/b><\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Cannabis receptor<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">Both on one cosmid<sup>a<\/sup><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">6q14-q15<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">GABA rho receptor<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">6q14-q21<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Tyrosine hydoxylase<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">10kb of each other<sup>b<\/sup><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">11p15.5<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Nucleosome assemby protein<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">11p<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Insulin-like growth factor receptor II<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">11p15.5<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Phenylalanine hydroxylase<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">20kb of each other<sup>c<\/sup><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">12q22<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Insulin-like growth factor receptor I<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">12q22-q24<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">TSC2<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">1kb of each other<sup>d<\/sup><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">16p13.3<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">ADPKDI<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">16p13.3<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">wntI<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">All on one cosmid<sup>e<\/sup><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">12q13<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">erb-b3<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">12q13<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">ADP ribosyl factor 3<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">Unassigned<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">wnt 10b<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">Unassigned<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">fos<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">All on one cosmid<sup>f<\/sup><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">14q24.3<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">s31 Golgi transport protein<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">14q24.3<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">s20i15 transcription factor<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">14q24.3<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">7SL RNA gene<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">Unassigned<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Activating factor 3<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">Unassigned<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">fos-like gene<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">14q24.3<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Dihydrolipoamide succinyl transferase<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">14q24.3<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Hox b cluster (nine genes)<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">90kb<sup>g<\/sup><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">17q21-22<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Hox c cluster (nine genes)<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">66kb<sup>h<\/sup><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">12q12-q13<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Ig VH genes (at least 6 genes)<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">All on one cosmid<sup>i<\/sup><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">14q32.3<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Surf-2<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">All on one cosmid<sup>j<\/sup><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">9q34<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Surf-4<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">9q34<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">ASS<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">9q34<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Dynamin<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">9q34<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Golgin-95<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">Unassigned<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Basic transcription factor<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">All on one cosmid<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">11p15<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Mucin 2<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">11p15.5<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Eosinophil peroxidase precursor<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">11p12-p2<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">3-oxo-5-alpha-steroid-dehydrogenase I<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">All on one cosmid<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">5p15<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Adenylate cyclase II<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">5p15.2-15.1<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Wt I<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">All on one cosmid<sup>k<\/sup><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">11p13<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">RCN<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">11p13<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Pax 6<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">11p13<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">72KD type IV collagenase precursor<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">All on one cosmid<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">16q<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Sodium-dependentNor-adrenalin transporter<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">16q<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Tyrosine kinase CAK precursor<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">All on one cosmid<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">1q33<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Regulator of G-protein signalling<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">1q33<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Topoisomerase I<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">All on one cosmid<sup>l<\/sup><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">20q<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Phospholipase C gamma I<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">20q<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">KIAA0181<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">20q<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Complement C8 alpha<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">All on one cosmid<sup>m<\/sup><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">1p32<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Complement C8 beta<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">1p32<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">MTF-1<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">All on one cosmid<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">1p32-34<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">IT5-P<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">1p34<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">HCGV<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\">All on two cosmids<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">6p21.3<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Tenascin-like<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">6<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">Valyl tRNA synthetase<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">6<\/td>\n<\/tr>\n<tr>\n<td valign=\"TOP\" width=\"42%\">AA008813<\/td>\n<td align=\"center\" valign=\"TOP\" width=\"28%\"><\/td>\n<td align=\"center\" valign=\"TOP\" width=\"30%\">6p21.3<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Table References:\u00a0<sup>a<\/sup>Yamaguchi (unpublished data);\u00a0<sup>b<\/sup>Sandford (unpublished data);\u00a0<sup>c<\/sup>Sandford (unpublished data);\u00a0<sup>d<\/sup>Sandford et al (1996) Genomics 38:84-86;\u00a0<sup>e<\/sup>Gellner (unpublished data);\u00a0<sup>f<\/sup>Trower et al (1996) Proc. Natl Acad. Sci. (USA) 93:1366-1369;\u00a0<sup>g<\/sup>Aparicio et al (1997) Nat. genet. 16:79-83;\u00a0<sup>h<\/sup>Aparicio et al (1997) Nat. genet. 16:79-83;\u00a0<sup>i<\/sup>Peixoto (unpublished data);\u00a0<sup>j<\/sup>Bouchireb (unpublished data);\u00a0<sup>k<\/sup>Miles et al (unpublished data);\u00a0<sup>l<\/sup>Smith (unpublished data);\u00a0<sup>m<\/sup>Yeo (unpublished data).<b><\/b><\/p>\n<p><b>4. Identification and Characterization of Microsatellites and Minisatellites<\/b><\/p>\n<p>501 theoretically possible microsatellites with a repeat unit of 1-6 bp were used to query 11.338Mb of<i>\u00a0Fugu<\/i>\u00a0DNA [7] using a Smith Waterman based algorithm in Censor [8]. In decreasing order, the twenty most frequently occurring microsatellites are AC, A, C, AGG, AG, AGC, AAT, AAAT, ACAG, ACGC, ATCC, AAC, ATC, AGGG, AAAG, AAG, AAAC, AT, CCG and TTAGGG. The twenty most frequently occurring microsatellites represent 81.79% of the total microsatellites identified. One microsatellite occurs every 1.876Kb of DNA in<i>\u00a0Fugu<\/i>. 11.55% of the microsatellites are detected in open reading frames (ORFs) that are predicted protein coding regions by Genemark [5]. With respect to the proportion of microsatellites present in ORFs and the total abundance (bp) of the total microsatellites, the genome of<i>\u00a0Fugu\u00a0<\/i>is similar to the genome of other vertebrate species. Previous estimates performed on genomic sequnces of primates, human, rat, rabbit, pig and chicken indicate that approximately 1% of many vertebrate genomes are comprised of microsatellites. However, many differences prevail in the abundance and frequency of the individual microsatellite classes. Many of the frequently occurring microsatellites in\u00a0<i>Fugu<\/i>\u00a0are known to code in other species for regions in proteins such as transcription factors, whilst others are associated with known functions, such as transcription factor binding sites in DNA sequences. Therefore, it is likely that microsatellites have a very significant and major role in evolution of genes, genomes and organisms.<\/p>\n<p>The identification of microsatellite DNA in<i>\u00a0Fugu<\/i>\u00a0has proved essential for the identification and characterization of other repetitive structural units such as minisatellites. For the identification of similar sequences within the<i>\u00a0Fugu<\/i>\u00a0genome that are eukaryotic, vertebrate, fish or<i>\u00a0Fugu<\/i>\u00a0specific, cluster analysis is required. Such clustering of genomic sequences requires the identification and masking out of microsatellites which would otherwise hinder clustering studies. Clustering analysis is carried out using ICAtools suite of programs [9]. The minisatellite classes identified in<i>\u00a0Fugu<\/i>\u00a0appear few in number and repeated at low to medium numbers. These are being characterised further. The considerations given to the two broad classes of repeat structures in<i>\u00a0Fugu<\/i>\u00a0i.e., the minisatellites and transposable elements, suggest that these two main classes of repeats comprise less than 10% of the genome. It is these two classes of repetitive elements that are significantly reduced in<i>\u00a0Fugu<\/i>\u00a0compared with other vertebrate species such as human and primates, and not microsatellite sequences.<b><\/b><\/p>\n<p><b>5. Conclusions &amp; The Future<\/b><\/p>\n<p>The general properties of the<i>\u00a0Fugu<\/i>\u00a0genome are well established and these include a conserved gene structure with other vertebrate species, small introns, similar codon usage and splice signals and homologous coding sequences. With respect to the analyses of the regulatory networks associated with equivalent and completely sequenced gene systems in different species, comparative sequence analysis are useful in understanding functions and functional relationships. For example, there are a growing number of examples where conserved elements in the 5\u00ed regions of equivalent gene structures present in different species were subsequently shown to be functional regulatory regions in mammals and the pufferfish as seen in the Hox [10], the Otx2 [11] and the oxytocin gene structures [12]. The\u00a0<i>Fugu<\/i>\u00a0Landmark Mapping Project has been primarily concerned with the partial single-pass sequence scanning of cosmids and now the genomic research has moved towards the complete cosmid sequencing, assembling and finishing of key genomic regions identified from sequence scanning in\u00a0<i>Fugu<\/i>\u00a0to permit more detailed comparison with the human genome especially with regards to studying regulatory networks for genomic structure and syntenic genes.<b><\/b><\/p>\n<p><b>Acknowledgements<\/b><\/p>\n<p>This work is financially supported by the MRC grant awarded to Dr. Greg Elgar and Professor Sydney Brenner for the<i>\u00a0Fugu<\/i>\u00a0Landmark Mapping Project and the EC TRAnscription Databases and Analysis Tools (TRADAT) project (Contract: BIO4-CT95-0226).<b><\/b><\/p>\n<p><strong>References<\/strong><\/p>\n<ol>\n<li>S. Brenner, G. Elgar, R. Sandford, A. Macrae, B. Venkatesh, &amp; S. Aparicio. (1993). Characterization of the pufferfish (<i>Fugu<\/i>) genome as a compact model vertebrate genome. Nature,\u00a0<b>366<\/b>:265-268.<\/li>\n<li>G. Elgar (1996). Quality not quantity: the pufferfish genome. Human Molecular Genetics,\u00a0<b>5<\/b>: 1437-1442.<\/li>\n<li>G. Elgar, R. Sandford, S. Aparicio, A. Macrae, B. Venkatesh, &amp; S. Brenner (1996). Small is beautiful &#8211; comparative genomics with the pufferfish (Fugu rubripes). Trends in Genetics,\u00a0<b>12<\/b>:145-150.<\/li>\n<li>S.F. Altschul, W. Gish, W. Miller, E.W. Myers, &amp; D.J. Lipman (1990). Basic local alignment search tool. J. Mol. Biol.,\u00a0<b>215<\/b>:403-410.<\/li>\n<li>M. Borodovsky &amp; J. McIninch (1993). GENEMARK &#8211; Parallel gene recognition for both dna strands. Comp. Chem.,\u00a0<b>17<\/b>:123-133.<\/li>\n<li>D. Botstein, S.A. Chervitz, &amp; J.M. Cherry (1997). Yeast as a model organism. Science,\u00a0<b>277<\/b>:1259-1260.<\/li>\n<li>Y.J.K. Edwards, G. Elgar, M.S. Clark &amp; M.J. Bishop (1998). The identification and characterization of microsatellites in the compact genome of the Japanese pufferfish,<i>\u00a0Fugu rubripes<\/i>: perspectives in functional and comparative genomic analyses. J. Mol. Biol.,\u00a0<b>278<\/b>:843-854.<\/li>\n<li>J. Jurka, P. Klonowski, V. Dagman, &amp; P. Pelton (1996). Censor &#8211; a program for identification and elimination of repetitive elements from DNA sequences. Comp. Chem.\u00a0<b>20<\/b>:119-121.<\/li>\n<li>9a. J.D. Parsons, S. Brenner &amp; M.J. Bishop (1992). Clustering cDNA sequences. CABIOS,\u00a0<b>8<\/b>:461-466. 9b. J.D. Parsons (1995). Improved tools for DNA comparison and clustering. CABIOS,\u00a0<b>11<\/b>:603-613.<\/li>\n<li>S. Aparicio, A. Morrison, A. Gould, J. Gilthorpe, C. Chaudhuri, P. Rigby, R. Krumlauf, &amp; S. Brenner. (1995). Detecting conserved regulatory elements with the model genome of the Japanese puffer fish,<i>\u00a0Fugu rubripes<\/i>. Proc. Natl. Acad. Sci. USA,\u00a0<b>92<\/b>:1684-1688.<\/li>\n<li>C. Kimura, N. Takeda, M. Suzuki, M. Oshimura, S. Aizawa &amp; I. Matsuo. (1997). Cis-acting elements conserved between mouse and pufferfish Otx2 genes govern the expression in mesencephalic neural crest. Cells Development,\u00a0<b>124<\/b>:3929-3941.<\/li>\n<li>B. Venkatesh, S.L. Sihoe, D. Murphy &amp; S. Brenner (1997). Transgenic rats reveal functional conservation of regulatory controls between the<i>\u00a0Fugu<\/i>\u00a0isotocin and rat oxytocin genes. Proc. Natl. Acad. Sci. USA,\u00a0<b>94<\/b>:12462-12466.<\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>ELGAR G.+,\u00a0CLARK M.S.,\u00a0EDWARDS Y.J.K.,\u00a0MEEK S.,\u00a0SMITH S.,\u00a0UMRANIA Y.,\u00a0WARNER S.,\u00a0WILLIAMS G.,\u00a0BISHOP M.J. UK Human Genome Mapping Project Resource Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SB, UK.; e-mail:\u00a0gelgar@hgmp.mrc.ac.uk; +Corresponding author Keywords: japanese puffer fish genome, cosmid sequence scanning, genome characterization, minimalist &hellip; <a href=\"https:\/\/conf.icgbio.ru\/bgrs98\/abstracts\/abstract-list\/080_characterization-of-the-compact-model-genome-of-the-japanese-puffer-fish-fugu-rubripes-using-a-cosmid-sequence-scanning-approach\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":13,"featured_media":0,"parent":97,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":[],"_links":{"self":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/941"}],"collection":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/users\/13"}],"replies":[{"embeddable":true,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/comments?post=941"}],"version-history":[{"count":4,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/941\/revisions"}],"predecessor-version":[{"id":1509,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/941\/revisions\/1509"}],"up":[{"embeddable":true,"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/pages\/97"}],"wp:attachment":[{"href":"https:\/\/conf.icgbio.ru\/bgrs98\/wp-json\/wp\/v2\/media?parent=941"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}