Home | Genome Browser | Blast | Search | Submissions | Help

Schmidtea mediterranea Genome Database

Nomenclature

I.              Nomenclature

a.     contigs: v31.019651

                                               i.     v31              assembly version 3.1

                                             ii.     019651          contig number padded to 6 digits 

b.     genes: mk4.019651.00

                                               i.     mk4             maker run 4

                                             ii.     0196510         contig number padded to 6 digits

                                            iii.     00               gene number padded to 2 digits

c.     transcripts: mk4.019651.00.01

                                               i.     mk4             maker run 4

                                             ii.     019651 contig number padded to 6 digits

                                            iii.     00               gene number padded to 2 digits

                                            iv.     01                transcript number padded to 2 digits

d.     est contigs: ec1.00159.004

                                               i.     ec1              est contig version 1

                                             ii.     00159           est contig number padded to 5 digits

                                            iii.     004              number of reads padded to 3 digits

e.     454 contigs: fc1.08641.005

                                               i.     fc1               454 contig version 1

                                             ii.     08641           454 contig number padded to 5 digits

                                            iii.     005              number of reads padded to 3 digits

II.            Browser Overview (Top to Bottom)

a.     Genome locator: Showing 1.071 kbp from v31.019651, positions 1 to 10,150

b.     Instruction Panel with search examples.

c.     Search Box

                                               i.     Terms for protein homology (piwi)

                                             ii.     Contig names (v31.019651)

                                            iii.     Gene names (mk4.019651.00)

                                            iv.     Use of wildcards(*) (mk4.0196*)

d.     Data source: Choose which dataset to view (SmedGD v1.0)

e.     Overview Panel

                                               i.     Simplified view of the entire genomic contig. 

                                             ii.     Red box indicates the area of the contig which is being viewed in Details panel (see below).  The red box can be recentered on a different area of the contig by clicking on a new area in the overview panel.  The size of the box and therefore the number of basepairs viewed in the details panel can be altered by changing the parameters in the Scroll/Zoom dropdown menu or by adding the desired region to be viewed in the Search Box of the contig. (v31.019651:1375..2445)

f.     Details Panel

                                               i.     Information that aligns to genome, everything from predicted genes to RNAi phenotype information.

g.     Tracks

                                               i.     The information in the Details panel is controlled by the selection of tracks. Tracks are types of information that are generated biologically and bioinformatically that have been aligned to the genomic contigs.

III.         Tracks

a.     Overview (v31.000001)

                                               i.     Viewed in the Overview panel

                                             ii.     HQ Genes

                                            iii.     Maker Genes

                                            iv.     Human Curated Genes

b.     Analysis

                                               i.     Restriction Sites: maps restriction sites to the genome.

c.     Genes

                                               i.     Predicted Genes/mRNAs with Evidence: This track is composed of genes/mRNAs predicted by Mark Yandell's Maker that are supported by High Quality evidence, such as est and protein homology.

                                             ii.     Predicted Genes/mRNAs: Any maker predictions that do not have High Quality evidence as support are located in this track.

                                            iii.     Human Curated Genes/mRNAs: Any human curated genes/mRNAs.  Currently, only genes investigated by Alejandro S‡nchez Alvarado with Apollo are presented in this track, and are identified by the three letter designation ASA, followed by a gene number and transcript number (ASA.00084.01).

d.     Phenotype/Expression:

                                               i.     cDNA microarray:  All ESTs were aligned to the genome using blat (see ESTs aligned with Blat).  The microarray result associated with a particular EST is mapped to every alignment of that EST to the genome.  A arrow centered on an EST indicates up- or down-regulation in comparison to the control (green for up, red for down, gray for no change) of that EST in the experiment listed in the arrow's label (above the arrow).  The logbase 2 change is displayed below the arrow. (AY066135.Irradiation.TimeCourse.24hr)

                                             ii.     RNAi Phenotype:  A bar that spans the length of the EST that was used to disrupt gene function by RNA intereference is employed in this track.  The description of the resulting phenotype is located under the bar. (RNAi:AY967490)

                                            iii.     in situ: A track that links to in situ images of the EST that is aligned to this particular region of the genome. (AY967481:insitu)

e.     Sequence Similarity:

                                               i.     454 contigs aligned with BLAT: Contigs prepared from 454 sequencing reads were aligned to the genome using the Blat algorithm and standard psl output. BLAT is designed to quickly find sequences of 90% and greater a score of 30. 

                                             ii.     BLASTX Hits: These are WUBLASTX (nucleotide to protein, via six-frame translation) similarity hits of genomic sequence, run against reference protein datasets from the genomes of E. coli, Saccharomyces cerevisiae, Drosophila melanogaster, C. elegans, platyhelminthes, Ciona intestinalis, mouse and human. Cutoffs of 40% identity and 1e-5 were used.

                                            iii.     BLASTX Exonerate: These are the outputs from BLASTX Hits alignments of the S. mediterranea genome to reference protein databases and polished with the splice-site aware algorithm Exonerate (protein2genome) to align protein to genomic sequences (Comput Appl Biosci 1997 Aug; 13(4) 477-8. pmid:9283765).

                                            iv.     EST BLASTN: These are contigs of S. mediterranea expressed sequence tags (ESTs) aligned to the S. mediterranea genome. Cutoffs of 85% identity and 1e-10 were used.

                                              v.     EST BLASTN Exonerate: These are contigs of S. mediterranea expressed sequence tags (ESTs) aligned to the S. mediterranea genome and polished with Exonerate (EST2genome) to align spliced DNA sequences to unspliced genomic DNA (Comput Appl Biosci 1997 Aug; 13(4) 477-8. pmid:9283765).

                                            vi.     ESTs aligned with BLAT: These are all available individual S. mediterranea expressed sequence tags (ESTs) aligned to the S. mediterranea genome using James Kent's BLAT program [ http://genome.cse.ucsc.edu/cgi-bin/hgBlat]. This track shows the best unique location for each EST (1e-95 with at least 45% alignment). Output is in the WUBLAST format.

                                           vii.     Maker predictions BLASTP to Swissprot: These are alignments of predicted proteins from Maker gene models against Swissprot. A cutoff of 1e-3 and 40% alignment with the protein hit was used.

                                         viii.     Maker predictions RPS-BLAST to PFAM domains: These are alignments of predicted proteins from Maker gene models against PFAM using NCBI RPS-Blast. A cutoff of 1e-3 and 40% alignment with the protein hit was used.

                                           ix.     Maker predictions RPS-BLAST to SMART domains: These are alignments of predicted proteins from Maker gene models against SMART using NCBI RPS-Blast. A cutoff of 1e-3 and 40% alignment with the protein hit was used.

                                             x.     Repeat Regions: This tracks shows areas of the genome containing interspersed repeats and low complexity DNA sequences as identified by RepeatMasker using a S. mediterranea specific repeat library.

f.     Species TBLASTN

                                               i.     C.elegans proteome TBLASTN against Genome: The C.elegans proteome aligned to the translated S.mediterranea genome.  Cutoffs of 1e-3 and 30% alignment with the protein query were used.

                                             ii.     D.mel proteome TBLASTN against Genome: The D.melanogaster proteome aligned to the translated S.mediterranea genome.  Cutoffs of 1e-3 and 30% alignment with the protein query were used.

                                            iii.     Human proteome TBLASTN against Genome: The H.sapien proteome aligned to the translated S.mediterranea genome.  Cutoffs of 1e-3 and 30% alignment with the protein query were used.

                                            iv.     M.mus proteome TBLASTN against Genome: The M.musculus proteome aligned to the translated S.mediterranea genome.  Cutoffs of 1e-3 and 30% alignment with the protein query were used.

g.     Tools

                                               i.     6-frame translation: Indicates the translation of sequence in all possible reading frames of the genomic sequence.

                                             ii.     Coding Segments: This track shows the reading frames of coding segments (also known as "CDS" features).

                                            iii.     DNA/GC Content: Shows the percentage of GC content along genomic DNA.  The S. mediterranea genome is AT rich.  If the zoom is set to 100bp, the actual sequence of the genomic contig can be viewed.

h.     Non-Coding RNA: mapping of mature miRNA sequences (RNA. 2006 Sep;12(9):1640-9. Epub 2006 Jul 18) to the genome with the use of BLAST (Marc Friedlaender). (sme-miR-2b)

IV.           Display Settings:

a.     Image Width

b.     Position of the Key or track names

V.             Add your own tracks: (help and more general help)

VI.           Details Pages and Links

a.     Gene: (mk4.000158.08) Details about the gene structure (exons, CDS, UTRs), location and sequence.

                                               i.     Position: links back to the browser with a zoomed in view of the gene.

b.     mRNA: (mk4.000158.06.01) Details about the mRNA structure (exons, CDS, UTRs) and location.

                                               i.     Name: retrieves protein and nucleotide seqeunce.

1.      Links to NCBI Blast and will auto-fill NCBI-Blast form with the retrieved sequence.

                                             ii.     Position: links back to the browser with a zoomed in view of the mRNA.

c.     miRNA: (sme-miR-2b) Details about the mature miRNA length and location.

                                               i.     Name: links to miRBase.

                                             ii.     Position: links back to the browser with a zoomed in view of the miRNA.

d.     cDNA Microarray: (AY066135.Irradiation.TimeCourse.24hr) Details about log2 ratio, aliases, and position in genome.

                                               i.     Position: links back to the browser with a zoomed view of the EST

e.     RNAi Phenotype: (RNAi:AY967490) Details about RNAi phenotypes, images, aliases and position in genome.

                                               i.     Position: links back to the browser with a zoomed view of the EST

f.     In Situ: (AY967481:insitu) Details about aliases, images and position in genome.

                                               i.     Position: links back to the browser with a zoomed view of the EST

g.     Blastn/Exonerate Blastn: (ec1.03089.014) Details about the EST alignment to the genome.

                                               i.     Name: retrieves nucleotide sequence of the EST or EST contig

                                             ii.     Position: links back to the browser with a zoomed in view of the EST or EST contig.

                                            iii.     Matches: searches for alignments of this EST/EST contig to other regions of the genome.

h.     ESTs Aligned with Blat: (DQ186986) Details about EST aliases and the EST alignment to the genome.

                                               i.     Name: retrieves nucleotide sequence of the EST

                                             ii.     Position: links back to the browser with a zoomed in view of the EST or EST contig.

                                            iii.     Matches: searches for alignments of this EST/EST contig to other regions of the genome.

                                            iv.     Target: links to NCBI

i.      454 contigs Aligned with Blat: (fc1.05410.002) Details about the 454 contig alignment to the genome.

                                               i.     Name: retrieves nucleotide sequence of the 454 contig

                                             ii.     Position: links back to the browser with a zoomed in view of the 454 contig.

                                            iii.     Matches: searches for alignments of this 454 contig to other regions of the genome.

j.      Repeat Regions: Details about the repeat and where it is found in the genome.

                                               i.     Position: links back to the browser with a zoomed in view of the repeat.

                                             ii.     Matches: searches for alignments of this repeat to other regions of the genome.

k.     Blastx/Exonerate Blastx: (gi|52345394|ref|NP_001004365.1| and ci0100130549) Details about the protein and the alignment to the genome.

                                               i.     Position: links back to the browser with a zoomed in view of the protein.

                                             ii.     Target: links to the NCBI for all gi numbers and to the JGI for all ci numbers

                                            iii.     Matches: searches for alignments of this protein to other regions of the genome.

l.      Maker predictions RPS-BLAST/BLASTP to PFAM/SMART/SWISSPROT (mk4.001111.00.01.pfam00069, mk4.001111.00.01.smart00220, and mk4.001111.01.01.O08795) Details about the protein/domain, the location in the genome, and Gene ontology terms.

                                               i.     Position: links back to the browser with a zoomed in view of the protein.

                                             ii.     Target: links to Pfam/Smart/Swissprot websites.

                                            iii.     Matches: searches for alignments of this protein to other regions of the genome.

m.    Species proteome TBLASTN against S.med Genome: (gi|17538133|ref|NP_496073.1| gi|48255891|ref|NP_001001329.1| gi|6679465|ref|NP_032951.1| gi|7298396|gb|AAF53621.1| ) Details about the protein and the location in the genome.

                                               i.     Position: links back to the browser with a zoomed in view of the protein.

                                             ii.     Target: links to NCBI for all gi numbers.

                                            iii.     Matches: searches for alignments of this protein to other regions of the genome.

                                            iv.     Note: links to FlyBase/WormBase/MouseEnsembl/HumanEntrezGene websites

VII.        Other Tools

a.     Blast

                                               i.     blastp/blastn/blastx against maker protein predicted sequences, maker predicted transcript sequences, or human curated protein sequences.

b.     Blat

                                               i.     Nucleotide vs genome sequence

                                             ii.     Results link to genome browser

c.     Search

                                               i.     Protein homology

                                             ii.     Gene Ontology

                                            iii.     RNAi phenotypes