Microexon ID Sm_GL377619:235408-235416:+
Species Selaginella moellendorffii
Coordinates GL377619:235408..235416
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TGTACTTAG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAATACTTGCCGAAAGGAAGGACAGACGAGGAAAGGAAGACAAGTGTACTTAGGGGGATATGATGCGGAGGAGAAGGCAGCAAGAGCTTACGATCTGGCAGCT
Microexon-tag Amino Acid Seq WDNTCRKEGQTRKGRQVYLGGYDAEEKAARAYDLAA
Microexon-tag spanning region235244-235537
Microexon-tag prediction score0.9749
Overlapped with the annotated transcript (%) 91.67
New Transcript ID EFJ16224x
Reference Transcript ID EFJ16224
Gene ID SELMODRAFT_17095
Gene Name NA
Sm_GL377619:235408-235416:+ does not have available information here.
Microexon DNA seq TGTACTTAG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAATACTTGCCGAAAGGAAGGACAGACGAGGAAAGGAAGACAAGTGTACTTAGGGGGATATGATGCGGAGGAGAAGGCAGCAAGAGCTTACGATCTGGCAGCT
Microexon-tag Amino Acid seq WDNTCRKEGQTRKGRQVYLGGYDAEEKAARAYDLAA
Transcript ID Sm.21177.1
Gene ID Sm.21177
Gene Name NA
Pfam domain motif AP2
Motif E-value 1e-12
Motif start 252
Motif end 311
Protein seq >Sm.21177.1
MQQPVEDCQRGSSIRSDNTFQSSSSPALPIQMQHHDFMGQGTLEHPHGFVDHHYHLSPGMISTRYGAISNSMVASLQSPY
NFQLLRRTLLPVSAFAEQQRAQVSSSPPSPLLSDSHRQIDQTSAQQQQQHGTTNLSTLKSLLRQPQENSHDDHLPSMDQS
HHQSAIVNHQSLTLSMSSGSQSSNTDSSSIAAHHDHQPLVTFQDQELPPPPSSSATITAINLPRTTGSKKRLGKNKAPNL
RKSIDTFGQRTSVFRGVTRHRWTGRFEAHLWDNTCRKEGQTRKGRQVYLGGYDAEEKAARAYDLAALKYWGPTTTTNFPA
VEYHSKLNEMKSMSRQAFVAALRRKSSGFARGASRFRGVTRHHQQGRWQARIGRVAGNKDLYLGTFSTEEEAAEAYDIAA
IKFRGASAVTNFDMSHYDLRRICSSPSLLLGDTARIKHKEAPDAANDQSATTAADEPICDDGGALVTSSVPQDSLANQEH
NYAGAAGPGVLRNLIGLDTFSCQADHPEAPDTSSSDGGGGGGGVPSSSVAQHDDQLYTPAPPDYSTGSLMPSPLQTLHAM
RSTIPAHMPYFSAWNESE*
CDS seq >Sm.21177.1
ATGCAGCAGCCGGTAGAAGATTGCCAGAGGGGTTCTTCCATTCGCAGCGACAACACCTTCCAGAGCTCCAGTTCTCCAGC
GCTCCCGATCCAGATGCAGCACCACGATTTCATGGGCCAAGGTACCCTGGAGCATCCCCACGGCTTCGTCGATCACCACT
ACCATCTCTCGCCAGGGATGATCAGCACAAGATACGGCGCTATTTCCAACTCCATGGTCGCCTCACTGCAGTCGCCATAC
AACTTTCAGCTATTAAGGAGAACGCTTCTTCCGGTAAGTGCCTTCGCAGAACAGCAGCGAGCACAAGTCTCTTCGTCTCC
TCCTTCGCCGCTTCTCAGCGATTCTCACCGCCAGATCGATCAGACGAGCGCGCAGCAGCAGCAGCAGCACGGCACTACCA
ATTTGTCCACTCTCAAATCTTTACTGAGGCAGCCGCAAGAGAATAGCCACGACGATCATCTTCCTTCCATGGATCAAAGC
CACCACCAGAGCGCTATCGTCAATCATCAGTCGCTGACTTTGTCCATGAGTTCCGGCTCACAGTCGAGCAACACCGATTC
GTCTTCCATCGCCGCCCACCACGATCACCAGCCGCTCGTGACCTTCCAGGACCAGGAGCTACCACCGCCGCCATCCTCGT
CCGCCACCATCACCGCCATCAATCTTCCAAGGACCACCGGTAGCAAGAAGAGGCTCGGCAAGAACAAGGCGCCCAACTTG
CGCAAATCGATCGATACTTTCGGGCAGCGAACATCGGTCTTTCGGGGTGTGACCAGGCATCGCTGGACTGGGCGTTTTGA
GGCTCATCTCTGGGACAATACTTGCCGAAAGGAAGGACAGACGAGGAAAGGAAGACAAGTGTACTTAGGGGGATATGATG
CGGAGGAGAAGGCAGCAAGAGCTTACGATCTGGCAGCTCTCAAGTACTGGGGGCCTACGACAACTACAAACTTTCCGGCT
GTGGAGTATCACTCCAAGTTAAATGAGATGAAGAGTATGAGCAGACAGGCATTTGTTGCTGCGCTCAGGAGGAAGAGCAG
TGGCTTCGCGAGAGGTGCATCGAGGTTCAGGGGTGTAACAAGGCATCACCAGCAAGGAAGATGGCAAGCTCGGATCGGAC
GAGTTGCCGGGAATAAAGATCTGTACCTCGGAACTTTCAGTACTGAAGAAGAAGCTGCCGAAGCTTACGACATCGCAGCC
ATCAAGTTCCGCGGTGCAAGCGCGGTGACAAACTTCGACATGAGTCACTACGACTTGAGGCGGATTTGCTCCAGCCCGAG
CTTGCTACTGGGTGACACCGCCAGGATCAAGCACAAGGAAGCTCCCGACGCAGCGAACGATCAGAGCGCCACCACAGCAG
CTGACGAACCGATATGTGATGATGGAGGAGCTCTCGTAACTTCTTCCGTTCCTCAAGACTCTCTCGCTAATCAGGAGCAC
AACTATGCTGGTGCCGCGGGGCCGGGAGTTCTTCGCAACCTCATTGGACTGGACACGTTCTCTTGTCAAGCTGATCATCC
TGAAGCTCCCGATACGAGCTCCTCGGATGGTGGTGGTGGAGGTGGCGGAGTTCCAAGTTCCTCCGTAGCTCAGCACGACG
ATCAGCTTTATACTCCGGCGCCGCCAGATTACAGCACCGGATCCCTGATGCCATCACCGCTGCAAACTCTGCACGCGATG
AGGTCCACAATCCCGGCACACATGCCTTACTTCTCAGCCTGGAACGAGTCGGAGTGA