Microexon ID Sm_GL377586:1405808-1405816:+
Species Selaginella moellendorffii
Coordinates GL377586:1405808..1405816
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq AGCGAGAGAACGGCATTCCATTTCCAACCGCGGAACAACTGGATGAACGATCCAAATGGTCCTCTCTTCTACAAAGGCTACTACCACTTGTTCTACCAGTACAATCCA
Microexon-tag Amino Acid Seq SERTAFHFQPRNNWMNDPNGPLFYKGYYHLFYQYNP
Microexon-tag spanning region1405653-1405929
Microexon-tag prediction score0.9461
Overlapped with the annotated transcript (%) 91.67
New Transcript ID EFJ25584x
Reference Transcript ID EFJ25584
Gene ID SELMODRAFT_98949
Gene Name NA
Sm_GL377586:1405808-1405816:+ does not have available information here.
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq AGCGAGAGAACGGCATTCCATTTCCAACCGCGGAACAACTGGATGAACGATCCAAATGGTCCTCTCTTCTACAAAGGCTACTACCACTTGTTCTACCAGTACAATCCA
Microexon-tag Amino Acid seq SERTAFHFQPRNNWMNDPNGPLFYKGYYHLFYQYNP
Transcript ID Sm.12385.1
Gene ID Sm.12385
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.8e-104
Motif start 85
Motif end 405
Protein seq >Sm.12385.1
MDPESQRSYAELPSGEEAEEIHSNPTAATENSRNRRIDLVLTLVGICCVVAGTFFWISLPSSNTENFSRIAPDGGFLASE
RTAFHFQPRNNWMNDPNGPLFYKGYYHLFYQYNPYGVEWGNISWGHAVSTDLLHWQHMDLAMQPDKWYDADGVWSGSATI
LPNGQVIMLYTGSTNASVQVQNLALPLNTSDPLLREWIKIPENPILVPPPGIAPKDFRDPTTAWLEADGLWRIAIGAKKG
RAGLALIYKTFDFLHWELEEEYLHTVQGTGMWECIDFYPVSTATSNGLDTSKVQTNELTKHILKASLDDDKHDYYAIGLY
SESSHTWIPDALDNDVGLGLRYDYGKYYASKTFFDSKHQRRILWGWANESDSLQDDIRKGWSSVQTLPRILYLDNLTGTN
LIQWPIEEVDALRHDKVSRSNVLLKGGDVVEVDAAQGAQLDIEVGFEYPDASKLDALPESENYDCSQGGATHRGVYGPFG
LLVLAEDKLQEMTAVYFYMTLKRDGSWETRVCSDQSRSSLEPGIDTTVYGTLFHRLPTEDSLSLRVIVDHSIVETFVQGG
RACITSRVYPTLATGDKARLFMFNNGTQPVVVKNLDAWKMRSTTLSVLPVTEWRLAARQS*
CDS seq >Sm.12385.1
ATGGATCCCGAATCTCAGCGGAGCTATGCGGAGCTCCCATCAGGGGAGGAGGCGGAGGAGATTCATTCAAATCCCACTGC
GGCGACGGAGAATTCCAGGAACAGACGCATTGATCTAGTCCTGACGCTGGTGGGGATTTGCTGCGTCGTCGCTGGAACTT
TTTTCTGGATCTCACTTCCTTCGAGCAACACCGAGAATTTCTCACGGATTGCTCCGGACGGGGGTTTCTTGGCCAGCGAG
AGAACGGCATTCCATTTCCAACCGCGGAACAACTGGATGAACGATCCAAATGGTCCTCTCTTCTACAAAGGCTACTACCA
CTTGTTCTACCAGTACAATCCATATGGAGTAGAGTGGGGAAACATCTCCTGGGGCCATGCTGTGTCCACTGATCTCCTCC
ACTGGCAACATATGGATCTGGCAATGCAGCCAGACAAGTGGTATGATGCAGATGGAGTGTGGTCTGGCTCTGCAACAATC
CTCCCCAATGGTCAAGTGATCATGCTCTACACTGGCTCCACAAATGCCTCTGTTCAGGTCCAAAATCTCGCTCTTCCACT
CAACACCTCAGATCCACTCCTGAGGGAGTGGATCAAAATCCCAGAGAATCCCATTCTGGTGCCACCACCAGGGATTGCTC
CTAAAGACTTCAGGGATCCAACAACAGCATGGCTGGAGGCTGATGGGCTGTGGAGAATTGCAATTGGTGCCAAAAAGGGC
AGGGCTGGACTGGCTCTCATCTACAAAACCTTTGATTTCCTGCACTGGGAGCTGGAGGAGGAGTATCTACACACTGTTCA
AGGCACAGGGATGTGGGAGTGCATTGATTTCTATCCAGTCTCAACTGCCACCAGCAATGGGTTGGATACATCCAAAGTTC
AGACCAATGAGCTCACTAAGCACATTCTCAAGGCCAGCCTGGATGATGACAAGCATGACTACTATGCAATTGGGCTGTAT
TCTGAGAGCTCTCATACTTGGATCCCTGATGCCCTTGACAATGATGTTGGGCTGGGGCTGAGGTATGATTATGGCAAGTA
TTATGCCTCCAAGACTTTTTTTGATTCCAAGCACCAGAGGAGGATCTTGTGGGGATGGGCTAATGAATCGGACAGTCTGC
AAGATGATATCAGGAAGGGGTGGTCTTCGGTGCAGACACTGCCCAGGATCTTGTATCTTGACAACTTGACTGGTACCAAC
TTGATCCAATGGCCAATTGAGGAGGTAGATGCCCTCAGGCATGACAAAGTGAGCAGATCAAATGTGCTTCTCAAAGGTGG
TGACGTGGTAGAAGTTGATGCAGCTCAAGGGGCACAGCTTGACATTGAGGTGGGATTTGAGTATCCAGATGCAAGCAAGC
TGGACGCACTTCCAGAGAGCGAGAATTACGATTGCAGCCAAGGTGGGGCAACTCACAGAGGTGTTTATGGCCCTTTTGGC
CTCCTTGTTCTTGCTGAAGACAAGTTGCAAGAGATGACAGCAGTCTACTTCTACATGACACTAAAACGAGATGGAAGTTG
GGAAACAAGAGTTTGCAGCGACCAAAGCAGATCATCTCTAGAACCCGGAATTGACACAACCGTGTACGGAACGCTCTTCC
ACAGGCTCCCAACGGAGGATTCGCTCTCGTTGCGTGTCATAGTGGATCACTCCATCGTGGAGACTTTCGTGCAAGGGGGA
CGCGCTTGTATCACGTCTCGCGTCTATCCAACGCTTGCTACTGGTGACAAGGCTCGCCTCTTCATGTTTAACAATGGAAC
GCAACCGGTTGTCGTGAAGAACTTGGATGCGTGGAAGATGAGATCGACCACGCTGAGTGTCTTGCCCGTCACTGAATGGA
GGCTCGCTGCTCGACAAAGTTGA