Microexon ID Sm_GL377596:355608-355616:-
Species Selaginella moellendorffii
Coordinates GL377596:355608..355616
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCAAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq AGCGAGAGAACGGCATTCCATTTCCAACCGCGGAACAACTGGATGAACGATCCAAACGGTCCTCTCTTCCACAAAGGCTACTACCACTTGTTCTACCAGTACAATCCA
Microexon-tag Amino Acid Seq SERTAFHFQPRNNWMNDPNGPLFHKGYYHLFYQYNP
Microexon-tag spanning region355495-355771
Microexon-tag prediction score0.9426
Overlapped with the annotated transcript (%) 91.67
New Transcript ID EFJ22309x
Reference Transcript ID EFJ22309
Gene ID SELMODRAFT_10966
Gene Name NA
Sm_GL377596:355608-355616:- does not have available information here.
Microexon DNA seq ATCCAAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq AGCGAGAGAACGGCATTCCATTTCCAACCGCGGAACAACTGGATGAACGATCCAAACGGTCCTCTCTTCCACAAAGGCTACTACCACTTGTTCTACCAGTACAATCCA
Microexon-tag Amino Acid seq SERTAFHFQPRNNWMNDPNGPLFHKGYYHLFYQYNP
Transcript ID Sm.15386.1
Gene ID Sm.15386
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 9.9e-104
Motif start 85
Motif end 405
Protein seq >Sm.15386.1
MDPESQRSYAELPSGEEAEEIHSNPTAVAENSRNRRIDLVLTLVGICCVAAGTFFWISLPSSNTENFLRIAPEGGFLASE
RTAFHFQPRNNWMNDPNGPLFHKGYYHLFYQYNPYGVEWGNISWGHAVSTDLLHWQHMDLAMQPDKWYDADGVWSGSATI
LPNGQVIMLYTGSTNASVQVQNLALPLNTSDPLLREWIKIPENPILVPPPGIAPKDFRDPTTAWLEADGLWRIAIGAKKG
RAGLALIYKTFDFLHWELEEEYLHTVQGTGMWECIDFYPVSTATSNGLDTSKVQTNELTKHILKASLDDDKHDYYAIGLY
SESSHTWIPDALDNDVGLGLRYDYGKYYASKTFFDSKHQKRILWGWANESDSLQDDIRKGWSSVQTLPRILYLDNLTGTN
LIQWPIEEVEALRHDKVSRSNVLLKGGDVVEVDAAQGAQLDIEVGFEYPDASKLDALPESEIYDCSQGGATHRGVYGPFG
LLVLAEDKLQEMTAVYFYMTLKRDGSWETRVCSDQSRSSLEPGIDTTVYGTLFHRLPTEDSLSLRVIVDHSIVETFVQGG
RACITSRVYPTLATGDKARLFMFNNGTQPVFVKNLDAWKMRSTTLSVLPVTEWRLAARQS*
CDS seq >Sm.15386.1
ATGGATCCCGAATCTCAGCGGAGCTATGCGGAGCTCCCATCAGGGGAGGAGGCGGAGGAGATTCATTCGAATCCCACGGC
GGTGGCGGAGAATTCCAGGAACAGACGCATTGATCTAGTCCTGACGCTGGTGGGGATTTGCTGCGTCGCCGCTGGAACTT
TTTTCTGGATCTCACTTCCTTCGAGCAACACCGAGAATTTCTTACGGATTGCTCCGGAAGGGGGTTTCTTGGCCAGCGAG
AGAACGGCATTCCATTTCCAACCGCGGAACAACTGGATGAACGATCCAAACGGTCCTCTCTTCCACAAAGGCTACTACCA
CTTGTTCTACCAGTACAATCCATATGGGGTAGAGTGGGGAAACATCTCCTGGGGCCATGCTGTGTCCACTGATCTCCTCC
ACTGGCAACATATGGATCTGGCAATGCAGCCAGACAAGTGGTATGATGCAGATGGAGTGTGGTCTGGCTCTGCAACAATC
CTCCCCAATGGTCAAGTGATCATGCTCTACACTGGCTCCACAAATGCCTCTGTTCAGGTCCAAAATCTCGCTCTTCCGCT
CAACACCTCAGATCCACTCCTGAGAGAATGGATCAAAATCCCAGAGAATCCCATCCTGGTGCCACCACCAGGGATTGCTC
CTAAAGACTTCAGGGATCCAACAACAGCATGGCTGGAGGCTGATGGGTTGTGGAGAATTGCAATTGGTGCCAAAAAGGGC
AGGGCTGGACTGGCTCTCATCTACAAAACCTTTGATTTCCTGCACTGGGAGCTGGAGGAGGAGTATCTACACACTGTTCA
AGGCACAGGGATGTGGGAGTGCATTGATTTCTATCCAGTCTCAACTGCCACCAGCAATGGGTTGGACACATCCAAAGTTC
AGACCAATGAGCTCACTAAGCACATTCTCAAGGCCAGCCTGGATGATGACAAGCATGACTACTATGCAATTGGGCTGTAT
TCTGAGAGCTCTCATACTTGGATCCCTGATGCCCTTGACAATGATGTTGGGCTGGGGCTAAGGTATGATTATGGTAAGTA
TTATGCCTCTAAGACCTTTTTTGATTCTAAGCACCAAAAGAGAATCTTGTGGGGATGGGCTAATGAATCAGACAGTCTGC
AAGATGATATCAGGAAGGGGTGGTCTTCGGTGCAGACACTGCCCAGGATCTTGTATCTTGACAACTTGACTGGCACCAAC
TTGATCCAATGGCCAATTGAGGAGGTAGAGGCCCTCAGGCATGACAAAGTGAGCAGATCAAATGTGCTTCTCAAAGGTGG
TGATGTGGTGGAAGTTGATGCAGCTCAAGGGGCACAGCTTGACATTGAGGTGGGATTTGAGTATCCAGATGCAAGCAAGC
TGGACGCACTTCCAGAGAGTGAGATTTACGATTGCAGCCAAGGTGGGGCAACTCACAGAGGTGTTTATGGCCCTTTTGGC
CTCCTTGTTCTTGCTGAAGACAAGTTGCAAGAGATGACAGCAGTCTACTTCTACATGACACTAAAACGAGATGGAAGTTG
GGAAACAAGGGTTTGCAGCGACCAAAGCAGATCATCTCTAGAACCCGGAATTGACACAACCGTGTACGGAACGCTCTTCC
ACAGGCTCCCAACGGAGGATTCGCTCTCGTTGCGTGTCATAGTGGATCACTCCATCGTGGAGACTTTCGTGCAAGGGGGA
CGCGCTTGTATCACGTCTCGCGTCTATCCAACGCTTGCTACTGGTGACAAGGCTCGCCTCTTCATGTTTAACAATGGAAC
GCAACCGGTTTTCGTGAAGAACTTGGATGCGTGGAAGATGAGATCGACCACGCTGAGTGTCTTGCCCGTCACTGAATGGA
GGCTCGCTGCTCGACAAAGTTGA