Microexon ID Sm_GL377684:264957-264965:-
Species Selaginella moellendorffii
Coordinates GL377684:264957..264965
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAGCGCACGGGATTCCACTTCCAGCCAGTGAAGAACTGGATGAACGATCCAAATGGACCGATGCTCTACAAAGGATTGTATCACCTCTTCTTCCAGTACAATCCC
Microexon-tag Amino Acid Seq WQRTGFHFQPVKNWMNDPNGPMLYKGLYHLFFQYNP
Microexon-tag spanning region255284-265115
Microexon-tag prediction score0.9607
Overlapped with the annotated transcript (%) 41.67
New Transcript ID EFJ07564x
Reference Transcript ID EFJ07564
Gene ID SELMODRAFT_429742
Gene Name NA
Sm_GL377684:264957-264965:- does not have available information here.
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAGCGCACGGGATTCCACTTCCAGCCAGTGAAGAACTGGATGAACGATCCAAATGGTCCCCTGTTTTACAAAGGCGTGTATCATCTCTTCTATCAATGGAACCCC
Microexon-tag Amino Acid seq WQRTGFHFQPVKNWMNDPNGPLFYKGVYHLFYQWNP
Transcript ID Sm.29659.1
Gene ID Sm.29659
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 6e-100
Motif start 58
Motif end 380
Protein seq >Sm.29659.1
MAPLPSSLVPFSLLFFALLPSILSLPDRYDVQTLAAIDQSPDSQRNDLDWEWQRTGFHFQPVKNWMNDPNGPLFYKGVYH
LFYQWNPYAAVWGNITWGHAVSTDLIHWKYVKELALVPDRWYDIKGVWSGSATIVNGKPILLYTGWTNSSTQVQNKAVPK
NSSDPLLREWIKVDAENPFAVPPPGINTSDFRDPTTAWIGQDGLWRTAVGSKYRANDTGIILQYRSKDFAKWELLDESLH
AVNGTGMWECPDFFPVAVHGQQGSENYLGEENAIQKFVIKVSLDETRFDTYVVGDYDPASEKFLPSFEALDIGTALRYDY
GIYYASKSFYDPHKKRRVLLGWINEADKPTSDIRKGWASVQAIPRVVWLDENQHSLRQWPVPEINSLRKHPIRHTDLLLK
QGEVFKVNGSQGSQLDIEVTFQIPKAHANDESDEFNFASSRVEGIPNNTLIYCNGSFPEAEQIIGPFGVHVLASEDLRER
TSVFFKFLKFKGSWKTMVCNDLTSSSLASDATKGVYGGLVSLSSYKNRQALTMRILVDHSIVETFAQGGRTCITARSYPL
LGSDNNAHIFVFNNGSLPVKATHLAVWKMDKIRYTTV*
CDS seq >Sm.29659.1
ATGGCTCCTCTTCCCTCCTCCTTGGTCCCCTTTTCCTTGCTGTTCTTCGCCCTTCTCCCTTCGATTCTGTCCCTCCCCGA
TCGATACGACGTGCAGACGCTCGCGGCGATCGATCAATCCCCGGATTCACAGAGGAACGATCTCGATTGGGAATGGCAGC
GCACGGGATTCCACTTCCAGCCAGTGAAGAACTGGATGAACGATCCAAATGGTCCCCTGTTTTACAAAGGCGTGTATCAT
CTCTTCTATCAATGGAACCCCTACGCTGCCGTGTGGGGGAACATCACTTGGGGCCACGCTGTCTCCACGGATCTAATCCA
CTGGAAGTACGTGAAGGAGCTAGCTCTCGTGCCGGATCGATGGTATGACATCAAGGGCGTGTGGTCGGGATCCGCGACGA
TCGTCAATGGCAAGCCCATTCTTCTCTATACTGGATGGACAAATTCCTCCACGCAAGTCCAAAACAAGGCTGTGCCCAAG
AACTCCTCCGATCCGCTACTGCGCGAATGGATCAAAGTCGACGCGGAGAATCCTTTCGCGGTGCCGCCGCCGGGGATCAA
TACCAGCGATTTCCGGGACCCGACGACGGCGTGGATTGGGCAAGATGGGCTATGGCGTACTGCAGTGGGATCCAAGTATC
GCGCCAACGACACAGGGATCATCCTGCAGTACCGCAGCAAGGATTTCGCGAAATGGGAGCTCCTGGATGAATCGCTCCAC
GCGGTGAATGGGACGGGGATGTGGGAGTGCCCGGATTTCTTCCCGGTTGCGGTCCATGGCCAGCAGGGCTCGGAGAACTA
TCTTGGCGAGGAGAATGCGATCCAGAAGTTCGTGATCAAGGTCAGCCTGGATGAGACGCGATTTGACACGTATGTGGTGG
GCGATTACGACCCGGCGTCGGAGAAGTTCTTGCCCAGCTTCGAGGCGCTCGATATTGGAACGGCGCTGCGCTATGACTAT
GGGATTTACTACGCGTCCAAGTCGTTCTATGATCCTCACAAGAAGAGGAGAGTTTTGCTAGGATGGATCAACGAGGCCGA
CAAGCCCACGTCGGACATTCGCAAGGGATGGGCTTCTGTCCAGGCAATTCCCAGAGTTGTATGGCTCGACGAAAACCAGC
ATTCCCTTCGGCAGTGGCCCGTGCCAGAGATCAATTCTCTAAGAAAGCATCCAATTCGTCATACAGATTTGCTCTTGAAG
CAGGGAGAAGTTTTCAAAGTAAATGGCTCTCAAGGATCGCAGCTCGATATCGAGGTGACTTTCCAAATCCCAAAGGCTCA
CGCCAACGATGAGAGCGATGAATTTAACTTTGCATCGTCCAGAGTCGAGGGAATTCCAAACAACACTCTCATATACTGCA
ATGGCAGCTTTCCAGAGGCCGAGCAGATAATCGGCCCGTTCGGAGTCCACGTCCTTGCGTCGGAGGATCTACGTGAGAGA
ACTTCCGTGTTCTTTAAGTTCCTCAAGTTCAAAGGTTCCTGGAAAACTATGGTTTGCAACGATCTCACAAGCTCTTCCCT
GGCATCTGATGCTACAAAAGGCGTCTACGGTGGACTAGTAAGCCTCTCGAGCTACAAGAATCGTCAGGCTTTGACAATGA
GAATCTTGGTGGATCATTCCATCGTCGAAACGTTTGCTCAAGGCGGCAGGACGTGCATCACTGCTCGATCCTATCCTCTT
CTCGGGAGCGACAACAACGCTCATATATTCGTGTTCAACAATGGAAGTCTTCCAGTCAAGGCAACACACTTGGCTGTGTG
GAAGATGGATAAGATTCGCTACACCACAGTCTAG