Microexon ID Sm_GL377687:351056-351059:-
Species Selaginella moellendorffii
Coordinates GL377687:351056..351059
Microexon Cluster ID MEP05
Size 4
Phase 2
Pfam Domain Motif Helicase_C
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 53,4,51
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GATGAYTGGMTTTCWKCSARRAYWCAAGTTGTWGTKGCHACWGTRGCWTTTGGRATGGGWATWGATARRMARGATGTYMGDATTGTKTGYCAYTTYAAYWTKCCWAAR
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GATG
Microexon Amino Acid seq GM
Microexon-tag DNA Seq CAAGACTGGGTTCTTGGAGAAGTTCATATTATTGTGGCAACTATCGCATTTGGGATGGGAATAGATCGGAAGGATGTTAGGATGGTGTGTCATTTTAACATGCCCAAG
Microexon-tag Amino Acid Seq QDWVLGEVHIIVATIAFGMGIDRKDVRMVCHFNMPK
Microexon-tag spanning region350950-351183
Microexon-tag prediction score0.9055
Overlapped with the annotated transcript (%) 92.86
New Transcript ID EFJ07289x
Reference Transcript ID EFJ07289
Gene ID SELMODRAFT_41332
Gene Name NA
Sm_GL377687:351056-351059:- does not have available information here.
Microexon DNA seq GATG
Microexon Amino Acid seq GM
Microexon-tag DNA Seq CAAGACTGGGTTCTTGGAGAAGTTCATATTATTGTGGCAACTATCGCATTTGGGATGGGAATAGATCGGAAGGATGTTAGGATGGTGTGTCATTTTAACATGCCCAAG
Microexon-tag Amino Acid seq QDWVLGEVHIIVATIAFGMGIDRKDVRMVCHFNMPK
Transcript ID Sm.29906.1
Gene ID Sm.29906
Gene Name NA
Pfam domain motif Helicase_C
Motif E-value 9.4e-17
Motif start 101
Motif end 206
Protein seq >Sm.29906.1
MQKLRKLHERSLLSLIAIDEAHCISSWGHDFRPSYRKLSALRTSLPDIPILALTATASKKVQEDIIKSLSLQKAAVLISS
FNRANIFYEVRFKDLMKSAYEDLRNIITTAPTRCMIIYCHARAMCDEIGSRLKSDGISCRVYHAGINVKARSQALQDWVL
GEVHIIVATIAFGMGIDRKDVRMVCHFNMPKSLESFYQESGRAGRDGKPAKSILYYSVDDKRTMEYVIRSSSQRQQAGIS
ENGENELLKKNIEAFEKVVAYCEEASCRRRRVLEHFGENVSPLLCSKTCDACKWPEKLSRDLKELADASCFNSVWQSGVR
IKSDCSSPNDKSEFWNYDNEDVDEHDAEDDISDSEDEKAREAASKGGKKVEKRVAALLRAEEEQKAKQPKKKSAQNNLVT
EELRSTSRARLENTVRAAIERLGCADAVNTTAAASALEIECHEKFGKFGRSFYHSQVASKVRWLSACSASELIAFKIS*
CDS seq >Sm.29906.1
ATGCAAAAGCTGAGAAAGCTTCACGAGAGGAGCCTTCTGTCTCTGATTGCCATTGATGAGGCGCATTGTATATCATCCTG
GGGACACGACTTCAGACCAAGCTATCGGAAGTTGTCAGCCTTGAGAACAAGCCTACCTGACATTCCAATACTAGCTCTAA
CAGCGACAGCCAGCAAGAAGGTGCAAGAAGACATCATCAAATCACTTAGTCTGCAAAAGGCAGCAGTTTTAATATCGTCT
TTTAATCGTGCAAATATTTTCTATGAAGTGCGCTTCAAGGACTTGATGAAGAGTGCTTACGAGGATCTGCGAAATATAAT
CACGACAGCTCCGACGCGGTGCATGATTATTTACTGTCATGCTCGCGCCATGTGCGATGAAATAGGCTCGCGGTTAAAAT
CAGATGGGATCTCGTGTCGAGTATATCACGCGGGTATCAACGTTAAAGCTCGGAGCCAGGCATTACAAGACTGGGTTCTT
GGAGAAGTTCATATTATTGTGGCAACTATCGCATTTGGGATGGGAATAGATCGGAAGGATGTTAGGATGGTGTGTCATTT
TAACATGCCCAAGTCGCTCGAATCCTTTTACCAAGAGTCGGGACGCGCAGGTCGTGATGGTAAACCCGCCAAAAGTATAC
TTTATTACAGTGTCGACGACAAGCGAACCATGGAGTACGTCATAAGGAGTTCTTCTCAAAGGCAACAGGCTGGAATTAGT
GAGAATGGAGAGAATGAGCTGCTAAAGAAAAATATCGAAGCTTTTGAAAAGGTAGTGGCCTACTGCGAAGAAGCAAGCTG
CCGAAGGCGTAGAGTACTGGAACACTTCGGAGAAAATGTCTCTCCTTTACTATGCAGTAAGACGTGTGATGCATGTAAGT
GGCCGGAGAAACTCTCTAGAGACTTGAAGGAGCTTGCGGATGCTTCGTGTTTCAACTCCGTATGGCAGTCCGGTGTACGC
ATAAAATCGGATTGTTCTTCGCCGAATGATAAGAGCGAGTTTTGGAATTACGACAATGAAGATGTGGACGAACACGATGC
GGAAGACGATATATCGGACTCTGAAGACGAGAAAGCCAGGGAAGCTGCGAGTAAAGGTGGAAAGAAAGTCGAGAAAAGAG
TTGCAGCTTTGCTGCGTGCCGAGGAAGAACAAAAAGCCAAACAACCAAAGAAAAAATCAGCTCAAAACAATTTGGTAACC
GAGGAGCTCCGCTCTACATCACGAGCACGCCTAGAAAACACAGTGCGAGCAGCCATCGAGCGCCTTGGTTGTGCGGACGC
TGTCAACACAACAGCAGCAGCATCAGCTTTGGAGATCGAGTGCCACGAAAAGTTCGGCAAATTTGGGAGGAGTTTTTACC
ACTCCCAAGTTGCTAGCAAAGTTAGATGGCTCTCCGCTTGCTCGGCCTCAGAGCTAATCGCTTTCAAGATCTCATAG