Microexon ID At_2:17402099-17402107:-
Species Arabidopsis thaliana
Coordinates 2:17402099..17402107
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTATCTAG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAAGAGTACCTGGAACCAAAACCAGAACAAGAAGGGAAAACAAGTTTATCTAGGAGCATATGATGATGAAGAGGCTGCTGCTAGAGCTTACGACCTTGCTGCC
Microexon-tag Amino Acid Seq WDKSTWNQNQNKKGKQVYLGAYDDEEAAARAYDLAA
Microexon-tag spanning region17401960-17402302
Microexon-tag prediction score0.975
Overlapped with the annotated transcript (%) 100
New Transcript ID AT2G41710.1x
Reference Transcript ID AT2G41710.1
Gene ID AT2G41710
Gene Name NA
Transcript ID AT2G41710.1
Protein ID AT2G41710.1
Gene ID AT2G41710
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.1e-13
Motif start 70
Motif end 128
Protein seq >AT2G41710.1
MASVSSSDQGPKTEAGCSGGGGGESSETVAASDQMLLYRGFKKAKKERGCTAKERISKMPPCTAGKRSSIYRGVTRHRWT
GRYEAHLWDKSTWNQNQNKKGKQVYLGAYDDEEAAARAYDLAALKYWGPGTLINFPVTDYTRDLEEMQNLSREEYLASLR
RKSSGFSRGIAKYRGLQSRWDASASRMPGPEYFSNIHYGAGDDRGTEGDFLGSFCLERKIDLTGYIKWWGANKNRQPESS
SKASEDANVEDAGTELKTLEHTSHATEPYKAPNLGVLCGTQRKEKEISSPSSSSALSILSQSPAFKSLEEKVLKIQESCN
NENDENANRNIINMEKNNGKAIEKPVVSHGVALGGAAALSLQKSMYPLTSLLTAPLLTNYNTLDPLADPILWTPFLPSGS
SLTSEVTKTETSCSTYSYLPQEK*
CDS seq >AT2G41710.1
ATGGCGTCGGTGTCGTCGTCGGATCAAGGACCTAAGACAGAAGCAGGATGTAGCGGCGGAGGAGGAGGAGAGAGCTCGGA
GACAGTGGCGGCGAGTGATCAGATGTTGTTGTATAGAGGTTTTAAGAAGGCGAAGAAGGAGAGAGGTTGTACAGCTAAGG
AGCGTATTAGTAAAATGCCTCCGTGCACTGCTGGGAAAAGGAGTTCCATATACCGGGGAGTCACCAGACATAGATGGACA
GGTCGTTATGAAGCTCACCTTTGGGATAAGAGTACCTGGAACCAAAACCAGAACAAGAAGGGAAAACAAGTTTATCTAGG
AGCATATGATGATGAAGAGGCTGCTGCTAGAGCTTACGACCTTGCTGCCTTAAAATATTGGGGTCCTGGGACACTTATAA
ATTTTCCGGTGACTGATTATACCAGGGATTTAGAAGAAATGCAAAATCTCTCAAGGGAAGAATACCTTGCATCTTTACGT
AGAAAAAGCAGCGGTTTCTCTAGGGGAATAGCGAAATATCGTGGACTTCAAAGCCGATGGGACGCATCAGCCAGTCGTAT
GCCTGGACCTGAATACTTCAGTAACATTCATTACGGGGCAGGTGATGATCGTGGAACAGAAGGTGACTTTCTAGGTAGCT
TTTGTCTGGAAAGAAAGATTGATCTAACAGGATACATAAAGTGGTGGGGAGCCAACAAGAACCGTCAACCAGAATCTTCA
TCAAAAGCATCAGAGGATGCAAACGTCGAAGATGCTGGTACTGAGCTTAAAACACTGGAACACACATCCCATGCAACAGA
ACCATACAAGGCGCCAAACCTTGGCGTCCTTTGTGGAACTCAGAGAAAAGAAAAAGAAATATCATCACCATCAAGCTCTT
CTGCTTTAAGCATCTTGTCTCAGTCGCCTGCCTTCAAGAGCCTAGAGGAGAAAGTGTTGAAGATCCAAGAAAGCTGCAAT
AATGAAAACGATGAGAATGCAAACCGTAACATCATCAATATGGAGAAGAATAACGGCAAGGCAATAGAGAAACCAGTTGT
GAGTCATGGAGTTGCTTTAGGCGGTGCTGCTGCTTTGTCTCTTCAGAAAAGCATGTACCCACTTACCTCTCTCTTAACGG
CTCCATTGCTCACCAACTACAATACATTGGATCCTCTTGCAGACCCTATTCTCTGGACACCATTTCTTCCTTCAGGATCC
TCTCTTACTTCAGAGGTGACAAAGACAGAGACCAGCTGTTCCACGTACAGCTACCTCCCACAAGAGAAATGA
Microexon DNA seq TTTATCTAG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAAGAGTACCTGGAACCAAAACCAGAACAAGAAGGGAAAACAAGTTTATCTAGGAGCATATGATGATGAAGAGGCTGCTGCTAGAGCTTACGACCTTGCTGCC
Microexon-tag Amino Acid seq WDKSTWNQNQNKKGKQVYLGAYDDEEAAARAYDLAA
Transcript ID AT2G41710.1
Gene ID At.11149
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.1e-13
Motif start 70
Motif end 128
Protein seq >AT2G41710.1
MASVSSSDQGPKTEAGCSGGGGGESSETVAASDQMLLYRGFKKAKKERGCTAKERISKMPPCTAGKRSSIYRGVTRHRWT
GRYEAHLWDKSTWNQNQNKKGKQVYLGAYDDEEAAARAYDLAALKYWGPGTLINFPVTDYTRDLEEMQNLSREEYLASLR
RKSSGFSRGIAKYRGLQSRWDASASRMPGPEYFSNIHYGAGDDRGTEGDFLGSFCLERKIDLTGYIKWWGANKNRQPESS
SKASEDANVEDAGTELKTLEHTSHATEPYKAPNLGVLCGTQRKEKEISSPSSSSALSILSQSPAFKSLEEKVLKIQESCN
NENDENANRNIINMEKNNGKAIEKPVVSHGVALGGAAALSLQKSMYPLTSLLTAPLLTNYNTLDPLADPILWTPFLPSGS
SLTSEVTKTETSCSTYSYLPQEK*
CDS seq >AT2G41710.1
ATGGCGTCGGTGTCGTCGTCGGATCAAGGACCTAAGACAGAAGCAGGATGTAGCGGCGGAGGAGGAGGAGAGAGCTCGGA
GACAGTGGCGGCGAGTGATCAGATGTTGTTGTATAGAGGTTTTAAGAAGGCGAAGAAGGAGAGAGGTTGTACAGCTAAGG
AGCGTATTAGTAAAATGCCTCCGTGCACTGCTGGGAAAAGGAGTTCCATATACCGGGGAGTCACCAGACATAGATGGACA
GGTCGTTATGAAGCTCACCTTTGGGATAAGAGTACCTGGAACCAAAACCAGAACAAGAAGGGAAAACAAGTTTATCTAGG
AGCATATGATGATGAAGAGGCTGCTGCTAGAGCTTACGACCTTGCTGCCTTAAAATATTGGGGTCCTGGGACACTTATAA
ATTTTCCGGTGACTGATTATACCAGGGATTTAGAAGAAATGCAAAATCTCTCAAGGGAAGAATACCTTGCATCTTTACGT
AGAAAAAGCAGCGGTTTCTCTAGGGGAATAGCGAAATATCGTGGACTTCAAAGCCGATGGGACGCATCAGCCAGTCGTAT
GCCTGGACCTGAATACTTCAGTAACATTCATTACGGGGCAGGTGATGATCGTGGAACAGAAGGTGACTTTCTAGGTAGCT
TTTGTCTGGAAAGAAAGATTGATCTAACAGGATACATAAAGTGGTGGGGAGCCAACAAGAACCGTCAACCAGAATCTTCA
TCAAAAGCATCAGAGGATGCAAACGTCGAAGATGCTGGTACTGAGCTTAAAACACTGGAACACACATCCCATGCAACAGA
ACCATACAAGGCGCCAAACCTTGGCGTCCTTTGTGGAACTCAGAGAAAAGAAAAAGAAATATCATCACCATCAAGCTCTT
CTGCTTTAAGCATCTTGTCTCAGTCGCCTGCCTTCAAGAGCCTAGAGGAGAAAGTGTTGAAGATCCAAGAAAGCTGCAAT
AATGAAAACGATGAGAATGCAAACCGTAACATCATCAATATGGAGAAGAATAACGGCAAGGCAATAGAGAAACCAGTTGT
GAGTCATGGAGTTGCTTTAGGCGGTGCTGCTGCTTTGTCTCTTCAGAAAAGCATGTACCCACTTACCTCTCTCTTAACGG
CTCCATTGCTCACCAACTACAATACATTGGATCCTCTTGCAGACCCTATTCTCTGGACACCATTTCTTCCTTCAGGATCC
TCTCTTACTTCAGAGGTGACAAAGACAGAGACCAGCTGTTCCACGTACAGCTACCTCCCACAAGAGAAATGA