Microexon ID At_1:4641419-4641428:-
Species Arabidopsis thaliana
Coordinates 1:4641419..4641428
Microexon Cluster ID MEP25
Size 10
Phase 2
Pfam Domain Motif CDP-OH_P_transf
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 50,10,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YTSCARCCYTTYTGGASYCGHTKYGTYAMYYTCTTCCCYCTTTGGATGCCRCCAAAYATGATWACACTTAYRGGATTYATGTTYYTRSTBAYATCTGCAYTGCTTGGC
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ACCAAACATG
Microexon Amino Acid seq PPNM
Microexon-tag DNA Seq CTCCAACCTTTTTGGACTCGATTTGTCAAAGTCTTCCCTCTATGGATGCCACCAAACATGATAACGCTTATGGGGTTTATGTTTCTAGTCACTTCCTCCCTGCTAGGC
Microexon-tag Amino Acid Seq LQPFWTRFVKVFPLWMPPNMITLMGFMFLVTSSLLG
Microexon-tag spanning region4641279-4641604
Microexon-tag prediction score0.9664
Overlapped with the annotated transcript (%) 100
New Transcript ID AT1G13560.1x
Reference Transcript ID AT1G13560.1
Gene ID AT1G13560
Gene Name AAPT1
Transcript ID AT1G13560.1
Protein ID AT1G13560.1
Gene ID AT1G13560
Gene Name AAPT1
Pfam domain motif CDP-OH_P_transf
Motif E-value 9e-17
Motif start 47
Motif end 120
Protein seq >AT1G13560.1
MGYIGAHGVAALHRYKYSGVDHSYLAKYVLQPFWTRFVKVFPLWMPPNMITLMGFMFLVTSSLLGYIYSPQLDSPPPRWV
HFAHGLLLFLYQTFDAVDGKQARRTNSSSPLGELFDHGCDALACAFEAMAFGSTAMCGRDTFWFWVISAIPFYGATWEHY
FTNTLILPVINGPTEGLALIFVSHFFTAIVGAEWWAQQLGQSIPLFSWVPFVNEIQTSRAVLYMMIAFAVIPTVAFNVTN
VYKVVRSRNGSMVLALAMLYPFVVLLGGVLIWDYLSPINLIATYPHLVVLGTGLAFGFLVGRMILAHLCDEPKGLKTNMC
MSLLYLPFALANALTARLNAGVPLVDELWVLLGYCIFTVSLYLHFATSVIHEITEALGIYCFRITRKEA*
CDS seq >AT1G13560.1
ATGGGTTACATAGGAGCTCATGGTGTAGCAGCTCTTCATAGGTACAAATACAGTGGAGTGGATCACTCTTATCTTGCCAA
ATACGTCCTCCAACCTTTTTGGACTCGATTTGTCAAAGTCTTCCCTCTATGGATGCCACCAAACATGATAACGCTTATGG
GGTTTATGTTTCTAGTCACTTCCTCCCTGCTAGGCTATATATATTCACCTCAGTTGGATTCTCCTCCTCCACGATGGGTT
CACTTCGCACATGGTTTACTTCTCTTCTTGTATCAGACATTTGATGCGGTTGATGGGAAGCAAGCAAGAAGGACAAATTC
CTCTAGCCCCCTAGGAGAGCTCTTCGATCATGGTTGTGACGCGCTTGCTTGTGCGTTTGAAGCCATGGCATTTGGAAGCA
CTGCAATGTGTGGAAGAGATACTTTCTGGTTCTGGGTTATTTCAGCTATTCCATTTTATGGAGCTACATGGGAACACTAT
TTCACCAACACACTTATTCTTCCGGTTATCAATGGGCCTACAGAGGGGCTTGCACTTATATTTGTCAGCCACTTCTTCAC
AGCCATCGTCGGTGCTGAATGGTGGGCTCAGCAGTTAGGGCAGTCAATACCATTGTTTAGTTGGGTGCCATTTGTGAATG
AGATTCAAACTTCTAGAGCAGTGCTATACATGATGATCGCTTTTGCTGTTATACCAACCGTTGCATTCAATGTAACAAAT
GTCTACAAAGTCGTTCGATCAAGAAACGGGAGCATGGTGTTAGCGTTAGCTATGCTGTATCCCTTCGTTGTCTTACTTGG
AGGAGTTTTGATATGGGATTACTTGTCTCCAATCAATCTCATAGCAACATATCCTCACTTAGTTGTACTCGGAACTGGAC
TTGCATTTGGATTTTTAGTGGGAAGAATGATTCTTGCTCACTTGTGTGATGAGCCTAAAGGACTAAAAACAAACATGTGC
ATGTCACTACTCTATCTTCCCTTTGCACTTGCAAATGCGCTAACCGCAAGATTGAATGCTGGGGTTCCTCTAGTCGACGA
ATTATGGGTTCTTCTTGGCTACTGCATATTCACAGTGTCATTATACTTGCACTTTGCAACATCAGTCATCCATGAGATCA
CTGAGGCCCTTGGAATCTACTGCTTTAGGATCACGCGTAAAGAAGCTTGA
Microexon DNA seq ACCAAACATG
Microexon Amino Acid seq PPNM
Microexon-tag DNA Seq CTCCAACCTTTTTGGACTCGATTTGTCAAAGTCTTCCCTCTATGGATGCCACCAAACATGATAACGCTTATGGGGTTTATGTTTCTAGTCACTTCCTCCCTGCTAGGC
Microexon-tag Amino Acid seq LQPFWTRFVKVFPLWMPPNMITLMGFMFLVTSSLLG
Transcript ID AT1G13560.1
Gene ID At.1426
Gene Name AAPT1
Pfam domain motif CDP-OH_P_transf
Motif E-value 9e-17
Motif start 47
Motif end 120
Protein seq >AT1G13560.1
MGYIGAHGVAALHRYKYSGVDHSYLAKYVLQPFWTRFVKVFPLWMPPNMITLMGFMFLVTSSLLGYIYSPQLDSPPPRWV
HFAHGLLLFLYQTFDAVDGKQARRTNSSSPLGELFDHGCDALACAFEAMAFGSTAMCGRDTFWFWVISAIPFYGATWEHY
FTNTLILPVINGPTEGLALIFVSHFFTAIVGAEWWAQQLGQSIPLFSWVPFVNEIQTSRAVLYMMIAFAVIPTVAFNVTN
VYKVVRSRNGSMVLALAMLYPFVVLLGGVLIWDYLSPINLIATYPHLVVLGTGLAFGFLVGRMILAHLCDEPKGLKTNMC
MSLLYLPFALANALTARLNAGVPLVDELWVLLGYCIFTVSLYLHFATSVIHEITEALGIYCFRITRKEA*
CDS seq >AT1G13560.1
ATGGGTTACATAGGAGCTCATGGTGTAGCAGCTCTTCATAGGTACAAATACAGTGGAGTGGATCACTCTTATCTTGCCAA
ATACGTCCTCCAACCTTTTTGGACTCGATTTGTCAAAGTCTTCCCTCTATGGATGCCACCAAACATGATAACGCTTATGG
GGTTTATGTTTCTAGTCACTTCCTCCCTGCTAGGCTATATATATTCACCTCAGTTGGATTCTCCTCCTCCACGATGGGTT
CACTTCGCACATGGTTTACTTCTCTTCTTGTATCAGACATTTGATGCGGTTGATGGGAAGCAAGCAAGAAGGACAAATTC
CTCTAGCCCCCTAGGAGAGCTCTTCGATCATGGTTGTGACGCGCTTGCTTGTGCGTTTGAAGCCATGGCATTTGGAAGCA
CTGCAATGTGTGGAAGAGATACTTTCTGGTTCTGGGTTATTTCAGCTATTCCATTTTATGGAGCTACATGGGAACACTAT
TTCACCAACACACTTATTCTTCCGGTTATCAATGGGCCTACAGAGGGGCTTGCACTTATATTTGTCAGCCACTTCTTCAC
AGCCATCGTCGGTGCTGAATGGTGGGCTCAGCAGTTAGGGCAGTCAATACCATTGTTTAGTTGGGTGCCATTTGTGAATG
AGATTCAAACTTCTAGAGCAGTGCTATACATGATGATCGCTTTTGCTGTTATACCAACCGTTGCATTCAATGTAACAAAT
GTCTACAAAGTCGTTCGATCAAGAAACGGGAGCATGGTGTTAGCGTTAGCTATGCTGTATCCCTTCGTTGTCTTACTTGG
AGGAGTTTTGATATGGGATTACTTGTCTCCAATCAATCTCATAGCAACATATCCTCACTTAGTTGTACTCGGAACTGGAC
TTGCATTTGGATTTTTAGTGGGAAGAATGATTCTTGCTCACTTGTGTGATGAGCCTAAAGGACTAAAAACAAACATGTGC
ATGTCACTACTCTATCTTCCCTTTGCACTTGCAAATGCGCTAACCGCAAGATTGAATGCTGGGGTTCCTCTAGTCGACGA
ATTATGGGTTCTTCTTGGCTACTGCATATTCACAGTGTCATTATACTTGCACTTTGCAACATCAGTCATCCATGAGATCA
CTGAGGCCCTTGGAATCTACTGCTTTAGGATCACGCGTAAAGAAGCTTGA