Microexon ID At_1:1972267-1972279:+
Species Arabidopsis thaliana
Coordinates 1:1972267..1972279
Microexon Cluster ID MEP31
Size 13
Phase 0
Pfam Domain Motif TPT
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,13,47
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GCTGTYATGTCTGGKTTYCGYTGGTSYATGACTCARATTCTTYTGCAGAAAGAARMHTAYGGTYTRAARAATCCAYTTACCTTGATGAGYTATGTKACYCCAGTGATG
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq AAAGAAACATTTG
Microexon Amino Acid seq KETFG
Microexon-tag DNA Seq GCTGTCATGTCTGGTTTCCGCTGGTGTATGACGCAAGTTCTTTTGCAGAAAGAAACATTTGGACTGAAAAATCCATTCATATTCATGAGCTGTGTGGCACCAGTGATG
Microexon-tag Amino Acid Seq AVMSGFRWCMTQVLLQKETFGLKNPFIFMSCVAPVM
Microexon-tag spanning region1972084-1972434
Microexon-tag prediction score0.9516
Overlapped with the annotated transcript (%) 100
New Transcript ID AT1G06470.1x
Reference Transcript ID AT1G06470.1
Gene ID AT1G06470
Gene Name NA
Transcript ID AT1G06470.1
Protein ID AT1G06470.1
Gene ID AT1G06470
Gene Name NA
Pfam domain motif TPT
Motif E-value 1.1e-30
Motif start 74
Motif end 373
Protein seq >AT1G06470.1
MEQRVQLRVGTMETISNEGDVDREQVLETFGIENETGKETNGSRSFDVGYSSGDTLETLPKASKVDISPADVLKTLFFIL
VWYTFSTFLTLYNKTLLGDDLGKFPAPLLMNTIHFSIQAVLSKMITWYWSGRFQPDVTISWRDYFVRVVPTALGTAMDIN
LSNESLVFISVTFATMCKSAAPIFLLLFAFAFRLESPSLKLFGIISVISAGVLLTVAKETEFEFWGFVFVMLAAVMSGFR
WCMTQVLLQKETFGLKNPFIFMSCVAPVMAIATGLLSLLLDPWSEFRDNKYFDSGAHFARTCFLMLFGGALAFCMVLTEY
VLVSVTSAVTVTIAGVVKEAVTIVVAVFYFHDEFTWLKGVGLMIIMVGVSLFNWYKYDKLQKGHKTEEEKQLQAPSQTGK
YVILDEMDDQENSP*
CDS seq >AT1G06470.1
ATGGAGCAAAGAGTGCAGCTCCGTGTTGGTACAATGGAAACCATATCCAATGAGGGAGATGTAGATAGAGAACAGGTTTT
AGAGACTTTTGGTATTGAAAATGAGACCGGGAAGGAGACAAACGGCTCCAGAAGTTTTGATGTTGGTTACAGTTCCGGTG
ATACCCTTGAAACATTGCCTAAAGCGTCAAAAGTAGATATCTCTCCTGCTGACGTGCTTAAAACACTCTTCTTTATACTT
GTATGGTACACTTTCAGCACATTTCTGACTCTGTACAACAAAACTCTCTTAGGAGATGATTTAGGGAAATTTCCTGCTCC
CTTATTAATGAATACCATTCACTTTTCTATTCAAGCTGTTTTATCAAAGATGATAACATGGTATTGGTCGGGGAGATTTC
AGCCTGATGTTACCATATCATGGAGAGACTATTTCGTTAGAGTTGTACCAACAGCGCTTGGAACTGCTATGGATATAAAC
CTAAGTAACGAGTCACTTGTCTTCATATCGGTTACATTTGCAACAATGTGCAAATCTGCAGCCCCAATATTTCTTCTTCT
GTTTGCTTTTGCCTTCAGGCTGGAGTCTCCAAGCCTGAAACTTTTTGGCATTATTTCAGTGATCTCAGCAGGAGTGTTGT
TAACAGTTGCAAAGGAGACAGAATTTGAGTTTTGGGGTTTCGTTTTTGTCATGCTTGCTGCTGTCATGTCTGGTTTCCGC
TGGTGTATGACGCAAGTTCTTTTGCAGAAAGAAACATTTGGACTGAAAAATCCATTCATATTCATGAGCTGTGTGGCACC
AGTGATGGCTATAGCGACTGGTCTTCTTTCTCTCCTTCTGGATCCATGGAGTGAATTTAGAGACAACAAATACTTTGATA
GTGGAGCACATTTTGCTCGAACTTGTTTTTTGATGCTTTTTGGCGGAGCACTGGCTTTTTGTATGGTTTTGACAGAGTAT
GTCCTTGTTTCCGTAACTAGTGCTGTAACCGTCACAATAGCGGGAGTTGTCAAAGAGGCTGTCACCATAGTGGTTGCAGT
GTTTTACTTCCATGACGAATTTACGTGGCTGAAAGGTGTGGGTCTGATGATCATTATGGTTGGTGTCAGTTTGTTCAACT
GGTACAAATATGATAAACTACAAAAGGGGCACAAAACAGAGGAAGAGAAGCAGTTACAAGCTCCAAGTCAGACTGGAAAA
TATGTGATTCTTGATGAGATGGATGATCAAGAAAATAGTCCCTAA
Microexon DNA seq AAAGAAACATTTG
Microexon Amino Acid seq KETFG
Microexon-tag DNA Seq GCTGTCATGTCTGGTTTCCGCTGGTGTATGACGCAAGTTCTTTTGCAGAAAGAAACATTTGGACTGAAAAATCCATTCATATTCATGAGCTGTGTGGCACCAGTGATG
Microexon-tag Amino Acid seq AVMSGFRWCMTQVLLQKETFGLKNPFIFMSCVAPVM
Transcript ID AT1G06470.2
Gene ID At.633
Gene Name NA
Pfam domain motif TPT
Motif E-value 1.1e-30
Motif start 74
Motif end 373
Protein seq >AT1G06470.2
MEQRVQLRVGTMETISNEGDVDREQVLETFGIENETGKETNGSRSFDVGYSSGDTLETLPKASKVDISPADVLKTLFFIL
VWYTFSTFLTLYNKTLLGDDLGKFPAPLLMNTIHFSIQAVLSKMITWYWSGRFQPDVTISWRDYFVRVVPTALGTAMDIN
LSNESLVFISVTFATMCKSAAPIFLLLFAFAFRLESPSLKLFGIISVISAGVLLTVAKETEFEFWGFVFVMLAAVMSGFR
WCMTQVLLQKETFGLKNPFIFMSCVAPVMAIATGLLSLLLDPWSEFRDNKYFDSGAHFARTCFLMLFGGALAFCMVLTEY
VLVSVTSAVTVTIAGVVKEAVTIVVAVFYFHDEFTWLKGVGLMIIMVGVSLFNWYKYDKLQKGHKTEEEKQLQAPSQTGK
YVILDEMDDQENSP*
CDS seq >AT1G06470.2
ATGGAGCAAAGAGTGCAGCTCCGTGTTGGTACAATGGAAACCATATCCAATGAGGGAGATGTAGATAGAGAACAGGTTTT
AGAGACTTTTGGTATTGAAAATGAGACCGGGAAGGAGACAAACGGCTCCAGAAGTTTTGATGTTGGTTACAGTTCCGGTG
ATACCCTTGAAACATTGCCTAAAGCGTCAAAAGTAGATATCTCTCCTGCTGACGTGCTTAAAACACTCTTCTTTATACTT
GTATGGTACACTTTCAGCACATTTCTGACTCTGTACAACAAAACTCTCTTAGGAGATGATTTAGGGAAATTTCCTGCTCC
CTTATTAATGAATACCATTCACTTTTCTATTCAAGCTGTTTTATCAAAGATGATAACATGGTATTGGTCGGGGAGATTTC
AGCCTGATGTTACCATATCATGGAGAGACTATTTCGTTAGAGTTGTACCAACAGCGCTTGGAACTGCTATGGATATAAAC
CTAAGTAACGAGTCACTTGTCTTCATATCGGTTACATTTGCAACAATGTGCAAATCTGCAGCCCCAATATTTCTTCTTCT
GTTTGCTTTTGCCTTCAGGCTGGAGTCTCCAAGCCTGAAACTTTTTGGCATTATTTCAGTGATCTCAGCAGGAGTGTTGT
TAACAGTTGCAAAGGAGACAGAATTTGAGTTTTGGGGTTTCGTTTTTGTCATGCTTGCTGCTGTCATGTCTGGTTTCCGC
TGGTGTATGACGCAAGTTCTTTTGCAGAAAGAAACATTTGGACTGAAAAATCCATTCATATTCATGAGCTGTGTGGCACC
AGTGATGGCTATAGCGACTGGTCTTCTTTCTCTCCTTCTGGATCCATGGAGTGAATTTAGAGACAACAAATACTTTGATA
GTGGAGCACATTTTGCTCGAACTTGTTTTTTGATGCTTTTTGGCGGAGCACTGGCTTTTTGTATGGTTTTGACAGAGTAT
GTCCTTGTTTCCGTAACTAGTGCTGTAACCGTCACAATAGCGGGAGTTGTCAAAGAGGCTGTCACCATAGTGGTTGCAGT
GTTTTACTTCCATGACGAATTTACGTGGCTGAAAGGTGTGGGTCTGATGATCATTATGGTTGGTGTCAGTTTGTTCAACT
GGTACAAATATGATAAACTACAAAAGGGGCACAAAACAGAGGAAGAGAAGCAGTTACAAGCTCCAAGTCAGACTGGAAAA
TATGTGATTCTTGATGAGATGGATGATCAAGAAAATAGTCCCTAA