Microexon ID At_5:3244674-3244686:-
Species Arabidopsis thaliana
Coordinates 5:3244674..3244686
Microexon Cluster ID MEP33
Size 13
Phase 2
Pfam Domain Motif Unknown
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 47,13,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GAVTTYYTRTCARTHTCRGARTTYCTWGTKGARRTWTCTGATGAYYTGTWTGAYTAYGAGGATGAYGTKTTRRAGAAYAAYTTCAAYATTYTGCGCATGTTTGTYRRA
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GTTCGATTACGAG
Microexon Amino Acid seq LFDYE
Microexon-tag DNA Seq GACTTCTTGTCAATCTCAGAGTTTCTGGTTGAAGTAGCGGATGATCTGTTCGATTACGAGGATGATGTTCTTGAGAACAATTTCAATGTATTACGTATGTTTGTGGGA
Microexon-tag Amino Acid Seq DFLSISEFLVEVADDLFDYEDDVLENNFNVLRMFVG
Microexon-tag spanning region3244557-3244819
Microexon-tag prediction score0.9486
Overlapped with the annotated transcript (%) 100
New Transcript ID AT5G10320.1x
Reference Transcript ID AT5G10320.1
Gene ID AT5G10320
Gene Name NA
Transcript ID AT5G10320.1
Protein ID AT5G10320.1
Gene ID AT5G10320
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >AT5G10320.1
MEELKRLEKFQTVVSSITCHGLLSSSSSDSASSSRFLSNLVLFLVQPCGELDLDSKLGLLSEFLPKISGPFLEEISRSLE
LDDEATTPLNTISAECQNSCVKRSVMDNVDPYMSQKHQEVVAMVGLDAMKRANSTLEDFSRSYFMFHRLDINEPQSIFRY
LPVLSFTESYIYQMDALNEKIVSESACGSQVIYSSHGWNAESRVLFETNPLKPLGDLLEREALLTNRVQQEFNSGEEYWA
LERKLCHALSNKNKICLEDVMRAIHLKSFDYRVLNLLLYKLRGEEVNELHMDFLSISEFLVEVADDLFDYEDDVLENNFN
VLRMFVGIFGSSNAPTELAKRISEAEEKYEEIMKSLDPHLSSNYQRRCEEATKEGGKISGHSLGTWNIPAVISDEEAYRS
AQR*
CDS seq >AT5G10320.1
ATGGAAGAGCTAAAGAGACTGGAGAAGTTTCAGACTGTTGTATCATCCATTACTTGTCACGGTTTACTTTCTTCTTCATC
TTCCGATTCAGCTTCTTCTTCTCGCTTCCTCTCCAATCTAGTCCTTTTCCTGGTACAACCATGCGGCGAACTCGATTTAG
ATTCGAAACTGGGTCTACTCTCTGAGTTCTTACCGAAAATCTCAGGACCCTTCCTGGAAGAGATATCGAGATCACTTGAG
CTAGACGATGAAGCTACTACACCACTTAACACAATTTCAGCAGAGTGCCAAAATAGTTGTGTGAAGAGGAGTGTCATGGA
CAATGTTGATCCTTACATGTCACAGAAGCACCAAGAAGTTGTGGCAATGGTCGGGCTAGATGCAATGAAGAGAGCAAACT
CTACACTTGAGGATTTCTCCAGGTCTTATTTCATGTTTCATCGGTTGGATATTAACGAACCGCAATCAATATTCAGATAT
TTACCTGTGCTTTCATTCACAGAGAGTTACATATATCAGATGGATGCTCTTAATGAGAAGATAGTCAGTGAATCAGCTTG
TGGATCCCAAGTCATTTATTCAAGTCATGGATGGAATGCTGAATCACGAGTTCTGTTTGAAACTAATCCGTTGAAGCCGC
TTGGAGATCTGCTTGAACGTGAGGCCCTTTTGACAAACAGAGTACAACAAGAGTTTAATTCGGGTGAAGAGTATTGGGCG
TTAGAAAGAAAGCTTTGTCATGCACTTTCAAACAAAAACAAGATCTGTCTCGAAGATGTGATGAGAGCAATTCATTTGAA
GTCCTTTGATTACAGGGTGTTGAATCTTCTACTATACAAGTTGAGAGGAGAAGAGGTGAATGAGTTGCATATGGACTTCT
TGTCAATCTCAGAGTTTCTGGTTGAAGTAGCGGATGATCTGTTCGATTACGAGGATGATGTTCTTGAGAACAATTTCAAT
GTATTACGTATGTTTGTGGGAATATTCGGCTCATCGAATGCACCCACTGAACTGGCGAAGCGGATATCGGAAGCTGAAGA
GAAGTATGAAGAGATTATGAAATCCCTGGATCCACATTTGTCTTCAAATTACCAAAGAAGATGTGAAGAAGCTACTAAAG
AAGGTGGGAAAATTTCTGGTCATTCGCTGGGAACATGGAACATACCGGCGGTTATCTCCGACGAAGAAGCATACCGATCT
GCTCAACGGTAA
Microexon DNA seq GTTCGATTACGAG
Microexon Amino Acid seq LFDYE
Microexon-tag DNA Seq GACTTCTTGTCAATCTCAGAGTTTCTGGTTGAAGTAGCGGATGATCTGTTCGATTACGAGGATGATGTTCTTGAGAACAATTTCAATGTATTACGTATGTTTGTGGGA
Microexon-tag Amino Acid seq DFLSISEFLVEVADDLFDYEDDVLENNFNVLRMFVG
Transcript ID AT5G10320.2
Gene ID At.22640
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >AT5G10320.2
MEELKRLEKFQTVVSSITCHGLLSSSSSDSASSSRFLSNLVLFLVQPCGELDLDSKLGLLSEFLPKISGPFLEEISRSLE
LDDEATTPLNTISECQNSCVKRSVMDNVDPYMSQKHQEVVAMVGLDAMKRANSTLEDFSRSYFMFHRLDINEPQSIFRYL
PVLSFTESYIYQMDALNEKIVSESACGSQVIYSSHGWNAESRVLFETNPLKPLGDLLEREALLTNRVQQEFNSGEEYWAL
ERKLCHALSNKNKICLEDVMRAIHLKSFDYRVLNLLLYKLRGEEVNELHMDFLSISEFLVEVADDLFDYEDDVLENNFNV
LRMFVGIFGSSNAPTELAKRISEAEEKYEEIMKSLDPHLSSNYQRRCEEATKEGGKISGHSLGTWNIPAVISDEEAYRSA
QR*
CDS seq >AT5G10320.2
ATGGAAGAGCTAAAGAGACTGGAGAAGTTTCAGACTGTTGTATCATCCATTACTTGTCACGGTTTACTTTCTTCTTCATC
TTCCGATTCAGCTTCTTCTTCTCGCTTCCTCTCCAATCTAGTCCTTTTCCTGGTACAACCATGCGGCGAACTCGATTTAG
ATTCGAAACTGGGTCTACTCTCTGAGTTCTTACCGAAAATCTCAGGACCCTTCCTGGAAGAGATATCGAGATCACTTGAG
CTAGACGATGAAGCTACTACACCACTTAACACAATTTCAGAGTGCCAAAATAGTTGTGTGAAGAGGAGTGTCATGGACAA
TGTTGATCCTTACATGTCACAGAAGCACCAAGAAGTTGTGGCAATGGTCGGGCTAGATGCAATGAAGAGAGCAAACTCTA
CACTTGAGGATTTCTCCAGGTCTTATTTCATGTTTCATCGGTTGGATATTAACGAACCGCAATCAATATTCAGATATTTA
CCTGTGCTTTCATTCACAGAGAGTTACATATATCAGATGGATGCTCTTAATGAGAAGATAGTCAGTGAATCAGCTTGTGG
ATCCCAAGTCATTTATTCAAGTCATGGATGGAATGCTGAATCACGAGTTCTGTTTGAAACTAATCCGTTGAAGCCGCTTG
GAGATCTGCTTGAACGTGAGGCCCTTTTGACAAACAGAGTACAACAAGAGTTTAATTCGGGTGAAGAGTATTGGGCGTTA
GAAAGAAAGCTTTGTCATGCACTTTCAAACAAAAACAAGATCTGTCTCGAAGATGTGATGAGAGCAATTCATTTGAAGTC
CTTTGATTACAGGGTGTTGAATCTTCTACTATACAAGTTGAGAGGAGAAGAGGTGAATGAGTTGCATATGGACTTCTTGT
CAATCTCAGAGTTTCTGGTTGAAGTAGCGGATGATCTGTTCGATTACGAGGATGATGTTCTTGAGAACAATTTCAATGTA
TTACGTATGTTTGTGGGAATATTCGGCTCATCGAATGCACCCACTGAACTGGCGAAGCGGATATCGGAAGCTGAAGAGAA
GTATGAAGAGATTATGAAATCCCTGGATCCACATTTGTCTTCAAATTACCAAAGAAGATGTGAAGAAGCTACTAAAGAAG
GTGGGAAAATTTCTGGTCATTCGCTGGGAACATGGAACATACCGGCGGTTATCTCCGACGAAGAAGCATACCGATCTGCT
CAACGGTAA