Microexon ID At_1:23200643-23200651:+
Species Arabidopsis thaliana
Coordinates 1:23200643..23200651
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAACGCACGGCGTTTCATTTCCAGCCTGAGAAAAATTGGATGAACGATCCTAATGGTCCATTGTTCTATAAGGGATGGTACCATTTCTTCTATCAATACAACCCA
Microexon-tag Amino Acid Seq WQRTAFHFQPEKNWMNDPNGPLFYKGWYHFFYQYNP
Microexon-tag spanning region23200258-23201481
Microexon-tag prediction score0.9719
Overlapped with the annotated transcript (%) 100
New Transcript ID AT1G62660.1x
Reference Transcript ID AT1G62660.1
Gene ID AT1G62660
Gene Name BFRUCT3
Transcript ID AT1G62660.1
Protein ID AT1G62660.1
Gene ID AT1G62660
Gene Name BFRUCT3
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.2e-107
Motif start 110
Motif end 430
Protein seq >AT1G62660.1
MASTEALLPVTSLQDPLSESRSDQIPETRRRRPIKVHLAVYSGLLLIALYVTLIVTHDGSKAEIATESRPRMAGVSEKSN
DGVWISSDDGKVEAFPWNNTILSWQRTAFHFQPEKNWMNDPNGPLFYKGWYHFFYQYNPNAAVWGDIVWGHAVSKDLIHW
LYLPIAMVPDQWYDANGVWTGSATFLDDGSIVMLYTGSTDEFVQVQNLAYPEDPSDPLLLKWVKFSGNPVLVPPPGIGAK
DFRDPTTAWKTSSGKWRITIGSKINRTGISLIYDTTDFKTYEKHETLLHQVPNTGMWECVDFYPVSKTQLNGLDTSVNGP
DVKHVIKASMDDTRIDHYAIGTYDDSNATWVPDNPSIDVGISTGLRYDYGKYYASKTFYDQNKGRRILWGWIGESDSEAA
DVQKGWSSVQGIPRTVVLDTRTHKNLVQWPVEEIKSLRLSSKKFDMTIGPGTVVPVDVGSATQLDIEAEFEIKTDDLKLF
FDDDSVEADNKFSCETNGGSTARGALGPFGFSVLADEGLSEQTPVYFYVTKGKHSKLNTVFCTDTSRSTLANDVVKPIYG
SFVPVLKGEKLTMRILVDHSIVEGFAQGGRSCITSRVYPTKAIYGATKLFLFNNAIDATVTASFTVWQMNNAFIHPYSSD
DLGVPSST*
CDS seq >AT1G62660.1
ATGGCGAGCACGGAAGCTCTCTTACCCGTCACGTCCCTACAAGATCCATTATCTGAGTCAAGATCCGACCAGATCCCGGA
GACCCGTCGGAGACGACCCATCAAAGTTCATCTTGCAGTCTATTCCGGTTTGCTCTTGATCGCTTTGTACGTCACTCTCA
TCGTCACACACGACGGCTCCAAGGCTGAAATAGCGACGGAGTCACGTCCTCGTATGGCCGGCGTGTCGGAGAAGAGCAAC
GACGGGGTTTGGATATCATCCGATGATGGGAAAGTTGAAGCGTTCCCGTGGAACAATACTATTTTGTCGTGGCAACGCAC
GGCGTTTCATTTCCAGCCTGAGAAAAATTGGATGAACGATCCTAATGGTCCATTGTTCTATAAGGGATGGTACCATTTCT
TCTATCAATACAACCCAAATGCAGCTGTGTGGGGTGACATTGTTTGGGGTCACGCCGTGTCAAAAGACCTTATCCACTGG
CTTTATCTCCCAATAGCCATGGTTCCTGACCAATGGTACGATGCAAACGGTGTCTGGACCGGTTCAGCCACTTTTCTAGA
TGATGGCTCCATTGTCATGCTCTACACCGGTTCCACTGACGAATTCGTACAGGTTCAAAACCTTGCCTATCCTGAAGACC
CAAGCGACCCACTTTTGTTGAAATGGGTCAAGTTCTCCGGTAACCCTGTCCTCGTACCGCCTCCAGGTATTGGTGCAAAG
GACTTCCGTGACCCAACAACAGCCTGGAAGACATCTTCTGGAAAATGGCGAATCACCATCGGTTCCAAAATCAATAGAAC
CGGAATATCTCTCATTTATGACACTACCGATTTCAAAACCTACGAGAAACACGAAACCTTGTTGCACCAAGTCCCCAACA
CCGGAATGTGGGAGTGCGTTGATTTTTACCCGGTGTCGAAGACTCAGCTCAATGGGCTCGATACTTCGGTCAACGGACCA
GATGTCAAGCATGTCATCAAGGCTAGCATGGACGATACTAGAATTGACCATTATGCCATTGGGACGTACGATGATTCAAA
CGCTACATGGGTCCCCGATAATCCTTCTATCGATGTCGGAATCAGTACCGGTTTGAGATACGATTACGGGAAATATTATG
CGTCAAAGACGTTTTACGATCAAAATAAGGGACGAAGAATCTTATGGGGTTGGATCGGTGAATCTGACAGTGAAGCTGCT
GATGTACAAAAGGGTTGGTCTTCTGTTCAGGGCATCCCAAGAACTGTTGTATTGGACACAAGGACGCATAAAAACTTAGT
CCAGTGGCCAGTTGAGGAAATCAAATCATTGAGACTAAGCAGCAAGAAATTTGACATGACTATTGGACCAGGGACTGTGG
TTCCGGTCGATGTGGGTTCCGCCACTCAGCTAGACATAGAGGCTGAGTTCGAGATCAAGACCGATGATCTCAAGTTATTC
TTTGATGATGACTCTGTGGAGGCCGACAATAAATTCAGCTGCGAAACAAACGGAGGCTCCACAGCGCGTGGTGCTTTAGG
GCCTTTTGGATTCTCGGTTCTCGCTGACGAGGGCTTGTCAGAACAAACTCCGGTTTACTTCTATGTGACTAAGGGAAAAC
ATTCAAAACTCAATACTGTCTTCTGCACTGACACCTCAAGGTCGACTTTGGCAAACGATGTGGTGAAACCAATCTATGGA
AGCTTCGTACCGGTCTTAAAAGGAGAGAAATTGACAATGAGAATCTTGGTTGATCATTCGATCGTAGAAGGATTCGCACA
AGGTGGAAGATCATGTATTACCTCAAGAGTATATCCCACAAAAGCTATCTATGGAGCTACCAAGCTCTTCTTGTTCAATA
ACGCCATTGATGCGACCGTTACGGCGTCGTTTACGGTCTGGCAAATGAACAATGCTTTTATTCATCCTTACTCTTCAGAC
GATCTCGGTGTTCCTTCCAGCACCTGA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAACGCACGGCGTTTCATTTCCAGCCTGAGAAAAATTGGATGAACGATCCTAATGGTCCATTGTTCTATAAGGGATGGTACCATTTCTTCTATCAATACAACCCA
Microexon-tag Amino Acid seq WQRTAFHFQPEKNWMNDPNGPLFYKGWYHFFYQYNP
Transcript ID AT1G62660.1
Gene ID At.5379
Gene Name BFRUCT3
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.2e-107
Motif start 110
Motif end 430
Protein seq >AT1G62660.1
MASTEALLPVTSLQDPLSESRSDQIPETRRRRPIKVHLAVYSGLLLIALYVTLIVTHDGSKAEIATESRPRMAGVSEKSN
DGVWISSDDGKVEAFPWNNTILSWQRTAFHFQPEKNWMNDPNGPLFYKGWYHFFYQYNPNAAVWGDIVWGHAVSKDLIHW
LYLPIAMVPDQWYDANGVWTGSATFLDDGSIVMLYTGSTDEFVQVQNLAYPEDPSDPLLLKWVKFSGNPVLVPPPGIGAK
DFRDPTTAWKTSSGKWRITIGSKINRTGISLIYDTTDFKTYEKHETLLHQVPNTGMWECVDFYPVSKTQLNGLDTSVNGP
DVKHVIKASMDDTRIDHYAIGTYDDSNATWVPDNPSIDVGISTGLRYDYGKYYASKTFYDQNKGRRILWGWIGESDSEAA
DVQKGWSSVQGIPRTVVLDTRTHKNLVQWPVEEIKSLRLSSKKFDMTIGPGTVVPVDVGSATQLDIEAEFEIKTDDLKLF
FDDDSVEADNKFSCETNGGSTARGALGPFGFSVLADEGLSEQTPVYFYVTKGKHSKLNTVFCTDTSRSTLANDVVKPIYG
SFVPVLKGEKLTMRILVDHSIVEGFAQGGRSCITSRVYPTKAIYGATKLFLFNNAIDATVTASFTVWQMNNAFIHPYSSD
DLGVPSST*
CDS seq >AT1G62660.1
ATGGCGAGCACGGAAGCTCTCTTACCCGTCACGTCCCTACAAGATCCATTATCTGAGTCAAGATCCGACCAGATCCCGGA
GACCCGTCGGAGACGACCCATCAAAGTTCATCTTGCAGTCTATTCCGGTTTGCTCTTGATCGCTTTGTACGTCACTCTCA
TCGTCACACACGACGGCTCCAAGGCTGAAATAGCGACGGAGTCACGTCCTCGTATGGCCGGCGTGTCGGAGAAGAGCAAC
GACGGGGTTTGGATATCATCCGATGATGGGAAAGTTGAAGCGTTCCCGTGGAACAATACTATTTTGTCGTGGCAACGCAC
GGCGTTTCATTTCCAGCCTGAGAAAAATTGGATGAACGATCCTAATGGTCCATTGTTCTATAAGGGATGGTACCATTTCT
TCTATCAATACAACCCAAATGCAGCTGTGTGGGGTGACATTGTTTGGGGTCACGCCGTGTCAAAAGACCTTATCCACTGG
CTTTATCTCCCAATAGCCATGGTTCCTGACCAATGGTACGATGCAAACGGTGTCTGGACCGGTTCAGCCACTTTTCTAGA
TGATGGCTCCATTGTCATGCTCTACACCGGTTCCACTGACGAATTCGTACAGGTTCAAAACCTTGCCTATCCTGAAGACC
CAAGCGACCCACTTTTGTTGAAATGGGTCAAGTTCTCCGGTAACCCTGTCCTCGTACCGCCTCCAGGTATTGGTGCAAAG
GACTTCCGTGACCCAACAACAGCCTGGAAGACATCTTCTGGAAAATGGCGAATCACCATCGGTTCCAAAATCAATAGAAC
CGGAATATCTCTCATTTATGACACTACCGATTTCAAAACCTACGAGAAACACGAAACCTTGTTGCACCAAGTCCCCAACA
CCGGAATGTGGGAGTGCGTTGATTTTTACCCGGTGTCGAAGACTCAGCTCAATGGGCTCGATACTTCGGTCAACGGACCA
GATGTCAAGCATGTCATCAAGGCTAGCATGGACGATACTAGAATTGACCATTATGCCATTGGGACGTACGATGATTCAAA
CGCTACATGGGTCCCCGATAATCCTTCTATCGATGTCGGAATCAGTACCGGTTTGAGATACGATTACGGGAAATATTATG
CGTCAAAGACGTTTTACGATCAAAATAAGGGACGAAGAATCTTATGGGGTTGGATCGGTGAATCTGACAGTGAAGCTGCT
GATGTACAAAAGGGTTGGTCTTCTGTTCAGGGCATCCCAAGAACTGTTGTATTGGACACAAGGACGCATAAAAACTTAGT
CCAGTGGCCAGTTGAGGAAATCAAATCATTGAGACTAAGCAGCAAGAAATTTGACATGACTATTGGACCAGGGACTGTGG
TTCCGGTCGATGTGGGTTCCGCCACTCAGCTAGACATAGAGGCTGAGTTCGAGATCAAGACCGATGATCTCAAGTTATTC
TTTGATGATGACTCTGTGGAGGCCGACAATAAATTCAGCTGCGAAACAAACGGAGGCTCCACAGCGCGTGGTGCTTTAGG
GCCTTTTGGATTCTCGGTTCTCGCTGACGAGGGCTTGTCAGAACAAACTCCGGTTTACTTCTATGTGACTAAGGGAAAAC
ATTCAAAACTCAATACTGTCTTCTGCACTGACACCTCAAGGTCGACTTTGGCAAACGATGTGGTGAAACCAATCTATGGA
AGCTTCGTACCGGTCTTAAAAGGAGAGAAATTGACAATGAGAATCTTGGTTGATCATTCGATCGTAGAAGGATTCGCACA
AGGTGGAAGATCATGTATTACCTCAAGAGTATATCCCACAAAAGCTATCTATGGAGCTACCAAGCTCTTCTTGTTCAATA
ACGCCATTGATGCGACCGTTACGGCGTCGTTTACGGTCTGGCAAATGAACAATGCTTTTATTCATCCTTACTCTTCAGAC
GATCTCGGTGTTCCTTCCAGCACCTGA