Microexon ID At_1:4154430-4154438:+
Species Arabidopsis thaliana
Coordinates 1:4154430..4154438
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAACGAACGGCGTTTCATTTTCAGCCTGAGCAAAATTGGATGAACGATCCTAACGGTCCATTGTTCTACAAGGGATGGTACCATTTCTTCTACCAATATAACCCA
Microexon-tag Amino Acid Seq WQRTAFHFQPEQNWMNDPNGPLFYKGWYHFFYQYNP
Microexon-tag spanning region4154053-4155139
Microexon-tag prediction score0.9709
Overlapped with the annotated transcript (%) 100
New Transcript ID AT1G12240.1x
Reference Transcript ID AT1G12240.1
Gene ID AT1G12240
Gene Name BFRUCT4
Transcript ID AT1G12240.1
Protein ID AT1G12240.1
Gene ID AT1G12240
Gene Name BFRUCT4
Pfam domain motif Glyco_hydro_32N
Motif E-value 8.7e-105
Motif start 125
Motif end 445
Protein seq >AT1G12240.1
MASSDALLPISAREEEPLCPYTRLPMADPNQETHGPRRRRPFKGLLAVSFGLLFIAFYVALIATHDGSRSNDEGIDETET
ITSRARLAGVSEKRNDGLWKLSGDRNTPAFEWNNSMLSWQRTAFHFQPEQNWMNDPNGPLFYKGWYHFFYQYNPNAAVWG
DIVWGHAVSRDLIHWVHLPIAMVADQWYDSNGVWTGSATFLPDGSIVMLYTGSTDKAVQVQNLAYPEDPNDPLLLKWVKF
PGNPVLVPPPGILPKDFRDPTTAWKTSEGKWRITIGSKLNKTGISLVYDTIDFKTYEKLDTLLHRVPNTGMWECVDFYPV
SKTAGNGLDTSVNGPDVKHIVKASMDDTRFDHYAVGTYFDSNGTWIPDDPTIDVGMTASLRYDYGKFYASKSFYDQNKGR
RVLWSWIGESDSEASDVQKGWSSLQGIPRTVVLDTKTGKNLVQWPVEEIKSLRLSSKQFDLEVGPGSVVPVDVGSAAQLD
IEAEFEINKESLDKIIGNASVVAEAEEFSCEKSGGSTVRGALGPFGFSVLATESLSEQTPVYFYVAKGKDSELKTFFCTD
TSRSSVANDVVKPIYGSVVPVLKGEKLTMRILVDHSIVEAFGQGGRTCITSRVYPTTAIYGAAKLFLFNNALDATVTASF
TVWQMNSAFIHPYSDEAVRALSRT*
CDS seq >AT1G12240.1
ATGGCGAGCTCCGATGCTCTCTTGCCAATCTCCGCCAGAGAAGAAGAACCATTATGTCCTTACACGAGATTACCAATGGC
CGACCCGAATCAAGAAACCCATGGCCCCCGGAGAAGAAGACCCTTTAAAGGTCTCCTCGCCGTCTCATTTGGTCTCTTGT
TCATCGCCTTTTACGTCGCTCTCATCGCCACACACGACGGATCTAGATCCAACGACGAAGGGATCGATGAAACAGAGACG
ATAACGTCACGTGCACGTCTTGCTGGTGTGTCGGAGAAACGTAACGATGGGTTATGGAAACTTTCCGGTGATCGGAACAC
GCCGGCGTTTGAATGGAACAATAGTATGTTGTCGTGGCAACGAACGGCGTTTCATTTTCAGCCTGAGCAAAATTGGATGA
ACGATCCTAACGGTCCATTGTTCTACAAGGGATGGTACCATTTCTTCTACCAATATAACCCAAACGCAGCCGTATGGGGT
GACATTGTTTGGGGTCACGCCGTGTCTAGGGACCTAATCCATTGGGTCCATTTGCCCATAGCCATGGTCGCTGATCAATG
GTACGACTCCAACGGTGTGTGGACCGGCTCAGCCACATTTCTCCCTGATGGCTCTATAGTCATGCTCTATACCGGTTCCA
CCGACAAAGCGGTGCAGGTCCAAAACCTTGCCTACCCTGAAGACCCCAACGACCCACTTCTGTTGAAATGGGTCAAGTTC
CCGGGGAACCCGGTTCTAGTACCTCCGCCCGGTATCCTCCCTAAGGACTTCCGTGACCCAACGACTGCATGGAAGACATC
AGAAGGAAAATGGCGGATCACGATTGGTTCCAAGCTCAACAAAACTGGAATCTCACTCGTGTACGACACAATCGACTTTA
AAACATACGAGAAACTTGACACATTGTTGCACCGAGTTCCCAACACTGGAATGTGGGAGTGTGTTGACTTTTACCCGGTG
TCTAAGACTGCGGGCAATGGGCTTGACACATCGGTCAATGGACCGGATGTGAAGCATATCGTGAAGGCTAGCATGGACGA
CACGAGGTTCGATCATTATGCTGTAGGCACGTATTTCGATTCAAACGGAACATGGATCCCCGATGATCCTACTATCGATG
TTGGGATGACTGCCAGTTTAAGATATGATTACGGAAAGTTCTATGCTTCAAAGTCGTTTTACGACCAGAACAAGGGTCGA
AGAGTCTTGTGGAGTTGGATTGGTGAGTCTGATAGTGAGGCTTCTGATGTACAAAAGGGTTGGTCTTCTCTCCAGGGTAT
CCCAAGAACCGTTGTCCTCGACACAAAGACAGGAAAGAACTTGGTTCAATGGCCAGTAGAAGAAATCAAATCTCTTAGAC
TAAGCAGCAAGCAATTTGATCTCGAGGTCGGTCCCGGGTCAGTGGTACCGGTCGATGTAGGTTCCGCAGCTCAGCTAGAC
ATCGAAGCAGAATTCGAGATTAACAAAGAATCTCTAGACAAAATCATCGGAAACGCTTCGGTAGTGGCTGAAGCCGAGGA
ATTTAGCTGCGAAAAAAGCGGAGGCTCCACCGTCCGTGGTGCTTTAGGGCCATTCGGATTCTCGGTACTTGCCACAGAGA
GCTTGTCTGAGCAAACACCGGTTTACTTCTATGTAGCTAAGGGAAAAGATTCAGAGCTCAAAACTTTCTTCTGCACAGAC
ACCTCAAGGTCATCTGTTGCAAACGATGTCGTTAAACCGATATACGGTAGCGTCGTACCGGTTCTAAAAGGGGAGAAACT
GACCATGAGAATTTTGGTGGATCATTCGATAGTAGAAGCATTCGGACAAGGTGGAAGAACATGTATAACATCAAGAGTCT
ATCCAACAACTGCAATCTATGGAGCAGCCAAGCTCTTCTTGTTCAATAATGCTCTTGATGCGACGGTTACGGCGTCGTTT
ACAGTTTGGCAAATGAACAGTGCTTTTATTCATCCTTACTCTGACGAAGCTGTCCGTGCTCTCTCCCGTACCTGA
Microexon DNA seq ATCCTAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAACGAACGGCGTTTCATTTTCAGCCTGAGCAAAATTGGATGAACGATCCTAACGGTCCATTGTTCTACAAGGGATGGTACCATTTCTTCTACCAATATAACCCA
Microexon-tag Amino Acid seq WQRTAFHFQPEQNWMNDPNGPLFYKGWYHFFYQYNP
Transcript ID At.1292.1
Gene ID At.1292
Gene Name BFRUCT4
Pfam domain motif Glyco_hydro_32N
Motif E-value 8.7e-105
Motif start 125
Motif end 445
Protein seq >At.1292.1
MASSDALLPISAREEEPLCPYTRLPMADPNQETHGPRRRRPFKGLLAVSFGLLFIAFYVALIATHDGSRSNDEGIDETET
ITSRARLAGVSEKRNDGLWKLSGDRNTPAFEWNNSMLSWQRTAFHFQPEQNWMNDPNGPLFYKGWYHFFYQYNPNAAVWG
DIVWGHAVSRDLIHWVHLPIAMVADQWYDSNGVWTGSATFLPDGSIVMLYTGSTDKAVQVQNLAYPEDPNDPLLLKWVKF
PGNPVLVPPPGILPKDFRDPTTAWKTSEGKWRITIGSKLNKTGISLVYDTIDFKTYEKLDTLLHRVPNTGMWECVDFYPV
SKTAGNGLDTSVNGPDVKHIVKASMDDTRFDHYAVGTYFDSNGTWIPDDPTIDVGMTASLRYDYGKFYASKSFYDQNKGR
RVLWSWIGESDSEASDVQKGWSSLQGIPRTVVLDTKTGKNLVQWPVEEIKSLRLSSKQFDLEVGPGSVVPVDVGSAAQLD
IEAEFEINKESLDKIIGNASVVAEAEEFSCEKSGGSTVRGALGPFGFSVLATESLSEQTPVYFYVAKGKDSELKTFFCTD
TSRSSVANDVVKPIYGSVVPVLKGEKLTMRILVDHSIVEAFGQGGRTCITSRVYPTTAIYGAAKLFLFNNALDATVTASF
TVWQMNSAFIHPYSDEAVRALSRT*
CDS seq >At.1292.1
ATGGCGAGCTCCGATGCTCTCTTGCCAATCTCCGCCAGAGAAGAAGAACCATTATGTCCTTACACGAGATTACCAATGGC
CGACCCGAATCAAGAAACCCATGGCCCCCGGAGAAGAAGACCCTTTAAAGGTCTCCTCGCCGTCTCATTTGGTCTCTTGT
TCATCGCCTTTTACGTCGCTCTCATCGCCACACACGACGGATCTAGATCCAACGACGAAGGGATCGATGAAACAGAGACG
ATAACGTCACGTGCACGTCTTGCTGGTGTGTCGGAGAAACGTAACGATGGGTTATGGAAACTTTCCGGTGATCGGAACAC
GCCGGCGTTTGAATGGAACAATAGTATGTTGTCGTGGCAACGAACGGCGTTTCATTTTCAGCCTGAGCAAAATTGGATGA
ACGATCCTAACGGTCCATTGTTCTACAAGGGATGGTACCATTTCTTCTACCAATATAACCCAAACGCAGCCGTATGGGGT
GACATTGTTTGGGGTCACGCCGTGTCTAGGGACCTAATCCATTGGGTCCATTTGCCCATAGCCATGGTCGCTGATCAATG
GTACGACTCCAACGGTGTGTGGACCGGCTCAGCCACATTTCTCCCTGATGGCTCTATAGTCATGCTCTATACCGGTTCCA
CCGACAAAGCGGTGCAGGTCCAAAACCTTGCCTACCCTGAAGACCCCAACGACCCACTTCTGTTGAAATGGGTCAAGTTC
CCGGGGAACCCGGTTCTAGTACCTCCGCCCGGTATCCTCCCTAAGGACTTCCGTGACCCAACGACTGCATGGAAGACATC
AGAAGGAAAATGGCGGATCACGATTGGTTCCAAGCTCAACAAAACTGGAATCTCACTCGTGTACGACACAATCGACTTTA
AAACATACGAGAAACTTGACACATTGTTGCACCGAGTTCCCAACACTGGAATGTGGGAGTGTGTTGACTTTTACCCGGTG
TCTAAGACTGCGGGCAATGGGCTTGACACATCGGTCAATGGACCGGATGTGAAGCATATCGTGAAGGCTAGCATGGACGA
CACGAGGTTCGATCATTATGCTGTAGGCACGTATTTCGATTCAAACGGAACATGGATCCCCGATGATCCTACTATCGATG
TTGGGATGACTGCCAGTTTAAGATATGATTACGGAAAGTTCTATGCTTCAAAGTCGTTTTACGACCAGAACAAGGGTCGA
AGAGTCTTGTGGAGTTGGATTGGTGAGTCTGATAGTGAGGCTTCTGATGTACAAAAGGGTTGGTCTTCTCTCCAGGGTAT
CCCAAGAACCGTTGTCCTCGACACAAAGACAGGAAAGAACTTGGTTCAATGGCCAGTAGAAGAAATCAAATCTCTTAGAC
TAAGCAGCAAGCAATTTGATCTCGAGGTCGGTCCCGGGTCAGTGGTACCGGTCGATGTAGGTTCCGCAGCTCAGCTAGAC
ATCGAAGCAGAATTCGAGATTAACAAAGAATCTCTAGACAAAATCATCGGAAACGCTTCGGTAGTGGCTGAAGCCGAGGA
ATTTAGCTGCGAAAAAAGCGGAGGCTCCACCGTCCGTGGTGCTTTAGGGCCATTCGGATTCTCGGTACTTGCCACAGAGA
GCTTGTCTGAGCAAACACCGGTTTACTTCTATGTAGCTAAGGGAAAAGATTCAGAGCTCAAAACTTTCTTCTGCACAGAC
ACCTCAAGGTCATCTGTTGCAAACGATGTCGTTAAACCGATATACGGTAGCGTCGTACCGGTTCTAAAAGGGGAGAAACT
GACCATGAGAATTTTGGTGGATCATTCGATAGTAGAAGCATTCGGACAAGGTGGAAGAACATGTATAACATCAAGAGTCT
ATCCAACAACTGCAATCTATGGAGCAGCCAAGCTCTTCTTGTTCAATAATGCTCTTGATGCGACGGTTACGGCGTCGTTT
ACAGTTTGGCAAATGAACAGTGCTTTTATTCATCCTTACTCTGACGAAGCTGTCCGTGCTCTCTCCCGTACCTGA