Microexon ID Ha_3:151369945-151369953:+
Species Helianthus annuus
Coordinates 3:151369945..151369953
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAAATGCCCGCTTTCCATTTCACTCCGGGAAAGAACTGGATGAACGATCCAAATGGTCCGGTATTTTACAAGGGATGGTACCATTTATTTTACCAATACAGTCCA
Microexon-tag Amino Acid Seq WQMPAFHFTPGKNWMNDPNGPVFYKGWYHLFYQYSP
Microexon-tag spanning region151369605-151370127
Microexon-tag prediction score0.9271
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG32544x
Reference Transcript ID OTG32544
Gene ID HannXRQ_Chr03g0087861
Gene Name NA
Transcript ID OTG32544
Protein ID OTG32544
Gene ID HannXRQ_Chr03g0087861
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2e-103
Motif start 120
Motif end 438
Protein seq >OTG32544
MTSQTSSDVENPTHAPLLDDDYGYVPSLDASDQPSQPQETNRQPSKPVLLLISGLLAIGLLVALIVGNVRIETHDNDAPL
PERRDPIEQGVSEKSFGLPPNVSSFSWTDDMLSWQMPAFHFTPGKNWMNDPNGPVFYKGWYHLFYQYSPAAPVWGLIVWG
HAVSRDMVHWRHLPIAMETDKWYDVNGVWTGSTTILPNNELVVLYTGSTNESVQVQNLAYPANPSDPLLVNWVKDPANPV
LVPPKWINIKDFRDPTTAWLTPEGRWRMVIGTKVNRTGIAILYDTDDFKSYDLLDTKLHQVDGTGMWECVDFFPVSKNES
GGLDTGVIGPGIKHVLKASMDDDRCDYYAIGDYNPKTGTWVPDNPKIDVGIGLRYDYGIYYASKTFYDQNKERRILWSWI
KETDSESSDIKKGWASLMAVPRTVALDPLTGSNLLQWPIEELENIRSNLNEFNEVKLEPGSLVPLNVGPTSQLDIMVEFE
LDKKLTDSLLPLKAGSVPYNCAGHGGAGVRDALGPFGVLVLANKNLTEHTPAYFYIVKGKKGELDTFFCIDQSRSSIAKD
VDKSIYGSTVPVLEGEKLSMRILVDHSIVEGYAQGGRSCITSRVYPTEAINDDAQIFLFNNATSITVTASLKTWQMGVSS
TNGSGWVWIGPSLILVVVFFFVVWTWQNQRNNRS*
CDS seq >OTG32544
ATGACATCCCAAACTTCATCCGACGTCGAAAACCCAACTCACGCCCCCCTTCTCGATGATGATTATGGCTATGTCCCGTC
CCTAGACGCAAGTGATCAACCCTCTCAACCTCAAGAAACCAACCGACAACCTAGCAAACCCGTTCTCCTTTTGATATCCG
GGTTGTTGGCCATTGGCTTGTTGGTGGCCTTGATAGTCGGAAATGTGCGTATTGAAACCCATGACAACGATGCTCCATTG
CCCGAAAGGAGGGATCCAATCGAACAAGGTGTTTCCGAAAAATCATTCGGGTTACCGCCTAATGTATCTTCTTTTTCGTG
GACCGACGACATGTTGAGCTGGCAAATGCCCGCTTTCCATTTCACTCCGGGAAAGAACTGGATGAACGATCCAAATGGTC
CGGTATTTTACAAGGGATGGTACCATTTATTTTACCAATACAGTCCAGCGGCCCCAGTATGGGGTCTAATCGTGTGGGGT
CATGCGGTATCCAGAGACATGGTTCATTGGCGACACCTTCCAATAGCAATGGAGACAGACAAATGGTACGATGTGAATGG
TGTATGGACCGGGTCAACAACCATCCTTCCAAATAATGAACTCGTGGTACTTTACACCGGTTCAACCAATGAATCGGTTC
AGGTCCAAAACCTAGCGTACCCAGCCAACCCATCTGACCCCCTCCTTGTTAACTGGGTCAAGGACCCCGCTAACCCGGTC
TTGGTCCCACCAAAATGGATCAACATAAAGGACTTCCGTGACCCGACCACGGCCTGGCTCACACCTGAAGGCAGATGGCG
AATGGTGATCGGAACAAAAGTAAATAGAACCGGAATTGCGATATTGTACGATACCGATGATTTCAAAAGTTACGACCTTC
TAGACACCAAGCTTCATCAGGTCGATGGTACGGGAATGTGGGAATGTGTCGATTTTTTTCCGGTTTCGAAAAATGAGTCT
GGTGGTCTTGATACCGGGGTTATTGGGCCTGGTATAAAGCATGTTCTTAAAGCTAGCATGGATGATGATAGATGCGATTA
TTATGCGATTGGAGATTATAACCCGAAAACCGGAACATGGGTCCCGGATAATCCAAAAATCGACGTTGGAATCGGTTTAA
GATATGATTATGGGATATACTACGCGTCTAAAACGTTCTATGACCAAAACAAAGAACGAAGGATTTTATGGAGTTGGATC
AAGGAGACCGATAGCGAAAGCTCGGATATCAAGAAGGGTTGGGCTTCTCTTATGGCGGTTCCAAGAACGGTTGCGCTAGA
TCCATTAACCGGTAGCAATTTGCTTCAATGGCCCATTGAAGAGTTGGAAAACATTAGGTCTAATCTGAATGAGTTCAATG
AAGTTAAACTTGAGCCCGGCTCACTCGTTCCACTTAATGTGGGCCCAACATCTCAGTTGGATATAATGGTCGAGTTTGAG
TTGGACAAGAAGCTTACCGATAGCCTACTACCCTTAAAAGCTGGTAGTGTCCCATACAACTGTGCGGGCCACGGTGGAGC
TGGTGTGAGAGACGCGCTAGGCCCATTTGGCGTGTTGGTTCTTGCAAACAAGAACCTAACCGAACACACGCCTGCTTACT
TCTACATTGTCAAAGGCAAAAAAGGTGAACTGGATACATTCTTTTGCATAGATCAATCAAGATCTTCCATTGCTAAGGAT
GTCGATAAGTCTATTTACGGTAGCACCGTTCCTGTTCTTGAAGGCGAAAAACTCTCCATGAGAATCTTGGTGGATCATTC
GATAGTCGAAGGTTACGCACAAGGAGGGCGATCATGCATCACCTCACGTGTTTATCCAACAGAAGCGATTAACGACGATG
CTCAAATATTCTTATTCAACAATGCCACTAGCATTACCGTGACAGCTTCACTCAAGACATGGCAAATGGGCGTCTCATCA
ACGAACGGCAGCGGATGGGTTTGGATTGGCCCATCACTTATCTTAGTTGTCGTCTTTTTCTTCGTGGTATGGACGTGGCA
AAATCAGAGAAACAACCGGTCCTAA
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAAATGCCCGCTTTCCATTTCACTCCGGGAAAGAACTGGATGAACGATCCAAATGGTCCGGTATTTTACAAGGGATGGTACCATTTATTTTACCAATACAGTCCA
Microexon-tag Amino Acid seq WQMPAFHFTPGKNWMNDPNGPVFYKGWYHLFYQYSP
Transcript ID OTG32544
Gene ID Ha.38994
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.1e-103
Motif start 120
Motif end 438
Protein seq >OTG32544
MTSQTSSDVENPTHAPLLDDDYGYVPSLDASDQPSQPQETNRQPSKPVLLLISGLLAIGLLVALIVGNVRIETHDNDAPL
PERRDPIEQGVSEKSFGLPPNVSSFSWTDDMLSWQMPAFHFTPGKNWMNDPNGPVFYKGWYHLFYQYSPAAPVWGLIVWG
HAVSRDMVHWRHLPIAMETDKWYDVNGVWTGSTTILPNNELVVLYTGSTNESVQVQNLAYPANPSDPLLVNWVKDPANPV
LVPPKWINIKDFRDPTTAWLTPEGRWRMVIGTKVNRTGIAILYDTDDFKSYDLLDTKLHQVDGTGMWECVDFFPVSKNES
GGLDTGVIGPGIKHVLKASMDDDRCDYYAIGDYNPKTGTWVPDNPKIDVGIGLRYDYGIYYASKTFYDQNKERRILWSWI
KETDSESSDIKKGWASLMAVPRTVALDPLTGSNLLQWPIEELENIRSNLNEFNEVKLEPGSLVPLNVGPTSQLDIMVEFE
LDKKLTDSLLPLKAGSVPYNCAGHGGAGVRDALGPFGVLVLANKNLTEHTPAYFYIVKGKKGELDTFFCIDQSRSSIAKD
VDKSIYGSTVPVLEGEKLSMRILVDHSIVEGYAQGGRSCITSRVYPTEAINDDAQIFLFNNATSITVTASLKTWQMGVSS
TNGSGWVWIGPSLILVVVFFFVVWTWQNQRNNRS*
CDS seq >OTG32544
ATGACATCCCAAACTTCATCCGACGTCGAAAACCCAACTCACGCCCCCCTTCTCGATGATGATTATGGCTATGTCCCGTC
CCTAGACGCAAGTGATCAACCCTCTCAACCTCAAGAAACCAACCGACAACCTAGCAAACCCGTTCTCCTTTTGATATCCG
GGTTGTTGGCCATTGGCTTGTTGGTGGCCTTGATAGTCGGAAATGTGCGTATTGAAACCCATGACAACGATGCTCCATTG
CCCGAAAGGAGGGATCCAATCGAACAAGGTGTTTCCGAAAAATCATTCGGGTTACCGCCTAATGTATCTTCTTTTTCGTG
GACCGACGACATGTTGAGCTGGCAAATGCCCGCTTTCCATTTCACTCCGGGAAAGAACTGGATGAACGATCCAAATGGTC
CGGTATTTTACAAGGGATGGTACCATTTATTTTACCAATACAGTCCAGCGGCCCCAGTATGGGGTCTAATCGTGTGGGGT
CATGCGGTATCCAGAGACATGGTTCATTGGCGACACCTTCCAATAGCAATGGAGACAGACAAATGGTACGATGTGAATGG
TGTATGGACCGGGTCAACAACCATCCTTCCAAATAATGAACTCGTGGTACTTTACACCGGTTCAACCAATGAATCGGTTC
AGGTCCAAAACCTAGCGTACCCAGCCAACCCATCTGACCCCCTCCTTGTTAACTGGGTCAAGGACCCCGCTAACCCGGTC
TTGGTCCCACCAAAATGGATCAACATAAAGGACTTCCGTGACCCGACCACGGCCTGGCTCACACCTGAAGGCAGATGGCG
AATGGTGATCGGAACAAAAGTAAATAGAACCGGAATTGCGATATTGTACGATACCGATGATTTCAAAAGTTACGACCTTC
TAGACACCAAGCTTCATCAGGTCGATGGTACGGGAATGTGGGAATGTGTCGATTTTTTTCCGGTTTCGAAAAATGAGTCT
GGTGGTCTTGATACCGGGGTTATTGGGCCTGGTATAAAGCATGTTCTTAAAGCTAGCATGGATGATGATAGATGCGATTA
TTATGCGATTGGAGATTATAACCCGAAAACCGGAACATGGGTCCCGGATAATCCAAAAATCGACGTTGGAATCGGTTTAA
GATATGATTATGGGATATACTACGCGTCTAAAACGTTCTATGACCAAAACAAAGAACGAAGGATTTTATGGAGTTGGATC
AAGGAGACCGATAGCGAAAGCTCGGATATCAAGAAGGGTTGGGCTTCTCTTATGGCGGTTCCAAGAACGGTTGCGCTAGA
TCCATTAACCGGTAGCAATTTGCTTCAATGGCCCATTGAAGAGTTGGAAAACATTAGGTCTAATCTGAATGAGTTCAATG
AAGTTAAACTTGAGCCCGGCTCACTCGTTCCACTTAATGTGGGCCCAACATCTCAGTTGGATATAATGGTCGAGTTTGAG
TTGGACAAGAAGCTTACCGATAGCCTACTACCCTTAAAAGCTGGTAGTGTCCCATACAACTGTGCGGGCCACGGTGGAGC
TGGTGTGAGAGACGCGCTAGGCCCATTTGGCGTGTTGGTTCTTGCAAACAAGAACCTAACCGAACACACGCCTGCTTACT
TCTACATTGTCAAAGGCAAAAAAGGTGAACTGGATACATTCTTTTGCATAGATCAATCAAGATCTTCCATTGCTAAGGAT
GTCGATAAGTCTATTTACGGTAGCACCGTTCCTGTTCTTGAAGGCGAAAAACTCTCCATGAGAATCTTGGTGGATCATTC
GATAGTCGAAGGTTACGCACAAGGAGGGCGATCATGCATCACCTCACGTGTTTATCCAACAGAAGCGATTAACGACGATG
CTCAAATATTCTTATTCAACAATGCCACTAGCATTACCGTGACAGCTTCACTCAAGACATGGCAAATGGGCGTCTCATCA
ACGAACGGCAGCGGATGGGTTTGGATTGGCCCATCACTTATCTTAGTTGTCGTCTTTTTCTTCGTGGTATGGACGTGGCA
AAATCAGAGAAACAACCGGTCCTAA