Microexon ID Ha_7:91537214-91537222:+
Species Helianthus annuus
Coordinates 7:91537214..91537222
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGGAACGGACCGCTTTTCATTTTCAGCCTGCCAAGAATTTTATTTACGATCCAAATGGTCCGTTGTTTCACATGGGCTGGTACCATTTGTTCTATCAATACAACCCA
Microexon-tag Amino Acid Seq WERTAFHFQPAKNFIYDPNGPLFHMGWYHLFYQYNP
Microexon-tag spanning region91537043-91538034
Microexon-tag prediction score0.9501
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG21394x
Reference Transcript ID OTG21394
Gene ID HannXRQ_Chr07g0203661
Gene Name NA
Transcript ID OTG21394
Protein ID OTG21394
Gene ID HannXRQ_Chr07g0203661
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.4e-86
Motif start 88
Motif end 406
Protein seq >OTG21394
MQTPEPFTDLDHEPLLDHHHPPQQTTTKPLFTRVVSGVTFVVFFFAFAIVFIVLNQQNSSVHIDTNSDKSFIRYSKADRL
SWERTAFHFQPAKNFIYDPNGPLFHMGWYHLFYQYNPYAPVWGNMSWGHSVSKDMINWYELPVAMVPTEWYDIEGVLSGS
TTVLPNGQIFALYTGNANDFSQLQCKAVPVNLSDPLLIEWVKYDDNPILYTPPGIGLMDYRDPSTVWTGPDGKHRMIMGS
KRGNTGMILVYHTTDYTNYELLDEPLHSVPNTDMWECVDFYPVSLTNDSALDMAAYGSGIKHVIKESWEGHGMDWYSIGT
YDAINDKWTPDNPELDVGIGLRCDYGKFFASKSLYDPLKKRRITWAYVGESDSVDQDLSRGWATVYNVGRTIVLDRKTGT
HLLHWPVEEVESLRYNGQEFKEIELEPGSIIPLDIGTATQLDIVATFEVDQAALNATSETDDIYGCTTSLGAAQRGSLGP
FGLAVLADGTLSELTPVYFYIAKKADGGLSTHFCTDKLRSSLDYDGERVVYGSTVPVLDDEELTMRLLVDHSIVEGFAQG
GRTVITSRVYPTKAIYEQAKLFLFNNATGTSVKASLKIWQMASAQIHQYSF*
CDS seq >OTG21394
ATGCAAACCCCTGAACCCTTTACAGACCTTGATCATGAACCCCTACTGGACCACCACCACCCACCACAACAAACCACCAC
AAAACCTTTGTTCACCAGGGTTGTGTCCGGTGTCACCTTTGTTGTATTCTTCTTTGCTTTCGCTATCGTATTCATTGTTC
TCAACCAACAGAATTCTTCTGTTCATATCGACACCAATTCGGATAAATCTTTTATAAGGTATTCGAAGGCCGATCGCTTG
TCGTGGGAACGGACCGCTTTTCATTTTCAGCCTGCCAAGAATTTTATTTACGATCCAAATGGTCCGTTGTTTCACATGGG
CTGGTACCATTTGTTCTATCAATACAACCCATACGCACCGGTTTGGGGCAATATGTCATGGGGTCACTCAGTGTCCAAAG
ACATGATCAACTGGTACGAGCTGCCAGTCGCTATGGTCCCGACCGAATGGTATGATATCGAGGGCGTCTTATCCGGGTCT
ACCACGGTCCTTCCAAACGGGCAGATCTTTGCATTGTATACTGGGAACGCTAATGATTTTTCCCAATTACAATGCAAAGC
TGTGCCCGTAAACTTATCTGACCCGCTTCTTATTGAGTGGGTCAAGTATGACGATAACCCAATCCTGTACACTCCACCAG
GGATTGGGTTAATGGACTACCGGGACCCGTCAACAGTCTGGACAGGTCCCGATGGAAAGCATAGGATGATCATGGGATCT
AAACGTGGCAATACAGGCATGATACTCGTTTACCATACCACCGATTACACGAACTACGAGTTGTTGGATGAGCCGTTGCA
CTCCGTTCCCAACACCGATATGTGGGAATGCGTCGACTTTTACCCGGTTTCGTTAACCAATGATAGTGCACTTGATATGG
CGGCCTATGGGTCGGGTATCAAACACGTTATTAAAGAAAGTTGGGAGGGACATGGAATGGATTGGTATTCAATCGGGACA
TATGACGCGATAAATGATAAATGGACTCCCGATAACCCGGAACTAGATGTCGGTATCGGGTTACGGTGCGATTACGGGAA
GTTTTTTGCATCAAAGAGTCTTTATGACCCATTGAAGAAAAGGAGGATCACTTGGGCTTATGTTGGAGAATCAGATAGTG
TTGACCAGGACCTCTCTAGAGGATGGGCTACTGTTTATAATGTTGGAAGAACAATTGTACTAGATAGAAAAACCGGGACC
CATTTACTTCATTGGCCCGTTGAGGAGGTCGAGAGTTTGAGATACAACGGTCAGGAGTTTAAAGAGATCGAGCTAGAGCC
CGGTTCAATCATTCCACTCGACATAGGCACGGCTACACAGTTGGACATAGTTGCAACATTTGAGGTGGATCAAGCAGCGT
TGAACGCGACAAGTGAAACCGATGATATTTATGGTTGCACCACTAGCTTAGGTGCAGCCCAAAGGGGAAGTTTGGGACCA
TTTGGTCTTGCGGTTCTAGCCGATGGAACCCTTTCTGAGTTAACTCCGGTTTATTTCTACATTGCTAAAAAGGCCGATGG
AGGTTTGTCGACACATTTTTGTACCGATAAGCTAAGGTCATCACTGGATTATGATGGAGAGAGAGTGGTGTATGGGAGCA
CTGTTCCTGTGTTAGATGATGAAGAACTCACAATGAGGCTATTGGTGGATCATTCGATAGTAGAGGGGTTTGCGCAAGGA
GGAAGGACGGTTATAACATCAAGGGTGTATCCAACAAAAGCGATATACGAACAAGCGAAGTTGTTCTTGTTCAACAACGC
TACAGGTACGAGTGTGAAGGCATCTCTCAAGATTTGGCAAATGGCTTCTGCACAAATTCATCAATACTCGTTTTAA
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGGAACGGACCGCTTTTCATTTTCAGCCTGCCAAGAATTTTATTTACGATCCAAATGGTCCGTTGTTTCACATGGGCTGGTACCATTTGTTCTATCAATACAACCCA
Microexon-tag Amino Acid seq WERTAFHFQPAKNFIYDPNGPLFHMGWYHLFYQYNP
Transcript ID OTG21394
Gene ID Ha.51038
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.4e-86
Motif start 88
Motif end 406
Protein seq >OTG21394
MQTPEPFTDLDHEPLLDHHHPPQQTTTKPLFTRVVSGVTFVVFFFAFAIVFIVLNQQNSSVHIDTNSDKSFIRYSKADRL
SWERTAFHFQPAKNFIYDPNGPLFHMGWYHLFYQYNPYAPVWGNMSWGHSVSKDMINWYELPVAMVPTEWYDIEGVLSGS
TTVLPNGQIFALYTGNANDFSQLQCKAVPVNLSDPLLIEWVKYDDNPILYTPPGIGLMDYRDPSTVWTGPDGKHRMIMGS
KRGNTGMILVYHTTDYTNYELLDEPLHSVPNTDMWECVDFYPVSLTNDSALDMAAYGSGIKHVIKESWEGHGMDWYSIGT
YDAINDKWTPDNPELDVGIGLRCDYGKFFASKSLYDPLKKRRITWAYVGESDSVDQDLSRGWATVYNVGRTIVLDRKTGT
HLLHWPVEEVESLRYNGQEFKEIELEPGSIIPLDIGTATQLDIVATFEVDQAALNATSETDDIYGCTTSLGAAQRGSLGP
FGLAVLADGTLSELTPVYFYIAKKADGGLSTHFCTDKLRSSLDYDGERVVYGSTVPVLDDEELTMRLLVDHSIVEGFAQG
GRTVITSRVYPTKAIYEQAKLFLFNNATGTSVKASLKIWQMASAQIHQYSF*
CDS seq >OTG21394
ATGCAAACCCCTGAACCCTTTACAGACCTTGATCATGAACCCCTACTGGACCACCACCACCCACCACAACAAACCACCAC
AAAACCTTTGTTCACCAGGGTTGTGTCCGGTGTCACCTTTGTTGTATTCTTCTTTGCTTTCGCTATCGTATTCATTGTTC
TCAACCAACAGAATTCTTCTGTTCATATCGACACCAATTCGGATAAATCTTTTATAAGGTATTCGAAGGCCGATCGCTTG
TCGTGGGAACGGACCGCTTTTCATTTTCAGCCTGCCAAGAATTTTATTTACGATCCAAATGGTCCGTTGTTTCACATGGG
CTGGTACCATTTGTTCTATCAATACAACCCATACGCACCGGTTTGGGGCAATATGTCATGGGGTCACTCAGTGTCCAAAG
ACATGATCAACTGGTACGAGCTGCCAGTCGCTATGGTCCCGACCGAATGGTATGATATCGAGGGCGTCTTATCCGGGTCT
ACCACGGTCCTTCCAAACGGGCAGATCTTTGCATTGTATACTGGGAACGCTAATGATTTTTCCCAATTACAATGCAAAGC
TGTGCCCGTAAACTTATCTGACCCGCTTCTTATTGAGTGGGTCAAGTATGACGATAACCCAATCCTGTACACTCCACCAG
GGATTGGGTTAATGGACTACCGGGACCCGTCAACAGTCTGGACAGGTCCCGATGGAAAGCATAGGATGATCATGGGATCT
AAACGTGGCAATACAGGCATGATACTCGTTTACCATACCACCGATTACACGAACTACGAGTTGTTGGATGAGCCGTTGCA
CTCCGTTCCCAACACCGATATGTGGGAATGCGTCGACTTTTACCCGGTTTCGTTAACCAATGATAGTGCACTTGATATGG
CGGCCTATGGGTCGGGTATCAAACACGTTATTAAAGAAAGTTGGGAGGGACATGGAATGGATTGGTATTCAATCGGGACA
TATGACGCGATAAATGATAAATGGACTCCCGATAACCCGGAACTAGATGTCGGTATCGGGTTACGGTGCGATTACGGGAA
GTTTTTTGCATCAAAGAGTCTTTATGACCCATTGAAGAAAAGGAGGATCACTTGGGCTTATGTTGGAGAATCAGATAGTG
TTGACCAGGACCTCTCTAGAGGATGGGCTACTGTTTATAATGTTGGAAGAACAATTGTACTAGATAGAAAAACCGGGACC
CATTTACTTCATTGGCCCGTTGAGGAGGTCGAGAGTTTGAGATACAACGGTCAGGAGTTTAAAGAGATCGAGCTAGAGCC
CGGTTCAATCATTCCACTCGACATAGGCACGGCTACACAGTTGGACATAGTTGCAACATTTGAGGTGGATCAAGCAGCGT
TGAACGCGACAAGTGAAACCGATGATATTTATGGTTGCACCACTAGCTTAGGTGCAGCCCAAAGGGGAAGTTTGGGACCA
TTTGGTCTTGCGGTTCTAGCCGATGGAACCCTTTCTGAGTTAACTCCGGTTTATTTCTACATTGCTAAAAAGGCCGATGG
AGGTTTGTCGACACATTTTTGTACCGATAAGCTAAGGTCATCACTGGATTATGATGGAGAGAGAGTGGTGTATGGGAGCA
CTGTTCCTGTGTTAGATGATGAAGAACTCACAATGAGGCTATTGGTGGATCATTCGATAGTAGAGGGGTTTGCGCAAGGA
GGAAGGACGGTTATAACATCAAGGGTGTATCCAACAAAAGCGATATACGAACAAGCGAAGTTGTTCTTGTTCAACAACGC
TACAGGTACGAGTGTGAAGGCATCTCTCAAGATTTGGCAAATGGCTTCTGCACAAATTCATCAATACTCGTTTTAA