Microexon ID Ha_17:67873460-67873468:-
Species Helianthus annuus
Coordinates 17:67873460..67873468
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAGAGAACTGGTTTTCATTTTCAACCCGAGAAGAACTGGATGAATGATCCTAACGGTCCTGTTTTCTACAAAGGATGGTACCATTTGTTTTACCAATATCATCCG
Microexon-tag Amino Acid Seq WQRTGFHFQPEKNWMNDPNGPVFYKGWYHLFYQYHP
Microexon-tag spanning region67871607-67873743
Microexon-tag prediction score0.957
Overlapped with the annotated transcript (%) 100
New Transcript ID OTF86433x
Reference Transcript ID OTF86433
Gene ID HannXRQ_Chr17g0550731
Gene Name NA
Transcript ID OTF86433
Protein ID OTF86433
Gene ID HannXRQ_Chr17g0550731
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 5.6e-106
Motif start 126
Motif end 444
Protein seq >OTF86433
MASSSDLENPNASSALPYSYTPLPDGEQTAEPAVVLHPKKAVAFLVCAFLSVGFLVALIGNNGPLAPKNLNTNVAPSTVA
TTAKTTPLSRGVDKGVSEKAFRPLLGADNSFPWSSNMLDWQRTGFHFQPEKNWMNDPNGPVFYKGWYHLFYQYHPDAPVW
GKIVWGHAVSKDLINWRHLPIAMETDQWYDEQGVWTGSATILPDGQLVVLYTGSTNESVQVQNLAYPADPNDPLLINWVK
YPGNPVLVPPPGIDDKDFRDPTTAWKTPEGKWRITIGSKINKTGISLVYDTEDFKTFELLDGVLHAVPGTGMWECVDFYP
ISKQTENGLDTSVDGPGVKHIVKASMDDDRHDYYAIGTYDAYKGKWTPDNPTLDVGIGLRYDYGIYYASKTFYDQNKNRR
VLWSWIKETDTEASDISKGWASLMGVPRTILLDKKTKSNIIQWPVEEITALRTDATVFKNLVLEAGALVPLNLPAASQLD
IVAEFELDEATVQRLNGADVAYDCAQSGGAATRGALGPFGLSVLAHDGLVEHTPVYFYVAKGVDGNLKTFFCADQSRSST
ATDVDKSIYGNIVPVLKGEKLSMRILVDHSIVESFAQEGRTCITSRVYPTKAIYNNARLFLFNNATAATVTASINVWQMK
SADI*
CDS seq >OTF86433
ATGGCTTCTTCCTCAGATCTTGAAAACCCAAATGCCTCATCTGCCCTTCCTTACTCCTACACCCCATTGCCCGACGGCGA
ACAAACCGCCGAACCCGCCGTTGTCCTCCACCCCAAAAAGGCAGTTGCCTTTCTAGTGTGTGCGTTTTTGTCTGTAGGTT
TTCTAGTGGCCCTTATAGGAAACAACGGACCATTAGCTCCTAAGAACTTAAACACAAATGTTGCACCTTCAACGGTGGCC
ACAACCGCGAAGACGACTCCATTGTCTCGTGGAGTGGATAAAGGTGTGTCCGAGAAAGCTTTCCGGCCTTTGTTGGGTGC
GGATAATTCGTTTCCATGGAGCTCTAACATGTTGGATTGGCAGAGAACTGGTTTTCATTTTCAACCCGAGAAGAACTGGA
TGAATGATCCTAACGGTCCTGTTTTCTACAAAGGATGGTACCATTTGTTTTACCAATATCATCCGGATGCTCCAGTGTGG
GGAAAAATTGTTTGGGGTCATGCAGTCTCGAAAGATCTAATCAACTGGCGCCACCTTCCAATCGCGATGGAAACCGACCA
ATGGTACGACGAGCAGGGTGTGTGGACAGGTTCGGCCACCATCCTTCCAGACGGTCAACTTGTCGTTCTCTACACTGGAT
CCACCAACGAATCGGTCCAAGTTCAAAACCTCGCCTATCCAGCCGACCCAAATGACCCACTTTTAATCAACTGGGTCAAG
TACCCTGGAAACCCGGTCCTTGTCCCACCACCCGGTATTGATGACAAGGACTTCCGTGACCCCACAACTGCGTGGAAGAC
CCCAGAGGGAAAATGGCGAATCACTATTGGTTCAAAGATCAACAAAACCGGTATCTCTCTGGTTTACGACACCGAAGACT
TTAAAACATTTGAGCTATTAGATGGAGTGCTCCATGCTGTCCCGGGCACAGGTATGTGGGAATGCGTTGACTTTTACCCT
ATTTCGAAACAAACCGAAAATGGTCTTGATACATCCGTAGATGGACCGGGGGTCAAACATATAGTTAAAGCAAGCATGGA
TGATGATAGGCACGACTACTACGCGATTGGTACTTATGACGCGTATAAGGGAAAATGGACACCGGATAATCCTACATTAG
ACGTCGGGATCGGGTTGAGATACGATTACGGAATATACTATGCCTCCAAGACATTTTACGACCAAAACAAAAATAGAAGA
GTTTTGTGGAGTTGGATCAAGGAGACCGATACTGAAGCCTCGGACATTAGTAAGGGTTGGGCTTCTCTCATGGGTGTTCC
AAGAACAATTCTACTAGACAAAAAAACTAAAAGCAACATAATCCAATGGCCTGTTGAAGAAATCACCGCATTGCGAACCG
ATGCAACAGTTTTCAAGAATCTAGTACTGGAAGCCGGCGCACTTGTACCACTGAACCTGCCTGCAGCATCCCAACTAGAC
ATTGTTGCTGAGTTCGAACTTGATGAGGCGACCGTGCAACGACTAAATGGAGCTGATGTCGCATACGACTGTGCCCAAAG
TGGTGGGGCAGCAACACGAGGCGCTTTAGGACCTTTCGGTCTTAGTGTCCTCGCACACGATGGCCTCGTTGAGCACACTC
CTGTCTATTTCTACGTTGCCAAAGGCGTTGATGGAAATCTAAAAACTTTCTTTTGTGCAGACCAATCAAGATCATCCACT
GCTACAGATGTCGATAAGTCGATCTATGGAAACATTGTCCCCGTACTGAAAGGCGAAAAACTATCAATGAGAATTTTGGT
GGATCATTCGATCGTAGAAAGCTTTGCACAAGAAGGAAGAACATGTATTACTTCACGAGTTTATCCAACAAAGGCTATAT
ACAATAATGCTCGGTTGTTCTTGTTCAACAATGCTACAGCAGCAACAGTTACTGCTTCAATCAATGTTTGGCAAATGAAG
TCGGCCGATATTTAG
Microexon DNA seq ATCCTAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAGAGAACTGGTTTTCATTTTCAACCCGAGAAGAACTGGATGAATGATCCTAACGGTCCTGTTTTCTACAAAGGATGGTACCATTTGTTTTACCAATATCATCCG
Microexon-tag Amino Acid seq WQRTGFHFQPEKNWMNDPNGPVFYKGWYHLFYQYHP
Transcript ID OTF86433
Gene ID Ha.30987
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 5.6e-106
Motif start 126
Motif end 444
Protein seq >OTF86433
MASSSDLENPNASSALPYSYTPLPDGEQTAEPAVVLHPKKAVAFLVCAFLSVGFLVALIGNNGPLAPKNLNTNVAPSTVA
TTAKTTPLSRGVDKGVSEKAFRPLLGADNSFPWSSNMLDWQRTGFHFQPEKNWMNDPNGPVFYKGWYHLFYQYHPDAPVW
GKIVWGHAVSKDLINWRHLPIAMETDQWYDEQGVWTGSATILPDGQLVVLYTGSTNESVQVQNLAYPADPNDPLLINWVK
YPGNPVLVPPPGIDDKDFRDPTTAWKTPEGKWRITIGSKINKTGISLVYDTEDFKTFELLDGVLHAVPGTGMWECVDFYP
ISKQTENGLDTSVDGPGVKHIVKASMDDDRHDYYAIGTYDAYKGKWTPDNPTLDVGIGLRYDYGIYYASKTFYDQNKNRR
VLWSWIKETDTEASDISKGWASLMGVPRTILLDKKTKSNIIQWPVEEITALRTDATVFKNLVLEAGALVPLNLPAASQLD
IVAEFELDEATVQRLNGADVAYDCAQSGGAATRGALGPFGLSVLAHDGLVEHTPVYFYVAKGVDGNLKTFFCADQSRSST
ATDVDKSIYGNIVPVLKGEKLSMRILVDHSIVESFAQEGRTCITSRVYPTKAIYNNARLFLFNNATAATVTASINVWQMK
SADI*
CDS seq >OTF86433
ATGGCTTCTTCCTCAGATCTTGAAAACCCAAATGCCTCATCTGCCCTTCCTTACTCCTACACCCCATTGCCCGACGGCGA
ACAAACCGCCGAACCCGCCGTTGTCCTCCACCCCAAAAAGGCAGTTGCCTTTCTAGTGTGTGCGTTTTTGTCTGTAGGTT
TTCTAGTGGCCCTTATAGGAAACAACGGACCATTAGCTCCTAAGAACTTAAACACAAATGTTGCACCTTCAACGGTGGCC
ACAACCGCGAAGACGACTCCATTGTCTCGTGGAGTGGATAAAGGTGTGTCCGAGAAAGCTTTCCGGCCTTTGTTGGGTGC
GGATAATTCGTTTCCATGGAGCTCTAACATGTTGGATTGGCAGAGAACTGGTTTTCATTTTCAACCCGAGAAGAACTGGA
TGAATGATCCTAACGGTCCTGTTTTCTACAAAGGATGGTACCATTTGTTTTACCAATATCATCCGGATGCTCCAGTGTGG
GGAAAAATTGTTTGGGGTCATGCAGTCTCGAAAGATCTAATCAACTGGCGCCACCTTCCAATCGCGATGGAAACCGACCA
ATGGTACGACGAGCAGGGTGTGTGGACAGGTTCGGCCACCATCCTTCCAGACGGTCAACTTGTCGTTCTCTACACTGGAT
CCACCAACGAATCGGTCCAAGTTCAAAACCTCGCCTATCCAGCCGACCCAAATGACCCACTTTTAATCAACTGGGTCAAG
TACCCTGGAAACCCGGTCCTTGTCCCACCACCCGGTATTGATGACAAGGACTTCCGTGACCCCACAACTGCGTGGAAGAC
CCCAGAGGGAAAATGGCGAATCACTATTGGTTCAAAGATCAACAAAACCGGTATCTCTCTGGTTTACGACACCGAAGACT
TTAAAACATTTGAGCTATTAGATGGAGTGCTCCATGCTGTCCCGGGCACAGGTATGTGGGAATGCGTTGACTTTTACCCT
ATTTCGAAACAAACCGAAAATGGTCTTGATACATCCGTAGATGGACCGGGGGTCAAACATATAGTTAAAGCAAGCATGGA
TGATGATAGGCACGACTACTACGCGATTGGTACTTATGACGCGTATAAGGGAAAATGGACACCGGATAATCCTACATTAG
ACGTCGGGATCGGGTTGAGATACGATTACGGAATATACTATGCCTCCAAGACATTTTACGACCAAAACAAAAATAGAAGA
GTTTTGTGGAGTTGGATCAAGGAGACCGATACTGAAGCCTCGGACATTAGTAAGGGTTGGGCTTCTCTCATGGGTGTTCC
AAGAACAATTCTACTAGACAAAAAAACTAAAAGCAACATAATCCAATGGCCTGTTGAAGAAATCACCGCATTGCGAACCG
ATGCAACAGTTTTCAAGAATCTAGTACTGGAAGCCGGCGCACTTGTACCACTGAACCTGCCTGCAGCATCCCAACTAGAC
ATTGTTGCTGAGTTCGAACTTGATGAGGCGACCGTGCAACGACTAAATGGAGCTGATGTCGCATACGACTGTGCCCAAAG
TGGTGGGGCAGCAACACGAGGCGCTTTAGGACCTTTCGGTCTTAGTGTCCTCGCACACGATGGCCTCGTTGAGCACACTC
CTGTCTATTTCTACGTTGCCAAAGGCGTTGATGGAAATCTAAAAACTTTCTTTTGTGCAGACCAATCAAGATCATCCACT
GCTACAGATGTCGATAAGTCGATCTATGGAAACATTGTCCCCGTACTGAAAGGCGAAAAACTATCAATGAGAATTTTGGT
GGATCATTCGATCGTAGAAAGCTTTGCACAAGAAGGAAGAACATGTATTACTTCACGAGTTTATCCAACAAAGGCTATAT
ACAATAATGCTCGGTTGTTCTTGTTCAACAATGCTACAGCAGCAACAGTTACTGCTTCAATCAATGTTTGGCAAATGAAG
TCGGCCGATATTTAG