
Microexon ID | Ha_17:67873460-67873468:- |
Species | Helianthus annuus | Coordinates | 17:67873460..67873468 |
Microexon Cluster ID | MEP22 |
Size | 9 |
Phase | 1 |
Pfam Domain Motif | Glyco_hydro_32N |
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 49,9,50 |
Microexon location in the Microexon-tag | 2 |
Microexon-tag DNA Seq | YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV |
Logo of Microexon-tag DNA Seq | ![]() |
Alignment of exons | ![]() |
Microexon DNA seq | ATCCTAACG |
Microexon Amino Acid seq | DPNG |
Microexon-tag DNA Seq | TGGCAGAGAACTGGTTTTCATTTTCAACCCGAGAAGAACTGGATGAATGATCCTAACGGTCCTGTTTTCTACAAAGGATGGTACCATTTGTTTTACCAATATCATCCG |
Microexon-tag Amino Acid Seq | WQRTGFHFQPEKNWMNDPNGPVFYKGWYHLFYQYHP |
Microexon-tag spanning region | 67871607-67873743 |
Microexon-tag prediction score | 0.957 |
Overlapped with the annotated transcript (%) | 100 |
New Transcript ID | OTF86433x |
Reference Transcript ID | OTF86433 |
Gene ID | HannXRQ_Chr17g0550731 |
Gene Name | NA |
Transcript ID | OTF86433 |
Protein ID | OTF86433 |
Gene ID | HannXRQ_Chr17g0550731 |
Gene Name | NA |
Pfam domain motif | Glyco_hydro_32N |
Motif E-value | 5.6e-106 |
Motif start | 126 |
Motif end | 444 |
Protein seq | >OTF86433 MASSSDLENPNASSALPYSYTPLPDGEQTAEPAVVLHPKKAVAFLVCAFLSVGFLVALIGNNGPLAPKNLNTNVAPSTVA TTAKTTPLSRGVDKGVSEKAFRPLLGADNSFPWSSNMLDWQRTGFHFQPEKNWMNDPNGPVFYKGWYHLFYQYHPDAPVW GKIVWGHAVSKDLINWRHLPIAMETDQWYDEQGVWTGSATILPDGQLVVLYTGSTNESVQVQNLAYPADPNDPLLINWVK YPGNPVLVPPPGIDDKDFRDPTTAWKTPEGKWRITIGSKINKTGISLVYDTEDFKTFELLDGVLHAVPGTGMWECVDFYP ISKQTENGLDTSVDGPGVKHIVKASMDDDRHDYYAIGTYDAYKGKWTPDNPTLDVGIGLRYDYGIYYASKTFYDQNKNRR VLWSWIKETDTEASDISKGWASLMGVPRTILLDKKTKSNIIQWPVEEITALRTDATVFKNLVLEAGALVPLNLPAASQLD IVAEFELDEATVQRLNGADVAYDCAQSGGAATRGALGPFGLSVLAHDGLVEHTPVYFYVAKGVDGNLKTFFCADQSRSST ATDVDKSIYGNIVPVLKGEKLSMRILVDHSIVESFAQEGRTCITSRVYPTKAIYNNARLFLFNNATAATVTASINVWQMK SADI* |
CDS seq | >OTF86433 ATGGCTTCTTCCTCAGATCTTGAAAACCCAAATGCCTCATCTGCCCTTCCTTACTCCTACACCCCATTGCCCGACGGCGA ACAAACCGCCGAACCCGCCGTTGTCCTCCACCCCAAAAAGGCAGTTGCCTTTCTAGTGTGTGCGTTTTTGTCTGTAGGTT TTCTAGTGGCCCTTATAGGAAACAACGGACCATTAGCTCCTAAGAACTTAAACACAAATGTTGCACCTTCAACGGTGGCC ACAACCGCGAAGACGACTCCATTGTCTCGTGGAGTGGATAAAGGTGTGTCCGAGAAAGCTTTCCGGCCTTTGTTGGGTGC GGATAATTCGTTTCCATGGAGCTCTAACATGTTGGATTGGCAGAGAACTGGTTTTCATTTTCAACCCGAGAAGAACTGGA TGAATGATCCTAACGGTCCTGTTTTCTACAAAGGATGGTACCATTTGTTTTACCAATATCATCCGGATGCTCCAGTGTGG GGAAAAATTGTTTGGGGTCATGCAGTCTCGAAAGATCTAATCAACTGGCGCCACCTTCCAATCGCGATGGAAACCGACCA ATGGTACGACGAGCAGGGTGTGTGGACAGGTTCGGCCACCATCCTTCCAGACGGTCAACTTGTCGTTCTCTACACTGGAT CCACCAACGAATCGGTCCAAGTTCAAAACCTCGCCTATCCAGCCGACCCAAATGACCCACTTTTAATCAACTGGGTCAAG TACCCTGGAAACCCGGTCCTTGTCCCACCACCCGGTATTGATGACAAGGACTTCCGTGACCCCACAACTGCGTGGAAGAC CCCAGAGGGAAAATGGCGAATCACTATTGGTTCAAAGATCAACAAAACCGGTATCTCTCTGGTTTACGACACCGAAGACT TTAAAACATTTGAGCTATTAGATGGAGTGCTCCATGCTGTCCCGGGCACAGGTATGTGGGAATGCGTTGACTTTTACCCT ATTTCGAAACAAACCGAAAATGGTCTTGATACATCCGTAGATGGACCGGGGGTCAAACATATAGTTAAAGCAAGCATGGA TGATGATAGGCACGACTACTACGCGATTGGTACTTATGACGCGTATAAGGGAAAATGGACACCGGATAATCCTACATTAG ACGTCGGGATCGGGTTGAGATACGATTACGGAATATACTATGCCTCCAAGACATTTTACGACCAAAACAAAAATAGAAGA GTTTTGTGGAGTTGGATCAAGGAGACCGATACTGAAGCCTCGGACATTAGTAAGGGTTGGGCTTCTCTCATGGGTGTTCC AAGAACAATTCTACTAGACAAAAAAACTAAAAGCAACATAATCCAATGGCCTGTTGAAGAAATCACCGCATTGCGAACCG ATGCAACAGTTTTCAAGAATCTAGTACTGGAAGCCGGCGCACTTGTACCACTGAACCTGCCTGCAGCATCCCAACTAGAC ATTGTTGCTGAGTTCGAACTTGATGAGGCGACCGTGCAACGACTAAATGGAGCTGATGTCGCATACGACTGTGCCCAAAG TGGTGGGGCAGCAACACGAGGCGCTTTAGGACCTTTCGGTCTTAGTGTCCTCGCACACGATGGCCTCGTTGAGCACACTC CTGTCTATTTCTACGTTGCCAAAGGCGTTGATGGAAATCTAAAAACTTTCTTTTGTGCAGACCAATCAAGATCATCCACT GCTACAGATGTCGATAAGTCGATCTATGGAAACATTGTCCCCGTACTGAAAGGCGAAAAACTATCAATGAGAATTTTGGT GGATCATTCGATCGTAGAAAGCTTTGCACAAGAAGGAAGAACATGTATTACTTCACGAGTTTATCCAACAAAGGCTATAT ACAATAATGCTCGGTTGTTCTTGTTCAACAATGCTACAGCAGCAACAGTTACTGCTTCAATCAATGTTTGGCAAATGAAG TCGGCCGATATTTAG |
Microexon DNA seq | ATCCTAACG |
Microexon Amino Acid seq | DPNG |
Microexon-tag DNA Seq | TGGCAGAGAACTGGTTTTCATTTTCAACCCGAGAAGAACTGGATGAATGATCCTAACGGTCCTGTTTTCTACAAAGGATGGTACCATTTGTTTTACCAATATCATCCG |
Microexon-tag Amino Acid seq | WQRTGFHFQPEKNWMNDPNGPVFYKGWYHLFYQYHP |
Transcript ID | OTF86433 |
Gene ID | Ha.30987 |
Gene Name | NA |
Pfam domain motif | Glyco_hydro_32N |
Motif E-value | 5.6e-106 |
Motif start | 126 |
Motif end | 444 |
Protein seq | >OTF86433 MASSSDLENPNASSALPYSYTPLPDGEQTAEPAVVLHPKKAVAFLVCAFLSVGFLVALIGNNGPLAPKNLNTNVAPSTVA TTAKTTPLSRGVDKGVSEKAFRPLLGADNSFPWSSNMLDWQRTGFHFQPEKNWMNDPNGPVFYKGWYHLFYQYHPDAPVW GKIVWGHAVSKDLINWRHLPIAMETDQWYDEQGVWTGSATILPDGQLVVLYTGSTNESVQVQNLAYPADPNDPLLINWVK YPGNPVLVPPPGIDDKDFRDPTTAWKTPEGKWRITIGSKINKTGISLVYDTEDFKTFELLDGVLHAVPGTGMWECVDFYP ISKQTENGLDTSVDGPGVKHIVKASMDDDRHDYYAIGTYDAYKGKWTPDNPTLDVGIGLRYDYGIYYASKTFYDQNKNRR VLWSWIKETDTEASDISKGWASLMGVPRTILLDKKTKSNIIQWPVEEITALRTDATVFKNLVLEAGALVPLNLPAASQLD IVAEFELDEATVQRLNGADVAYDCAQSGGAATRGALGPFGLSVLAHDGLVEHTPVYFYVAKGVDGNLKTFFCADQSRSST ATDVDKSIYGNIVPVLKGEKLSMRILVDHSIVESFAQEGRTCITSRVYPTKAIYNNARLFLFNNATAATVTASINVWQMK SADI* |
CDS seq | >OTF86433 ATGGCTTCTTCCTCAGATCTTGAAAACCCAAATGCCTCATCTGCCCTTCCTTACTCCTACACCCCATTGCCCGACGGCGA ACAAACCGCCGAACCCGCCGTTGTCCTCCACCCCAAAAAGGCAGTTGCCTTTCTAGTGTGTGCGTTTTTGTCTGTAGGTT TTCTAGTGGCCCTTATAGGAAACAACGGACCATTAGCTCCTAAGAACTTAAACACAAATGTTGCACCTTCAACGGTGGCC ACAACCGCGAAGACGACTCCATTGTCTCGTGGAGTGGATAAAGGTGTGTCCGAGAAAGCTTTCCGGCCTTTGTTGGGTGC GGATAATTCGTTTCCATGGAGCTCTAACATGTTGGATTGGCAGAGAACTGGTTTTCATTTTCAACCCGAGAAGAACTGGA TGAATGATCCTAACGGTCCTGTTTTCTACAAAGGATGGTACCATTTGTTTTACCAATATCATCCGGATGCTCCAGTGTGG GGAAAAATTGTTTGGGGTCATGCAGTCTCGAAAGATCTAATCAACTGGCGCCACCTTCCAATCGCGATGGAAACCGACCA ATGGTACGACGAGCAGGGTGTGTGGACAGGTTCGGCCACCATCCTTCCAGACGGTCAACTTGTCGTTCTCTACACTGGAT CCACCAACGAATCGGTCCAAGTTCAAAACCTCGCCTATCCAGCCGACCCAAATGACCCACTTTTAATCAACTGGGTCAAG TACCCTGGAAACCCGGTCCTTGTCCCACCACCCGGTATTGATGACAAGGACTTCCGTGACCCCACAACTGCGTGGAAGAC CCCAGAGGGAAAATGGCGAATCACTATTGGTTCAAAGATCAACAAAACCGGTATCTCTCTGGTTTACGACACCGAAGACT TTAAAACATTTGAGCTATTAGATGGAGTGCTCCATGCTGTCCCGGGCACAGGTATGTGGGAATGCGTTGACTTTTACCCT ATTTCGAAACAAACCGAAAATGGTCTTGATACATCCGTAGATGGACCGGGGGTCAAACATATAGTTAAAGCAAGCATGGA TGATGATAGGCACGACTACTACGCGATTGGTACTTATGACGCGTATAAGGGAAAATGGACACCGGATAATCCTACATTAG ACGTCGGGATCGGGTTGAGATACGATTACGGAATATACTATGCCTCCAAGACATTTTACGACCAAAACAAAAATAGAAGA GTTTTGTGGAGTTGGATCAAGGAGACCGATACTGAAGCCTCGGACATTAGTAAGGGTTGGGCTTCTCTCATGGGTGTTCC AAGAACAATTCTACTAGACAAAAAAACTAAAAGCAACATAATCCAATGGCCTGTTGAAGAAATCACCGCATTGCGAACCG ATGCAACAGTTTTCAAGAATCTAGTACTGGAAGCCGGCGCACTTGTACCACTGAACCTGCCTGCAGCATCCCAACTAGAC ATTGTTGCTGAGTTCGAACTTGATGAGGCGACCGTGCAACGACTAAATGGAGCTGATGTCGCATACGACTGTGCCCAAAG TGGTGGGGCAGCAACACGAGGCGCTTTAGGACCTTTCGGTCTTAGTGTCCTCGCACACGATGGCCTCGTTGAGCACACTC CTGTCTATTTCTACGTTGCCAAAGGCGTTGATGGAAATCTAAAAACTTTCTTTTGTGCAGACCAATCAAGATCATCCACT GCTACAGATGTCGATAAGTCGATCTATGGAAACATTGTCCCCGTACTGAAAGGCGAAAAACTATCAATGAGAATTTTGGT GGATCATTCGATCGTAGAAAGCTTTGCACAAGAAGGAAGAACATGTATTACTTCACGAGTTTATCCAACAAAGGCTATAT ACAATAATGCTCGGTTGTTCTTGTTCAACAATGCTACAGCAGCAACAGTTACTGCTTCAATCAATGTTTGGCAAATGAAG TCGGCCGATATTTAG |