| Microexon ID | Ha_13:133081036-133081044:+ |
| Species | Helianthus annuus | Coordinates | 13:133081036..133081044 |
| Microexon Cluster ID | MEP22 |
| Size | 9 |
| Phase | 1 |
| Pfam Domain Motif | Glyco_hydro_32N |
| Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 49,9,50 |
| Microexon location in the Microexon-tag | 2 |
| Microexon-tag DNA Seq | YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV |
| Logo of Microexon-tag DNA Seq | ![]() |
| Alignment of exons | ![]() |
| Microexon DNA seq | ATCCTAATG |
| Microexon Amino Acid seq | DPNG |
| Microexon-tag DNA Seq | CCTTACCGAACTGCTTTTCACTTTCAGCCTCTCAAAAACTGGATGAATGATCCTAATGGTCCAATGTATTTCAATGGAGTATACCATCTGTTCTACCAGTACAATCCT |
| Microexon-tag Amino Acid Seq | PYRTAFHFQPLKNWMNDPNGPMYFNGVYHLFYQYNP |
| Microexon-tag spanning region | 133080781-133081178 |
| Microexon-tag prediction score | 0.9659 |
| Overlapped with the annotated transcript (%) | 100 |
| New Transcript ID | OTG02162x |
| Reference Transcript ID | OTG02162 |
| Gene ID | HannXRQ_Chr13g0409961 |
| Gene Name | INV1 |
| Transcript ID | OTG02162 |
| Protein ID | OTG02162 |
| Gene ID | HannXRQ_Chr13g0409961 |
| Gene Name | INV1 |
| Pfam domain motif | Glyco_hydro_32N |
| Motif E-value | 1.4e-96 |
| Motif start | 37 |
| Motif end | 354 |
| Protein seq | >OTG02162 MVKEMAGWVLSFCILLVVNGVGVHASEDLQPYRTAFHFQPLKNWMNDPNGPMYFNGVYHLFYQYNPGGPLWGNISWGHSI SHDLVNWFILEPALSPKEPYDIGGCFTGSSTILHGSKPIILYTAQDVDGAQVQNLALPKNRSDPLLKDWIKWSGNPILTP VNDINTSQFRDPSTAWMGPDGKWRIVIGSEIIKGQATALLYYSTDGFNWTRSDKPLKFSRETNMWECPDFYPVSNTGKDG IDTSFQGNNTMHVLKVSFDSHDYYVIGMYDPQMDQFLLATSDFNVSNTQLQYDYGRFYASKSFYDGAKKRRVLWGWVNEG DNPSDAFKKGWSGLQSFPRSVWLSDTRKQLVQWPVEEIKKLRAKQVNMESRELKGGSLLEVPGISGSQADIEVVFSLSNL SDLELINSDMSDPQHLCDQKNVSTSGSYGPFGVLVFASQNLTEQTAVFFRVFKGPNKFQVLMCSDQSRSSIAQGVDKSTY GAFLDLDPLHDKISLRSLVDHSIVESFGGEGLACITARVYPKLAIHEHAKLYVFNNGTKSVTMLSLNAWNMNKAQIVPMD * |
| CDS seq | >OTG02162 ATGGTAAAGGAGATGGCTGGCTGGGTTCTTTCTTTTTGCATCCTTTTGGTTGTTAATGGTGTCGGAGTTCATGCATCAGA AGATTTGCAGCCTTACCGAACTGCTTTTCACTTTCAGCCTCTCAAAAACTGGATGAATGATCCTAATGGTCCAATGTATT TCAATGGAGTATACCATCTGTTCTACCAGTACAATCCTGGGGGTCCACTATGGGGCAACATTTCATGGGGTCATTCCATT TCACATGATCTTGTGAACTGGTTTATTCTTGAACCTGCTCTCAGTCCAAAAGAACCCTACGACATTGGCGGCTGCTTCAC AGGTTCAAGCACAATCCTACACGGTTCAAAACCGATAATTCTCTATACCGCTCAAGACGTAGATGGCGCCCAGGTTCAAA ACCTGGCCCTCCCCAAAAACCGCTCTGACCCCCTTCTAAAAGACTGGATCAAATGGTCAGGTAACCCTATTCTGACCCCT GTCAATGACATCAACACGTCCCAATTCCGTGACCCTTCAACGGCTTGGATGGGTCCAGATGGAAAATGGAGGATTGTGAT TGGAAGTGAGATCATCAAGGGCCAGGCAACCGCGCTTTTGTATTATAGTACAGACGGTTTTAACTGGACCAGGTCGGATA AACCTTTGAAGTTTTCGAGAGAAACAAATATGTGGGAATGTCCTGACTTCTATCCGGTTAGTAATACTGGCAAAGATGGT ATTGATACATCTTTTCAAGGGAATAACACAATGCATGTGCTAAAAGTAAGCTTCGATAGTCATGATTATTATGTCATCGG GATGTATGATCCGCAAATGGATCAGTTTCTCCTTGCTACTAGTGATTTTAACGTTAGCAACACACAACTTCAATACGATT ATGGGAGGTTTTATGCATCAAAGTCATTCTATGATGGTGCGAAAAAAAGAAGGGTGTTATGGGGATGGGTTAATGAAGGT GATAATCCCTCGGATGCCTTCAAGAAAGGGTGGTCTGGCCTTCAGTCGTTTCCAAGGAGCGTTTGGCTTAGCGATACTCG AAAACAGTTAGTACAGTGGCCAGTGGAGGAAATCAAGAAGCTAAGAGCAAAACAAGTGAATATGGAGAGTAGAGAACTGA AGGGTGGGTCCCTCCTTGAAGTACCAGGCATAAGTGGTTCACAGGCGGATATAGAAGTCGTTTTTAGTCTATCGAATCTG AGTGATTTAGAGCTAATAAATTCAGATATGTCTGATCCACAACATCTTTGTGACCAAAAGAATGTTTCTACGAGTGGGAG TTATGGTCCATTCGGCGTGCTAGTTTTCGCTTCACAGAACTTGACAGAACAAACCGCGGTTTTCTTTCGAGTATTTAAAG GTCCTAACAAATTTCAAGTGCTCATGTGCAGTGATCAAAGCAGGTCCTCAATTGCACAAGGAGTGGACAAAAGCACATAT GGAGCTTTCCTTGATTTGGACCCTCTTCATGATAAGATCTCACTAAGAAGCTTGGTAGATCATTCGATTGTCGAAAGTTT TGGTGGAGAAGGACTTGCTTGTATCACCGCTAGAGTATATCCAAAGCTAGCGATTCATGAACACGCCAAGCTCTATGTAT TCAATAATGGCACAAAGAGTGTTACTATGTTGAGTCTAAATGCTTGGAACATGAACAAGGCTCAAATAGTTCCTATGGAT TGA |
| Microexon DNA seq | ATCCTAATG |
| Microexon Amino Acid seq | DPNG |
| Microexon-tag DNA Seq | CCTTACCGAACTGCTTTTCACTTTCAGCCTCTCAAAAACTGGATGAATGATCCTAATGGTCCAATGTATTTCAATGGAGTATACCATCTGTTCTACCAGTACAATCCT |
| Microexon-tag Amino Acid seq | PYRTAFHFQPLKNWMNDPNGPMYFNGVYHLFYQYNP |
| Transcript ID | OTG02162 |
| Gene ID | Ha.16650 |
| Gene Name | INV1 |
| Pfam domain motif | Glyco_hydro_32N |
| Motif E-value | 1.4e-96 |
| Motif start | 37 |
| Motif end | 354 |
| Protein seq | >OTG02162 MVKEMAGWVLSFCILLVVNGVGVHASEDLQPYRTAFHFQPLKNWMNDPNGPMYFNGVYHLFYQYNPGGPLWGNISWGHSI SHDLVNWFILEPALSPKEPYDIGGCFTGSSTILHGSKPIILYTAQDVDGAQVQNLALPKNRSDPLLKDWIKWSGNPILTP VNDINTSQFRDPSTAWMGPDGKWRIVIGSEIIKGQATALLYYSTDGFNWTRSDKPLKFSRETNMWECPDFYPVSNTGKDG IDTSFQGNNTMHVLKVSFDSHDYYVIGMYDPQMDQFLLATSDFNVSNTQLQYDYGRFYASKSFYDGAKKRRVLWGWVNEG DNPSDAFKKGWSGLQSFPRSVWLSDTRKQLVQWPVEEIKKLRAKQVNMESRELKGGSLLEVPGISGSQADIEVVFSLSNL SDLELINSDMSDPQHLCDQKNVSTSGSYGPFGVLVFASQNLTEQTAVFFRVFKGPNKFQVLMCSDQSRSSIAQGVDKSTY GAFLDLDPLHDKISLRSLVDHSIVESFGGEGLACITARVYPKLAIHEHAKLYVFNNGTKSVTMLSLNAWNMNKAQIVPMD * |
| CDS seq | >OTG02162 ATGGTAAAGGAGATGGCTGGCTGGGTTCTTTCTTTTTGCATCCTTTTGGTTGTTAATGGTGTCGGAGTTCATGCATCAGA AGATTTGCAGCCTTACCGAACTGCTTTTCACTTTCAGCCTCTCAAAAACTGGATGAATGATCCTAATGGTCCAATGTATT TCAATGGAGTATACCATCTGTTCTACCAGTACAATCCTGGGGGTCCACTATGGGGCAACATTTCATGGGGTCATTCCATT TCACATGATCTTGTGAACTGGTTTATTCTTGAACCTGCTCTCAGTCCAAAAGAACCCTACGACATTGGCGGCTGCTTCAC AGGTTCAAGCACAATCCTACACGGTTCAAAACCGATAATTCTCTATACCGCTCAAGACGTAGATGGCGCCCAGGTTCAAA ACCTGGCCCTCCCCAAAAACCGCTCTGACCCCCTTCTAAAAGACTGGATCAAATGGTCAGGTAACCCTATTCTGACCCCT GTCAATGACATCAACACGTCCCAATTCCGTGACCCTTCAACGGCTTGGATGGGTCCAGATGGAAAATGGAGGATTGTGAT TGGAAGTGAGATCATCAAGGGCCAGGCAACCGCGCTTTTGTATTATAGTACAGACGGTTTTAACTGGACCAGGTCGGATA AACCTTTGAAGTTTTCGAGAGAAACAAATATGTGGGAATGTCCTGACTTCTATCCGGTTAGTAATACTGGCAAAGATGGT ATTGATACATCTTTTCAAGGGAATAACACAATGCATGTGCTAAAAGTAAGCTTCGATAGTCATGATTATTATGTCATCGG GATGTATGATCCGCAAATGGATCAGTTTCTCCTTGCTACTAGTGATTTTAACGTTAGCAACACACAACTTCAATACGATT ATGGGAGGTTTTATGCATCAAAGTCATTCTATGATGGTGCGAAAAAAAGAAGGGTGTTATGGGGATGGGTTAATGAAGGT GATAATCCCTCGGATGCCTTCAAGAAAGGGTGGTCTGGCCTTCAGTCGTTTCCAAGGAGCGTTTGGCTTAGCGATACTCG AAAACAGTTAGTACAGTGGCCAGTGGAGGAAATCAAGAAGCTAAGAGCAAAACAAGTGAATATGGAGAGTAGAGAACTGA AGGGTGGGTCCCTCCTTGAAGTACCAGGCATAAGTGGTTCACAGGCGGATATAGAAGTCGTTTTTAGTCTATCGAATCTG AGTGATTTAGAGCTAATAAATTCAGATATGTCTGATCCACAACATCTTTGTGACCAAAAGAATGTTTCTACGAGTGGGAG TTATGGTCCATTCGGCGTGCTAGTTTTCGCTTCACAGAACTTGACAGAACAAACCGCGGTTTTCTTTCGAGTATTTAAAG GTCCTAACAAATTTCAAGTGCTCATGTGCAGTGATCAAAGCAGGTCCTCAATTGCACAAGGAGTGGACAAAAGCACATAT GGAGCTTTCCTTGATTTGGACCCTCTTCATGATAAGATCTCACTAAGAAGCTTGGTAGATCATTCGATTGTCGAAAGTTT TGGTGGAGAAGGACTTGCTTGTATCACCGCTAGAGTATATCCAAAGCTAGCGATTCATGAACACGCCAAGCTCTATGTAT TCAATAATGGCACAAAGAGTGTTACTATGTTGAGTCTAAATGCTTGGAACATGAACAAGGCTCAAATAGTTCCTATGGAT TGA |

