
Microexon ID | Ha_13:133081036-133081044:+ |
Species | Helianthus annuus | Coordinates | 13:133081036..133081044 |
Microexon Cluster ID | MEP22 |
Size | 9 |
Phase | 1 |
Pfam Domain Motif | Glyco_hydro_32N |
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 49,9,50 |
Microexon location in the Microexon-tag | 2 |
Microexon-tag DNA Seq | YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV |
Logo of Microexon-tag DNA Seq | ![]() |
Alignment of exons | ![]() |
Microexon DNA seq | ATCCTAATG |
Microexon Amino Acid seq | DPNG |
Microexon-tag DNA Seq | CCTTACCGAACTGCTTTTCACTTTCAGCCTCTCAAAAACTGGATGAATGATCCTAATGGTCCAATGTATTTCAATGGAGTATACCATCTGTTCTACCAGTACAATCCT |
Microexon-tag Amino Acid Seq | PYRTAFHFQPLKNWMNDPNGPMYFNGVYHLFYQYNP |
Microexon-tag spanning region | 133080781-133081178 |
Microexon-tag prediction score | 0.9659 |
Overlapped with the annotated transcript (%) | 100 |
New Transcript ID | OTG02162x |
Reference Transcript ID | OTG02162 |
Gene ID | HannXRQ_Chr13g0409961 |
Gene Name | INV1 |
Transcript ID | OTG02162 |
Protein ID | OTG02162 |
Gene ID | HannXRQ_Chr13g0409961 |
Gene Name | INV1 |
Pfam domain motif | Glyco_hydro_32N |
Motif E-value | 1.4e-96 |
Motif start | 37 |
Motif end | 354 |
Protein seq | >OTG02162 MVKEMAGWVLSFCILLVVNGVGVHASEDLQPYRTAFHFQPLKNWMNDPNGPMYFNGVYHLFYQYNPGGPLWGNISWGHSI SHDLVNWFILEPALSPKEPYDIGGCFTGSSTILHGSKPIILYTAQDVDGAQVQNLALPKNRSDPLLKDWIKWSGNPILTP VNDINTSQFRDPSTAWMGPDGKWRIVIGSEIIKGQATALLYYSTDGFNWTRSDKPLKFSRETNMWECPDFYPVSNTGKDG IDTSFQGNNTMHVLKVSFDSHDYYVIGMYDPQMDQFLLATSDFNVSNTQLQYDYGRFYASKSFYDGAKKRRVLWGWVNEG DNPSDAFKKGWSGLQSFPRSVWLSDTRKQLVQWPVEEIKKLRAKQVNMESRELKGGSLLEVPGISGSQADIEVVFSLSNL SDLELINSDMSDPQHLCDQKNVSTSGSYGPFGVLVFASQNLTEQTAVFFRVFKGPNKFQVLMCSDQSRSSIAQGVDKSTY GAFLDLDPLHDKISLRSLVDHSIVESFGGEGLACITARVYPKLAIHEHAKLYVFNNGTKSVTMLSLNAWNMNKAQIVPMD * |
CDS seq | >OTG02162 ATGGTAAAGGAGATGGCTGGCTGGGTTCTTTCTTTTTGCATCCTTTTGGTTGTTAATGGTGTCGGAGTTCATGCATCAGA AGATTTGCAGCCTTACCGAACTGCTTTTCACTTTCAGCCTCTCAAAAACTGGATGAATGATCCTAATGGTCCAATGTATT TCAATGGAGTATACCATCTGTTCTACCAGTACAATCCTGGGGGTCCACTATGGGGCAACATTTCATGGGGTCATTCCATT TCACATGATCTTGTGAACTGGTTTATTCTTGAACCTGCTCTCAGTCCAAAAGAACCCTACGACATTGGCGGCTGCTTCAC AGGTTCAAGCACAATCCTACACGGTTCAAAACCGATAATTCTCTATACCGCTCAAGACGTAGATGGCGCCCAGGTTCAAA ACCTGGCCCTCCCCAAAAACCGCTCTGACCCCCTTCTAAAAGACTGGATCAAATGGTCAGGTAACCCTATTCTGACCCCT GTCAATGACATCAACACGTCCCAATTCCGTGACCCTTCAACGGCTTGGATGGGTCCAGATGGAAAATGGAGGATTGTGAT TGGAAGTGAGATCATCAAGGGCCAGGCAACCGCGCTTTTGTATTATAGTACAGACGGTTTTAACTGGACCAGGTCGGATA AACCTTTGAAGTTTTCGAGAGAAACAAATATGTGGGAATGTCCTGACTTCTATCCGGTTAGTAATACTGGCAAAGATGGT ATTGATACATCTTTTCAAGGGAATAACACAATGCATGTGCTAAAAGTAAGCTTCGATAGTCATGATTATTATGTCATCGG GATGTATGATCCGCAAATGGATCAGTTTCTCCTTGCTACTAGTGATTTTAACGTTAGCAACACACAACTTCAATACGATT ATGGGAGGTTTTATGCATCAAAGTCATTCTATGATGGTGCGAAAAAAAGAAGGGTGTTATGGGGATGGGTTAATGAAGGT GATAATCCCTCGGATGCCTTCAAGAAAGGGTGGTCTGGCCTTCAGTCGTTTCCAAGGAGCGTTTGGCTTAGCGATACTCG AAAACAGTTAGTACAGTGGCCAGTGGAGGAAATCAAGAAGCTAAGAGCAAAACAAGTGAATATGGAGAGTAGAGAACTGA AGGGTGGGTCCCTCCTTGAAGTACCAGGCATAAGTGGTTCACAGGCGGATATAGAAGTCGTTTTTAGTCTATCGAATCTG AGTGATTTAGAGCTAATAAATTCAGATATGTCTGATCCACAACATCTTTGTGACCAAAAGAATGTTTCTACGAGTGGGAG TTATGGTCCATTCGGCGTGCTAGTTTTCGCTTCACAGAACTTGACAGAACAAACCGCGGTTTTCTTTCGAGTATTTAAAG GTCCTAACAAATTTCAAGTGCTCATGTGCAGTGATCAAAGCAGGTCCTCAATTGCACAAGGAGTGGACAAAAGCACATAT GGAGCTTTCCTTGATTTGGACCCTCTTCATGATAAGATCTCACTAAGAAGCTTGGTAGATCATTCGATTGTCGAAAGTTT TGGTGGAGAAGGACTTGCTTGTATCACCGCTAGAGTATATCCAAAGCTAGCGATTCATGAACACGCCAAGCTCTATGTAT TCAATAATGGCACAAAGAGTGTTACTATGTTGAGTCTAAATGCTTGGAACATGAACAAGGCTCAAATAGTTCCTATGGAT TGA |
Microexon DNA seq | ATCCTAATG |
Microexon Amino Acid seq | DPNG |
Microexon-tag DNA Seq | CCTTACCGAACTGCTTTTCACTTTCAGCCTCTCAAAAACTGGATGAATGATCCTAATGGTCCAATGTATTTCAATGGAGTATACCATCTGTTCTACCAGTACAATCCT |
Microexon-tag Amino Acid seq | PYRTAFHFQPLKNWMNDPNGPMYFNGVYHLFYQYNP |
Transcript ID | OTG02162 |
Gene ID | Ha.16650 |
Gene Name | INV1 |
Pfam domain motif | Glyco_hydro_32N |
Motif E-value | 1.4e-96 |
Motif start | 37 |
Motif end | 354 |
Protein seq | >OTG02162 MVKEMAGWVLSFCILLVVNGVGVHASEDLQPYRTAFHFQPLKNWMNDPNGPMYFNGVYHLFYQYNPGGPLWGNISWGHSI SHDLVNWFILEPALSPKEPYDIGGCFTGSSTILHGSKPIILYTAQDVDGAQVQNLALPKNRSDPLLKDWIKWSGNPILTP VNDINTSQFRDPSTAWMGPDGKWRIVIGSEIIKGQATALLYYSTDGFNWTRSDKPLKFSRETNMWECPDFYPVSNTGKDG IDTSFQGNNTMHVLKVSFDSHDYYVIGMYDPQMDQFLLATSDFNVSNTQLQYDYGRFYASKSFYDGAKKRRVLWGWVNEG DNPSDAFKKGWSGLQSFPRSVWLSDTRKQLVQWPVEEIKKLRAKQVNMESRELKGGSLLEVPGISGSQADIEVVFSLSNL SDLELINSDMSDPQHLCDQKNVSTSGSYGPFGVLVFASQNLTEQTAVFFRVFKGPNKFQVLMCSDQSRSSIAQGVDKSTY GAFLDLDPLHDKISLRSLVDHSIVESFGGEGLACITARVYPKLAIHEHAKLYVFNNGTKSVTMLSLNAWNMNKAQIVPMD * |
CDS seq | >OTG02162 ATGGTAAAGGAGATGGCTGGCTGGGTTCTTTCTTTTTGCATCCTTTTGGTTGTTAATGGTGTCGGAGTTCATGCATCAGA AGATTTGCAGCCTTACCGAACTGCTTTTCACTTTCAGCCTCTCAAAAACTGGATGAATGATCCTAATGGTCCAATGTATT TCAATGGAGTATACCATCTGTTCTACCAGTACAATCCTGGGGGTCCACTATGGGGCAACATTTCATGGGGTCATTCCATT TCACATGATCTTGTGAACTGGTTTATTCTTGAACCTGCTCTCAGTCCAAAAGAACCCTACGACATTGGCGGCTGCTTCAC AGGTTCAAGCACAATCCTACACGGTTCAAAACCGATAATTCTCTATACCGCTCAAGACGTAGATGGCGCCCAGGTTCAAA ACCTGGCCCTCCCCAAAAACCGCTCTGACCCCCTTCTAAAAGACTGGATCAAATGGTCAGGTAACCCTATTCTGACCCCT GTCAATGACATCAACACGTCCCAATTCCGTGACCCTTCAACGGCTTGGATGGGTCCAGATGGAAAATGGAGGATTGTGAT TGGAAGTGAGATCATCAAGGGCCAGGCAACCGCGCTTTTGTATTATAGTACAGACGGTTTTAACTGGACCAGGTCGGATA AACCTTTGAAGTTTTCGAGAGAAACAAATATGTGGGAATGTCCTGACTTCTATCCGGTTAGTAATACTGGCAAAGATGGT ATTGATACATCTTTTCAAGGGAATAACACAATGCATGTGCTAAAAGTAAGCTTCGATAGTCATGATTATTATGTCATCGG GATGTATGATCCGCAAATGGATCAGTTTCTCCTTGCTACTAGTGATTTTAACGTTAGCAACACACAACTTCAATACGATT ATGGGAGGTTTTATGCATCAAAGTCATTCTATGATGGTGCGAAAAAAAGAAGGGTGTTATGGGGATGGGTTAATGAAGGT GATAATCCCTCGGATGCCTTCAAGAAAGGGTGGTCTGGCCTTCAGTCGTTTCCAAGGAGCGTTTGGCTTAGCGATACTCG AAAACAGTTAGTACAGTGGCCAGTGGAGGAAATCAAGAAGCTAAGAGCAAAACAAGTGAATATGGAGAGTAGAGAACTGA AGGGTGGGTCCCTCCTTGAAGTACCAGGCATAAGTGGTTCACAGGCGGATATAGAAGTCGTTTTTAGTCTATCGAATCTG AGTGATTTAGAGCTAATAAATTCAGATATGTCTGATCCACAACATCTTTGTGACCAAAAGAATGTTTCTACGAGTGGGAG TTATGGTCCATTCGGCGTGCTAGTTTTCGCTTCACAGAACTTGACAGAACAAACCGCGGTTTTCTTTCGAGTATTTAAAG GTCCTAACAAATTTCAAGTGCTCATGTGCAGTGATCAAAGCAGGTCCTCAATTGCACAAGGAGTGGACAAAAGCACATAT GGAGCTTTCCTTGATTTGGACCCTCTTCATGATAAGATCTCACTAAGAAGCTTGGTAGATCATTCGATTGTCGAAAGTTT TGGTGGAGAAGGACTTGCTTGTATCACCGCTAGAGTATATCCAAAGCTAGCGATTCATGAACACGCCAAGCTCTATGTAT TCAATAATGGCACAAAGAGTGTTACTATGTTGAGTCTAAATGCTTGGAACATGAACAAGGCTCAAATAGTTCCTATGGAT TGA |