Microexon ID Ha_13:133081036-133081044:+
Species Helianthus annuus
Coordinates 13:133081036..133081044
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTACCGAACTGCTTTTCACTTTCAGCCTCTCAAAAACTGGATGAATGATCCTAATGGTCCAATGTATTTCAATGGAGTATACCATCTGTTCTACCAGTACAATCCT
Microexon-tag Amino Acid Seq PYRTAFHFQPLKNWMNDPNGPMYFNGVYHLFYQYNP
Microexon-tag spanning region133080781-133081178
Microexon-tag prediction score0.9659
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG02162x
Reference Transcript ID OTG02162
Gene ID HannXRQ_Chr13g0409961
Gene Name INV1
Transcript ID OTG02162
Protein ID OTG02162
Gene ID HannXRQ_Chr13g0409961
Gene Name INV1
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.4e-96
Motif start 37
Motif end 354
Protein seq >OTG02162
MVKEMAGWVLSFCILLVVNGVGVHASEDLQPYRTAFHFQPLKNWMNDPNGPMYFNGVYHLFYQYNPGGPLWGNISWGHSI
SHDLVNWFILEPALSPKEPYDIGGCFTGSSTILHGSKPIILYTAQDVDGAQVQNLALPKNRSDPLLKDWIKWSGNPILTP
VNDINTSQFRDPSTAWMGPDGKWRIVIGSEIIKGQATALLYYSTDGFNWTRSDKPLKFSRETNMWECPDFYPVSNTGKDG
IDTSFQGNNTMHVLKVSFDSHDYYVIGMYDPQMDQFLLATSDFNVSNTQLQYDYGRFYASKSFYDGAKKRRVLWGWVNEG
DNPSDAFKKGWSGLQSFPRSVWLSDTRKQLVQWPVEEIKKLRAKQVNMESRELKGGSLLEVPGISGSQADIEVVFSLSNL
SDLELINSDMSDPQHLCDQKNVSTSGSYGPFGVLVFASQNLTEQTAVFFRVFKGPNKFQVLMCSDQSRSSIAQGVDKSTY
GAFLDLDPLHDKISLRSLVDHSIVESFGGEGLACITARVYPKLAIHEHAKLYVFNNGTKSVTMLSLNAWNMNKAQIVPMD
*
CDS seq >OTG02162
ATGGTAAAGGAGATGGCTGGCTGGGTTCTTTCTTTTTGCATCCTTTTGGTTGTTAATGGTGTCGGAGTTCATGCATCAGA
AGATTTGCAGCCTTACCGAACTGCTTTTCACTTTCAGCCTCTCAAAAACTGGATGAATGATCCTAATGGTCCAATGTATT
TCAATGGAGTATACCATCTGTTCTACCAGTACAATCCTGGGGGTCCACTATGGGGCAACATTTCATGGGGTCATTCCATT
TCACATGATCTTGTGAACTGGTTTATTCTTGAACCTGCTCTCAGTCCAAAAGAACCCTACGACATTGGCGGCTGCTTCAC
AGGTTCAAGCACAATCCTACACGGTTCAAAACCGATAATTCTCTATACCGCTCAAGACGTAGATGGCGCCCAGGTTCAAA
ACCTGGCCCTCCCCAAAAACCGCTCTGACCCCCTTCTAAAAGACTGGATCAAATGGTCAGGTAACCCTATTCTGACCCCT
GTCAATGACATCAACACGTCCCAATTCCGTGACCCTTCAACGGCTTGGATGGGTCCAGATGGAAAATGGAGGATTGTGAT
TGGAAGTGAGATCATCAAGGGCCAGGCAACCGCGCTTTTGTATTATAGTACAGACGGTTTTAACTGGACCAGGTCGGATA
AACCTTTGAAGTTTTCGAGAGAAACAAATATGTGGGAATGTCCTGACTTCTATCCGGTTAGTAATACTGGCAAAGATGGT
ATTGATACATCTTTTCAAGGGAATAACACAATGCATGTGCTAAAAGTAAGCTTCGATAGTCATGATTATTATGTCATCGG
GATGTATGATCCGCAAATGGATCAGTTTCTCCTTGCTACTAGTGATTTTAACGTTAGCAACACACAACTTCAATACGATT
ATGGGAGGTTTTATGCATCAAAGTCATTCTATGATGGTGCGAAAAAAAGAAGGGTGTTATGGGGATGGGTTAATGAAGGT
GATAATCCCTCGGATGCCTTCAAGAAAGGGTGGTCTGGCCTTCAGTCGTTTCCAAGGAGCGTTTGGCTTAGCGATACTCG
AAAACAGTTAGTACAGTGGCCAGTGGAGGAAATCAAGAAGCTAAGAGCAAAACAAGTGAATATGGAGAGTAGAGAACTGA
AGGGTGGGTCCCTCCTTGAAGTACCAGGCATAAGTGGTTCACAGGCGGATATAGAAGTCGTTTTTAGTCTATCGAATCTG
AGTGATTTAGAGCTAATAAATTCAGATATGTCTGATCCACAACATCTTTGTGACCAAAAGAATGTTTCTACGAGTGGGAG
TTATGGTCCATTCGGCGTGCTAGTTTTCGCTTCACAGAACTTGACAGAACAAACCGCGGTTTTCTTTCGAGTATTTAAAG
GTCCTAACAAATTTCAAGTGCTCATGTGCAGTGATCAAAGCAGGTCCTCAATTGCACAAGGAGTGGACAAAAGCACATAT
GGAGCTTTCCTTGATTTGGACCCTCTTCATGATAAGATCTCACTAAGAAGCTTGGTAGATCATTCGATTGTCGAAAGTTT
TGGTGGAGAAGGACTTGCTTGTATCACCGCTAGAGTATATCCAAAGCTAGCGATTCATGAACACGCCAAGCTCTATGTAT
TCAATAATGGCACAAAGAGTGTTACTATGTTGAGTCTAAATGCTTGGAACATGAACAAGGCTCAAATAGTTCCTATGGAT
TGA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTACCGAACTGCTTTTCACTTTCAGCCTCTCAAAAACTGGATGAATGATCCTAATGGTCCAATGTATTTCAATGGAGTATACCATCTGTTCTACCAGTACAATCCT
Microexon-tag Amino Acid seq PYRTAFHFQPLKNWMNDPNGPMYFNGVYHLFYQYNP
Transcript ID OTG02162
Gene ID Ha.16650
Gene Name INV1
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.4e-96
Motif start 37
Motif end 354
Protein seq >OTG02162
MVKEMAGWVLSFCILLVVNGVGVHASEDLQPYRTAFHFQPLKNWMNDPNGPMYFNGVYHLFYQYNPGGPLWGNISWGHSI
SHDLVNWFILEPALSPKEPYDIGGCFTGSSTILHGSKPIILYTAQDVDGAQVQNLALPKNRSDPLLKDWIKWSGNPILTP
VNDINTSQFRDPSTAWMGPDGKWRIVIGSEIIKGQATALLYYSTDGFNWTRSDKPLKFSRETNMWECPDFYPVSNTGKDG
IDTSFQGNNTMHVLKVSFDSHDYYVIGMYDPQMDQFLLATSDFNVSNTQLQYDYGRFYASKSFYDGAKKRRVLWGWVNEG
DNPSDAFKKGWSGLQSFPRSVWLSDTRKQLVQWPVEEIKKLRAKQVNMESRELKGGSLLEVPGISGSQADIEVVFSLSNL
SDLELINSDMSDPQHLCDQKNVSTSGSYGPFGVLVFASQNLTEQTAVFFRVFKGPNKFQVLMCSDQSRSSIAQGVDKSTY
GAFLDLDPLHDKISLRSLVDHSIVESFGGEGLACITARVYPKLAIHEHAKLYVFNNGTKSVTMLSLNAWNMNKAQIVPMD
*
CDS seq >OTG02162
ATGGTAAAGGAGATGGCTGGCTGGGTTCTTTCTTTTTGCATCCTTTTGGTTGTTAATGGTGTCGGAGTTCATGCATCAGA
AGATTTGCAGCCTTACCGAACTGCTTTTCACTTTCAGCCTCTCAAAAACTGGATGAATGATCCTAATGGTCCAATGTATT
TCAATGGAGTATACCATCTGTTCTACCAGTACAATCCTGGGGGTCCACTATGGGGCAACATTTCATGGGGTCATTCCATT
TCACATGATCTTGTGAACTGGTTTATTCTTGAACCTGCTCTCAGTCCAAAAGAACCCTACGACATTGGCGGCTGCTTCAC
AGGTTCAAGCACAATCCTACACGGTTCAAAACCGATAATTCTCTATACCGCTCAAGACGTAGATGGCGCCCAGGTTCAAA
ACCTGGCCCTCCCCAAAAACCGCTCTGACCCCCTTCTAAAAGACTGGATCAAATGGTCAGGTAACCCTATTCTGACCCCT
GTCAATGACATCAACACGTCCCAATTCCGTGACCCTTCAACGGCTTGGATGGGTCCAGATGGAAAATGGAGGATTGTGAT
TGGAAGTGAGATCATCAAGGGCCAGGCAACCGCGCTTTTGTATTATAGTACAGACGGTTTTAACTGGACCAGGTCGGATA
AACCTTTGAAGTTTTCGAGAGAAACAAATATGTGGGAATGTCCTGACTTCTATCCGGTTAGTAATACTGGCAAAGATGGT
ATTGATACATCTTTTCAAGGGAATAACACAATGCATGTGCTAAAAGTAAGCTTCGATAGTCATGATTATTATGTCATCGG
GATGTATGATCCGCAAATGGATCAGTTTCTCCTTGCTACTAGTGATTTTAACGTTAGCAACACACAACTTCAATACGATT
ATGGGAGGTTTTATGCATCAAAGTCATTCTATGATGGTGCGAAAAAAAGAAGGGTGTTATGGGGATGGGTTAATGAAGGT
GATAATCCCTCGGATGCCTTCAAGAAAGGGTGGTCTGGCCTTCAGTCGTTTCCAAGGAGCGTTTGGCTTAGCGATACTCG
AAAACAGTTAGTACAGTGGCCAGTGGAGGAAATCAAGAAGCTAAGAGCAAAACAAGTGAATATGGAGAGTAGAGAACTGA
AGGGTGGGTCCCTCCTTGAAGTACCAGGCATAAGTGGTTCACAGGCGGATATAGAAGTCGTTTTTAGTCTATCGAATCTG
AGTGATTTAGAGCTAATAAATTCAGATATGTCTGATCCACAACATCTTTGTGACCAAAAGAATGTTTCTACGAGTGGGAG
TTATGGTCCATTCGGCGTGCTAGTTTTCGCTTCACAGAACTTGACAGAACAAACCGCGGTTTTCTTTCGAGTATTTAAAG
GTCCTAACAAATTTCAAGTGCTCATGTGCAGTGATCAAAGCAGGTCCTCAATTGCACAAGGAGTGGACAAAAGCACATAT
GGAGCTTTCCTTGATTTGGACCCTCTTCATGATAAGATCTCACTAAGAAGCTTGGTAGATCATTCGATTGTCGAAAGTTT
TGGTGGAGAAGGACTTGCTTGTATCACCGCTAGAGTATATCCAAAGCTAGCGATTCATGAACACGCCAAGCTCTATGTAT
TCAATAATGGCACAAAGAGTGTTACTATGTTGAGTCTAAATGCTTGGAACATGAACAAGGCTCAAATAGTTCCTATGGAT
TGA