Microexon ID Ha_3:58733954-58733962:-
Species Helianthus annuus
Coordinates 3:58733954..58733962
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTACTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAACAGTTGCAGAAGAGAAGGTCAAAGTAGGAAAGGAAGACAAGTTTACTTGGGTGGATACGACAAGGAAGACAAAGCAGCTAGAGCTTATGATTTAGCAGCC
Microexon-tag Amino Acid Seq WDNSCRREGQSRKGRQVYLGGYDKEDKAARAYDLAA
Microexon-tag spanning region58733597-58734327
Microexon-tag prediction score0.98
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG30855x
Reference Transcript ID OTG30855
Gene ID HannXRQ_Chr03g0069061
Gene Name AIL5
Transcript ID OTG30855
Protein ID OTG30855
Gene ID HannXRQ_Chr03g0069061
Gene Name AIL5
Pfam domain motif AP2
Motif E-value 2.3e-12
Motif start 155
Motif end 213
Protein seq >OTG30855
MNMDSSQSHHQNLANNWLAFSLSNTNTLFHHPPATVFHHHHDANTTGTTSHHGLSVITDGSPKLEDFLGGCGSAASTGSG
SDVHHFQDESQPSMQQPHDTTSVYDSELKTIAASLFSGFINNNNQQGQEVAPTPPQQENSPAKKAVDNFGHRTSIYRGVT
RHRWTGRYEAHLWDNSCRREGQSRKGRQVYLGGYDKEDKAARAYDLAALKYWGPTTTTNFPVCNYEKELEEMKNMTRQEF
VASLRRKSSGFSRGASIYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEAAAEAYDIAAIKFRGLNAVTNFDMSRYD
VNIIANKNLPVGGMCSKSKISTDQSVPDVRDLTSTSEHPTQTTHANNVLSFGIPKKQDPCTDYWDSVLGYNQNYQCTTPY
NMEYPSNTTTNNGYYIGGEFIQQEYSNGSTIALSSASTIPMGTPISLNGSSYGTWVEQSFHSYQPAKQNLSVIQTPIFGM
E*
CDS seq >OTG30855
ATGAACATGGATTCTTCTCAATCTCATCATCAGAATCTAGCAAACAACTGGCTTGCCTTCTCTCTTTCCAACACCAACAC
CCTCTTCCACCACCCACCCGCCACCGTCTTCCACCACCACCATGATGCAAACACCACCGGAACAACCAGCCATCATGGTT
TGTCTGTTATCACCGACGGCAGCCCGAAACTGGAAGACTTCCTCGGCGGCTGTGGCTCCGCCGCCTCAACCGGATCTGGT
TCCGATGTTCATCATTTTCAAGATGAATCACAACCCTCTATGCAGCAACCACATGACACCACATCGGTTTATGATTCTGA
GTTGAAGACAATCGCCGCTAGCTTGTTCAGTGGGTTCATCAACAACAACAACCAGCAAGGGCAGGAGGTTGCTCCTACGC
CGCCACAGCAAGAGAACTCTCCGGCGAAAAAAGCTGTTGATAACTTCGGTCATCGAACTTCCATTTACCGTGGTGTTACA
AGGCATAGATGGACAGGGAGATATGAAGCTCATTTATGGGATAACAGTTGCAGAAGAGAAGGTCAAAGTAGGAAAGGAAG
ACAAGTTTACTTGGGTGGATACGACAAGGAAGACAAAGCAGCTAGAGCTTATGATTTAGCAGCCCTCAAGTATTGGGGTC
CCACCACCACCACAAACTTTCCTGTTTGCAACTATGAGAAAGAGCTTGAAGAAATGAAGAACATGACTAGGCAAGAGTTT
GTTGCTTCACTTAGAAGGAAAAGCAGTGGTTTTTCTAGAGGAGCCTCAATTTATAGAGGTGTCACAAGGCACCATCAACA
TGGACGTTGGCAAGCAAGAATAGGAAGAGTAGCTGGAAACAAAGATCTCTACCTCGGAACCTTTAGCACACAAGAAGCAG
CGGCCGAGGCATACGACATAGCTGCCATCAAATTCCGAGGGTTAAATGCGGTCACCAATTTCGACATGAGCCGATACGAC
GTGAATATTATCGCCAACAAAAACCTCCCGGTTGGTGGTATGTGTAGCAAATCCAAGATCTCAACAGACCAATCAGTTCC
AGACGTTCGAGACCTCACCTCTACTTCCGAACATCCTACACAAACAACACACGCAAACAACGTACTTAGCTTCGGGATCC
CCAAGAAACAAGACCCGTGTACCGATTACTGGGATTCGGTCCTTGGTTACAACCAAAACTACCAATGTACAACACCATAC
AACATGGAATACCCTTCAAATACTACAACCAACAATGGGTACTACATTGGTGGAGAGTTCATTCAACAAGAGTACAGTAA
TGGTAGTACTATTGCTTTAAGTAGTGCATCAACAATTCCTATGGGTACCCCTATAAGTTTGAATGGATCTAGTTATGGCA
CTTGGGTAGAACAGTCTTTTCACTCATATCAACCTGCCAAGCAAAATCTATCAGTTATTCAGACACCCATTTTTGGAATG
GAATGA
Microexon DNA seq TTTACTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAACAGTTGCAGAAGAGAAGGTCAAAGTAGGAAAGGAAGACAAGTTTACTTGGGTGGATACGACAAGGAAGACAAAGCAGCTAGAGCTTATGATTTAGCAGCC
Microexon-tag Amino Acid seq WDNSCRREGQSRKGRQVYLGGYDKEDKAARAYDLAA
Transcript ID OTG30855
Gene ID Ha.37035
Gene Name AIL5
Pfam domain motif AP2
Motif E-value 2.3e-12
Motif start 155
Motif end 213
Protein seq >OTG30855
MNMDSSQSHHQNLANNWLAFSLSNTNTLFHHPPATVFHHHHDANTTGTTSHHGLSVITDGSPKLEDFLGGCGSAASTGSG
SDVHHFQDESQPSMQQPHDTTSVYDSELKTIAASLFSGFINNNNQQGQEVAPTPPQQENSPAKKAVDNFGHRTSIYRGVT
RHRWTGRYEAHLWDNSCRREGQSRKGRQVYLGGYDKEDKAARAYDLAALKYWGPTTTTNFPVCNYEKELEEMKNMTRQEF
VASLRRKSSGFSRGASIYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEAAAEAYDIAAIKFRGLNAVTNFDMSRYD
VNIIANKNLPVGGMCSKSKISTDQSVPDVRDLTSTSEHPTQTTHANNVLSFGIPKKQDPCTDYWDSVLGYNQNYQCTTPY
NMEYPSNTTTNNGYYIGGEFIQQEYSNGSTIALSSASTIPMGTPISLNGSSYGTWVEQSFHSYQPAKQNLSVIQTPIFGM
E*
CDS seq >OTG30855
ATGAACATGGATTCTTCTCAATCTCATCATCAGAATCTAGCAAACAACTGGCTTGCCTTCTCTCTTTCCAACACCAACAC
CCTCTTCCACCACCCACCCGCCACCGTCTTCCACCACCACCATGATGCAAACACCACCGGAACAACCAGCCATCATGGTT
TGTCTGTTATCACCGACGGCAGCCCGAAACTGGAAGACTTCCTCGGCGGCTGTGGCTCCGCCGCCTCAACCGGATCTGGT
TCCGATGTTCATCATTTTCAAGATGAATCACAACCCTCTATGCAGCAACCACATGACACCACATCGGTTTATGATTCTGA
GTTGAAGACAATCGCCGCTAGCTTGTTCAGTGGGTTCATCAACAACAACAACCAGCAAGGGCAGGAGGTTGCTCCTACGC
CGCCACAGCAAGAGAACTCTCCGGCGAAAAAAGCTGTTGATAACTTCGGTCATCGAACTTCCATTTACCGTGGTGTTACA
AGGCATAGATGGACAGGGAGATATGAAGCTCATTTATGGGATAACAGTTGCAGAAGAGAAGGTCAAAGTAGGAAAGGAAG
ACAAGTTTACTTGGGTGGATACGACAAGGAAGACAAAGCAGCTAGAGCTTATGATTTAGCAGCCCTCAAGTATTGGGGTC
CCACCACCACCACAAACTTTCCTGTTTGCAACTATGAGAAAGAGCTTGAAGAAATGAAGAACATGACTAGGCAAGAGTTT
GTTGCTTCACTTAGAAGGAAAAGCAGTGGTTTTTCTAGAGGAGCCTCAATTTATAGAGGTGTCACAAGGCACCATCAACA
TGGACGTTGGCAAGCAAGAATAGGAAGAGTAGCTGGAAACAAAGATCTCTACCTCGGAACCTTTAGCACACAAGAAGCAG
CGGCCGAGGCATACGACATAGCTGCCATCAAATTCCGAGGGTTAAATGCGGTCACCAATTTCGACATGAGCCGATACGAC
GTGAATATTATCGCCAACAAAAACCTCCCGGTTGGTGGTATGTGTAGCAAATCCAAGATCTCAACAGACCAATCAGTTCC
AGACGTTCGAGACCTCACCTCTACTTCCGAACATCCTACACAAACAACACACGCAAACAACGTACTTAGCTTCGGGATCC
CCAAGAAACAAGACCCGTGTACCGATTACTGGGATTCGGTCCTTGGTTACAACCAAAACTACCAATGTACAACACCATAC
AACATGGAATACCCTTCAAATACTACAACCAACAATGGGTACTACATTGGTGGAGAGTTCATTCAACAAGAGTACAGTAA
TGGTAGTACTATTGCTTTAAGTAGTGCATCAACAATTCCTATGGGTACCCCTATAAGTTTGAATGGATCTAGTTATGGCA
CTTGGGTAGAACAGTCTTTTCACTCATATCAACCTGCCAAGCAAAATCTATCAGTTATTCAGACACCCATTTTTGGAATG
GAATGA