Microexon ID Ha_15:35523211-35523219:+
Species Helianthus annuus
Coordinates 15:35523211..35523219
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTATCTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAACAGCTGCAAAAAAGAAGGCCAAAGCAGGAAGGGAAGACAAGTTTATCTGGGTGGTTATGATATGGAAGAGAAAGCTGCTAGAGCTTATGATTTAGCTGCA
Microexon-tag Amino Acid Seq WDNSCKKEGQSRKGRQVYLGGYDMEEKAARAYDLAA
Microexon-tag spanning region35523024-35523375
Microexon-tag prediction score0.9851
Overlapped with the annotated transcript (%) 100
New Transcript ID OTF94597x
Reference Transcript ID OTF94597
Gene ID HannXRQ_Chr15g0473861
Gene Name ANT
Transcript ID OTF94597
Protein ID OTF94597
Gene ID HannXRQ_Chr15g0473861
Gene Name ANT
Pfam domain motif AP2
Motif E-value 1.1e-13
Motif start 302
Motif end 360
Protein seq >OTF94597
MKFMNNSNNTTVNNDNNTNNNWLGFSLSPHINTNPTTSTMEGSPLPPPSSSNPTTYFNLPSHFNYTNMYGVEGENGNGIY
TAFPIMPLKSDGSLCLMEAITRSQSQGMVTSAPPKLENFFGGVTMGTPDFDRGGGATMGLGLDSSTMYYNQNLDHQETLQ
HNYRHQQNYPDYSSFKPMYQSVQQEVKEDHLSTDNLHLPTIGEDDDITGMKNWISRNYHTGVSGDHGGGYGDLQSLSLSM
SFGCSQPSPCVTASPRQEVVPAANVTDCVVMDTKKRGSEKVDQQKQIVHRKSLDTFGQRTSQYRGVTRHRWTGRYEAHLW
DNSCKKEGQSRKGRQVYLGGYDMEEKAARAYDLAALKYWGPATHINFPLENYEQEIVEMKNMSRQEYVAHLRRRSSGFSR
GASVYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDVAAIKFRGVNAVTNFDISRYNVEKIMASNTLLAG
ELARRTKEVDTTTTNELLSDQLLAHNSEPKTILSDPTADMNMLDWKMTLYDQNRTRPAGLEGDDHSSKLGPHLSNTSSLV
TSLSSSREHSPDRNNNLPMVLETPTSSGSRFLGNPSGNTWVPTAVTATQMRSHIPVFAAWTDA*
CDS seq >OTF94597
ATGAAGTTTATGAACAACAGCAACAACACAACAGTCAACAATGACAACAACACAAACAACAACTGGTTGGGCTTCTCTCT
TTCTCCTCATATAAACACAAATCCCACAACTTCAACCATGGAAGGCTCTCCACTTCCTCCTCCTTCTTCTTCAAACCCCA
CAACTTATTTTAATCTTCCATCTCATTTCAACTACACAAACATGTATGGAGTTGAAGGTGAAAATGGCAATGGGATTTAC
ACTGCTTTTCCCATCATGCCCTTGAAGTCTGATGGTTCCCTTTGTTTGATGGAAGCCATCACAAGATCACAATCACAAGG
AATGGTTACTTCAGCTCCACCAAAACTTGAGAATTTTTTTGGTGGTGTAACAATGGGGACCCCTGACTTTGACAGAGGGG
GTGGAGCTACAATGGGTCTTGGTTTAGATAGCAGTACAATGTACTACAACCAAAATCTTGATCATCAAGAGACTCTACAA
CACAACTACAGGCATCAACAAAACTATCCAGATTACTCTAGTTTCAAACCTATGTACCAAAGTGTACAACAAGAAGTCAA
GGAGGATCATCTTTCTACAGATAACCTTCATCTACCAACTATTGGTGAAGATGATGACATCACTGGCATGAAGAACTGGA
TTTCAAGAAACTATCATACTGGTGTCAGTGGTGATCATGGTGGTGGTTATGGAGATCTACAGTCACTTAGCTTGTCAATG
AGCTTTGGTTGTTCACAACCATCCCCATGTGTGACTGCATCACCACGGCAGGAAGTTGTACCTGCTGCTAATGTCACTGA
CTGTGTGGTTATGGATACAAAGAAAAGAGGGTCTGAGAAAGTTGACCAACAAAAGCAAATTGTTCATAGGAAATCTTTGG
ATACGTTTGGTCAAAGAACCTCTCAGTATAGAGGTGTTACCAGGCATAGATGGACTGGTAGATATGAAGCACATTTGTGG
GATAACAGCTGCAAAAAAGAAGGCCAAAGCAGGAAGGGAAGACAAGTTTATCTGGGTGGTTATGATATGGAAGAGAAAGC
TGCTAGAGCTTATGATTTAGCTGCACTCAAGTATTGGGGTCCTGCAACTCACATCAATTTTCCATTGGAGAACTATGAGC
AAGAAATTGTGGAAATGAAGAACATGTCTAGACAAGAATATGTAGCTCACTTGAGAAGGAGAAGCAGTGGTTTCTCTAGG
GGAGCCTCAGTCTATAGAGGAGTAACAAGACACCATCAACATGGCCGATGGCAAGCGCGGATAGGCCGTGTGGCGGGGAA
CAAGGATCTTTACTTAGGCACATTTAGCACCCAAGAAGAAGCTGCTGAGGCTTATGATGTGGCTGCCATTAAATTTCGAG
GCGTTAATGCAGTTACAAATTTCGACATTTCAAGGTATAACGTCGAAAAGATCATGGCGAGTAATACCCTTTTAGCCGGT
GAACTAGCTAGGAGAACCAAAGAAGTAGACACCACAACCACAAATGAGCTCTTATCAGACCAACTACTAGCACATAATAG
TGAACCCAAGACTATATTAAGTGATCCCACCGCGGACATGAACATGTTGGATTGGAAAATGACACTCTATGATCAAAACC
GGACCAGGCCTGCAGGGCTCGAGGGGGATGATCACTCCTCTAAACTTGGACCACATTTATCAAATACTTCTTCTTTGGTG
ACTAGTTTGAGCAGCTCCAGAGAACATAGCCCAGATAGAAACAACAATCTCCCAATGGTTCTTGAAACTCCAACATCGTC
TGGTTCTAGGTTTCTTGGTAATCCATCTGGAAACACATGGGTTCCAACAGCAGTAACAGCAACCCAAATGAGGTCTCATA
TACCAGTATTTGCTGCATGGACAGATGCATGA
Microexon DNA seq TTTATCTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAACAGCTGCAAAAAAGAAGGCCAAAGCAGGAAGGGAAGACAAGTTTATCTGGGTGGTTATGATATGGAAGAGAAAGCTGCTAGAGCTTATGATTTAGCTGCA
Microexon-tag Amino Acid seq WDNSCKKEGQSRKGRQVYLGGYDMEEKAARAYDLAA
Transcript ID OTF94597
Gene ID Ha.23025
Gene Name ANT
Pfam domain motif AP2
Motif E-value 1.1e-13
Motif start 302
Motif end 360
Protein seq >OTF94597
MKFMNNSNNTTVNNDNNTNNNWLGFSLSPHINTNPTTSTMEGSPLPPPSSSNPTTYFNLPSHFNYTNMYGVEGENGNGIY
TAFPIMPLKSDGSLCLMEAITRSQSQGMVTSAPPKLENFFGGVTMGTPDFDRGGGATMGLGLDSSTMYYNQNLDHQETLQ
HNYRHQQNYPDYSSFKPMYQSVQQEVKEDHLSTDNLHLPTIGEDDDITGMKNWISRNYHTGVSGDHGGGYGDLQSLSLSM
SFGCSQPSPCVTASPRQEVVPAANVTDCVVMDTKKRGSEKVDQQKQIVHRKSLDTFGQRTSQYRGVTRHRWTGRYEAHLW
DNSCKKEGQSRKGRQVYLGGYDMEEKAARAYDLAALKYWGPATHINFPLENYEQEIVEMKNMSRQEYVAHLRRRSSGFSR
GASVYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDVAAIKFRGVNAVTNFDISRYNVEKIMASNTLLAG
ELARRTKEVDTTTTNELLSDQLLAHNSEPKTILSDPTADMNMLDWKMTLYDQNRTRPAGLEGDDHSSKLGPHLSNTSSLV
TSLSSSREHSPDRNNNLPMVLETPTSSGSRFLGNPSGNTWVPTAVTATQMRSHIPVFAAWTDA*
CDS seq >OTF94597
ATGAAGTTTATGAACAACAGCAACAACACAACAGTCAACAATGACAACAACACAAACAACAACTGGTTGGGCTTCTCTCT
TTCTCCTCATATAAACACAAATCCCACAACTTCAACCATGGAAGGCTCTCCACTTCCTCCTCCTTCTTCTTCAAACCCCA
CAACTTATTTTAATCTTCCATCTCATTTCAACTACACAAACATGTATGGAGTTGAAGGTGAAAATGGCAATGGGATTTAC
ACTGCTTTTCCCATCATGCCCTTGAAGTCTGATGGTTCCCTTTGTTTGATGGAAGCCATCACAAGATCACAATCACAAGG
AATGGTTACTTCAGCTCCACCAAAACTTGAGAATTTTTTTGGTGGTGTAACAATGGGGACCCCTGACTTTGACAGAGGGG
GTGGAGCTACAATGGGTCTTGGTTTAGATAGCAGTACAATGTACTACAACCAAAATCTTGATCATCAAGAGACTCTACAA
CACAACTACAGGCATCAACAAAACTATCCAGATTACTCTAGTTTCAAACCTATGTACCAAAGTGTACAACAAGAAGTCAA
GGAGGATCATCTTTCTACAGATAACCTTCATCTACCAACTATTGGTGAAGATGATGACATCACTGGCATGAAGAACTGGA
TTTCAAGAAACTATCATACTGGTGTCAGTGGTGATCATGGTGGTGGTTATGGAGATCTACAGTCACTTAGCTTGTCAATG
AGCTTTGGTTGTTCACAACCATCCCCATGTGTGACTGCATCACCACGGCAGGAAGTTGTACCTGCTGCTAATGTCACTGA
CTGTGTGGTTATGGATACAAAGAAAAGAGGGTCTGAGAAAGTTGACCAACAAAAGCAAATTGTTCATAGGAAATCTTTGG
ATACGTTTGGTCAAAGAACCTCTCAGTATAGAGGTGTTACCAGGCATAGATGGACTGGTAGATATGAAGCACATTTGTGG
GATAACAGCTGCAAAAAAGAAGGCCAAAGCAGGAAGGGAAGACAAGTTTATCTGGGTGGTTATGATATGGAAGAGAAAGC
TGCTAGAGCTTATGATTTAGCTGCACTCAAGTATTGGGGTCCTGCAACTCACATCAATTTTCCATTGGAGAACTATGAGC
AAGAAATTGTGGAAATGAAGAACATGTCTAGACAAGAATATGTAGCTCACTTGAGAAGGAGAAGCAGTGGTTTCTCTAGG
GGAGCCTCAGTCTATAGAGGAGTAACAAGACACCATCAACATGGCCGATGGCAAGCGCGGATAGGCCGTGTGGCGGGGAA
CAAGGATCTTTACTTAGGCACATTTAGCACCCAAGAAGAAGCTGCTGAGGCTTATGATGTGGCTGCCATTAAATTTCGAG
GCGTTAATGCAGTTACAAATTTCGACATTTCAAGGTATAACGTCGAAAAGATCATGGCGAGTAATACCCTTTTAGCCGGT
GAACTAGCTAGGAGAACCAAAGAAGTAGACACCACAACCACAAATGAGCTCTTATCAGACCAACTACTAGCACATAATAG
TGAACCCAAGACTATATTAAGTGATCCCACCGCGGACATGAACATGTTGGATTGGAAAATGACACTCTATGATCAAAACC
GGACCAGGCCTGCAGGGCTCGAGGGGGATGATCACTCCTCTAAACTTGGACCACATTTATCAAATACTTCTTCTTTGGTG
ACTAGTTTGAGCAGCTCCAGAGAACATAGCCCAGATAGAAACAACAATCTCCCAATGGTTCTTGAAACTCCAACATCGTC
TGGTTCTAGGTTTCTTGGTAATCCATCTGGAAACACATGGGTTCCAACAGCAGTAACAGCAACCCAAATGAGGTCTCATA
TACCAGTATTTGCTGCATGGACAGATGCATGA