Microexon ID Ha_9:45365022-45365030:+
Species Helianthus annuus
Coordinates 9:45365022..45365030
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTACTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGTTGCAGAAGAGAAGGGCAAAGTAGGAAAGGAAGACAAGTTTACTTGGGTGGATACGACAAAGAAGACAAAGCAGCTCGAGCTTATGATCTTGCAGCT
Microexon-tag Amino Acid Seq WDNSCRREGQSRKGRQVYLGGYDKEDKAARAYDLAA
Microexon-tag spanning region45364639-45365441
Microexon-tag prediction score0.9791
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG13977x
Reference Transcript ID OTG13977
Gene ID HannXRQ_Chr09g0244261
Gene Name NA
Transcript ID OTG13977
Protein ID OTG13977
Gene ID HannXRQ_Chr09g0244261
Gene Name NA
Pfam domain motif AP2
Motif E-value 2.8e-12
Motif start 134
Motif end 192
Protein seq >OTG13977
MNMNPSDQPINLQDHNNWLTFSLSNTTTTILQPPHAAALHHHHHHDADVTPKLEDFLGGSSGSGHDVCQFSDESQQPHDT
TSIYDSELKTIATSFLHGFSSDNQQQLAVTPAPPQQENSPVRKAVDTFGQRTSIYRGVTRHRWTGRYEAHLWDNSCRREG
QSRKGRQVYLGGYDKEDKAARAYDLAALKYWGLTTTTNFPVCNYQKEIEEMKNMTRQEFVASLRRKSSGFSRGASIYRGV
TRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGLNAVTNFDMSRYDVNSIANKNLPIGGMCSKSKTS
MDQTVSDNNQISLNQDLATTSEQPHSNILSFALPVKQDPCPDYWNAVFGYTQSNSYQCTSTMPYVMEYPSSTTNNGYYNS
EVIQQENDNGTTVALSTESAIPMGTPVGLNGSSYGSWIEQSFHSNPPAKQNLSVFQTPIFGME*
CDS seq >OTG13977
ATGAACATGAATCCTTCTGATCAACCCATTAATCTCCAGGATCACAACAACTGGCTCACCTTCTCTCTCTCCAACACCAC
CACCACCATCTTACAACCACCACACGCCGCCGCCTTACACCACCACCACCACCATGATGCAGATGTTACTCCAAAACTGG
AAGACTTTCTCGGCGGCTCTAGCGGATCTGGTCATGATGTGTGTCAGTTTTCAGATGAATCTCAGCAACCCCATGACACA
ACATCAATATATGATTCTGAGTTGAAGACAATAGCGACTAGTTTTCTTCATGGGTTCTCATCGGACAACCAGCAGCAACT
GGCGGTGACACCGGCGCCGCCACAGCAAGAAAACTCTCCGGTGAGAAAGGCTGTTGATACCTTTGGCCAACGTACTTCTA
TTTACCGTGGTGTTACAAGGCATAGATGGACGGGGAGATATGAAGCTCATTTATGGGATAATAGTTGCAGAAGAGAAGGG
CAAAGTAGGAAAGGAAGACAAGTTTACTTGGGTGGATACGACAAAGAAGACAAAGCAGCTCGAGCTTATGATCTTGCAGC
TCTTAAGTACTGGGGTCTGACCACCACGACAAATTTTCCGGTTTGCAACTACCAGAAAGAGATTGAAGAAATGAAGAACA
TGACTAGGCAAGAATTTGTTGCTTCACTTAGAAGGAAAAGCAGTGGGTTTTCTAGAGGAGCCTCCATTTATAGGGGTGTC
ACAAGGCACCATCAACATGGACGTTGGCAAGCGAGAATAGGACGAGTAGCCGGAAACAAAGATCTCTACCTCGGAACCTT
TAGCACACAAGAAGAAGCAGCGGAGGCATACGACATTGCAGCCATCAAATTCCGAGGGTTAAACGCGGTCACAAATTTCG
ACATGAGCCGATACGACGTGAACAGCATCGCCAACAAAAACCTCCCAATCGGTGGCATGTGTAGCAAATCCAAAACCTCA
ATGGATCAGACTGTGTCCGACAACAACCAAATATCCCTCAACCAAGATCTCGCCACAACTTCCGAACAACCACACTCGAA
CATCCTTAGCTTCGCCTTGCCCGTGAAACAAGACCCGTGCCCGGATTACTGGAATGCGGTCTTCGGGTACACTCAAAGCA
ATAGCTATCAATGTACAAGTACAATGCCATACGTCATGGAATACCCTTCAAGTACAACCAACAATGGGTACTACAATAGT
GAGGTGATTCAACAAGAGAATGATAATGGTACTACTGTTGCTTTAAGTACTGAATCAGCAATTCCAATGGGTACCCCTGT
AGGTTTGAATGGATCTAGTTATGGAAGCTGGATAGAACAGTCTTTTCACTCAAATCCACCAGCCAAGCAAAATCTCTCTG
TTTTTCAGACACCCATTTTTGGAATGGAATGA
Microexon DNA seq TTTACTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGTTGCAGAAGAGAAGGGCAAAGTAGGAAAGGAAGACAAGTTTACTTGGGTGGATACGACAAAGAAGACAAAGCAGCTCGAGCTTATGATCTTGCAGCT
Microexon-tag Amino Acid seq WDNSCRREGQSRKGRQVYLGGYDKEDKAARAYDLAA
Transcript ID OTG13977
Gene ID Ha.55155
Gene Name NA
Pfam domain motif AP2
Motif E-value 2.8e-12
Motif start 134
Motif end 192
Protein seq >OTG13977
MNMNPSDQPINLQDHNNWLTFSLSNTTTTILQPPHAAALHHHHHHDADVTPKLEDFLGGSSGSGHDVCQFSDESQQPHDT
TSIYDSELKTIATSFLHGFSSDNQQQLAVTPAPPQQENSPVRKAVDTFGQRTSIYRGVTRHRWTGRYEAHLWDNSCRREG
QSRKGRQVYLGGYDKEDKAARAYDLAALKYWGLTTTTNFPVCNYQKEIEEMKNMTRQEFVASLRRKSSGFSRGASIYRGV
TRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGLNAVTNFDMSRYDVNSIANKNLPIGGMCSKSKTS
MDQTVSDNNQISLNQDLATTSEQPHSNILSFALPVKQDPCPDYWNAVFGYTQSNSYQCTSTMPYVMEYPSSTTNNGYYNS
EVIQQENDNGTTVALSTESAIPMGTPVGLNGSSYGSWIEQSFHSNPPAKQNLSVFQTPIFGME*
CDS seq >OTG13977
ATGAACATGAATCCTTCTGATCAACCCATTAATCTCCAGGATCACAACAACTGGCTCACCTTCTCTCTCTCCAACACCAC
CACCACCATCTTACAACCACCACACGCCGCCGCCTTACACCACCACCACCACCATGATGCAGATGTTACTCCAAAACTGG
AAGACTTTCTCGGCGGCTCTAGCGGATCTGGTCATGATGTGTGTCAGTTTTCAGATGAATCTCAGCAACCCCATGACACA
ACATCAATATATGATTCTGAGTTGAAGACAATAGCGACTAGTTTTCTTCATGGGTTCTCATCGGACAACCAGCAGCAACT
GGCGGTGACACCGGCGCCGCCACAGCAAGAAAACTCTCCGGTGAGAAAGGCTGTTGATACCTTTGGCCAACGTACTTCTA
TTTACCGTGGTGTTACAAGGCATAGATGGACGGGGAGATATGAAGCTCATTTATGGGATAATAGTTGCAGAAGAGAAGGG
CAAAGTAGGAAAGGAAGACAAGTTTACTTGGGTGGATACGACAAAGAAGACAAAGCAGCTCGAGCTTATGATCTTGCAGC
TCTTAAGTACTGGGGTCTGACCACCACGACAAATTTTCCGGTTTGCAACTACCAGAAAGAGATTGAAGAAATGAAGAACA
TGACTAGGCAAGAATTTGTTGCTTCACTTAGAAGGAAAAGCAGTGGGTTTTCTAGAGGAGCCTCCATTTATAGGGGTGTC
ACAAGGCACCATCAACATGGACGTTGGCAAGCGAGAATAGGACGAGTAGCCGGAAACAAAGATCTCTACCTCGGAACCTT
TAGCACACAAGAAGAAGCAGCGGAGGCATACGACATTGCAGCCATCAAATTCCGAGGGTTAAACGCGGTCACAAATTTCG
ACATGAGCCGATACGACGTGAACAGCATCGCCAACAAAAACCTCCCAATCGGTGGCATGTGTAGCAAATCCAAAACCTCA
ATGGATCAGACTGTGTCCGACAACAACCAAATATCCCTCAACCAAGATCTCGCCACAACTTCCGAACAACCACACTCGAA
CATCCTTAGCTTCGCCTTGCCCGTGAAACAAGACCCGTGCCCGGATTACTGGAATGCGGTCTTCGGGTACACTCAAAGCA
ATAGCTATCAATGTACAAGTACAATGCCATACGTCATGGAATACCCTTCAAGTACAACCAACAATGGGTACTACAATAGT
GAGGTGATTCAACAAGAGAATGATAATGGTACTACTGTTGCTTTAAGTACTGAATCAGCAATTCCAATGGGTACCCCTGT
AGGTTTGAATGGATCTAGTTATGGAAGCTGGATAGAACAGTCTTTTCACTCAAATCCACCAGCCAAGCAAAATCTCTCTG
TTTTTCAGACACCCATTTTTGGAATGGAATGA