Microexon ID Ha_7:102950673-102950681:+
Species Helianthus annuus
Coordinates 7:102950673..102950681
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CAAGGGAAACCATCTTTCCACCTCCGGCCTCCACATAACTGGATAAACGATCCTAATGGTCCGGTGTATTACAAAGGATTCTATCATTTATTCTACCAGTACAATCCG
Microexon-tag Amino Acid Seq QGKPSFHLRPPHNWINDPNGPVYYKGFYHLFYQYNP
Microexon-tag spanning region102950481-102950830
Microexon-tag prediction score0.9224
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG21720x
Reference Transcript ID OTG21720
Gene ID HannXRQ_Chr07g0207201
Gene Name NA
Transcript ID OTG21720
Protein ID OTG21720
Gene ID HannXRQ_Chr07g0207201
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1e-100
Motif start 47
Motif end 364
Protein seq >OTG21720
MKTFVFCIIYLCCVLHSNGTDIHDTFYVASLSHDQIVKNQQGKPSFHLRPPHNWINDPNGPVYYKGFYHLFYQYNPKGAV
FKKPIVWGHSVSRDLTNWIHLNNALVPTDPFDINSCYSGSTTILPGNKPVVLYTGLDANNQQVQNLAKPKNLSDPFLKEW
IKYSGNPLMTPPKGVTPDNFRDPTTAWKGKDGNWRVAVGGLRGNLGVAILYRSKDFVHWILHDDPLYFGMNTGIWECPDF
YPVSVNSRHGLDNSVNGKNIKYVLKASFQSHDYYAIGSYDSNKDKFVPESKGLTGSTSDWRYDYGKFYASKTFYDSVKKR
RILWGWINESDSSSDDIKKGWAGIQSIPRQVYLSDAGRQLVQWPIEETEKLREKHVTYSNKKLKGKSLFEVTGITASQVD
IQLSFNLPKIEEAEVFDPKWDDPQTLCSTKTASASGRAGPFGLLVMASQDLTEQTAVFFRVFKDNHKFIVLMCSDQSRSS
LREGIDKTTYGAFIHMDHEDDMISLRSLVDHSVVESFGADGRACITARVYPHFHVKKEAHLYVFNNGSNDVVIKTLDAWN
MKKTKGN*
CDS seq >OTG21720
ATGAAAACTTTTGTGTTTTGTATCATATACTTGTGTTGCGTTCTGCATAGCAATGGCACGGATATTCATGATACATTTTA
CGTCGCATCTCTATCTCACGATCAAATTGTGAAGAACCAACAAGGGAAACCATCTTTCCACCTCCGGCCTCCACATAACT
GGATAAACGATCCTAATGGTCCGGTGTATTACAAAGGATTCTATCATTTATTCTACCAGTACAATCCGAAAGGTGCCGTT
TTTAAGAAACCAATCGTATGGGGGCATTCTGTCTCCCGTGATTTAACCAACTGGATTCATCTCAACAACGCCCTCGTCCC
AACTGATCCATTCGACATCAACAGTTGCTATTCAGGTTCTACAACAATCCTTCCAGGAAACAAACCGGTTGTTTTGTACA
CTGGACTCGACGCTAATAATCAACAAGTTCAGAACCTAGCAAAGCCTAAAAACTTATCAGATCCTTTTCTTAAAGAATGG
ATAAAGTATTCTGGTAATCCTTTAATGACTCCGCCGAAGGGGGTTACACCAGATAACTTTCGCGATCCCACTACAGCTTG
GAAGGGCAAAGATGGGAATTGGAGGGTGGCTGTAGGTGGTCTGAGAGGTAATCTAGGAGTCGCGATTCTATATCGCAGTA
AAGATTTTGTTCACTGGATTTTGCATGACGATCCACTTTATTTTGGGATGAATACTGGGATTTGGGAGTGTCCAGACTTT
TATCCTGTGTCTGTCAACAGCAGACATGGGCTAGATAATTCAGTGAATGGGAAGAACATAAAGTATGTGCTGAAAGCGAG
CTTTCAATCACACGACTACTATGCAATCGGAAGCTATGATTCTAACAAGGACAAGTTTGTGCCAGAAAGTAAAGGTCTTA
CTGGAAGTACCTCAGACTGGAGATATGACTATGGAAAGTTTTATGCATCAAAGACATTTTACGACAGTGTGAAGAAGAGG
AGAATACTGTGGGGTTGGATTAACGAGTCGGATAGCTCATCTGATGATATTAAGAAAGGATGGGCTGGAATCCAGTCCAT
CCCTCGCCAAGTGTATCTTAGTGACGCTGGAAGGCAGCTAGTACAGTGGCCCATTGAGGAAACTGAAAAATTGCGCGAGA
AGCATGTTACTTACTCTAACAAGAAGCTCAAAGGAAAATCATTATTTGAAGTTACAGGCATCACAGCCTCACAGGTTGAC
ATACAACTGTCCTTTAACCTGCCCAAAATAGAAGAGGCCGAGGTTTTCGACCCGAAGTGGGATGATCCTCAAACTCTATG
TAGTACAAAGACAGCAAGCGCCAGTGGTCGAGCAGGGCCATTCGGCTTATTAGTTATGGCTTCTCAAGATTTAACCGAAC
AAACTGCAGTCTTCTTCCGTGTATTTAAAGACAACCACAAGTTTATTGTTTTAATGTGTAGTGATCAAAGCAGGTCTTCG
CTGAGAGAAGGGATTGATAAAACCACATATGGAGCTTTTATACACATGGATCATGAAGATGACATGATCTCACTTAGGAG
CTTGGTAGATCATTCGGTGGTTGAGAGTTTTGGAGCTGATGGAAGAGCATGCATTACGGCTAGAGTGTACCCGCACTTTC
ATGTTAAGAAAGAAGCTCATTTATATGTGTTCAATAATGGAAGTAATGACGTAGTTATCAAAACATTAGATGCTTGGAAT
ATGAAGAAGACCAAAGGTAACTAA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CAAGGGAAACCATCTTTCCACCTCCGGCCTCCACATAACTGGATAAACGATCCTAATGGTCCGGTGTATTACAAAGGATTCTATCATTTATTCTACCAGTACAATCCG
Microexon-tag Amino Acid seq QGKPSFHLRPPHNWINDPNGPVYYKGFYHLFYQYNP
Transcript ID OTG21720
Gene ID Ha.51355
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1e-100
Motif start 47
Motif end 364
Protein seq >OTG21720
MKTFVFCIIYLCCVLHSNGTDIHDTFYVASLSHDQIVKNQQGKPSFHLRPPHNWINDPNGPVYYKGFYHLFYQYNPKGAV
FKKPIVWGHSVSRDLTNWIHLNNALVPTDPFDINSCYSGSTTILPGNKPVVLYTGLDANNQQVQNLAKPKNLSDPFLKEW
IKYSGNPLMTPPKGVTPDNFRDPTTAWKGKDGNWRVAVGGLRGNLGVAILYRSKDFVHWILHDDPLYFGMNTGIWECPDF
YPVSVNSRHGLDNSVNGKNIKYVLKASFQSHDYYAIGSYDSNKDKFVPESKGLTGSTSDWRYDYGKFYASKTFYDSVKKR
RILWGWINESDSSSDDIKKGWAGIQSIPRQVYLSDAGRQLVQWPIEETEKLREKHVTYSNKKLKGKSLFEVTGITASQVD
IQLSFNLPKIEEAEVFDPKWDDPQTLCSTKTASASGRAGPFGLLVMASQDLTEQTAVFFRVFKDNHKFIVLMCSDQSRSS
LREGIDKTTYGAFIHMDHEDDMISLRSLVDHSVVESFGADGRACITARVYPHFHVKKEAHLYVFNNGSNDVVIKTLDAWN
MKKTKGN*
CDS seq >OTG21720
ATGAAAACTTTTGTGTTTTGTATCATATACTTGTGTTGCGTTCTGCATAGCAATGGCACGGATATTCATGATACATTTTA
CGTCGCATCTCTATCTCACGATCAAATTGTGAAGAACCAACAAGGGAAACCATCTTTCCACCTCCGGCCTCCACATAACT
GGATAAACGATCCTAATGGTCCGGTGTATTACAAAGGATTCTATCATTTATTCTACCAGTACAATCCGAAAGGTGCCGTT
TTTAAGAAACCAATCGTATGGGGGCATTCTGTCTCCCGTGATTTAACCAACTGGATTCATCTCAACAACGCCCTCGTCCC
AACTGATCCATTCGACATCAACAGTTGCTATTCAGGTTCTACAACAATCCTTCCAGGAAACAAACCGGTTGTTTTGTACA
CTGGACTCGACGCTAATAATCAACAAGTTCAGAACCTAGCAAAGCCTAAAAACTTATCAGATCCTTTTCTTAAAGAATGG
ATAAAGTATTCTGGTAATCCTTTAATGACTCCGCCGAAGGGGGTTACACCAGATAACTTTCGCGATCCCACTACAGCTTG
GAAGGGCAAAGATGGGAATTGGAGGGTGGCTGTAGGTGGTCTGAGAGGTAATCTAGGAGTCGCGATTCTATATCGCAGTA
AAGATTTTGTTCACTGGATTTTGCATGACGATCCACTTTATTTTGGGATGAATACTGGGATTTGGGAGTGTCCAGACTTT
TATCCTGTGTCTGTCAACAGCAGACATGGGCTAGATAATTCAGTGAATGGGAAGAACATAAAGTATGTGCTGAAAGCGAG
CTTTCAATCACACGACTACTATGCAATCGGAAGCTATGATTCTAACAAGGACAAGTTTGTGCCAGAAAGTAAAGGTCTTA
CTGGAAGTACCTCAGACTGGAGATATGACTATGGAAAGTTTTATGCATCAAAGACATTTTACGACAGTGTGAAGAAGAGG
AGAATACTGTGGGGTTGGATTAACGAGTCGGATAGCTCATCTGATGATATTAAGAAAGGATGGGCTGGAATCCAGTCCAT
CCCTCGCCAAGTGTATCTTAGTGACGCTGGAAGGCAGCTAGTACAGTGGCCCATTGAGGAAACTGAAAAATTGCGCGAGA
AGCATGTTACTTACTCTAACAAGAAGCTCAAAGGAAAATCATTATTTGAAGTTACAGGCATCACAGCCTCACAGGTTGAC
ATACAACTGTCCTTTAACCTGCCCAAAATAGAAGAGGCCGAGGTTTTCGACCCGAAGTGGGATGATCCTCAAACTCTATG
TAGTACAAAGACAGCAAGCGCCAGTGGTCGAGCAGGGCCATTCGGCTTATTAGTTATGGCTTCTCAAGATTTAACCGAAC
AAACTGCAGTCTTCTTCCGTGTATTTAAAGACAACCACAAGTTTATTGTTTTAATGTGTAGTGATCAAAGCAGGTCTTCG
CTGAGAGAAGGGATTGATAAAACCACATATGGAGCTTTTATACACATGGATCATGAAGATGACATGATCTCACTTAGGAG
CTTGGTAGATCATTCGGTGGTTGAGAGTTTTGGAGCTGATGGAAGAGCATGCATTACGGCTAGAGTGTACCCGCACTTTC
ATGTTAAGAAAGAAGCTCATTTATATGTGTTCAATAATGGAAGTAATGACGTAGTTATCAAAACATTAGATGCTTGGAAT
ATGAAGAAGACCAAAGGTAACTAA