Microexon ID Ha_10:36803874-36803879:-
Species Helianthus annuus
Coordinates 10:36803874..36803879
Microexon Cluster ID MEP12
Size 6
Phase 1
Pfam Domain Motif Unknown
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 52,6,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GAAGAMCRTGTKCGKGAGGATKTKCARAAGTWYYCWAGRGGWTCYCCACAAGCWAGAGCTTATSGKAATGATGGMRCWMRRRGYCGWTCAASMCATTCAAAATCTCCM
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq CTAGAG
Microexon Amino Acid seq ARA
Microexon-tag DNA Seq AAAGAACATGTGCGTGAGGAATTTCAGAAGTACCCAAGGGGATCCCCACAAGCTAGAGCTTATCGGAATGATGGCATTCAGATCCGATCAACACATTCAAAATCTCCC
Microexon-tag Amino Acid Seq KEHVREEFQKYPRGSPQARAYRNDGIQIRSTHSKSP
Microexon-tag spanning region36803671-36804698
Microexon-tag prediction score0.9585
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG10177x
Reference Transcript ID OTG10177
Gene ID HannXRQ_Chr10g0284731
Gene Name NA
Transcript ID OTG10177
Protein ID OTG10177
Gene ID HannXRQ_Chr10g0284731
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >OTG10177
MSNREDSDIDDDFADLYKEYTGPVKSTTTSVPETTKTSKRSRADSDEEEQETLDPNAVPTDFTSREAKVWEAKSKATERN
WKKRTEEEMICKICGDSSHFTQGCPSTLGASRKSQDLFERVPARDPQVKALFTDRVVKNIEKDIGCKIKIEEKFIIVSAK
EGHILSKGVDAVHKIKNEGVKTGESNSNMVESDSRSPKGRISVAPKVGPADSPRSNPSPRNVSNYDQRSGRHDKVIKEHV
REEFQKYPRGSPQARAYRNDGIQIRSTHSKSPARRPPYSGGSYDLKDNHKCGPGLNLQPTHKEGRSMFSLALEELELEYT
RDAMDIAKFRDKEEDEENYRHRKAIEEIRGIYMKKLATIRDMHAKQWEEFLRFETHKRQQETRQEMPNVGFGGYRNNNHY
DYGDAAAAGNPYINNMQMESRSRHPEIDNYPSLRPGNNYNDFQRQRHEDYGEAYNRY*
CDS seq >OTG10177
ATGTCTAATAGAGAAGATTCAGACATTGATGATGATTTTGCTGATCTCTACAAGGAATACACCGGTCCGGTAAAATCTAC
TACCACTAGTGTTCCTGAAACCACAAAAACGAGCAAAAGGTCTCGTGCTGATTCAGATGAAGAAGAACAAGAAACCCTAG
ACCCTAATGCGGTCCCCACTGATTTTACCAGCCGTGAAGCGAAGGTTTGGGAGGCAAAATCTAAAGCTACTGAAAGAAAC
TGGAAAAAAAGAACGGAAGAAGAAATGATTTGTAAAATATGTGGAGATTCCAGTCATTTTACACAGGGTTGTCCATCTAC
TCTTGGGGCTAGTCGCAAGTCACAAGATTTATTTGAACGGGTTCCCGCAAGAGATCCTCAAGTAAAAGCGCTTTTCACCG
ATAGGGTGGTTAAAAACATTGAGAAGGATATCGGGTGCAAAATCAAAATCGAGGAAAAATTTATAATCGTTAGTGCCAAA
GAAGGACATATTTTATCAAAGGGTGTGGATGCTGTTCACAAAATTAAGAACGAGGGTGTCAAAACGGGTGAATCCAATTC
AAACATGGTTGAATCTGATTCCAGGTCACCGAAAGGTAGAATTTCTGTTGCCCCTAAAGTAGGACCCGCTGATTCTCCGA
GGTCTAACCCTAGTCCAAGAAATGTATCAAATTACGACCAGAGGTCTGGCAGACATGATAAAGTGATTAAAGAACATGTG
CGTGAGGAATTTCAGAAGTACCCAAGGGGATCCCCACAAGCTAGAGCTTATCGGAATGATGGCATTCAGATCCGATCAAC
ACATTCAAAATCTCCCGCTCGCCGTCCACCGTACTCTGGTGGCTCATATGACTTGAAAGATAATCACAAATGCGGGCCCG
GGCTCAATTTGCAGCCTACTCATAAGGAAGGCCGTTCTATGTTTTCACTGGCATTGGAGGAATTAGAGTTGGAGTATACG
AGGGATGCAATGGATATTGCAAAATTTAGAGATAAGGAAGAAGATGAAGAGAATTATAGGCATCGTAAGGCGATTGAAGA
AATCAGGGGAATTTACATGAAAAAGTTGGCTACAATAAGAGACATGCATGCAAAACAATGGGAAGAATTTCTTCGATTTG
AAACACATAAAAGACAACAAGAAACTCGTCAAGAAATGCCTAATGTAGGGTTTGGTGGTTATAGAAATAATAATCATTAT
GATTATGGTGATGCTGCTGCTGCTGGCAATCCTTATATTAACAATATGCAAATGGAATCAAGATCAAGGCATCCAGAAAT
TGACAACTATCCTTCTTTGAGGCCCGGTAACAATTATAATGATTTTCAACGTCAAAGGCATGAGGATTACGGGGAAGCAT
ATAATCGTTATTAA
Microexon DNA seq CTAGAG
Microexon Amino Acid seq ARA
Microexon-tag DNA Seq AAAGAACATGTGCGTGAGGAATTTCAGAAGTACCCAAGGGGATCCCCACAAGCTAGAGCTTATCGGAATGATGGCATTCAGATCCGATCAACACATTCAAAATCTCCC
Microexon-tag Amino Acid seq KEHVREEFQKYPRGSPQARAYRNDGIQIRSTHSKSP
Transcript ID Ha.3910.1
Gene ID Ha.3910
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Ha.3910.1
MSNREDSDIDDDFADLYKEYTGPVKSTTTSVPETTKTSKRSRADSDEEEQETLDPNAVPTDFTSREAKVWEAKSKATERN
WKKRTEEEMICKICGDSSHFTQGCPSTLGASRKSQDLFERVPARDPQVKALFTDRVVKNIEKDIGCKIKIEEKFIIVSAK
EGHILSKGVDAVHKIKNEGVKTGESNSNMVESDSRSPKGRISVAPKVGPADSPRSNPSPRNVSNYDQRSGRHDKVIKEHV
REEFQKYPRGSPQARAYRNDGIQIRSTHSKSPARRPPYSGGSYDLKDNHKCGPGLNLQPTHKEGRSMFSLALEELELEYT
RDAMDIAKFRDKEEDEENYRHRKAIEEIRGIYMKKLATIRDMHAKQWEEFLRFETHKRQQETRQEMPNVGFGGYRNNNHY
DYGDAAAAGNPYINNMQMESRSRHPEIDNYPSLRPGNNYNDFQRQRHEDYGEAYNRY*
CDS seq >Ha.3910.1
ATGTCTAATAGAGAAGATTCAGACATTGATGATGATTTTGCTGATCTCTACAAGGAATACACCGGTCCGGTAAAATCTAC
TACCACTAGTGTTCCTGAAACCACAAAAACGAGCAAAAGGTCTCGTGCTGATTCAGATGAAGAAGAACAAGAAACCCTAG
ACCCTAATGCGGTCCCCACTGATTTTACCAGCCGTGAAGCGAAGGTTTGGGAGGCAAAATCTAAAGCTACTGAAAGAAAC
TGGAAAAAAAGAACGGAAGAAGAAATGATTTGTAAAATATGTGGAGATTCCAGTCATTTTACACAGGGTTGTCCATCTAC
TCTTGGGGCTAGTCGCAAGTCACAAGATTTATTTGAACGGGTTCCCGCAAGAGATCCTCAAGTAAAAGCGCTTTTCACCG
ATAGGGTGGTTAAAAACATTGAGAAGGATATCGGGTGCAAAATCAAAATCGAGGAAAAATTTATAATCGTTAGTGCCAAA
GAAGGACATATTTTATCAAAGGGTGTGGATGCTGTTCACAAAATTAAGAACGAGGGTGTCAAAACGGGTGAATCCAATTC
AAACATGGTTGAATCTGATTCCAGGTCACCGAAAGGTAGAATTTCTGTTGCCCCTAAAGTAGGACCCGCTGATTCTCCGA
GGTCTAACCCTAGTCCAAGAAATGTATCAAATTACGACCAGAGGTCTGGCAGACATGATAAAGTGATTAAAGAACATGTG
CGTGAGGAATTTCAGAAGTACCCAAGGGGATCCCCACAAGCTAGAGCTTATCGGAATGATGGCATTCAGATCCGATCAAC
ACATTCAAAATCTCCCGCTCGCCGTCCACCGTACTCTGGTGGCTCATATGACTTGAAAGATAATCACAAATGCGGGCCCG
GGCTCAATTTGCAGCCTACTCATAAGGAAGGCCGTTCTATGTTTTCACTGGCATTGGAGGAATTAGAGTTGGAGTATACG
AGGGATGCAATGGATATTGCAAAATTTAGAGATAAGGAAGAAGATGAAGAGAATTATAGGCATCGTAAGGCGATTGAAGA
AATCAGGGGAATTTACATGAAAAAGTTGGCTACAATAAGAGACATGCATGCAAAACAATGGGAAGAATTTCTTCGATTTG
AAACACATAAAAGACAACAAGAAACTCGTCAAGAAATGCCTAATGTAGGGTTTGGTGGTTATAGAAATAATAATCATTAT
GATTATGGTGATGCTGCTGCTGCTGGCAATCCTTATATTAACAATATGCAAATGGAATCAAGATCAAGGCATCCAGAAAT
TGACAACTATCCTTCTTTGAGGCCCGGTAACAATTATAATGATTTTCAACGTCAAAGGCATGAGGATTACGGGGAAGCAT
ATAATCGTTATTAA