Microexon ID Ha_1:45066965-45066972:-
Species Helianthus annuus
Coordinates 1:45066965..45066972
Microexon Cluster ID MEP20
Size 8
Phase 2
Pfam Domain Motif VSP
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 16,34,8,50
Microexon location in the Microexon-tag 3
Microexon-tag DNA Seq TWTRTTCCKGGMARAGATSAAAATGGAARYTWYSYRCCHYTRMAGWYRAGHHMARRWGGWWTAKCWRGTGGWGYYATWGCTGGMATATCTRTWGSWGGAGTWRCHGGK
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq CAATAGGG
Microexon Amino Acid seq SNRG
Microexon-tag DNA Seq TTTATTCCTGGAAGAGATCAAAATGGAACTTTTCCGCCATTAAAGATCAGCAATAGGGGGTTATCTGGTGGTGCCACTGCTGGTGTATCTGTTGGAGCAGTTGCAGGT
Microexon-tag Amino Acid Seq FIPGRDQNGTFPPLKISNRGLSGGATAGVSVGAVAG
Microexon-tag spanning region45066792-45070211
Microexon-tag prediction score0.9494
Overlapped with the annotated transcript (%) 85.19
New Transcript ID OTG36336x
Reference Transcript ID OTG36336
Gene ID HannXRQ_Chr01g0006491
Gene Name NA
Transcript ID OTG36336
Protein ID OTG36336
Gene ID HannXRQ_Chr01g0006491
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >OTG36336
MNNNISNTKRHILQLGLCVLFFIVKIESKCSNGCDLAFASYYVTQGSNLTYISNIFGQSIPMILKYNPQIPRSDTIESGT
RINVPFSCLCLNGDFLGHTFIYHTQVGDTYSKVAKEVFANLTDEYWVQLVNIFDPAQIPDFADVNVTVNCTCGDKHVSKD
YGLFATYPLQPGEDLWSLSKDSGVPMMLLERFNPGSNFSAGSGPVFVPAKDQNGTFPPLKISNRGLSGGATAGVSVGAVA
GVLFFAICFYFVSYRRKRVGEGSLLLEDGPEHVHDVGLRRNMGGTTNSGTLTGGAHPGATGITVDKSVEFTYEELAMATD
DFSITNKIGQGGFGTVYYGELRGEKAAIKKMEMQASKEFLAELKVLTHVHHLNLVRLIGYCVEGSLFLVYEFIGNGNLSQ
HLRGSSGRESIPWATRVQIALDAARGLEYIHEHTVPVYIHRDIKSPNILIDEDFRAKVADFGLTKLTEVGSGSLQTRLVG
TFGYMPPEYAQYGEVSPKVDVYAFGVVLFELVSAKEAIVKTNEFGNESKGLVGLFDEVLSLSDPSEGLRKLVDPRLGADY
PLDSVRKVALLARACTHDNAQLRPSMRSIVVALMTLSSLTEDWDVGSFYENQDLAQLMSGR*
CDS seq >OTG36336
ATGAACAACAATATTTCAAATACAAAGAGACATATACTACAATTGGGGCTTTGTGTTCTTTTCTTCATTGTCAAGATTGA
ATCCAAGTGTAGCAATGGTTGTGATCTGGCCTTTGCATCATATTATGTCACACAAGGGTCAAATCTGACTTACATAAGCA
ACATTTTTGGCCAATCAATCCCTATGATCCTTAAGTACAATCCACAAATCCCGAGAAGCGACACCATCGAAAGTGGGACC
CGAATTAACGTTCCATTTTCTTGTCTATGTTTGAATGGTGATTTTTTAGGTCACACGTTTATATACCACACACAAGTTGG
CGATACTTATAGTAAGGTTGCAAAAGAGGTATTTGCAAATCTTACCGATGAGTATTGGGTCCAACTGGTTAACATATTTG
ACCCGGCTCAAATACCAGATTTTGCGGATGTAAATGTTACCGTGAATTGTACATGTGGCGACAAACACGTGTCGAAGGAT
TACGGGTTGTTCGCAACATACCCTCTTCAACCCGGAGAAGATTTGTGGTCCTTAAGTAAAGATTCGGGTGTCCCAATGAT
GTTGTTGGAGCGGTTCAACCCGGGATCCAATTTCAGTGCCGGGTCGGGACCAGTGTTTGTGCCGGCCAAAGATCAAAATG
GAACTTTTCCGCCATTAAAGATCAGCAATAGGGGGTTATCTGGTGGTGCCACTGCTGGTGTATCTGTTGGAGCAGTTGCA
GGTGTTCTTTTCTTTGCCATTTGCTTCTACTTTGTCTCCTATCGAAGGAAGAGGGTGGGCGAAGGATCACTGCTTCTTGA
AGATGGACCCGAACATGTTCATGATGTAGGTTTACGTAGAAACATGGGAGGAACCACAAACTCGGGCACTCTAACTGGGG
GTGCACATCCAGGGGCCACGGGTATAACCGTGGACAAGTCAGTGGAGTTCACATACGAGGAGCTTGCTATGGCTACAGAT
GACTTTAGCATCACCAATAAGATTGGTCAAGGCGGCTTTGGTACCGTTTACTATGGAGAGCTCAGAGGCGAAAAAGCTGC
AATAAAGAAGATGGAAATGCAAGCATCAAAAGAATTTCTTGCTGAATTAAAGGTGTTAACACATGTTCATCATTTGAACC
TGGTACGGTTGATAGGATATTGTGTTGAAGGATCGTTGTTTTTGGTCTATGAGTTTATCGGGAATGGCAATTTGAGTCAA
CATTTGCGCGGTTCATCGGGCAGAGAGTCAATTCCATGGGCTACAAGGGTTCAAATCGCCCTTGATGCGGCTAGAGGGCT
TGAATACATTCATGAGCATACAGTTCCTGTCTATATACATCGCGATATCAAGTCTCCTAACATTTTAATCGATGAAGATT
TCCGTGCAAAGGTTGCTGACTTTGGGCTGACAAAACTTACCGAAGTCGGGAGTGGTTCTTTACAAACCCGTTTAGTTGGC
ACATTCGGCTACATGCCTCCCGAATATGCTCAGTATGGGGAGGTTTCTCCAAAGGTTGATGTATATGCTTTCGGGGTCGT
ATTGTTTGAGCTAGTATCAGCGAAAGAAGCCATTGTCAAGACAAATGAATTTGGTAACGAGTCCAAAGGACTTGTTGGTT
TGTTTGATGAGGTCCTAAGTTTGTCTGACCCAAGTGAAGGTTTACGCAAACTAGTCGACCCAAGGCTTGGGGCCGACTAC
CCTCTCGACTCAGTGCGCAAGGTTGCACTGCTTGCTAGAGCTTGTACACATGATAATGCGCAGTTGAGGCCAAGCATGAG
GTCTATTGTCGTTGCGCTAATGACACTTTCATCATTGACCGAGGATTGGGACGTTGGATCTTTCTACGAAAATCAAGATT
TAGCTCAACTGATGTCAGGGAGGTAG
Microexon DNA seq CAATAGGG
Microexon Amino Acid seq SNRG
Microexon-tag DNA Seq TTTGTGCCGGCCAAAGATCAAAATGGAACTTTTCCGCCATTAAAGATCAGCAATAGGGGGTTATCTGGTGGTGCCACTGCTGGTGTATCTGTTGGAGCAGTTGCAGGT
Microexon-tag Amino Acid seq FVPAKDQNGTFPPLKISNRGLSGGATAGVSVGAVAG
Transcript ID OTG36336
Gene ID Ha.702
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >OTG36336
MNNNISNTKRHILQLGLCVLFFIVKIESKCSNGCDLAFASYYVTQGSNLTYISNIFGQSIPMILKYNPQIPRSDTIESGT
RINVPFSCLCLNGDFLGHTFIYHTQVGDTYSKVAKEVFANLTDEYWVQLVNIFDPAQIPDFADVNVTVNCTCGDKHVSKD
YGLFATYPLQPGEDLWSLSKDSGVPMMLLERFNPGSNFSAGSGPVFVPAKDQNGTFPPLKISNRGLSGGATAGVSVGAVA
GVLFFAICFYFVSYRRKRVGEGSLLLEDGPEHVHDVGLRRNMGGTTNSGTLTGGAHPGATGITVDKSVEFTYEELAMATD
DFSITNKIGQGGFGTVYYGELRGEKAAIKKMEMQASKEFLAELKVLTHVHHLNLVRLIGYCVEGSLFLVYEFIGNGNLSQ
HLRGSSGRESIPWATRVQIALDAARGLEYIHEHTVPVYIHRDIKSPNILIDEDFRAKVADFGLTKLTEVGSGSLQTRLVG
TFGYMPPEYAQYGEVSPKVDVYAFGVVLFELVSAKEAIVKTNEFGNESKGLVGLFDEVLSLSDPSEGLRKLVDPRLGADY
PLDSVRKVALLARACTHDNAQLRPSMRSIVVALMTLSSLTEDWDVGSFYENQDLAQLMSGR*
CDS seq >OTG36336
ATGAACAACAATATTTCAAATACAAAGAGACATATACTACAATTGGGGCTTTGTGTTCTTTTCTTCATTGTCAAGATTGA
ATCCAAGTGTAGCAATGGTTGTGATCTGGCCTTTGCATCATATTATGTCACACAAGGGTCAAATCTGACTTACATAAGCA
ACATTTTTGGCCAATCAATCCCTATGATCCTTAAGTACAATCCACAAATCCCGAGAAGCGACACCATCGAAAGTGGGACC
CGAATTAACGTTCCATTTTCTTGTCTATGTTTGAATGGTGATTTTTTAGGTCACACGTTTATATACCACACACAAGTTGG
CGATACTTATAGTAAGGTTGCAAAAGAGGTATTTGCAAATCTTACCGATGAGTATTGGGTCCAACTGGTTAACATATTTG
ACCCGGCTCAAATACCAGATTTTGCGGATGTAAATGTTACCGTGAATTGTACATGTGGCGACAAACACGTGTCGAAGGAT
TACGGGTTGTTCGCAACATACCCTCTTCAACCCGGAGAAGATTTGTGGTCCTTAAGTAAAGATTCGGGTGTCCCAATGAT
GTTGTTGGAGCGGTTCAACCCGGGATCCAATTTCAGTGCCGGGTCGGGACCAGTGTTTGTGCCGGCCAAAGATCAAAATG
GAACTTTTCCGCCATTAAAGATCAGCAATAGGGGGTTATCTGGTGGTGCCACTGCTGGTGTATCTGTTGGAGCAGTTGCA
GGTGTTCTTTTCTTTGCCATTTGCTTCTACTTTGTCTCCTATCGAAGGAAGAGGGTGGGCGAAGGATCACTGCTTCTTGA
AGATGGACCCGAACATGTTCATGATGTAGGTTTACGTAGAAACATGGGAGGAACCACAAACTCGGGCACTCTAACTGGGG
GTGCACATCCAGGGGCCACGGGTATAACCGTGGACAAGTCAGTGGAGTTCACATACGAGGAGCTTGCTATGGCTACAGAT
GACTTTAGCATCACCAATAAGATTGGTCAAGGCGGCTTTGGTACCGTTTACTATGGAGAGCTCAGAGGCGAAAAAGCTGC
AATAAAGAAGATGGAAATGCAAGCATCAAAAGAATTTCTTGCTGAATTAAAGGTGTTAACACATGTTCATCATTTGAACC
TGGTACGGTTGATAGGATATTGTGTTGAAGGATCGTTGTTTTTGGTCTATGAGTTTATCGGGAATGGCAATTTGAGTCAA
CATTTGCGCGGTTCATCGGGCAGAGAGTCAATTCCATGGGCTACAAGGGTTCAAATCGCCCTTGATGCGGCTAGAGGGCT
TGAATACATTCATGAGCATACAGTTCCTGTCTATATACATCGCGATATCAAGTCTCCTAACATTTTAATCGATGAAGATT
TCCGTGCAAAGGTTGCTGACTTTGGGCTGACAAAACTTACCGAAGTCGGGAGTGGTTCTTTACAAACCCGTTTAGTTGGC
ACATTCGGCTACATGCCTCCCGAATATGCTCAGTATGGGGAGGTTTCTCCAAAGGTTGATGTATATGCTTTCGGGGTCGT
ATTGTTTGAGCTAGTATCAGCGAAAGAAGCCATTGTCAAGACAAATGAATTTGGTAACGAGTCCAAAGGACTTGTTGGTT
TGTTTGATGAGGTCCTAAGTTTGTCTGACCCAAGTGAAGGTTTACGCAAACTAGTCGACCCAAGGCTTGGGGCCGACTAC
CCTCTCGACTCAGTGCGCAAGGTTGCACTGCTTGCTAGAGCTTGTACACATGATAATGCGCAGTTGAGGCCAAGCATGAG
GTCTATTGTCGTTGCGCTAATGACACTTTCATCATTGACCGAGGATTGGGACGTTGGATCTTTCTACGAAAATCAAGATT
TAGCTCAACTGATGTCAGGGAGGTAG