Microexon ID Ha_9:46802676-46802686:-
Species Helianthus annuus
Coordinates 9:46802676..46802686
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAGAACTTAAG
Microexon Amino Acid seq GELK
Microexon-tag DNA Seq ACGGCTACGGAAAGCGATCCACATTTGTTCTCATGCACATTTTCAAAAGGAGAACTTAAGGTGACTGAGATATACAACTTCAACCAGGATGATTTGATGACTGAAGAT
Microexon-tag Amino Acid Seq TATESDPHLFSCTFSKGELKVTEIYNFNQDDLMTED
Microexon-tag spanning region46800678-46802865
Microexon-tag prediction score0.9634
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG13993x
Reference Transcript ID OTG13993
Gene ID HannXRQ_Chr09g0244421
Gene Name NA
Transcript ID OTG13993
Protein ID OTG13993
Gene ID HannXRQ_Chr09g0244421
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >OTG13993
MSVSMRDLDPAFQGVGQKAGIELWRIENFKPVPVPQSSYGKFYTGDSYVILKTNALKSGVLRHDIHYWLGNDTSQDEAGA
AAIKTIELDAALGGRAVQYREVQGHETERFLSYFKPCIIPQSGGVASGLNHAKPVEHKTRMFVCLGKYVVHVKEVPFARS
SLNHDDIFILDTAHKIFQFNGSNSSIQERAKALEVLQYIRDTYHDGKCDIATIEDGRLMSDAESGEFWGFFGGFAPLPRK
TQTNGVLSTNTLPTKLYCVVKGQAELVDAEPLTRGLLDTNTCYILDCGVEIYVWLGRSTSLNERKAASGATEEYVRSQDR
QKSHIICVIENFETVAFRSKFDSWPQSTAVTVSEDGRGKVAALLKRQGVNVKGLLKAAPTKEEPKPYIDCSGNLQVWRVN
GQQKILLPVSDQSKFYTGDCYIYQYTYSGEDQEECLIGTWFGKQSIEEDRISATSQANKIVESQKFLAAQLRVYEGREPM
LFFAIFQSFMVLKGGLSDGYKKCILESELPNGTGIGEEVALFRVQGSGPENMQAIQLEPVASSLNSSYCYILHSGSSVFT
WIGNLTTPEEQELVERQLDVIKPEMQSRLQKEGSESVEFWKMLGGKSEYPSQKTTTATESDPHLFSCTFSKGELKVTEIY
NFNQDDLMTEDIYILDCHSSIFVWVGQQVDPKTRTQALVIAEKFLERDFLLEKLPLQTPLYIIMEASEPQFFTRFFTWDS
TKSAMHGNSFQRKLSILKNGGRPTSNNKPKRRTPVSNGGRSGAVEQPQRSRSVTFSPERVRVRGRSPAFVALASTFENPT
VRHLSTPPPLETKIYPKSNGSDSSKPASRSTAIASLTATFEQPPRAKFIPRSIKVTSEPPTKTETISNGDLKSSRIETPT
IQKDAKDVETEDEEGLTLYPYERLTVSSTDPAPDINIFKRETYLSSAEFRAKFGMAKNAFYKLPKWKQNKLKMVLHLF*
CDS seq >OTG13993
ATGTCTGTTTCAATGCGAGATTTGGATCCAGCCTTTCAAGGAGTCGGACAGAAAGCAGGAATTGAGTTATGGCGCATTGA
AAATTTTAAGCCAGTTCCTGTCCCGCAATCTTCCTATGGTAAATTTTATACAGGGGACTCCTATGTTATTTTGAAGACGA
ATGCTTTGAAGAGTGGTGTCTTACGTCATGACATCCATTATTGGCTTGGTAACGATACAAGCCAGGACGAAGCTGGCGCT
GCAGCCATAAAGACAATTGAACTTGATGCAGCTCTTGGAGGGCGTGCTGTTCAGTATCGTGAAGTACAAGGACACGAAAC
AGAGAGATTTCTTTCTTACTTTAAACCATGTATTATACCTCAAAGTGGTGGTGTTGCATCTGGTTTGAATCATGCAAAAC
CTGTAGAACACAAGACCCGCATGTTCGTTTGCCTAGGAAAATATGTGGTTCATGTAAAAGAGGTTCCTTTTGCTCGATCC
TCACTCAATCATGATGACATCTTTATCTTGGATACTGCACACAAGATATTCCAGTTTAACGGGTCGAATTCATCCATTCA
AGAAAGGGCTAAAGCACTTGAAGTCTTACAGTATATAAGAGATACTTATCATGATGGGAAGTGTGACATAGCTACTATTG
AGGATGGAAGGCTGATGTCTGATGCGGAAAGCGGAGAATTTTGGGGCTTTTTTGGTGGCTTTGCACCACTTCCTAGGAAA
ACACAGACAAATGGTGTTTTAAGTACCAATACACTTCCTACTAAGCTATATTGTGTTGTGAAGGGTCAGGCAGAACTAGT
TGATGCTGAACCCTTGACAAGGGGGTTGCTAGACACAAATACGTGCTATATACTCGATTGTGGGGTAGAAATTTATGTAT
GGTTGGGGAGAAGTACTTCTCTTAATGAAAGGAAGGCTGCAAGTGGAGCCACAGAAGAATACGTACGTAGTCAGGATAGA
CAAAAATCTCATATAATTTGTGTGATTGAAAATTTTGAGACTGTTGCATTCCGGTCCAAGTTTGACTCATGGCCTCAGTC
AACTGCTGTCACAGTCTCAGAGGATGGTAGAGGCAAGGTTGCTGCACTTCTAAAACGCCAAGGGGTCAATGTGAAGGGTC
TACTAAAAGCTGCTCCAACTAAGGAGGAACCTAAACCCTATATCGACTGCAGTGGGAATTTGCAGGTTTGGCGTGTGAAT
GGTCAACAAAAGATTCTTCTTCCAGTCTCTGATCAGTCAAAATTTTACACTGGAGATTGCTATATATATCAGTATACGTA
TTCTGGAGAAGATCAAGAGGAATGCCTTATTGGGACTTGGTTTGGAAAGCAGAGTATCGAGGAAGACAGAATATCAGCTA
CCTCACAGGCGAACAAGATAGTAGAGTCCCAAAAGTTTTTGGCTGCACAGTTGCGAGTTTATGAAGGAAGGGAACCTATG
TTGTTCTTTGCTATCTTCCAGAGCTTTATGGTTCTAAAGGGTGGTCTTAGTGATGGATACAAGAAATGCATATTAGAGAG
CGAACTTCCTAACGGAACTGGCATAGGGGAAGAAGTTGCACTATTCAGAGTTCAAGGCTCCGGACCAGAAAACATGCAAG
CGATCCAACTTGAACCGGTGGCTTCTTCTTTGAATTCCTCCTACTGTTACATATTACACAGTGGTTCTTCTGTCTTCACA
TGGATTGGGAACCTCACAACTCCTGAGGAACAGGAACTTGTCGAGAGGCAACTTGATGTCATAAAGCCAGAGATGCAGTC
CAGACTGCAGAAAGAAGGCTCAGAATCTGTAGAGTTTTGGAAAATGTTAGGTGGAAAATCTGAATACCCAAGTCAGAAAA
CAACAACGGCTACGGAAAGCGATCCACATTTGTTCTCATGCACATTTTCAAAAGGAGAACTTAAGGTGACTGAGATATAC
AACTTCAACCAGGATGATTTGATGACTGAAGATATATATATTCTCGACTGTCACTCCAGCATCTTTGTTTGGGTAGGCCA
ACAGGTTGATCCCAAGACCAGAACTCAAGCTCTAGTTATCGCGGAGAAATTTCTGGAACGTGATTTTCTTCTTGAGAAAC
TACCACTTCAAACTCCACTATATATCATAATGGAAGCAAGTGAACCGCAATTCTTCACACGCTTCTTTACATGGGATTCG
ACCAAATCAGCTATGCATGGAAACTCATTCCAGAGGAAGCTATCAATACTGAAAAATGGAGGTCGTCCAACATCCAACAA
TAAGCCAAAACGAAGAACACCTGTCTCAAACGGAGGTCGATCTGGAGCAGTGGAACAACCACAACGTTCAAGGAGTGTAA
CGTTTAGTCCTGAACGAGTGCGTGTAAGAGGCAGATCTCCAGCCTTTGTTGCTCTTGCTTCCACATTTGAGAACCCAACC
GTAAGACATCTGTCAACGCCTCCACCACTTGAAACCAAGATCTATCCAAAATCCAACGGCTCAGATTCCTCTAAACCAGC
TTCAAGGTCAACAGCCATTGCGTCACTCACTGCCACCTTTGAACAACCTCCACGTGCAAAGTTTATACCCCGTTCTATAA
AAGTGACGTCTGAGCCACCAACAAAGACCGAAACGATATCCAATGGGGATTTAAAAAGCAGTAGAATAGAGACACCAACC
ATACAAAAAGATGCGAAAGATGTTGAAACCGAAGATGAAGAGGGGCTTACATTATACCCATATGAACGCCTTACTGTATC
ATCCACCGACCCTGCTCCTGATATTAATATCTTCAAGCGCGAGACATATCTGTCGTCAGCTGAGTTCAGAGCAAAATTTG
GAATGGCTAAGAATGCTTTCTACAAGCTGCCAAAATGGAAACAGAATAAGCTGAAAATGGTGCTTCATTTGTTTTGA
Microexon DNA seq GAGAACTTAAG
Microexon Amino Acid seq GELK
Microexon-tag DNA Seq ACGGCTACGGAAAGCGATCCACATTTGTTCTCATGCACATTTTCAAAAGGAGAACTTAAGGTGACTGAGATATACAACTTCAACCAGGATGATTTGATGACTGAAGAT
Microexon-tag Amino Acid seq TATESDPHLFSCTFSKGELKVTEIYNFNQDDLMTED
Transcript ID OTG13993
Gene ID Ha.55174
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >OTG13993
MSVSMRDLDPAFQGVGQKAGIELWRIENFKPVPVPQSSYGKFYTGDSYVILKTNALKSGVLRHDIHYWLGNDTSQDEAGA
AAIKTIELDAALGGRAVQYREVQGHETERFLSYFKPCIIPQSGGVASGLNHAKPVEHKTRMFVCLGKYVVHVKEVPFARS
SLNHDDIFILDTAHKIFQFNGSNSSIQERAKALEVLQYIRDTYHDGKCDIATIEDGRLMSDAESGEFWGFFGGFAPLPRK
TQTNGVLSTNTLPTKLYCVVKGQAELVDAEPLTRGLLDTNTCYILDCGVEIYVWLGRSTSLNERKAASGATEEYVRSQDR
QKSHIICVIENFETVAFRSKFDSWPQSTAVTVSEDGRGKVAALLKRQGVNVKGLLKAAPTKEEPKPYIDCSGNLQVWRVN
GQQKILLPVSDQSKFYTGDCYIYQYTYSGEDQEECLIGTWFGKQSIEEDRISATSQANKIVESQKFLAAQLRVYEGREPM
LFFAIFQSFMVLKGGLSDGYKKCILESELPNGTGIGEEVALFRVQGSGPENMQAIQLEPVASSLNSSYCYILHSGSSVFT
WIGNLTTPEEQELVERQLDVIKPEMQSRLQKEGSESVEFWKMLGGKSEYPSQKTTTATESDPHLFSCTFSKGELKVTEIY
NFNQDDLMTEDIYILDCHSSIFVWVGQQVDPKTRTQALVIAEKFLERDFLLEKLPLQTPLYIIMEASEPQFFTRFFTWDS
TKSAMHGNSFQRKLSILKNGGRPTSNNKPKRRTPVSNGGRSGAVEQPQRSRSVTFSPERVRVRGRSPAFVALASTFENPT
VRHLSTPPPLETKIYPKSNGSDSSKPASRSTAIASLTATFEQPPRAKFIPRSIKVTSEPPTKTETISNGDLKSSRIETPT
IQKDAKDVETEDEEGLTLYPYERLTVSSTDPAPDINIFKRETYLSSAEFRAKFGMAKNAFYKLPKWKQNKLKMVLHLF*
CDS seq >OTG13993
ATGTCTGTTTCAATGCGAGATTTGGATCCAGCCTTTCAAGGAGTCGGACAGAAAGCAGGAATTGAGTTATGGCGCATTGA
AAATTTTAAGCCAGTTCCTGTCCCGCAATCTTCCTATGGTAAATTTTATACAGGGGACTCCTATGTTATTTTGAAGACGA
ATGCTTTGAAGAGTGGTGTCTTACGTCATGACATCCATTATTGGCTTGGTAACGATACAAGCCAGGACGAAGCTGGCGCT
GCAGCCATAAAGACAATTGAACTTGATGCAGCTCTTGGAGGGCGTGCTGTTCAGTATCGTGAAGTACAAGGACACGAAAC
AGAGAGATTTCTTTCTTACTTTAAACCATGTATTATACCTCAAAGTGGTGGTGTTGCATCTGGTTTGAATCATGCAAAAC
CTGTAGAACACAAGACCCGCATGTTCGTTTGCCTAGGAAAATATGTGGTTCATGTAAAAGAGGTTCCTTTTGCTCGATCC
TCACTCAATCATGATGACATCTTTATCTTGGATACTGCACACAAGATATTCCAGTTTAACGGGTCGAATTCATCCATTCA
AGAAAGGGCTAAAGCACTTGAAGTCTTACAGTATATAAGAGATACTTATCATGATGGGAAGTGTGACATAGCTACTATTG
AGGATGGAAGGCTGATGTCTGATGCGGAAAGCGGAGAATTTTGGGGCTTTTTTGGTGGCTTTGCACCACTTCCTAGGAAA
ACACAGACAAATGGTGTTTTAAGTACCAATACACTTCCTACTAAGCTATATTGTGTTGTGAAGGGTCAGGCAGAACTAGT
TGATGCTGAACCCTTGACAAGGGGGTTGCTAGACACAAATACGTGCTATATACTCGATTGTGGGGTAGAAATTTATGTAT
GGTTGGGGAGAAGTACTTCTCTTAATGAAAGGAAGGCTGCAAGTGGAGCCACAGAAGAATACGTACGTAGTCAGGATAGA
CAAAAATCTCATATAATTTGTGTGATTGAAAATTTTGAGACTGTTGCATTCCGGTCCAAGTTTGACTCATGGCCTCAGTC
AACTGCTGTCACAGTCTCAGAGGATGGTAGAGGCAAGGTTGCTGCACTTCTAAAACGCCAAGGGGTCAATGTGAAGGGTC
TACTAAAAGCTGCTCCAACTAAGGAGGAACCTAAACCCTATATCGACTGCAGTGGGAATTTGCAGGTTTGGCGTGTGAAT
GGTCAACAAAAGATTCTTCTTCCAGTCTCTGATCAGTCAAAATTTTACACTGGAGATTGCTATATATATCAGTATACGTA
TTCTGGAGAAGATCAAGAGGAATGCCTTATTGGGACTTGGTTTGGAAAGCAGAGTATCGAGGAAGACAGAATATCAGCTA
CCTCACAGGCGAACAAGATAGTAGAGTCCCAAAAGTTTTTGGCTGCACAGTTGCGAGTTTATGAAGGAAGGGAACCTATG
TTGTTCTTTGCTATCTTCCAGAGCTTTATGGTTCTAAAGGGTGGTCTTAGTGATGGATACAAGAAATGCATATTAGAGAG
CGAACTTCCTAACGGAACTGGCATAGGGGAAGAAGTTGCACTATTCAGAGTTCAAGGCTCCGGACCAGAAAACATGCAAG
CGATCCAACTTGAACCGGTGGCTTCTTCTTTGAATTCCTCCTACTGTTACATATTACACAGTGGTTCTTCTGTCTTCACA
TGGATTGGGAACCTCACAACTCCTGAGGAACAGGAACTTGTCGAGAGGCAACTTGATGTCATAAAGCCAGAGATGCAGTC
CAGACTGCAGAAAGAAGGCTCAGAATCTGTAGAGTTTTGGAAAATGTTAGGTGGAAAATCTGAATACCCAAGTCAGAAAA
CAACAACGGCTACGGAAAGCGATCCACATTTGTTCTCATGCACATTTTCAAAAGGAGAACTTAAGGTGACTGAGATATAC
AACTTCAACCAGGATGATTTGATGACTGAAGATATATATATTCTCGACTGTCACTCCAGCATCTTTGTTTGGGTAGGCCA
ACAGGTTGATCCCAAGACCAGAACTCAAGCTCTAGTTATCGCGGAGAAATTTCTGGAACGTGATTTTCTTCTTGAGAAAC
TACCACTTCAAACTCCACTATATATCATAATGGAAGCAAGTGAACCGCAATTCTTCACACGCTTCTTTACATGGGATTCG
ACCAAATCAGCTATGCATGGAAACTCATTCCAGAGGAAGCTATCAATACTGAAAAATGGAGGTCGTCCAACATCCAACAA
TAAGCCAAAACGAAGAACACCTGTCTCAAACGGAGGTCGATCTGGAGCAGTGGAACAACCACAACGTTCAAGGAGTGTAA
CGTTTAGTCCTGAACGAGTGCGTGTAAGAGGCAGATCTCCAGCCTTTGTTGCTCTTGCTTCCACATTTGAGAACCCAACC
GTAAGACATCTGTCAACGCCTCCACCACTTGAAACCAAGATCTATCCAAAATCCAACGGCTCAGATTCCTCTAAACCAGC
TTCAAGGTCAACAGCCATTGCGTCACTCACTGCCACCTTTGAACAACCTCCACGTGCAAAGTTTATACCCCGTTCTATAA
AAGTGACGTCTGAGCCACCAACAAAGACCGAAACGATATCCAATGGGGATTTAAAAAGCAGTAGAATAGAGACACCAACC
ATACAAAAAGATGCGAAAGATGTTGAAACCGAAGATGAAGAGGGGCTTACATTATACCCATATGAACGCCTTACTGTATC
ATCCACCGACCCTGCTCCTGATATTAATATCTTCAAGCGCGAGACATATCTGTCGTCAGCTGAGTTCAGAGCAAAATTTG
GAATGGCTAAGAATGCTTTCTACAAGCTGCCAAAATGGAAACAGAATAAGCTGAAAATGGTGCTTCATTTGTTTTGA