Microexon ID Ha_6:12663655-12663662:-
Species Helianthus annuus
Coordinates 6:12663655..12663662
Microexon Cluster ID MEP16
Size 8
Phase 1
Pfam Domain Motif SNF2_N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,8,51
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq BTWWMHTGGCTYATACGAHKRTATSAGMWYGGYRTMAAYGKMATTCTYGSWGATGARATGGGWCTKGGRAARACWYTKCAARCYATHTCHTTSYTGRGYTAYYTRMAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAGACGAG
Microexon Amino Acid seq GDE
Microexon-tag DNA Seq CTGTCATGGCTTATACGACGATACCTTATCGGTGCCAACGTTATTCTCGGAGACGAGATGGGCCTAGGGAAGACATTACAAGCCATCTCCTTCCTTAGTTACTTAAAG
Microexon-tag Amino Acid Seq MSWLIRRYLIGANVILGDEMGLGKTLQAISFLSYLK
Microexon-tag spanning region12663475-12665068
Microexon-tag prediction score0.9429
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG22192x
Reference Transcript ID OTG22192
Gene ID HannXRQ_Chr06g0168811
Gene Name NA
Transcript ID OTG22192
Protein ID OTG22192
Gene ID HannXRQ_Chr06g0168811
Gene Name NA
Pfam domain motif SNF2_N
Motif E-value 3.6e-50
Motif start 58
Motif end 340
Protein seq >OTG22192
MNYEQRLTAAAKYVFDGDARAATEGEVNSSDYGVTAVLKPHQIEGLSWLIRRYLIGANVILGDEMGLGKTLQAISFLSYL
KFCRGIHGPFLILCPLSVTDGWISEVTKFSPKLKVLRYVGDKQYRRALRREMYENVEKQSSSTNVQTLPFDVLLTTYDIA
LIDQDFLSQIPWHYAVIDEAQRLKNPSSVLYNVLRDHYIMPRRLLMTGTPIQNSLTELWALMHFCMPSVFGTLEQFSSTF
KEGKGAPIVKEKFKSLKYVLAAFMLRRTKSRLIETGTLSLPPLTEITLMAPLVTLQKKVYMSILRKELPKLLALSSGTSN
QQSLQNIVIQLRKACSHPYLFAGIEPEPYEEGEHLIEASGKLIVLDQLLQKLRNSGHRVLLFAQMTHTLDILQDYMELRK
YPYERLDGSVRAEERFAAIRSFSRQSVTGNSSSESEPDPDTAFVFMISTRAGGVGLNLVAADTVIFYEQDWNPQVDKQAL
QRAHRIGQMNHVISINLVTERTVEEVIMHRAERKLQLSHDVIGEDAIDKEGKDMVGAEAGDLKSVVLGLRMFDPTTESNE
SSGQLDTSKVNAIVEKVIAFRHGGKSEFEDQTVEFTSKDLLGEKSYDPVLDEASYNSWVQKLKETSETSGDLALEEANKR
LKLEEKHLKAESVRRKAEEKKLAKWEAQGYRSLSVKDVVLPHDNVLSDSGLVNLVYGDCTQPSKICPSEPCIIFSCVDDS
GTWGHGGMFDALARLSSQIPKAYERASEFGDLHLGDLHLLEISGEDEENKDGNVNLWVALAVVQTYNPRRKVPRSSISIP
ELEQCLSKAAFSAAQKSASIHMPRIGHQNGSDRSEWYSVERLIRKYAALYGIKIYVYYFRRSS*
CDS seq >OTG22192
ATGAACTACGAGCAGAGATTAACCGCCGCCGCGAAGTATGTGTTCGACGGAGACGCACGCGCCGCCACTGAGGGTGAAGT
CAACTCTTCCGATTACGGTGTAACGGCGGTGCTTAAGCCTCACCAAATTGAAGGACTGTCATGGCTTATACGACGATACC
TTATCGGTGCCAACGTTATTCTCGGAGACGAGATGGGCCTAGGGAAGACATTACAAGCCATCTCCTTCCTTAGTTACTTA
AAGTTTTGTCGAGGAATACACGGACCCTTCTTGATATTGTGTCCGCTCAGTGTGACAGATGGATGGATATCTGAAGTTAC
CAAATTCTCCCCAAAATTGAAGGTTTTGCGTTACGTTGGAGACAAACAATACAGACGCGCTTTGAGACGGGAAATGTATG
AAAATGTGGAAAAACAATCGTCATCAACCAATGTCCAGACCCTACCTTTTGATGTCCTCCTTACAACATATGATATAGCG
TTAATTGATCAAGACTTTCTTTCACAAATTCCATGGCATTATGCTGTTATTGATGAGGCGCAAAGACTTAAAAACCCTTC
CAGTGTTTTGTATAATGTTCTCAGAGATCACTATATTATGCCTAGAAGACTACTCATGACTGGAACACCTATTCAGAATA
GCCTTACTGAACTTTGGGCGCTCATGCATTTCTGCATGCCTTCGGTTTTTGGGACATTAGAGCAGTTTTCTTCTACATTC
AAGGAAGGTAAAGGTGCACCGATAGTTAAGGAGAAGTTCAAAAGCTTAAAATATGTTTTGGCGGCATTCATGCTTCGAAG
AACAAAATCTAGGCTTATTGAGACTGGAACCCTATCTTTGCCACCTCTTACTGAAATCACTTTGATGGCTCCCTTAGTGA
CATTGCAGAAGAAGGTGTATATGTCAATATTGAGAAAGGAACTTCCTAAGCTTCTAGCATTGTCTTCCGGAACCTCTAAT
CAGCAATCCTTGCAGAATATCGTAATACAGCTACGTAAAGCATGTAGCCATCCTTACCTCTTTGCTGGTATAGAGCCTGA
ACCGTATGAAGAAGGCGAACACTTAATCGAGGCTAGTGGCAAACTTATCGTGTTGGACCAGCTCCTCCAGAAGCTACGTA
ATTCTGGACATCGTGTTCTTCTGTTTGCACAGATGACACATACACTTGACATCTTACAGGACTATATGGAGTTGAGGAAA
TATCCTTACGAACGTCTTGACGGATCTGTTAGGGCTGAAGAACGGTTCGCTGCAATAAGGAGTTTTAGCCGCCAGTCTGT
CACCGGAAATTCCAGTTCTGAATCTGAACCCGATCCGGACACTGCGTTTGTCTTCATGATTTCCACCAGAGCAGGGGGTG
TTGGTTTGAATCTTGTAGCTGCAGACACTGTTATATTTTATGAACAAGATTGGAATCCACAGGTGGATAAGCAGGCTCTA
CAACGGGCTCATAGAATTGGTCAAATGAATCATGTTATTTCTATAAATCTAGTTACTGAACGGACTGTGGAGGAGGTTAT
CATGCATAGAGCAGAGAGAAAATTGCAACTAAGCCATGATGTAATAGGTGAAGACGCGATAGACAAAGAAGGCAAAGATA
TGGTAGGAGCCGAAGCTGGTGATCTGAAATCTGTTGTACTAGGGTTACGTATGTTCGATCCTACTACTGAAAGCAATGAG
AGCTCGGGTCAACTTGATACGTCAAAAGTCAACGCTATTGTTGAAAAGGTTATAGCATTCCGACACGGAGGGAAATCAGA
ATTTGAGGATCAAACGGTTGAGTTTACTTCAAAGGATTTATTAGGTGAAAAGTCATATGACCCTGTTCTTGATGAAGCTT
CGTATAATTCGTGGGTCCAGAAGCTTAAAGAAACATCAGAGACCAGTGGTGATTTGGCGTTAGAGGAGGCAAATAAGAGA
TTGAAACTCGAGGAGAAGCATCTTAAAGCGGAGTCTGTTAGGAGGAAAGCCGAAGAGAAGAAATTAGCAAAATGGGAAGC
TCAGGGATATCGATCTCTGTCGGTTAAAGATGTAGTGCTACCACATGATAATGTCCTGTCGGATTCGGGTTTGGTAAATT
TAGTTTATGGAGACTGCACTCAGCCGTCGAAAATTTGTCCATCAGAGCCATGTATAATATTTAGTTGTGTGGATGATTCC
GGAACTTGGGGCCACGGTGGTATGTTTGATGCACTCGCTAGACTTTCATCACAAATTCCAAAAGCATATGAGCGCGCTTC
TGAATTCGGGGACCTTCATCTTGGTGATCTCCATCTTTTAGAAATTTCTGGTGAAGATGAGGAGAACAAAGATGGTAATG
TTAATCTGTGGGTAGCTCTAGCTGTGGTTCAAACGTATAATCCCAGACGCAAAGTGCCTCGTAGCAGTATCTCTATTCCT
GAACTGGAACAGTGCTTGTCGAAAGCTGCGTTTTCTGCTGCTCAAAAGTCGGCTTCGATCCACATGCCGCGAATTGGTCA
TCAGAATGGGTCAGACCGGTCAGAATGGTACTCGGTTGAACGTCTTATTAGGAAATATGCTGCTTTATATGGCATAAAGA
TATATGTGTATTATTTCCGTCGATCCTCTTAA
Microexon DNA seq GAGACGAG
Microexon Amino Acid seq GDE
Microexon-tag DNA Seq CTGTCATGGCTTATACGACGATACCTTATCGGTGCCAACGTTATTCTCGGAGACGAGATGGGCCTAGGGAAGACATTACAAGCCATCTCCTTCCTTAGTTACTTAAAG
Microexon-tag Amino Acid seq MSWLIRRYLIGANVILGDEMGLGKTLQAISFLSYLK
Transcript ID OTG22192
Gene ID Ha.47433
Gene Name NA
Pfam domain motif SNF2_N
Motif E-value 3.6e-50
Motif start 58
Motif end 340
Protein seq >OTG22192
MNYEQRLTAAAKYVFDGDARAATEGEVNSSDYGVTAVLKPHQIEGLSWLIRRYLIGANVILGDEMGLGKTLQAISFLSYL
KFCRGIHGPFLILCPLSVTDGWISEVTKFSPKLKVLRYVGDKQYRRALRREMYENVEKQSSSTNVQTLPFDVLLTTYDIA
LIDQDFLSQIPWHYAVIDEAQRLKNPSSVLYNVLRDHYIMPRRLLMTGTPIQNSLTELWALMHFCMPSVFGTLEQFSSTF
KEGKGAPIVKEKFKSLKYVLAAFMLRRTKSRLIETGTLSLPPLTEITLMAPLVTLQKKVYMSILRKELPKLLALSSGTSN
QQSLQNIVIQLRKACSHPYLFAGIEPEPYEEGEHLIEASGKLIVLDQLLQKLRNSGHRVLLFAQMTHTLDILQDYMELRK
YPYERLDGSVRAEERFAAIRSFSRQSVTGNSSSESEPDPDTAFVFMISTRAGGVGLNLVAADTVIFYEQDWNPQVDKQAL
QRAHRIGQMNHVISINLVTERTVEEVIMHRAERKLQLSHDVIGEDAIDKEGKDMVGAEAGDLKSVVLGLRMFDPTTESNE
SSGQLDTSKVNAIVEKVIAFRHGGKSEFEDQTVEFTSKDLLGEKSYDPVLDEASYNSWVQKLKETSETSGDLALEEANKR
LKLEEKHLKAESVRRKAEEKKLAKWEAQGYRSLSVKDVVLPHDNVLSDSGLVNLVYGDCTQPSKICPSEPCIIFSCVDDS
GTWGHGGMFDALARLSSQIPKAYERASEFGDLHLGDLHLLEISGEDEENKDGNVNLWVALAVVQTYNPRRKVPRSSISIP
ELEQCLSKAAFSAAQKSASIHMPRIGHQNGSDRSEWYSVERLIRKYAALYGIKIYVYYFRRSS*
CDS seq >OTG22192
ATGAACTACGAGCAGAGATTAACCGCCGCCGCGAAGTATGTGTTCGACGGAGACGCACGCGCCGCCACTGAGGGTGAAGT
CAACTCTTCCGATTACGGTGTAACGGCGGTGCTTAAGCCTCACCAAATTGAAGGACTGTCATGGCTTATACGACGATACC
TTATCGGTGCCAACGTTATTCTCGGAGACGAGATGGGCCTAGGGAAGACATTACAAGCCATCTCCTTCCTTAGTTACTTA
AAGTTTTGTCGAGGAATACACGGACCCTTCTTGATATTGTGTCCGCTCAGTGTGACAGATGGATGGATATCTGAAGTTAC
CAAATTCTCCCCAAAATTGAAGGTTTTGCGTTACGTTGGAGACAAACAATACAGACGCGCTTTGAGACGGGAAATGTATG
AAAATGTGGAAAAACAATCGTCATCAACCAATGTCCAGACCCTACCTTTTGATGTCCTCCTTACAACATATGATATAGCG
TTAATTGATCAAGACTTTCTTTCACAAATTCCATGGCATTATGCTGTTATTGATGAGGCGCAAAGACTTAAAAACCCTTC
CAGTGTTTTGTATAATGTTCTCAGAGATCACTATATTATGCCTAGAAGACTACTCATGACTGGAACACCTATTCAGAATA
GCCTTACTGAACTTTGGGCGCTCATGCATTTCTGCATGCCTTCGGTTTTTGGGACATTAGAGCAGTTTTCTTCTACATTC
AAGGAAGGTAAAGGTGCACCGATAGTTAAGGAGAAGTTCAAAAGCTTAAAATATGTTTTGGCGGCATTCATGCTTCGAAG
AACAAAATCTAGGCTTATTGAGACTGGAACCCTATCTTTGCCACCTCTTACTGAAATCACTTTGATGGCTCCCTTAGTGA
CATTGCAGAAGAAGGTGTATATGTCAATATTGAGAAAGGAACTTCCTAAGCTTCTAGCATTGTCTTCCGGAACCTCTAAT
CAGCAATCCTTGCAGAATATCGTAATACAGCTACGTAAAGCATGTAGCCATCCTTACCTCTTTGCTGGTATAGAGCCTGA
ACCGTATGAAGAAGGCGAACACTTAATCGAGGCTAGTGGCAAACTTATCGTGTTGGACCAGCTCCTCCAGAAGCTACGTA
ATTCTGGACATCGTGTTCTTCTGTTTGCACAGATGACACATACACTTGACATCTTACAGGACTATATGGAGTTGAGGAAA
TATCCTTACGAACGTCTTGACGGATCTGTTAGGGCTGAAGAACGGTTCGCTGCAATAAGGAGTTTTAGCCGCCAGTCTGT
CACCGGAAATTCCAGTTCTGAATCTGAACCCGATCCGGACACTGCGTTTGTCTTCATGATTTCCACCAGAGCAGGGGGTG
TTGGTTTGAATCTTGTAGCTGCAGACACTGTTATATTTTATGAACAAGATTGGAATCCACAGGTGGATAAGCAGGCTCTA
CAACGGGCTCATAGAATTGGTCAAATGAATCATGTTATTTCTATAAATCTAGTTACTGAACGGACTGTGGAGGAGGTTAT
CATGCATAGAGCAGAGAGAAAATTGCAACTAAGCCATGATGTAATAGGTGAAGACGCGATAGACAAAGAAGGCAAAGATA
TGGTAGGAGCCGAAGCTGGTGATCTGAAATCTGTTGTACTAGGGTTACGTATGTTCGATCCTACTACTGAAAGCAATGAG
AGCTCGGGTCAACTTGATACGTCAAAAGTCAACGCTATTGTTGAAAAGGTTATAGCATTCCGACACGGAGGGAAATCAGA
ATTTGAGGATCAAACGGTTGAGTTTACTTCAAAGGATTTATTAGGTGAAAAGTCATATGACCCTGTTCTTGATGAAGCTT
CGTATAATTCGTGGGTCCAGAAGCTTAAAGAAACATCAGAGACCAGTGGTGATTTGGCGTTAGAGGAGGCAAATAAGAGA
TTGAAACTCGAGGAGAAGCATCTTAAAGCGGAGTCTGTTAGGAGGAAAGCCGAAGAGAAGAAATTAGCAAAATGGGAAGC
TCAGGGATATCGATCTCTGTCGGTTAAAGATGTAGTGCTACCACATGATAATGTCCTGTCGGATTCGGGTTTGGTAAATT
TAGTTTATGGAGACTGCACTCAGCCGTCGAAAATTTGTCCATCAGAGCCATGTATAATATTTAGTTGTGTGGATGATTCC
GGAACTTGGGGCCACGGTGGTATGTTTGATGCACTCGCTAGACTTTCATCACAAATTCCAAAAGCATATGAGCGCGCTTC
TGAATTCGGGGACCTTCATCTTGGTGATCTCCATCTTTTAGAAATTTCTGGTGAAGATGAGGAGAACAAAGATGGTAATG
TTAATCTGTGGGTAGCTCTAGCTGTGGTTCAAACGTATAATCCCAGACGCAAAGTGCCTCGTAGCAGTATCTCTATTCCT
GAACTGGAACAGTGCTTGTCGAAAGCTGCGTTTTCTGCTGCTCAAAAGTCGGCTTCGATCCACATGCCGCGAATTGGTCA
TCAGAATGGGTCAGACCGGTCAGAATGGTACTCGGTTGAACGTCTTATTAGGAAATATGCTGCTTTATATGGCATAAAGA
TATATGTGTATTATTTCCGTCGATCCTCTTAA