Microexon ID Ha_3:92364250-92364254:-
Species Helianthus annuus
Coordinates 3:92364250..92364254
Microexon Cluster ID MEP09
Size 5
Phase 2
Pfam Domain Motif SKG6
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 19,34,5,50
Microexon location in the Microexon-tag 3
Microexon-tag DNA Seq GTDTWYATTCCDGSMMRAGATSAAAATGGAARYTATCSRCCCYTRMMAWCMAGCWCAGGAWTATCARGTGGAGYYATWGSTGGMATAKYTRTAGSAGYAGTAGYWGKR
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq CTCAG
Microexon Amino Acid seq SSG
Microexon-tag DNA Seq CTGTATATTCCTGGGAGAGATGAAAATGGAAACTATCCACCCTTACCAAGCAGCTCAGGATTATCAGGTGGGGCCATAGGTGGAATAGTCGTAGGAATAGTAGCTGTA
Microexon-tag Amino Acid Seq MYIPGRDENGNYPPLPSSSGLSGGAIGGIVVGIVAV
Microexon-tag spanning region92364094-92365706
Microexon-tag prediction score0.9727
Overlapped with the annotated transcript (%) 82.41
New Transcript ID OTG31250x
Reference Transcript ID OTG31250
Gene ID HannXRQ_Chr03g0073441
Gene Name CERK1
Transcript ID OTG31250
Protein ID OTG31250
Gene ID HannXRQ_Chr03g0073441
Gene Name CERK1
Pfam domain motif SKG6
Motif E-value 0.019
Motif start 226
Motif end 255
Protein seq >OTG31250
MLELNSGFRLATFLSLISTCFIPSAQSRCTRGCNLALGSYYVQSGDELAQISLRFNINNNNNILKYNPSIPNQDSLQAGQ
RINVPFSCGCINGDFLGHVFTYNIQSEDTYDKVAQKSYANLTTADWVQRFNSYDSFRIPDSGTLNVTVNCSCGDSLISKD
YGLFVTYPLQPGETLDSVSSAANLSSDLIRRYNSGANLTTGSGLLYIPGRDENGNYPPLPSSSGLSGGAIGGIVVGIVAV
LLLLAGCFYFGYYKKKKAGSSSALLKNAQVQLVPDQGRNGSLVRGSESSGGAVAGASPGLTGITVDKSVEFSYEELSTAT
DDFSLANKIGQGGFGAVYYAELRGEKAAIKKMDMQASREFLAELKVLTHVHHLNLVRLIGYCVEGSLFLVYEYIENGNLS
QHLHGSGRDPLPWSTRVQIALDSARGLEYIHEHTVPVYIHRDIKSANILIDKNFHGKVADFGLTKLTEVGSNSLPTRLVG
TFGYMPPEYAQYGDVSPKVDVYAFGVVLYELISAKEAIVKANGSVTESKGLVAMFEEVLSQPDPKDDLIKMIDPRLGENY
PLDSVRKMAQLAKACTHENPQLRPSMRSIVVALMTLSSSTEDWDVGSFYENQNLVSLMSGR*
CDS seq >OTG31250
ATGTTGGAGCTCAATTCAGGGTTCCGATTAGCAACCTTCTTATCCTTAATCTCAACCTGTTTCATTCCATCAGCTCAATC
CAGATGCACCCGAGGCTGTAATCTAGCACTCGGTTCCTACTACGTTCAATCAGGAGATGAACTCGCTCAAATTTCGTTGC
GCTTCAACATCAACAACAACAACAATATTCTCAAGTATAATCCGAGTATACCGAATCAGGATAGCCTCCAGGCCGGTCAG
AGGATTAACGTGCCGTTTTCTTGTGGGTGTATCAACGGTGACTTCTTAGGACATGTGTTCACCTACAACATTCAGTCTGA
GGATACTTATGACAAGGTTGCACAAAAGAGTTATGCGAATTTGACTACTGCGGATTGGGTTCAGAGGTTTAATTCGTATG
ATTCTTTTCGGATACCTGATAGTGGTACTCTTAATGTGACGGTGAATTGTTCGTGTGGGGATAGTTTGATTTCCAAGGAT
TATGGTTTGTTTGTGACATATCCGCTCCAGCCGGGGGAGACTTTGGATTCAGTTTCTTCAGCTGCGAATCTTAGTTCCGA
TTTGATTCGGAGGTATAATTCCGGTGCCAATTTAACTACAGGAAGTGGGTTGTTGTATATTCCTGGGCGAGATGAAAATG
GAAACTATCCACCCTTACCAAGCAGCTCAGGATTATCAGGTGGGGCCATAGGTGGAATAGTCGTAGGAATAGTAGCTGTA
CTGCTGTTACTTGCAGGATGCTTTTATTTCGGATATTACAAAAAGAAAAAGGCTGGATCGAGTTCAGCTTTATTAAAGAA
CGCACAGGTTCAACTTGTGCCCGATCAAGGGCGCAATGGTTCGTTGGTTAGAGGTTCAGAGTCCAGTGGTGGTGCGGTTG
CTGGTGCCTCACCTGGGCTTACAGGTATAACAGTTGACAAATCAGTAGAGTTCTCGTATGAAGAGCTTTCGACAGCTACA
GATGACTTCAGTCTTGCTAATAAGATTGGTCAAGGTGGTTTCGGTGCTGTTTACTATGCTGAGCTCCGAGGCGAGAAAGC
TGCTATCAAGAAGATGGATATGCAAGCATCACGTGAATTTCTAGCCGAACTAAAGGTTTTAACGCATGTTCATCACCTAA
ACCTGGTGCGTTTGATAGGATATTGTGTCGAGGGTTCCCTTTTCTTGGTCTATGAGTACATTGAGAACGGAAACTTGAGT
CAACATCTACATGGATCAGGACGGGACCCGCTACCGTGGTCTACCCGAGTCCAAATCGCCCTTGATTCAGCGCGCGGTCT
TGAGTATATCCATGAACATACTGTTCCCGTATATATACATCGTGATATTAAATCAGCAAATATACTCATCGACAAGAACT
TTCATGGAAAGGTTGCGGATTTCGGTTTAACAAAATTGACAGAAGTTGGAAGTAATTCTTTGCCTACACGTCTTGTGGGT
ACATTCGGATATATGCCACCAGAGTATGCACAGTATGGGGATGTTTCTCCGAAGGTAGACGTGTATGCATTTGGGGTTGT
ACTTTACGAACTTATTTCGGCCAAAGAAGCCATAGTCAAAGCAAATGGCTCTGTTACCGAATCAAAGGGACTAGTTGCCA
TGTTTGAAGAAGTTCTAAGTCAACCAGATCCAAAAGATGACCTGATCAAAATGATTGATCCTAGACTGGGGGAAAACTAC
CCTCTCGATTCAGTTCGTAAGATGGCTCAACTTGCGAAAGCGTGCACGCATGAGAATCCCCAACTAAGGCCGAGCATGAG
ATCTATCGTGGTCGCGTTGATGACTCTCTCGTCGTCCACTGAAGATTGGGATGTTGGCTCGTTTTATGAAAACCAGAATC
TTGTGAGCCTCATGTCTGGCAGATAG
Microexon DNA seq CTCAG
Microexon Amino Acid seq SSG
Microexon-tag DNA Seq CTGTATATTCCTGGGAGAGATGAAAATGGAAACTATCCACCCTTACCAAGCAGCTCAGGATTATCAGGTGGGGCCATAGGTGGAATAGTCGTAGGAATAGTAGCTGTA
Microexon-tag Amino Acid seq MYIPGRDENGNYPPLPSSSGLSGGAIGGIVVGIVAV
Transcript ID Ha.37534.1
Gene ID Ha.37534
Gene Name CERK1
Pfam domain motif SKG6
Motif E-value 0.019
Motif start 224
Motif end 253
Protein seq >Ha.37534.1
MLNLKLGFRLITTFLFITSICFNSLAQSRCTRGCNLALGSYYVQQGDELTRISLRFNTNNDNILSYNPSIPNQDSVQAFT
RMNVPFSCDCINGEFLGHVFNYDVATGDTYVTIAQDKFANLTTADWIQRFNNFDPNRIPDTAFLNVTVNCSCGDSDISKD
YGLFVTYPLRPGETLDSVSSAANISSDLIRRYNPDANLTSRLLYIPGRDENGNYPPLPSSSGLSGGAIGGIVVGIVAVLL
LLAGCFYFGYYKKKKAGSSSALLKNAQVQLVPDQGRNGSLVRGSESSGGAVAGASPGLTGITVDKSVEFSYEELSTATDD
FSLANKIGQGGFGAVYYAELRGEKAAIKKMDMQASREFLAELKVLTHVHHLNLVRLIGYCVEGSLFLVYEYIENGNLSQH
LHGSGRDPLPWSTRVQIALDSARGLEYIHEHTVPVYIHRDIKSANILIDKNFHGKVADFGLTKLTEVGSNSLPTRLVGTF
GYMPPEYAQYGDVSPKVDVYAFGVVLYELISAKEAIVKANGSVTESKGLVAMFEEVLSQPDPKDDLIKMIDPRLGENYPL
DSVRKMAQLAKACTHENPQLRPSMRSIVVALMTLSSSTEDWDVGSFYENQNLVSLMSGR*
CDS seq >Ha.37534.1
ATGCTAAACCTCAAATTAGGGTTCCGATTAATCACCACCTTCTTATTCATAACCTCAATTTGTTTCAATTCACTAGCTCA
ATCCAGATGCACCAGAGGTTGCAACCTAGCCTTGGGTTCATACTACGTTCAACAAGGAGATGAACTCACTCGAATTTCTC
TACGCTTCAACACAAACAACGACAATATCCTCAGTTATAATCCATCTATCCCTAACCAGGATAGCGTTCAGGCCTTCACG
AGGATGAACGTTCCGTTTTCCTGCGATTGTATCAACGGTGAGTTTTTAGGACACGTGTTCAACTACGACGTTGCCACTGG
GGATACGTATGTGACAATTGCACAAGACAAGTTTGCGAATTTGACTACTGCTGATTGGATTCAGCGGTTTAATAATTTTG
ATCCGAATCGGATCCCCGATACTGCGTTTCTTAATGTGACGGTGAATTGTTCGTGTGGAGATAGTGATATTTCGAAGGAT
TATGGCTTGTTTGTGACGTATCCGCTTCGGCCCGGGGAGACGTTGGATTCGGTTTCGTCTGCTGCGAATATTAGTTCCGA
TTTGATTAGGAGATATAATCCGGATGCGAATCTGACTAGTAGATTGCTGTATATTCCTGGGAGAGATGAAAATGGAAACT
ATCCACCCTTACCAAGCAGCTCAGGATTATCAGGTGGGGCCATAGGTGGAATAGTCGTAGGAATAGTAGCTGTACTGCTG
TTACTTGCAGGATGCTTTTATTTCGGATATTACAAAAAGAAAAAGGCTGGATCGAGTTCAGCTTTATTAAAGAACGCACA
GGTTCAACTTGTGCCCGATCAAGGGCGCAATGGTTCGTTGGTTAGAGGTTCAGAGTCCAGTGGTGGTGCGGTTGCTGGTG
CCTCACCTGGGCTTACAGGTATAACAGTTGACAAATCAGTAGAGTTCTCGTATGAAGAGCTTTCGACAGCTACAGATGAC
TTCAGTCTTGCTAATAAGATTGGTCAAGGTGGTTTCGGTGCTGTTTACTATGCTGAGCTCCGAGGCGAGAAAGCTGCTAT
CAAGAAGATGGATATGCAAGCATCACGTGAATTTCTAGCCGAACTAAAGGTTTTAACGCATGTTCATCACCTAAACCTGG
TGCGTTTGATAGGATATTGTGTCGAGGGTTCCCTTTTCTTGGTCTATGAGTACATTGAGAACGGAAACTTGAGTCAACAT
CTACATGGATCAGGACGGGACCCGCTACCGTGGTCTACCCGAGTCCAAATCGCCCTTGATTCAGCGCGCGGTCTTGAGTA
TATCCATGAACATACTGTTCCCGTATATATACATCGTGATATTAAATCAGCAAATATACTCATCGACAAGAACTTTCATG
GAAAGGTTGCGGATTTCGGTTTAACAAAATTGACAGAAGTTGGAAGTAATTCTTTGCCTACACGTCTTGTGGGTACATTC
GGATATATGCCACCAGAGTATGCACAGTATGGGGATGTTTCTCCGAAGGTAGACGTGTATGCATTTGGGGTTGTACTTTA
CGAACTTATTTCGGCCAAAGAAGCCATAGTCAAAGCAAATGGCTCTGTTACCGAATCAAAGGGACTAGTTGCCATGTTTG
AAGAAGTTCTAAGTCAACCAGATCCAAAAGATGACCTGATCAAAATGATTGATCCTAGACTGGGGGAAAACTACCCTCTC
GATTCAGTTCGTAAGATGGCTCAACTTGCGAAAGCGTGCACGCATGAGAATCCCCAACTAAGGCCGAGCATGAGATCTAT
CGTGGTCGCGTTGATGACTCTCTCGTCGTCCACTGAAGATTGGGATGTTGGCTCGTTTTATGAAAACCAGAATCTTGTGA
GCCTCATGTCTGGCAGATAG