Microexon ID Ha_12:41928234-41928247:-
Species Helianthus annuus
Coordinates 12:41928234..41928247
Microexon Cluster ID MEP38
Size 14
Phase 1
Pfam Domain Motif Myosin_head
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 46,14,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq CAYTTYARTRMAACTGGRAARATATSTGGTGCYAADATTCAAACWTTTYTRCTTGARAAGTCWAGAGTWGTYCARYKTGCWGAWGGWGARAGRTCATAYCATATWTTT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTCTTCTTGAAAAG
Microexon Amino Acid seq FLLEK
Microexon-tag DNA Seq CACTTTAGTGAAATCGGAAGAATAGCAGGTGCCAGTATTCAAACATTTCTTCTTGAAAAGTCAAGAGTTGTTCAGTGCACAGAAGGTGAAAGGTCATATCATTCGTTT
Microexon-tag Amino Acid Seq HFSEIGRIAGASIQTFLLEKSRVVQCTEGERSYHSF
Microexon-tag spanning region41928102-41928428
Microexon-tag prediction score0.9532
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG04781x
Reference Transcript ID OTG04781
Gene ID HannXRQ_Chr12g0366221
Gene Name NA
Transcript ID OTG04781
Protein ID OTG04781
Gene ID HannXRQ_Chr12g0366221
Gene Name NA
Pfam domain motif Myosin_head
Motif E-value 4.5e-240
Motif start 118
Motif end 775
Protein seq >OTG04781
MDSAEKSANKISSMGSGSIDQVDDDDSPYARKDNKGLKSSISHQDSDLHSRGWDDVSVYNAKKKNYAWFQAPDGKWELAK
IISVTGTESLIAFSETKVLKVQSDCLLPANPEILDGIDDLIQLSYLSEPSVLYNLQYRYERDMIYSKAGPVLVAINPFKT
IPLYGDDYMHAYKRKKIDSPHVYAIADTAIREMIRDEVNQSIVINGESGAGKTETAKIAMQYLAAVRGGSGIEYEILKTN
PILEGFGNAKTSRNDNSSRFGKLIEIHFSEIGRIAGASIQTFLLEKSRVVQCTEGERSYHSFYQLCAGAPPSLREKLNLK
SAHEYKYLQQSNCYNISGVNDAEEFRIVVEALDIVHVSKEDQENVFSMLAAVLWLGNVTFQAVDDNNHVEPMIDEALLTV
AKLLGCEAENLQTALSTRKMTVMNENIIKRLNLTQAMDSRDALAKSIYSCLFDWLVEQINKSLSAGKRLSGRSISILDIY
GFESFDVNSLEQFCINYANERLQQHFNHHLFKLEQEEYIQDGIDWAKVDFEDNQACLSLFEKKPLGLLSLLDEETTFPNA
TDMSFANKLKQHLSDNPCFRGERGKAFTVHHYAGEVKYDTTGFLEKNRDLLHSVAIQLLSSCTCKLPQIFASSMLSLTEK
PAVGSLNKSGGADSQKLSVTLKFKGQLFQLMQRLGNTRSHFIRCIKPNDSHASGIYDQQLVLQQLKCCGVLEVVRISRSG
FPTRMTHQKFARRYGFLLLDHVASQDPLSASVAILQQFNILPEMYQVGYTKLFFRTGQIGKLEDTRNRTLNGILRVQSCF
RGHKARLILREMKRRIFTLQSFVRGQKDRKEFAILLHRHRAAARIQKRIKAINVRKDYKKLYDASDVIQAVIRGWLVRKS
TEGTCLQRFKGNKDDVLVNKTYLADLQRRVLKAEIGLREKEEEKEMIQQRIQQYETRWSDYEQKMKSMEELWQKQMNSLQ
SSLTLAKQNLTVDDFSDPSVNLTSANNFSRRNSNDAAGASSNEMSAGLSLISRLAEEFEQRSHVFGDDAQFLVEVKSGQA
EASLNPEEELGRLKQSFEGWKKDFGARLRETKVILNKLSSEEGSSGRGGEKVKKNWWGRLNSSRVN*
CDS seq >OTG04781
ATGGATTCAGCTGAAAAATCTGCAAATAAAATTAGTAGTATGGGATCTGGTAGCATTGATCAGGTTGATGATGATGATTC
GCCATATGCTCGTAAAGATAATAAAGGTCTCAAGTCATCAATTTCTCATCAAGATTCGGATTTACATTCGAGAGGCTGGG
ATGATGTCTCTGTCTATAATGCCAAGAAGAAGAATTATGCTTGGTTTCAAGCCCCTGATGGAAAGTGGGAACTGGCAAAG
ATTATATCTGTAACAGGCACTGAATCACTCATTGCATTTTCTGAAACAAAAGTATTAAAGGTGCAATCTGATTGTCTGTT
ACCTGCCAATCCAGAAATCCTTGACGGGATAGACGATCTAATTCAATTAAGTTATTTGAGCGAACCATCAGTCTTGTACA
ACCTTCAATATAGATATGAACGAGACATGATTTATTCAAAAGCGGGACCTGTGTTAGTTGCCATCAACCCTTTTAAAACG
ATTCCATTGTATGGAGATGACTACATGCATGCCTATAAGAGGAAAAAGATTGATAGCCCTCACGTATACGCCATTGCTGA
TACAGCTATCCGGGAAATGATTAGAGATGAAGTAAACCAATCGATTGTGATAAATGGTGAAAGCGGAGCTGGGAAAACTG
AAACAGCAAAGATAGCAATGCAATACTTAGCTGCAGTTAGAGGGGGTAGTGGAATAGAGTACGAGATACTGAAAACGAAT
CCAATTTTGGAAGGATTCGGGAATGCAAAAACATCGAGAAATGACAACTCGAGTCGTTTTGGAAAGCTAATTGAAATACA
CTTTAGTGAAATCGGAAGAATAGCAGGTGCCAGTATTCAAACATTTCTTCTTGAAAAGTCAAGAGTTGTTCAGTGCACAG
AAGGTGAAAGGTCATATCATTCGTTTTATCAGCTTTGTGCAGGAGCTCCACCTTCTCTTAGAGAGAAACTAAACTTGAAG
AGTGCGCATGAGTACAAATATTTGCAGCAAAGCAATTGCTATAATATTTCCGGGGTAAATGATGCTGAAGAATTTCGTAT
CGTAGTGGAAGCTCTGGATATTGTTCATGTTAGCAAAGAGGACCAAGAAAATGTTTTTTCAATGCTTGCAGCAGTGTTAT
GGCTCGGAAATGTAACATTTCAAGCTGTTGACGACAACAATCATGTGGAACCTATGATTGATGAAGCTCTTCTAACTGTT
GCTAAATTGCTTGGGTGTGAGGCTGAGAATCTACAGACCGCTTTATCCACTCGGAAAATGACTGTTATGAATGAAAACAT
AATCAAAAGGCTAAATCTAACTCAGGCAATGGATTCACGCGACGCGTTGGCAAAATCAATATATTCTTGTCTGTTTGATT
GGTTGGTGGAACAGATCAACAAATCACTTTCTGCGGGGAAACGTCTGTCTGGAAGATCCATCAGCATTCTTGATATTTAC
GGATTCGAATCATTTGACGTAAATAGTTTAGAGCAGTTCTGCATTAATTATGCAAATGAGAGATTACAGCAACACTTTAA
TCATCATTTATTCAAGCTAGAACAGGAGGAATATATCCAAGATGGCATTGACTGGGCAAAGGTTGACTTTGAGGACAATC
AAGCTTGTCTCAGTCTTTTTGAGAAGAAACCATTAGGATTACTATCCCTATTAGATGAAGAAACCACATTTCCAAACGCG
ACAGATATGTCATTTGCCAACAAGCTAAAGCAACACTTGAGTGATAATCCGTGTTTTAGAGGAGAACGAGGCAAAGCGTT
CACGGTTCATCATTATGCTGGGGAAGTAAAATATGACACAACCGGGTTTTTGGAGAAAAACCGCGATTTATTGCATTCGG
TTGCCATTCAACTTCTGTCTTCTTGCACATGCAAACTTCCTCAGATTTTTGCTTCCAGTATGCTTTCTTTGACTGAGAAG
CCTGCAGTTGGTTCACTAAATAAATCAGGTGGCGCAGATTCCCAGAAGCTTAGTGTCACGTTAAAGTTTAAGGGCCAATT
ATTCCAACTAATGCAACGTTTGGGGAACACAAGGTCACATTTCATACGTTGCATTAAGCCCAATGACTCACACGCGTCTG
GAATTTATGATCAACAACTTGTACTTCAGCAGCTAAAATGTTGTGGTGTTTTAGAAGTCGTTCGAATATCAAGATCTGGA
TTTCCAACAAGAATGACCCATCAAAAATTCGCAAGAAGGTATGGTTTTCTTCTTCTCGACCATGTTGCATCACAAGATCC
ACTAAGTGCATCTGTTGCCATCCTTCAACAATTCAATATTCTTCCCGAAATGTATCAAGTTGGCTATACAAAGTTGTTCT
TTCGAACCGGACAGATTGGTAAGCTTGAAGATACACGAAATCGTACTCTGAATGGCATATTACGCGTTCAAAGCTGCTTT
AGAGGTCACAAAGCGCGTCTGATTCTGAGAGAAATGAAACGAAGAATTTTCACTCTTCAATCATTTGTTCGAGGGCAAAA
AGATAGAAAGGAGTTTGCAATCTTACTACACAGACATAGAGCAGCAGCGCGTATACAAAAGCGGATTAAGGCAATCAATG
TTAGGAAAGACTACAAGAAACTTTATGATGCATCAGATGTCATACAAGCAGTTATTCGCGGATGGCTTGTCCGAAAAAGT
ACAGAGGGCACTTGCCTGCAACGATTTAAGGGTAACAAGGACGATGTGCTGGTGAACAAAACGTATCTGGCGGACCTACA
ACGACGAGTTCTTAAAGCTGAGATTGGTTTACGCGAAAAAGAAGAAGAAAAGGAAATGATCCAGCAACGCATCCAACAAT
ACGAAACCCGTTGGTCTGACTACGAGCAGAAAATGAAATCCATGGAAGAACTATGGCAGAAACAGATGAACTCACTTCAA
TCTAGCCTAACTTTAGCAAAACAGAACCTAACAGTAGACGATTTTTCTGATCCTTCGGTCAACCTAACCAGCGCGAATAA
CTTCAGCCGAAGAAACTCAAACGATGCAGCAGGGGCCAGTAGCAATGAAATGAGCGCTGGACTGAGTTTGATCAGCCGGT
TAGCCGAGGAGTTTGAGCAGAGAAGCCATGTGTTTGGAGATGATGCACAGTTTCTGGTGGAGGTTAAGTCGGGTCAAGCG
GAGGCAAGTTTGAACCCGGAAGAAGAACTTGGGAGGTTGAAACAGAGTTTTGAAGGGTGGAAGAAGGATTTTGGCGCCAG
ATTGAGGGAGACAAAGGTAATTTTGAACAAACTAAGTAGTGAAGAAGGGAGTAGTGGTCGTGGTGGTGAAAAGGTTAAAA
AGAATTGGTGGGGGAGGCTTAACAGCTCAAGGGTTAATTGA
Microexon DNA seq TTCTTCTTGAAAAG
Microexon Amino Acid seq FLLEK
Microexon-tag DNA Seq CACTTTAGTGAAATCGGAAGAATAGCAGGTGCCAGTATTCAAACATTTCTTCTTGAAAAGTCAAGAGTTGTTCAGTGCACAGAAGGTGAAAGGTCATATCATTCGTTT
Microexon-tag Amino Acid seq HFSEIGRIAGASIQTFLLEKSRVVQCTEGERSYHSF
Transcript ID OTG04781
Gene ID Ha.12162
Gene Name NA
Pfam domain motif Myosin_head
Motif E-value 4.5e-240
Motif start 118
Motif end 775
Protein seq >OTG04781
MDSAEKSANKISSMGSGSIDQVDDDDSPYARKDNKGLKSSISHQDSDLHSRGWDDVSVYNAKKKNYAWFQAPDGKWELAK
IISVTGTESLIAFSETKVLKVQSDCLLPANPEILDGIDDLIQLSYLSEPSVLYNLQYRYERDMIYSKAGPVLVAINPFKT
IPLYGDDYMHAYKRKKIDSPHVYAIADTAIREMIRDEVNQSIVINGESGAGKTETAKIAMQYLAAVRGGSGIEYEILKTN
PILEGFGNAKTSRNDNSSRFGKLIEIHFSEIGRIAGASIQTFLLEKSRVVQCTEGERSYHSFYQLCAGAPPSLREKLNLK
SAHEYKYLQQSNCYNISGVNDAEEFRIVVEALDIVHVSKEDQENVFSMLAAVLWLGNVTFQAVDDNNHVEPMIDEALLTV
AKLLGCEAENLQTALSTRKMTVMNENIIKRLNLTQAMDSRDALAKSIYSCLFDWLVEQINKSLSAGKRLSGRSISILDIY
GFESFDVNSLEQFCINYANERLQQHFNHHLFKLEQEEYIQDGIDWAKVDFEDNQACLSLFEKKPLGLLSLLDEETTFPNA
TDMSFANKLKQHLSDNPCFRGERGKAFTVHHYAGEVKYDTTGFLEKNRDLLHSVAIQLLSSCTCKLPQIFASSMLSLTEK
PAVGSLNKSGGADSQKLSVTLKFKGQLFQLMQRLGNTRSHFIRCIKPNDSHASGIYDQQLVLQQLKCCGVLEVVRISRSG
FPTRMTHQKFARRYGFLLLDHVASQDPLSASVAILQQFNILPEMYQVGYTKLFFRTGQIGKLEDTRNRTLNGILRVQSCF
RGHKARLILREMKRRIFTLQSFVRGQKDRKEFAILLHRHRAAARIQKRIKAINVRKDYKKLYDASDVIQAVIRGWLVRKS
TEGTCLQRFKGNKDDVLVNKTYLADLQRRVLKAEIGLREKEEEKEMIQQRIQQYETRWSDYEQKMKSMEELWQKQMNSLQ
SSLTLAKQNLTVDDFSDPSVNLTSANNFSRRNSNDAAGASSNEMSAGLSLISRLAEEFEQRSHVFGDDAQFLVEVKSGQA
EASLNPEEELGRLKQSFEGWKKDFGARLRETKVILNKLSSEEGSSGRGGEKVKKNWWGRLNSSRVN*
CDS seq >OTG04781
ATGGATTCAGCTGAAAAATCTGCAAATAAAATTAGTAGTATGGGATCTGGTAGCATTGATCAGGTTGATGATGATGATTC
GCCATATGCTCGTAAAGATAATAAAGGTCTCAAGTCATCAATTTCTCATCAAGATTCGGATTTACATTCGAGAGGCTGGG
ATGATGTCTCTGTCTATAATGCCAAGAAGAAGAATTATGCTTGGTTTCAAGCCCCTGATGGAAAGTGGGAACTGGCAAAG
ATTATATCTGTAACAGGCACTGAATCACTCATTGCATTTTCTGAAACAAAAGTATTAAAGGTGCAATCTGATTGTCTGTT
ACCTGCCAATCCAGAAATCCTTGACGGGATAGACGATCTAATTCAATTAAGTTATTTGAGCGAACCATCAGTCTTGTACA
ACCTTCAATATAGATATGAACGAGACATGATTTATTCAAAAGCGGGACCTGTGTTAGTTGCCATCAACCCTTTTAAAACG
ATTCCATTGTATGGAGATGACTACATGCATGCCTATAAGAGGAAAAAGATTGATAGCCCTCACGTATACGCCATTGCTGA
TACAGCTATCCGGGAAATGATTAGAGATGAAGTAAACCAATCGATTGTGATAAATGGTGAAAGCGGAGCTGGGAAAACTG
AAACAGCAAAGATAGCAATGCAATACTTAGCTGCAGTTAGAGGGGGTAGTGGAATAGAGTACGAGATACTGAAAACGAAT
CCAATTTTGGAAGGATTCGGGAATGCAAAAACATCGAGAAATGACAACTCGAGTCGTTTTGGAAAGCTAATTGAAATACA
CTTTAGTGAAATCGGAAGAATAGCAGGTGCCAGTATTCAAACATTTCTTCTTGAAAAGTCAAGAGTTGTTCAGTGCACAG
AAGGTGAAAGGTCATATCATTCGTTTTATCAGCTTTGTGCAGGAGCTCCACCTTCTCTTAGAGAGAAACTAAACTTGAAG
AGTGCGCATGAGTACAAATATTTGCAGCAAAGCAATTGCTATAATATTTCCGGGGTAAATGATGCTGAAGAATTTCGTAT
CGTAGTGGAAGCTCTGGATATTGTTCATGTTAGCAAAGAGGACCAAGAAAATGTTTTTTCAATGCTTGCAGCAGTGTTAT
GGCTCGGAAATGTAACATTTCAAGCTGTTGACGACAACAATCATGTGGAACCTATGATTGATGAAGCTCTTCTAACTGTT
GCTAAATTGCTTGGGTGTGAGGCTGAGAATCTACAGACCGCTTTATCCACTCGGAAAATGACTGTTATGAATGAAAACAT
AATCAAAAGGCTAAATCTAACTCAGGCAATGGATTCACGCGACGCGTTGGCAAAATCAATATATTCTTGTCTGTTTGATT
GGTTGGTGGAACAGATCAACAAATCACTTTCTGCGGGGAAACGTCTGTCTGGAAGATCCATCAGCATTCTTGATATTTAC
GGATTCGAATCATTTGACGTAAATAGTTTAGAGCAGTTCTGCATTAATTATGCAAATGAGAGATTACAGCAACACTTTAA
TCATCATTTATTCAAGCTAGAACAGGAGGAATATATCCAAGATGGCATTGACTGGGCAAAGGTTGACTTTGAGGACAATC
AAGCTTGTCTCAGTCTTTTTGAGAAGAAACCATTAGGATTACTATCCCTATTAGATGAAGAAACCACATTTCCAAACGCG
ACAGATATGTCATTTGCCAACAAGCTAAAGCAACACTTGAGTGATAATCCGTGTTTTAGAGGAGAACGAGGCAAAGCGTT
CACGGTTCATCATTATGCTGGGGAAGTAAAATATGACACAACCGGGTTTTTGGAGAAAAACCGCGATTTATTGCATTCGG
TTGCCATTCAACTTCTGTCTTCTTGCACATGCAAACTTCCTCAGATTTTTGCTTCCAGTATGCTTTCTTTGACTGAGAAG
CCTGCAGTTGGTTCACTAAATAAATCAGGTGGCGCAGATTCCCAGAAGCTTAGTGTCACGTTAAAGTTTAAGGGCCAATT
ATTCCAACTAATGCAACGTTTGGGGAACACAAGGTCACATTTCATACGTTGCATTAAGCCCAATGACTCACACGCGTCTG
GAATTTATGATCAACAACTTGTACTTCAGCAGCTAAAATGTTGTGGTGTTTTAGAAGTCGTTCGAATATCAAGATCTGGA
TTTCCAACAAGAATGACCCATCAAAAATTCGCAAGAAGGTATGGTTTTCTTCTTCTCGACCATGTTGCATCACAAGATCC
ACTAAGTGCATCTGTTGCCATCCTTCAACAATTCAATATTCTTCCCGAAATGTATCAAGTTGGCTATACAAAGTTGTTCT
TTCGAACCGGACAGATTGGTAAGCTTGAAGATACACGAAATCGTACTCTGAATGGCATATTACGCGTTCAAAGCTGCTTT
AGAGGTCACAAAGCGCGTCTGATTCTGAGAGAAATGAAACGAAGAATTTTCACTCTTCAATCATTTGTTCGAGGGCAAAA
AGATAGAAAGGAGTTTGCAATCTTACTACACAGACATAGAGCAGCAGCGCGTATACAAAAGCGGATTAAGGCAATCAATG
TTAGGAAAGACTACAAGAAACTTTATGATGCATCAGATGTCATACAAGCAGTTATTCGCGGATGGCTTGTCCGAAAAAGT
ACAGAGGGCACTTGCCTGCAACGATTTAAGGGTAACAAGGACGATGTGCTGGTGAACAAAACGTATCTGGCGGACCTACA
ACGACGAGTTCTTAAAGCTGAGATTGGTTTACGCGAAAAAGAAGAAGAAAAGGAAATGATCCAGCAACGCATCCAACAAT
ACGAAACCCGTTGGTCTGACTACGAGCAGAAAATGAAATCCATGGAAGAACTATGGCAGAAACAGATGAACTCACTTCAA
TCTAGCCTAACTTTAGCAAAACAGAACCTAACAGTAGACGATTTTTCTGATCCTTCGGTCAACCTAACCAGCGCGAATAA
CTTCAGCCGAAGAAACTCAAACGATGCAGCAGGGGCCAGTAGCAATGAAATGAGCGCTGGACTGAGTTTGATCAGCCGGT
TAGCCGAGGAGTTTGAGCAGAGAAGCCATGTGTTTGGAGATGATGCACAGTTTCTGGTGGAGGTTAAGTCGGGTCAAGCG
GAGGCAAGTTTGAACCCGGAAGAAGAACTTGGGAGGTTGAAACAGAGTTTTGAAGGGTGGAAGAAGGATTTTGGCGCCAG
ATTGAGGGAGACAAAGGTAATTTTGAACAAACTAAGTAGTGAAGAAGGGAGTAGTGGTCGTGGTGGTGAAAAGGTTAAAA
AGAATTGGTGGGGGAGGCTTAACAGCTCAAGGGTTAATTGA