Microexon ID Ha_8:131019178-131019191:+
Species Helianthus annuus
Coordinates 8:131019178..131019191
Microexon Cluster ID MEP38
Size 14
Phase 1
Pfam Domain Motif Myosin_head
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 46,14,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq CAYTTYARTRMAACTGGRAARATATSTGGTGCYAADATTCAAACWTTTYTRCTTGARAAGTCWAGAGTWGTYCARYKTGCWGAWGGWGARAGRTCATAYCATATWTTT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCTTTTTGAAAAG
Microexon Amino Acid seq YLFEK
Microexon-tag DNA Seq CATTATAGTGCAGAGGGGATGATAAGTGGTGCTTGTATCCAAACATATCTTTTTGAAAAGTCAAGAGTATCTCAAATATGTCGTGGAGAACGGTCGTACCATGTATTT
Microexon-tag Amino Acid Seq HYSAEGMISGACIQTYLFEKSRVSQICRGERSYHVF
Microexon-tag spanning region131019009-131019323
Microexon-tag prediction score0.9097
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG19489x
Reference Transcript ID OTG19489
Gene ID HannXRQ_Chr08g0234661
Gene Name NA
Transcript ID OTG19489
Protein ID OTG19489
Gene ID HannXRQ_Chr08g0234661
Gene Name NA
Pfam domain motif Myosin_head
Motif E-value 1.7e-197
Motif start 167
Motif end 807
Protein seq >OTG19489
MLLSSRSSLEVMLDSLRLRDNSGESSPDLPPALPSRPISKARLPKRLIRSTIAVQHVVSCDTSNNNEKLDVKCSSRSSGG
GGGCGFGRKKMIGAAAAATSSVSVDSIGYFIRNKLGVWCKSRNDRWELGKIESAVGEDVTVRVSNGNMITVSRRELLPAN
TDVLDGVDDLVELSYLNEPSVLHSLQYRYNRDIIYSKAGPVLLAFNPFKDVNIYGDDFVAVYKEKILDNPHVYAVADAAY
NDMMKDGVNQSIIISGESGAGKTETAKFAMQYLASVKSQNYEMKSKLIQTSCILEAFGNAKTSRNWNSSRFGKLIDIHYS
AEGMISGACIQTYLFEKSRVSQICRGERSYHVFYQMCAGAPPVLKDKLNLKMSSEYKFLNQSGCLKINGVDDAHNFIKLV
DAFDTLGIHGLDQENIFELLAAILWLGNISFAAIDEELVEPVADEASRSAARLMGCKMDDLMMVLSTNRAHNMTEPLTLQ
QATDKRNTLANFVYESLFNWLIEEVNTSLKGNTQHTRHAISILDTYGFESLQRNSLQQLFINYADERLQQHFIRHLCKLE
QEEYELEGIHWKKVEFEDNQECLDLFEKKPMGIISMLNECSNSSIATDTTFTEKIKQHLSYNLCISCEEGAFRVRHYARE
VQYDASGLLEKDSDKLQFDTIQLLSSCKKPLNLSGSASGVMNQVQAAGQSVGSKFMDHLSKLINQMENSKQHFIRCIKPN
TKKLPGIYESDIVWEQLKCSQVMEVMQISKSRYPLRFTHQEFASRFGCLLSTNVMCMDPLSTSVAILQQHRVPTQMYQVG
FTKLFFRGQVDALENLRQEVLGSTRELDNRFLGGRVLVDFHELKFGIVTLQSFIRGENARREFNVLKKQNHGIALSSLDE
HMTTVVHIQSVVRGWLARKHFNHMQSWKKSGLDRSRSQRKSSRKHVELRDLLQENVHVLPQNIEELQKRVVKAESLLSER
ELENTALREQIRQFEIRWSEYESKMKAVEEMWQSQMASLQMNLAAAKKTLGSGISDVQIGRPVDSLSPNFYDSEDTISGI
QTPSQMTPVRIGNSRRGSNVVISDTIDNLSKEFEQRKQNFDSDAKAVTNVNRGRPPSKQIEDYNNLKKKFEIWKKEYRNR
LREAKTRLVKGVHAENGGVGGGGDKQTRNWWGKLSKRGKERVV*
CDS seq >OTG19489
ATGTTACTTTCATCTCGGAGTTCGTTGGAAGTAATGCTCGATTCTCTCCGCCTGAGAGACAACTCCGGCGAGAGTTCACC
GGACTTGCCGCCGGCGTTACCGTCACGACCGATCTCCAAAGCTCGCCTCCCGAAGCGGCTGATCCGGTCCACAATTGCAG
TACAGCATGTTGTTTCTTGTGATACAAGTAACAACAACGAAAAACTAGATGTTAAATGCAGCAGCAGAAGCAGTGGTGGT
GGTGGTGGTTGTGGTTTTGGACGGAAGAAGATGATCGGAGCTGCGGCTGCCGCCACGTCATCAGTTTCGGTTGACAGTAT
TGGTTATTTCATTAGAAATAAACTTGGAGTGTGGTGTAAGTCGCGAAACGATCGGTGGGAGTTAGGGAAAATCGAATCAG
CGGTTGGTGAAGATGTGACTGTTAGGGTTTCGAATGGGAATATGATTACGGTGTCTAGACGAGAGTTGTTACCTGCGAAT
ACGGATGTTCTTGATGGAGTTGATGATCTTGTTGAACTTAGTTATTTAAACGAACCATCGGTTCTTCATAGTCTTCAATA
TAGATACAATCGCGATATAATTTATAGTAAGGCGGGCCCTGTATTATTAGCGTTTAATCCCTTCAAGGATGTAAATATTT
ACGGAGATGATTTTGTGGCGGTGTATAAAGAGAAGATTTTGGATAATCCTCATGTATATGCTGTGGCTGATGCTGCATAC
AATGATATGATGAAAGATGGTGTGAATCAATCTATTATTATCAGTGGCGAAAGTGGAGCTGGGAAGACAGAAACTGCAAA
GTTTGCAATGCAATATCTGGCATCGGTTAAAAGCCAGAACTATGAGATGAAATCTAAACTAATTCAAACCAGTTGTATCC
TGGAAGCTTTCGGGAATGCAAAGACTTCCAGAAATTGGAATTCTAGCCGATTTGGGAAATTAATAGATATACATTATAGT
GCAGAGGGGATGATAAGTGGTGCTTGTATCCAAACATATCTTTTTGAAAAGTCAAGAGTATCTCAAATATGTCGTGGAGA
ACGGTCGTACCATGTATTTTATCAGATGTGTGCTGGGGCCCCACCTGTTCTTAAAGATAAGCTGAATTTAAAAATGTCAA
GCGAGTACAAGTTTCTTAATCAGAGTGGATGCTTGAAAATTAATGGTGTTGATGATGCCCACAACTTTATAAAGCTAGTG
GATGCCTTTGATACACTTGGAATTCATGGTCTGGATCAAGAAAACATATTTGAATTGCTTGCTGCAATTCTATGGCTTGG
GAATATTTCATTTGCAGCAATCGATGAAGAACTTGTTGAGCCTGTGGCTGATGAAGCGAGTCGAAGTGCTGCTAGGTTAA
TGGGCTGCAAGATGGATGACCTCATGATGGTGTTATCTACCAACAGAGCTCATAATATGACCGAACCATTGACATTGCAG
CAGGCAACTGACAAAAGAAACACATTGGCAAATTTTGTTTATGAGAGCCTGTTTAATTGGCTCATTGAAGAAGTTAATAC
ATCACTTAAAGGAAACACACAACATACTCGACACGCCATAAGCATATTAGACACGTACGGATTTGAGTCATTGCAGAGAA
ATAGCTTACAGCAGTTGTTTATAAACTACGCTGATGAGAGACTGCAGCAGCACTTCATTCGCCATCTTTGTAAGCTTGAA
CAAGAGGAGTACGAATTAGAAGGAATCCACTGGAAAAAAGTAGAGTTTGAAGACAACCAAGAGTGTTTGGATCTGTTTGA
GAAGAAACCAATGGGGATAATATCAATGCTCAATGAGTGTTCAAATTCCTCCATAGCCACAGATACGACATTCACCGAAA
AGATTAAACAACACCTAAGTTATAATCTTTGTATTAGCTGTGAAGAAGGAGCTTTCAGGGTTCGCCACTATGCGCGAGAG
GTTCAATATGATGCTTCAGGGTTGTTGGAAAAGGACAGTGATAAATTACAGTTTGATACCATCCAACTTTTGTCATCTTG
TAAGAAACCCTTGAACCTTTCGGGTTCGGCCTCTGGTGTGATGAACCAGGTCCAAGCAGCAGGTCAAAGTGTTGGGTCAA
AGTTCATGGATCACTTGTCCAAATTAATCAACCAAATGGAGAATTCAAAGCAACACTTCATTCGATGCATAAAACCAAAT
ACCAAAAAACTTCCTGGAATCTACGAAAGCGACATCGTATGGGAACAACTTAAATGTAGCCAAGTTATGGAGGTAATGCA
AATATCAAAATCAAGATACCCGTTACGCTTCACACATCAAGAATTTGCTAGCCGGTTTGGCTGCCTTTTATCGACGAATG
TTATGTGTATGGATCCATTGAGTACATCGGTTGCTATTCTGCAGCAGCATCGAGTACCCACGCAAATGTACCAAGTTGGA
TTTACAAAATTGTTTTTCCGAGGACAGGTTGATGCATTGGAGAATTTGAGACAAGAAGTTCTAGGAAGTACTCGTGAACT
CGATAACCGTTTCCTTGGTGGTCGAGTTCTTGTTGATTTTCATGAGTTGAAGTTTGGAATTGTGACATTGCAGTCATTTA
TTCGTGGTGAAAATGCAAGAAGGGAGTTTAATGTTTTGAAGAAACAGAACCATGGGATTGCACTAAGTTCACTTGATGAA
CACATGACAACAGTTGTACATATACAATCAGTTGTTCGTGGGTGGTTGGCTCGGAAGCATTTCAATCACATGCAAAGTTG
GAAAAAATCAGGCCTTGATAGATCAAGAAGCCAGCGAAAGTCAAGCAGGAAACATGTAGAATTGAGGGATTTGTTACAGG
AAAATGTACATGTTTTACCACAAAATATTGAAGAGCTGCAAAAACGAGTAGTGAAGGCGGAATCGTTGTTGAGTGAAAGG
GAGCTTGAAAATACTGCTTTGCGGGAACAAATACGACAGTTTGAAATACGTTGGTCGGAATACGAAAGCAAAATGAAGGC
CGTTGAGGAGATGTGGCAGAGCCAAATGGCATCTTTACAAATGAATCTTGCTGCAGCCAAGAAGACTCTTGGTTCCGGCA
TTTCCGATGTGCAAATTGGAAGACCCGTTGATTCACTGTCACCCAATTTTTACGATTCTGAGGATACCATATCAGGAATA
CAAACTCCTTCGCAAATGACACCTGTCAGAATCGGAAACAGTAGACGCGGAAGCAATGTTGTTATTTCCGACACGATTGA
TAACTTATCCAAAGAATTCGAGCAAAGAAAACAGAATTTTGATAGTGATGCTAAAGCTGTTACTAATGTGAATCGTGGAC
GTCCTCCTTCAAAACAGATTGAAGATTATAATAACTTAAAAAAGAAATTCGAGATTTGGAAAAAAGAGTACAGGAATCGG
TTACGCGAAGCCAAAACAAGACTTGTGAAGGGTGTACATGCCGAAAATGGCGGTGTTGGTGGTGGTGGTGATAAGCAGAC
GAGAAACTGGTGGGGGAAGTTAAGCAAGAGGGGGAAAGAAAGGGTCGTGTGA
Microexon DNA seq ATCTTTTTGAAAAG
Microexon Amino Acid seq YLFEK
Microexon-tag DNA Seq CATTATAGTGCAGAGGGGATGATAAGTGGTGCTTGTATCCAAACATATCTTTTTGAAAAGTCAAGAGTATCTCAAATATGTCGTGGAGAACGGTCGTACCATGTATTT
Microexon-tag Amino Acid seq HYSAEGMISGACIQTYLFEKSRVSQICRGERSYHVF
Transcript ID OTG19489
Gene ID Ha.54184
Gene Name NA
Pfam domain motif Myosin_head
Motif E-value 1.7e-197
Motif start 167
Motif end 807
Protein seq >OTG19489
MLLSSRSSLEVMLDSLRLRDNSGESSPDLPPALPSRPISKARLPKRLIRSTIAVQHVVSCDTSNNNEKLDVKCSSRSSGG
GGGCGFGRKKMIGAAAAATSSVSVDSIGYFIRNKLGVWCKSRNDRWELGKIESAVGEDVTVRVSNGNMITVSRRELLPAN
TDVLDGVDDLVELSYLNEPSVLHSLQYRYNRDIIYSKAGPVLLAFNPFKDVNIYGDDFVAVYKEKILDNPHVYAVADAAY
NDMMKDGVNQSIIISGESGAGKTETAKFAMQYLASVKSQNYEMKSKLIQTSCILEAFGNAKTSRNWNSSRFGKLIDIHYS
AEGMISGACIQTYLFEKSRVSQICRGERSYHVFYQMCAGAPPVLKDKLNLKMSSEYKFLNQSGCLKINGVDDAHNFIKLV
DAFDTLGIHGLDQENIFELLAAILWLGNISFAAIDEELVEPVADEASRSAARLMGCKMDDLMMVLSTNRAHNMTEPLTLQ
QATDKRNTLANFVYESLFNWLIEEVNTSLKGNTQHTRHAISILDTYGFESLQRNSLQQLFINYADERLQQHFIRHLCKLE
QEEYELEGIHWKKVEFEDNQECLDLFEKKPMGIISMLNECSNSSIATDTTFTEKIKQHLSYNLCISCEEGAFRVRHYARE
VQYDASGLLEKDSDKLQFDTIQLLSSCKKPLNLSGSASGVMNQVQAAGQSVGSKFMDHLSKLINQMENSKQHFIRCIKPN
TKKLPGIYESDIVWEQLKCSQVMEVMQISKSRYPLRFTHQEFASRFGCLLSTNVMCMDPLSTSVAILQQHRVPTQMYQVG
FTKLFFRGQVDALENLRQEVLGSTRELDNRFLGGRVLVDFHELKFGIVTLQSFIRGENARREFNVLKKQNHGIALSSLDE
HMTTVVHIQSVVRGWLARKHFNHMQSWKKSGLDRSRSQRKSSRKHVELRDLLQENVHVLPQNIEELQKRVVKAESLLSER
ELENTALREQIRQFEIRWSEYESKMKAVEEMWQSQMASLQMNLAAAKKTLGSGISDVQIGRPVDSLSPNFYDSEDTISGI
QTPSQMTPVRIGNSRRGSNVVISDTIDNLSKEFEQRKQNFDSDAKAVTNVNRGRPPSKQIEDYNNLKKKFEIWKKEYRNR
LREAKTRLVKGVHAENGGVGGGGDKQTRNWWGKLSKRGKERVV*
CDS seq >OTG19489
ATGTTACTTTCATCTCGGAGTTCGTTGGAAGTAATGCTCGATTCTCTCCGCCTGAGAGACAACTCCGGCGAGAGTTCACC
GGACTTGCCGCCGGCGTTACCGTCACGACCGATCTCCAAAGCTCGCCTCCCGAAGCGGCTGATCCGGTCCACAATTGCAG
TACAGCATGTTGTTTCTTGTGATACAAGTAACAACAACGAAAAACTAGATGTTAAATGCAGCAGCAGAAGCAGTGGTGGT
GGTGGTGGTTGTGGTTTTGGACGGAAGAAGATGATCGGAGCTGCGGCTGCCGCCACGTCATCAGTTTCGGTTGACAGTAT
TGGTTATTTCATTAGAAATAAACTTGGAGTGTGGTGTAAGTCGCGAAACGATCGGTGGGAGTTAGGGAAAATCGAATCAG
CGGTTGGTGAAGATGTGACTGTTAGGGTTTCGAATGGGAATATGATTACGGTGTCTAGACGAGAGTTGTTACCTGCGAAT
ACGGATGTTCTTGATGGAGTTGATGATCTTGTTGAACTTAGTTATTTAAACGAACCATCGGTTCTTCATAGTCTTCAATA
TAGATACAATCGCGATATAATTTATAGTAAGGCGGGCCCTGTATTATTAGCGTTTAATCCCTTCAAGGATGTAAATATTT
ACGGAGATGATTTTGTGGCGGTGTATAAAGAGAAGATTTTGGATAATCCTCATGTATATGCTGTGGCTGATGCTGCATAC
AATGATATGATGAAAGATGGTGTGAATCAATCTATTATTATCAGTGGCGAAAGTGGAGCTGGGAAGACAGAAACTGCAAA
GTTTGCAATGCAATATCTGGCATCGGTTAAAAGCCAGAACTATGAGATGAAATCTAAACTAATTCAAACCAGTTGTATCC
TGGAAGCTTTCGGGAATGCAAAGACTTCCAGAAATTGGAATTCTAGCCGATTTGGGAAATTAATAGATATACATTATAGT
GCAGAGGGGATGATAAGTGGTGCTTGTATCCAAACATATCTTTTTGAAAAGTCAAGAGTATCTCAAATATGTCGTGGAGA
ACGGTCGTACCATGTATTTTATCAGATGTGTGCTGGGGCCCCACCTGTTCTTAAAGATAAGCTGAATTTAAAAATGTCAA
GCGAGTACAAGTTTCTTAATCAGAGTGGATGCTTGAAAATTAATGGTGTTGATGATGCCCACAACTTTATAAAGCTAGTG
GATGCCTTTGATACACTTGGAATTCATGGTCTGGATCAAGAAAACATATTTGAATTGCTTGCTGCAATTCTATGGCTTGG
GAATATTTCATTTGCAGCAATCGATGAAGAACTTGTTGAGCCTGTGGCTGATGAAGCGAGTCGAAGTGCTGCTAGGTTAA
TGGGCTGCAAGATGGATGACCTCATGATGGTGTTATCTACCAACAGAGCTCATAATATGACCGAACCATTGACATTGCAG
CAGGCAACTGACAAAAGAAACACATTGGCAAATTTTGTTTATGAGAGCCTGTTTAATTGGCTCATTGAAGAAGTTAATAC
ATCACTTAAAGGAAACACACAACATACTCGACACGCCATAAGCATATTAGACACGTACGGATTTGAGTCATTGCAGAGAA
ATAGCTTACAGCAGTTGTTTATAAACTACGCTGATGAGAGACTGCAGCAGCACTTCATTCGCCATCTTTGTAAGCTTGAA
CAAGAGGAGTACGAATTAGAAGGAATCCACTGGAAAAAAGTAGAGTTTGAAGACAACCAAGAGTGTTTGGATCTGTTTGA
GAAGAAACCAATGGGGATAATATCAATGCTCAATGAGTGTTCAAATTCCTCCATAGCCACAGATACGACATTCACCGAAA
AGATTAAACAACACCTAAGTTATAATCTTTGTATTAGCTGTGAAGAAGGAGCTTTCAGGGTTCGCCACTATGCGCGAGAG
GTTCAATATGATGCTTCAGGGTTGTTGGAAAAGGACAGTGATAAATTACAGTTTGATACCATCCAACTTTTGTCATCTTG
TAAGAAACCCTTGAACCTTTCGGGTTCGGCCTCTGGTGTGATGAACCAGGTCCAAGCAGCAGGTCAAAGTGTTGGGTCAA
AGTTCATGGATCACTTGTCCAAATTAATCAACCAAATGGAGAATTCAAAGCAACACTTCATTCGATGCATAAAACCAAAT
ACCAAAAAACTTCCTGGAATCTACGAAAGCGACATCGTATGGGAACAACTTAAATGTAGCCAAGTTATGGAGGTAATGCA
AATATCAAAATCAAGATACCCGTTACGCTTCACACATCAAGAATTTGCTAGCCGGTTTGGCTGCCTTTTATCGACGAATG
TTATGTGTATGGATCCATTGAGTACATCGGTTGCTATTCTGCAGCAGCATCGAGTACCCACGCAAATGTACCAAGTTGGA
TTTACAAAATTGTTTTTCCGAGGACAGGTTGATGCATTGGAGAATTTGAGACAAGAAGTTCTAGGAAGTACTCGTGAACT
CGATAACCGTTTCCTTGGTGGTCGAGTTCTTGTTGATTTTCATGAGTTGAAGTTTGGAATTGTGACATTGCAGTCATTTA
TTCGTGGTGAAAATGCAAGAAGGGAGTTTAATGTTTTGAAGAAACAGAACCATGGGATTGCACTAAGTTCACTTGATGAA
CACATGACAACAGTTGTACATATACAATCAGTTGTTCGTGGGTGGTTGGCTCGGAAGCATTTCAATCACATGCAAAGTTG
GAAAAAATCAGGCCTTGATAGATCAAGAAGCCAGCGAAAGTCAAGCAGGAAACATGTAGAATTGAGGGATTTGTTACAGG
AAAATGTACATGTTTTACCACAAAATATTGAAGAGCTGCAAAAACGAGTAGTGAAGGCGGAATCGTTGTTGAGTGAAAGG
GAGCTTGAAAATACTGCTTTGCGGGAACAAATACGACAGTTTGAAATACGTTGGTCGGAATACGAAAGCAAAATGAAGGC
CGTTGAGGAGATGTGGCAGAGCCAAATGGCATCTTTACAAATGAATCTTGCTGCAGCCAAGAAGACTCTTGGTTCCGGCA
TTTCCGATGTGCAAATTGGAAGACCCGTTGATTCACTGTCACCCAATTTTTACGATTCTGAGGATACCATATCAGGAATA
CAAACTCCTTCGCAAATGACACCTGTCAGAATCGGAAACAGTAGACGCGGAAGCAATGTTGTTATTTCCGACACGATTGA
TAACTTATCCAAAGAATTCGAGCAAAGAAAACAGAATTTTGATAGTGATGCTAAAGCTGTTACTAATGTGAATCGTGGAC
GTCCTCCTTCAAAACAGATTGAAGATTATAATAACTTAAAAAAGAAATTCGAGATTTGGAAAAAAGAGTACAGGAATCGG
TTACGCGAAGCCAAAACAAGACTTGTGAAGGGTGTACATGCCGAAAATGGCGGTGTTGGTGGTGGTGGTGATAAGCAGAC
GAGAAACTGGTGGGGGAAGTTAAGCAAGAGGGGGAAAGAAAGGGTCGTGTGA