Microexon ID Ha_7:91471026-91471034:+
Species Helianthus annuus
Coordinates 7:91471026..91471034
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTGATG
Microexon Amino Acid seq DPDG
Microexon-tag DNA Seq TGGCAACGATCCGCTTATCATTTTCAACCCGACAAAAATTTCATTAGTGATCCTGATGGCCCAATGTATCACATGGGATGGTACCATCTATTCTATCAGTACAACCCT
Microexon-tag Amino Acid Seq WQRSAYHFQPDKNFISDPDGPMYHMGWYHLFYQYNP
Microexon-tag spanning region91470531-91471403
Microexon-tag prediction score0.953
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG21392x
Reference Transcript ID OTG21392
Gene ID HannXRQ_Chr07g0203641
Gene Name NA
Transcript ID OTG21392
Protein ID OTG21392
Gene ID HannXRQ_Chr07g0203641
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 3.2e-96
Motif start 98
Motif end 417
Protein seq >OTG21392
MMASSTTTTPLILHDDPENLPELTGSPTTRRLSIAKVLSGILVSVLVTCALVALINNQTYEPPAATTFATQLPNIDLKRV
PGKLDSSAEVEWQRSAYHFQPDKNFISDPDGPMYHMGWYHLFYQYNPESAIWGNITWGHSVSKDMINWFHLPFAMVPDHW
YDIEGVMTGSATVLPNGQIIMLYTGNAYDLSQVQCLAYAVNSSDPLLIEWKKYEGNPVLFPPPGVGYKDFRDPSTLWLGP
DGEYRMVMGSKHNETIGCALIYHTTNFTHFELKEEVLHAVPHTGMWECVDLYPVSTVHTNGLDMVDNGPNVKYVLKQSGD
EDRHDWYAIGSYDVVNDKWYPDDPENDVGIGLRYDFGKFYASKTFYDQHKKRRVLWGYVGETDPQKYDISKGWANILNIP
RTVVLDTKTKTNLIQWPIEETENLRSKTYDEFKDVELRPGSLVPLEIGTATQLDIVATFEIDQKMLESTLEADVLFNCTT
SEGSVARGALGPFGVVVLADAQRSEQLPVYFYIAKDIDGTSRTYFCADETRSSKDVSVGKWVYGSSVPVLPGEKYNMRLL
VDHSIVEGFAQNGRTVVTSRVYPTKAIYNAAKVFLFNNATGISVKASIKIWKMAKAELNPFPLPGWTFEL*
CDS seq >OTG21392
ATGATGGCTTCATCCACCACCACCACCCCTCTCATTCTCCATGATGACCCTGAAAACCTCCCAGAACTCACCGGATCTCC
GACAACTCGTCGTCTATCCATCGCAAAAGTGCTTTCGGGGATCCTTGTTTCGGTTCTAGTTACATGTGCTCTTGTTGCTT
TAATCAACAACCAAACATATGAACCACCCGCGGCCACCACATTCGCAACTCAGTTGCCAAATATTGATCTGAAGCGGGTT
CCAGGAAAGTTGGATTCGAGTGCTGAGGTTGAATGGCAACGATCCGCTTATCATTTTCAACCCGACAAAAATTTCATTAG
TGATCCTGATGGCCCAATGTATCACATGGGATGGTACCATCTATTCTATCAGTACAACCCTGAATCTGCCATCTGGGGCA
ACATCACATGGGGCCACTCGGTATCGAAAGACATGATCAACTGGTTCCATCTCCCTTTCGCCATGGTTCCTGACCATTGG
TACGACATCGAAGGTGTCATGACGGGTTCGGCTACAGTCCTCCCTAATGGTCAAATCATCATGCTTTACACGGGCAACGC
GTACGATCTCTCCCAAGTACAATGCTTGGCATACGCTGTCAACTCGTCGGATCCCCTTCTTATAGAGTGGAAAAAATATG
AAGGTAACCCTGTCTTGTTCCCACCACCAGGAGTGGGCTACAAGGACTTTCGGGACCCATCCACATTGTGGTTGGGCCCT
GATGGTGAATATAGAATGGTAATGGGGTCCAAGCACAACGAGACTATTGGATGTGCTTTGATTTACCATACCACTAATTT
TACGCATTTTGAATTGAAAGAGGAGGTGCTTCATGCAGTCCCACATACTGGTATGTGGGAATGTGTTGATCTTTACCCAG
TGTCCACCGTACACACAAACGGGTTGGACATGGTGGATAACGGGCCAAATGTTAAATACGTGTTGAAACAAAGTGGGGAT
GAAGATCGCCATGATTGGTATGCAATTGGAAGTTATGATGTGGTGAATGATAAGTGGTACCCGGATGACCCGGAAAATGA
TGTGGGTATTGGATTAAGATATGATTTTGGAAAATTTTATGCGTCCAAGACTTTTTATGACCAACATAAGAAGAGGAGGG
TCCTTTGGGGCTATGTTGGAGAAACCGATCCCCAAAAGTATGACATTTCAAAGGGATGGGCTAACATTTTGAATATTCCA
AGAACCGTCGTTTTGGACACAAAAACCAAAACCAATTTGATTCAATGGCCAATCGAGGAAACCGAAAACCTTAGGTCAAA
AACGTACGATGAATTTAAAGACGTGGAGCTTCGACCCGGGTCACTCGTTCCCCTTGAGATAGGCACAGCCACACAGTTGG
ATATAGTTGCGACATTCGAAATCGACCAAAAGATGTTGGAATCAACGCTAGAGGCCGATGTTCTATTCAATTGCACGACT
AGTGAAGGCTCGGTTGCAAGGGGTGCGTTGGGACCGTTTGGTGTGGTGGTTCTAGCCGATGCCCAACGCTCCGAACAACT
TCCTGTATACTTCTATATCGCAAAAGATATCGATGGAACCTCACGAACTTACTTTTGTGCCGATGAAACAAGATCATCCA
AGGATGTAAGCGTAGGGAAATGGGTGTACGGAAGCAGTGTTCCTGTCCTCCCAGGCGAAAAGTACAATATGAGGTTATTG
GTGGATCATTCGATAGTGGAGGGATTTGCACAAAACGGGAGAACCGTGGTGACATCAAGAGTGTATCCAACAAAGGCGAT
CTACAACGCTGCGAAGGTGTTTTTGTTCAACAACGCGACTGGGATCAGTGTGAAGGCGTCGATCAAGATCTGGAAGATGG
CGAAAGCAGAACTCAATCCTTTCCCTCTTCCTGGGTGGACTTTTGAACTTTGA
Microexon DNA seq ATCCTGATG
Microexon Amino Acid seq DPDG
Microexon-tag DNA Seq TGGCAACGATCCGCTTATCATTTTCAACCCGACAAAAATTTCATTAGTGATCCTGATGGCCCAATGTATCACATGGGATGGTACCATCTATTCTATCAGTACAACCCT
Microexon-tag Amino Acid seq WQRSAYHFQPDKNFISDPDGPMYHMGWYHLFYQYNP
Transcript ID OTG21392
Gene ID Ha.51036
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 3.2e-96
Motif start 98
Motif end 417
Protein seq >OTG21392
MMASSTTTTPLILHDDPENLPELTGSPTTRRLSIAKVLSGILVSVLVTCALVALINNQTYEPPAATTFATQLPNIDLKRV
PGKLDSSAEVEWQRSAYHFQPDKNFISDPDGPMYHMGWYHLFYQYNPESAIWGNITWGHSVSKDMINWFHLPFAMVPDHW
YDIEGVMTGSATVLPNGQIIMLYTGNAYDLSQVQCLAYAVNSSDPLLIEWKKYEGNPVLFPPPGVGYKDFRDPSTLWLGP
DGEYRMVMGSKHNETIGCALIYHTTNFTHFELKEEVLHAVPHTGMWECVDLYPVSTVHTNGLDMVDNGPNVKYVLKQSGD
EDRHDWYAIGSYDVVNDKWYPDDPENDVGIGLRYDFGKFYASKTFYDQHKKRRVLWGYVGETDPQKYDISKGWANILNIP
RTVVLDTKTKTNLIQWPIEETENLRSKTYDEFKDVELRPGSLVPLEIGTATQLDIVATFEIDQKMLESTLEADVLFNCTT
SEGSVARGALGPFGVVVLADAQRSEQLPVYFYIAKDIDGTSRTYFCADETRSSKDVSVGKWVYGSSVPVLPGEKYNMRLL
VDHSIVEGFAQNGRTVVTSRVYPTKAIYNAAKVFLFNNATGISVKASIKIWKMAKAELNPFPLPGWTFEL*
CDS seq >OTG21392
ATGATGGCTTCATCCACCACCACCACCCCTCTCATTCTCCATGATGACCCTGAAAACCTCCCAGAACTCACCGGATCTCC
GACAACTCGTCGTCTATCCATCGCAAAAGTGCTTTCGGGGATCCTTGTTTCGGTTCTAGTTACATGTGCTCTTGTTGCTT
TAATCAACAACCAAACATATGAACCACCCGCGGCCACCACATTCGCAACTCAGTTGCCAAATATTGATCTGAAGCGGGTT
CCAGGAAAGTTGGATTCGAGTGCTGAGGTTGAATGGCAACGATCCGCTTATCATTTTCAACCCGACAAAAATTTCATTAG
TGATCCTGATGGCCCAATGTATCACATGGGATGGTACCATCTATTCTATCAGTACAACCCTGAATCTGCCATCTGGGGCA
ACATCACATGGGGCCACTCGGTATCGAAAGACATGATCAACTGGTTCCATCTCCCTTTCGCCATGGTTCCTGACCATTGG
TACGACATCGAAGGTGTCATGACGGGTTCGGCTACAGTCCTCCCTAATGGTCAAATCATCATGCTTTACACGGGCAACGC
GTACGATCTCTCCCAAGTACAATGCTTGGCATACGCTGTCAACTCGTCGGATCCCCTTCTTATAGAGTGGAAAAAATATG
AAGGTAACCCTGTCTTGTTCCCACCACCAGGAGTGGGCTACAAGGACTTTCGGGACCCATCCACATTGTGGTTGGGCCCT
GATGGTGAATATAGAATGGTAATGGGGTCCAAGCACAACGAGACTATTGGATGTGCTTTGATTTACCATACCACTAATTT
TACGCATTTTGAATTGAAAGAGGAGGTGCTTCATGCAGTCCCACATACTGGTATGTGGGAATGTGTTGATCTTTACCCAG
TGTCCACCGTACACACAAACGGGTTGGACATGGTGGATAACGGGCCAAATGTTAAATACGTGTTGAAACAAAGTGGGGAT
GAAGATCGCCATGATTGGTATGCAATTGGAAGTTATGATGTGGTGAATGATAAGTGGTACCCGGATGACCCGGAAAATGA
TGTGGGTATTGGATTAAGATATGATTTTGGAAAATTTTATGCGTCCAAGACTTTTTATGACCAACATAAGAAGAGGAGGG
TCCTTTGGGGCTATGTTGGAGAAACCGATCCCCAAAAGTATGACATTTCAAAGGGATGGGCTAACATTTTGAATATTCCA
AGAACCGTCGTTTTGGACACAAAAACCAAAACCAATTTGATTCAATGGCCAATCGAGGAAACCGAAAACCTTAGGTCAAA
AACGTACGATGAATTTAAAGACGTGGAGCTTCGACCCGGGTCACTCGTTCCCCTTGAGATAGGCACAGCCACACAGTTGG
ATATAGTTGCGACATTCGAAATCGACCAAAAGATGTTGGAATCAACGCTAGAGGCCGATGTTCTATTCAATTGCACGACT
AGTGAAGGCTCGGTTGCAAGGGGTGCGTTGGGACCGTTTGGTGTGGTGGTTCTAGCCGATGCCCAACGCTCCGAACAACT
TCCTGTATACTTCTATATCGCAAAAGATATCGATGGAACCTCACGAACTTACTTTTGTGCCGATGAAACAAGATCATCCA
AGGATGTAAGCGTAGGGAAATGGGTGTACGGAAGCAGTGTTCCTGTCCTCCCAGGCGAAAAGTACAATATGAGGTTATTG
GTGGATCATTCGATAGTGGAGGGATTTGCACAAAACGGGAGAACCGTGGTGACATCAAGAGTGTATCCAACAAAGGCGAT
CTACAACGCTGCGAAGGTGTTTTTGTTCAACAACGCGACTGGGATCAGTGTGAAGGCGTCGATCAAGATCTGGAAGATGG
CGAAAGCAGAACTCAATCCTTTCCCTCTTCCTGGGTGGACTTTTGAACTTTGA