Microexon ID Ha_10:80465879-80465887:-
Species Helianthus annuus
Coordinates 10:80465879..80465887
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTGATG
Microexon Amino Acid seq DPDG
Microexon-tag DNA Seq TGGCAACGATCTGCTTACCACTTTCAACCCGATAAAAATTTCATTAGTGATCCTGATGGTCCACTATATTACAAGGGATGGTACCATTTATTCTACCAATACAATCCG
Microexon-tag Amino Acid Seq WQRSAYHFQPDKNFISDPDGPLYYKGWYHLFYQYNP
Microexon-tag spanning region80463224-80466076
Microexon-tag prediction score0.9498
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG10728x
Reference Transcript ID OTG10728
Gene ID HannXRQ_Chr10g0290911
Gene Name NA
Transcript ID OTG10728
Protein ID OTG10728
Gene ID HannXRQ_Chr10g0290911
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.3e-96
Motif start 93
Motif end 412
Protein seq >OTG10728
MASTPTTPLITHNDLEQRPESTESPPGRSSIVKILTGLFVSILVLSSLAAIPHRKTHLQSTTTTTVDIEPSTSSPKEVVG
ADDSIEWQRSAYHFQPDKNFISDPDGPLYYKGWYHLFYQYNPDSAIWGNITWGHAVSKDLINWFHLPLAMVPDHWYDIQG
VMTGSATILPDGQIFMLYSGNAYDLSQLQCLAYPKNASDPLLIEWVKYEGNPILFPPPGVGLKDFRDPSSLWIGPDGKYR
MVMGSKHNNTIGCALIYHTTNFTHFELLDEVLHSVQGTGMWECVDLYPVSTTETNGLDMSNHESGAKYVLKQSGDEDRHD
WYAIGTYDVVHDKWYPDDPEMDLGIGLRYDYGKFYASKTFYDPSKKRRVLWGYVGETDPQKDDLEKGWANILNVPRTVVL
DTKTQSNLIQWPVEETETLRSEECDEFKDVELRPGSLVPLDVGSATQLDISASFEVDEALLGATLEADVLFNCTTSEGST
MRGVLGPFGLVVLADSALSEQTPVYFYIAKNLDGTSRTYFCADESRSSKLLDVGKMVYGSSVPVLHGENYNMRLLVDHSI
VESFAQGGRTVITSRVYPTMAIYDAAKVFVFNNATGITVKASLKIWKMGGAQLNPFPF*
CDS seq >OTG10728
ATGGCTTCCACCCCCACCACCCCTCTTATTACTCACAATGACCTTGAACAACGCCCGGAATCGACCGAGTCTCCACCCGG
TCGATCATCCATCGTAAAGATCCTCACTGGATTATTTGTGTCCATTCTTGTTCTTTCATCATTGGCTGCAATTCCACACC
GGAAAACTCACTTGCAGTCCACCACTACCACCACAGTTGATATTGAACCATCGACAAGCAGTCCGAAGGAGGTTGTGGGA
GCTGACGATAGCATTGAATGGCAACGATCTGCTTACCACTTTCAACCCGATAAAAATTTCATTAGTGATCCTGATGGTCC
ACTATATTACAAGGGATGGTACCATTTATTCTACCAATACAATCCGGATTCAGCCATTTGGGGCAACATAACATGGGGTC
ATGCAGTCTCGAAAGACCTCATCAATTGGTTCCACCTCCCTTTAGCCATGGTTCCGGATCACTGGTACGACATCCAAGGT
GTCATGACTGGGTCCGCCACCATCCTCCCCGATGGCCAAATCTTTATGCTCTATAGCGGCAACGCCTACGACCTCTCTCA
GCTTCAATGCCTCGCGTACCCCAAAAATGCTTCTGATCCCCTTCTTATCGAATGGGTCAAATACGAAGGCAACCCAATTC
TCTTCCCTCCTCCGGGCGTTGGTCTTAAAGACTTTAGGGACCCGTCATCTCTTTGGATTGGGCCCGATGGGAAGTACCGA
ATGGTTATGGGGTCCAAGCACAATAATACAATTGGTTGTGCTTTAATTTACCACACCACTAATTTCACCCATTTTGAATT
GTTGGATGAGGTGCTCCATTCGGTTCAGGGTACGGGTATGTGGGAATGTGTTGATCTTTACCCGGTATCCACGACCGAGA
CAAACGGGTTGGATATGTCGAATCATGAGTCGGGTGCTAAGTATGTGTTGAAGCAAAGTGGGGATGAGGATAGACATGAT
TGGTATGCAATTGGGACATATGACGTGGTTCATGATAAATGGTATCCGGATGATCCGGAAATGGATTTGGGTATCGGGTT
GAGATATGATTATGGAAAGTTTTATGCCTCGAAGACGTTTTATGACCCGAGTAAGAAGAGGAGGGTCTTATGGGGCTATG
TTGGTGAAACGGATCCTCAAAAAGATGACCTCGAGAAAGGATGGGCCAATATTTTGAATGTTCCTAGAACCGTGGTGTTG
GACACGAAGACTCAAAGTAACTTGATTCAATGGCCGGTCGAGGAAACAGAGACATTGAGATCTGAAGAGTGCGATGAGTT
CAAAGATGTCGAGTTGCGGCCTGGATCACTTGTCCCGCTTGATGTAGGCTCAGCCACACAGTTGGACATAAGTGCCTCAT
TCGAGGTTGATGAAGCTTTGCTTGGTGCAACCTTAGAAGCCGATGTGTTGTTCAACTGCACCACGAGCGAGGGTTCAACC
ATGAGGGGTGTTTTGGGACCGTTTGGGCTTGTGGTTCTTGCAGATTCGGCACTTTCAGAACAAACTCCTGTTTACTTCTA
CATTGCAAAAAACTTGGATGGCACTTCAAGAACTTATTTCTGTGCTGATGAATCAAGATCATCAAAGCTTTTAGATGTGG
GCAAGATGGTATATGGAAGTAGTGTTCCTGTACTCCATGGTGAAAACTACAACATGAGGTTATTGGTGGACCATTCAATA
GTCGAAAGCTTTGCACAAGGAGGAAGAACGGTGATTACATCAAGAGTGTATCCTACAATGGCAATCTATGATGCAGCCAA
AGTGTTTGTGTTCAACAATGCAACTGGAATCACTGTTAAGGCATCTCTCAAGATTTGGAAGATGGGTGGAGCACAACTTA
ACCCTTTTCCTTTCTAA
Microexon DNA seq ATCCTGATG
Microexon Amino Acid seq DPDG
Microexon-tag DNA Seq TGGCAACGATCTGCTTACCACTTTCAACCCGATAAAAATTTCATTAGTGATCCTGATGGTCCACTATATTACAAGGGATGGTACCATTTATTCTACCAATACAATCCG
Microexon-tag Amino Acid seq WQRSAYHFQPDKNFISDPDGPLYYKGWYHLFYQYNP
Transcript ID OTG10728
Gene ID Ha.4574
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.3e-96
Motif start 93
Motif end 412
Protein seq >OTG10728
MASTPTTPLITHNDLEQRPESTESPPGRSSIVKILTGLFVSILVLSSLAAIPHRKTHLQSTTTTTVDIEPSTSSPKEVVG
ADDSIEWQRSAYHFQPDKNFISDPDGPLYYKGWYHLFYQYNPDSAIWGNITWGHAVSKDLINWFHLPLAMVPDHWYDIQG
VMTGSATILPDGQIFMLYSGNAYDLSQLQCLAYPKNASDPLLIEWVKYEGNPILFPPPGVGLKDFRDPSSLWIGPDGKYR
MVMGSKHNNTIGCALIYHTTNFTHFELLDEVLHSVQGTGMWECVDLYPVSTTETNGLDMSNHESGAKYVLKQSGDEDRHD
WYAIGTYDVVHDKWYPDDPEMDLGIGLRYDYGKFYASKTFYDPSKKRRVLWGYVGETDPQKDDLEKGWANILNVPRTVVL
DTKTQSNLIQWPVEETETLRSEECDEFKDVELRPGSLVPLDVGSATQLDISASFEVDEALLGATLEADVLFNCTTSEGST
MRGVLGPFGLVVLADSALSEQTPVYFYIAKNLDGTSRTYFCADESRSSKLLDVGKMVYGSSVPVLHGENYNMRLLVDHSI
VESFAQGGRTVITSRVYPTMAIYDAAKVFVFNNATGITVKASLKIWKMGGAQLNPFPF*
CDS seq >OTG10728
ATGGCTTCCACCCCCACCACCCCTCTTATTACTCACAATGACCTTGAACAACGCCCGGAATCGACCGAGTCTCCACCCGG
TCGATCATCCATCGTAAAGATCCTCACTGGATTATTTGTGTCCATTCTTGTTCTTTCATCATTGGCTGCAATTCCACACC
GGAAAACTCACTTGCAGTCCACCACTACCACCACAGTTGATATTGAACCATCGACAAGCAGTCCGAAGGAGGTTGTGGGA
GCTGACGATAGCATTGAATGGCAACGATCTGCTTACCACTTTCAACCCGATAAAAATTTCATTAGTGATCCTGATGGTCC
ACTATATTACAAGGGATGGTACCATTTATTCTACCAATACAATCCGGATTCAGCCATTTGGGGCAACATAACATGGGGTC
ATGCAGTCTCGAAAGACCTCATCAATTGGTTCCACCTCCCTTTAGCCATGGTTCCGGATCACTGGTACGACATCCAAGGT
GTCATGACTGGGTCCGCCACCATCCTCCCCGATGGCCAAATCTTTATGCTCTATAGCGGCAACGCCTACGACCTCTCTCA
GCTTCAATGCCTCGCGTACCCCAAAAATGCTTCTGATCCCCTTCTTATCGAATGGGTCAAATACGAAGGCAACCCAATTC
TCTTCCCTCCTCCGGGCGTTGGTCTTAAAGACTTTAGGGACCCGTCATCTCTTTGGATTGGGCCCGATGGGAAGTACCGA
ATGGTTATGGGGTCCAAGCACAATAATACAATTGGTTGTGCTTTAATTTACCACACCACTAATTTCACCCATTTTGAATT
GTTGGATGAGGTGCTCCATTCGGTTCAGGGTACGGGTATGTGGGAATGTGTTGATCTTTACCCGGTATCCACGACCGAGA
CAAACGGGTTGGATATGTCGAATCATGAGTCGGGTGCTAAGTATGTGTTGAAGCAAAGTGGGGATGAGGATAGACATGAT
TGGTATGCAATTGGGACATATGACGTGGTTCATGATAAATGGTATCCGGATGATCCGGAAATGGATTTGGGTATCGGGTT
GAGATATGATTATGGAAAGTTTTATGCCTCGAAGACGTTTTATGACCCGAGTAAGAAGAGGAGGGTCTTATGGGGCTATG
TTGGTGAAACGGATCCTCAAAAAGATGACCTCGAGAAAGGATGGGCCAATATTTTGAATGTTCCTAGAACCGTGGTGTTG
GACACGAAGACTCAAAGTAACTTGATTCAATGGCCGGTCGAGGAAACAGAGACATTGAGATCTGAAGAGTGCGATGAGTT
CAAAGATGTCGAGTTGCGGCCTGGATCACTTGTCCCGCTTGATGTAGGCTCAGCCACACAGTTGGACATAAGTGCCTCAT
TCGAGGTTGATGAAGCTTTGCTTGGTGCAACCTTAGAAGCCGATGTGTTGTTCAACTGCACCACGAGCGAGGGTTCAACC
ATGAGGGGTGTTTTGGGACCGTTTGGGCTTGTGGTTCTTGCAGATTCGGCACTTTCAGAACAAACTCCTGTTTACTTCTA
CATTGCAAAAAACTTGGATGGCACTTCAAGAACTTATTTCTGTGCTGATGAATCAAGATCATCAAAGCTTTTAGATGTGG
GCAAGATGGTATATGGAAGTAGTGTTCCTGTACTCCATGGTGAAAACTACAACATGAGGTTATTGGTGGACCATTCAATA
GTCGAAAGCTTTGCACAAGGAGGAAGAACGGTGATTACATCAAGAGTGTATCCTACAATGGCAATCTATGATGCAGCCAA
AGTGTTTGTGTTCAACAATGCAACTGGAATCACTGTTAAGGCATCTCTCAAGATTTGGAAGATGGGTGGAGCACAACTTA
ACCCTTTTCCTTTCTAA