Microexon ID Ha_3:132311206-132311214:+
Species Helianthus annuus
Coordinates 3:132311206..132311214
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCCAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCGTATCGAACCGGTTATCATTTTCAACCACCAAGTAACTGGATGAATGATCCCAATGGACCAATGTTATACCAAGGCGTCTACCACTTCTTCTACCAATACAACCCA
Microexon-tag Amino Acid Seq PYRTGYHFQPPSNWMNDPNGPMLYQGVYHFFYQYNP
Microexon-tag spanning region132311030-132312506
Microexon-tag prediction score0.9544
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG32010x
Reference Transcript ID OTG32010
Gene ID HannXRQ_Chr03g0081911
Gene Name NA
Transcript ID OTG32010
Protein ID OTG32010
Gene ID HannXRQ_Chr03g0081911
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 3.6e-97
Motif start 73
Motif end 391
Protein seq >OTG32010
MAHPHIHPFLLTTLKLTDTKTSIMNKLLSSFLTLCFLVLVFDTPTSNATGRNMVDGIFLPSQKNEQPYRTGYHFQPPSNW
MNDPNGPMLYQGVYHFFYQYNPLAPTFGTIVWGHAVSHDLVNWIHLDPAIYPTDEPDISSCWSGSATILPGNLPAMLYTG
SDSTSRQVQDLAWPKNRSDPFLREWVKSTHNPIITPPEGVKDDCFRDPSTAWLGPDGLWRIVVGGDRDNNGMAFLYQSPD
FVTWTRYENPLAAADSTGTWECPDFFPVPLNSTNGLDTSVVSSGSVLHVMKAGFEGHDWYTIGTYSPDRENFLPQNGLSL
SGSTLDLRYDYGNFYASKSFFDESKNRRVLWAWVPEKDSEEDDIEKGWAGLQSFPRALWIDRSGKQLIQWPVEEIETLRE
NEVKLKNKRLKSGSVVEIEGVTASQADVTISFKLEDLKEAEVVDTCSVDPQALCTDKGASSKGVIGPFGVLAMASKDLKE
QTAIFFRVFQNQKGRYSVLMCSDLSRSTIRSTIDKTSFGAFVDIDPRYDEISLRNLIDHSIIESFGAGGKTCITSRVYPQ
FLNYEDAHLFAFNNGTQSVKISQMSAWSMKNAEFTIDQTVKSAT*
CDS seq >OTG32010
ATGGCTCATCCTCATATTCACCCCTTTCTCTTGACAACCTTAAAACTCACGGATACCAAAACATCCATTATGAACAAACT
TCTTTCTTCTTTTCTTACTTTATGTTTTCTTGTTCTCGTCTTTGATACTCCGACCAGTAATGCCACTGGTCGGAATATGG
TAGACGGGATATTTCTGCCGAGCCAGAAAAATGAGCAACCGTATCGAACCGGTTATCATTTTCAACCACCAAGTAACTGG
ATGAATGATCCCAATGGACCAATGTTATACCAAGGCGTCTACCACTTCTTCTACCAATACAACCCACTGGCCCCCACTTT
CGGCACCATCGTGTGGGGCCACGCCGTATCCCACGACCTCGTCAACTGGATCCACCTAGACCCGGCAATCTACCCGACCG
ACGAACCCGACATCAGCAGCTGCTGGTCCGGATCCGCCACCATCCTCCCGGGTAACCTTCCAGCCATGCTCTACACCGGT
AGCGACTCCACTTCCCGCCAAGTCCAAGACCTCGCCTGGCCTAAAAATCGCTCCGACCCGTTTCTCCGCGAATGGGTCAA
ATCCACCCACAACCCGATAATAACCCCACCCGAAGGCGTCAAAGACGATTGTTTTCGAGACCCGAGCACCGCGTGGCTCG
GGCCCGATGGCTTATGGCGAATTGTTGTCGGTGGTGATCGTGACAACAACGGTATGGCGTTTTTGTACCAGAGTCCGGAT
TTTGTAACCTGGACTCGGTATGAGAATCCGCTTGCGGCAGCGGATTCTACGGGTACATGGGAGTGTCCCGACTTTTTTCC
TGTCCCGTTGAATAGTACCAACGGGCTGGATACGTCTGTTGTGAGTAGTGGGAGTGTTTTGCATGTGATGAAAGCGGGAT
TTGAAGGGCATGATTGGTACACGATCGGGACGTATAGTCCTGACCGCGAGAACTTTTTGCCGCAAAACGGGTTGAGTTTG
AGTGGGAGCACGTTGGATTTGAGATACGACTATGGAAACTTTTATGCGTCAAAGTCATTCTTTGACGAGTCGAAGAACAG
GAGGGTTTTGTGGGCGTGGGTTCCTGAAAAAGATTCGGAAGAAGATGATATTGAAAAAGGATGGGCTGGGCTTCAGTCGT
TTCCAAGGGCCCTTTGGATTGACAGAAGTGGGAAGCAGTTGATCCAGTGGCCGGTGGAGGAGATAGAAACACTTCGTGAA
AATGAAGTTAAGCTTAAAAACAAGAGGCTCAAATCCGGGTCTGTTGTCGAAATCGAGGGTGTTACTGCTTCTCAGGCGGA
TGTTACAATTTCGTTTAAATTGGAGGATTTGAAAGAGGCGGAGGTTGTGGATACGTGTTCGGTTGATCCGCAAGCACTTT
GTACCGACAAGGGTGCATCAAGCAAGGGTGTAATCGGGCCTTTTGGTGTTTTGGCTATGGCATCTAAAGACTTGAAAGAA
CAAACCGCGATTTTCTTTAGGGTTTTCCAAAACCAAAAAGGACGCTATTCTGTGCTCATGTGTAGCGACCTTAGCAGGTC
TACAATCAGAAGCACCATCGACAAGACAAGTTTTGGCGCGTTTGTTGACATAGATCCTCGATACGATGAGATCTCACTAA
GAAACTTGATCGACCACTCAATCATCGAGAGTTTCGGAGCAGGCGGAAAAACATGCATCACAAGTCGGGTCTATCCACAA
TTTCTTAACTATGAGGATGCTCATCTTTTTGCATTTAACAACGGCACTCAAAGCGTCAAGATTTCTCAAATGAGTGCTTG
GAGTATGAAAAATGCGGAATTTACCATCGACCAAACCGTAAAAAGTGCAACTTAA
Microexon DNA seq ATCCCAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCGTATCGAACCGGTTATCATTTTCAACCACCAAGTAACTGGATGAATGATCCCAATGGACCAATGTTATACCAAGGCGTCTACCACTTCTTCTACCAATACAACCCA
Microexon-tag Amino Acid seq PYRTGYHFQPPSNWMNDPNGPMLYQGVYHFFYQYNP
Transcript ID Ha.38384.1
Gene ID Ha.38384
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 3.6e-97
Motif start 73
Motif end 391
Protein seq >Ha.38384.1
MAHPHIHPFLLTTLKLTDTKTSIMNKLLSSFLTLCFLVLVFDTPTSNATGRNMVDGIFLPSQKNEQPYRTGYHFQPPSNW
MNDPNGPMLYQGVYHFFYQYNPLAPTFGTIVWGHAVSHDLVNWIHLDPAIYPTDEPDISSCWSGSATILPGNLPAMLYTG
SDSTSRQVQDLAWPKNRSDPFLREWVKSTHNPIITPPEGVKDDCFRDPSTAWLGPDGLWRIVVGGDRDNNGMAFLYQSPD
FVTWTRYENPLAAADSTGTWECPDFFPVPLNSTNGLDTSVVSSGSVLHVMKAGFEGHDWYTIGTYSPDRENFLPQNGLSL
SGSTLDLRYDYGNFYASKSFFDESKNRRVLWAWVPEKDSEEDDIEKGWAGLQSFPRALWIDRSGKQLIQWPVEEIETLRE
NEVKLKNKRLKSGSVVEIEGVTASQADVTISFKLEDLKEAEVVDTCSVDPQALCTDKGASSKGVIGPFGVLAMASKDLKE
QTAIFFRVFQNQKGRYSVLMCSDLSRSTIRSTIDKTSFGAFVDIDPRYDEISLRNLIDHSIIESFGAGGKTCITSRVYPQ
FLNYEDAHLFAFNNGTQSVKISQMSAWSMKNAEFTIDQTVKSAT*
CDS seq >Ha.38384.1
ATGGCTCATCCTCATATTCACCCCTTTCTCTTGACAACCTTAAAACTCACGGATACCAAAACATCCATTATGAACAAACT
TCTTTCTTCTTTTCTTACTTTATGTTTTCTTGTTCTCGTCTTTGATACTCCGACCAGTAATGCCACTGGTCGGAATATGG
TAGACGGGATATTTCTGCCGAGCCAGAAAAATGAGCAACCGTATCGAACCGGTTATCATTTTCAACCACCAAGTAACTGG
ATGAATGATCCCAATGGACCAATGTTATACCAAGGCGTCTACCACTTCTTCTACCAATACAACCCACTGGCCCCCACTTT
CGGCACCATCGTGTGGGGCCACGCCGTATCCCACGACCTCGTCAACTGGATCCACCTAGACCCGGCAATCTACCCGACCG
ACGAACCCGACATCAGCAGCTGCTGGTCCGGATCCGCCACCATCCTCCCGGGTAACCTTCCAGCCATGCTCTACACCGGT
AGCGACTCCACTTCCCGCCAAGTCCAAGACCTCGCCTGGCCTAAAAATCGCTCCGACCCGTTTCTCCGCGAATGGGTCAA
ATCCACCCACAACCCGATAATAACCCCACCCGAAGGCGTCAAAGACGATTGTTTTCGAGACCCGAGCACCGCGTGGCTCG
GGCCCGATGGCTTATGGCGAATTGTTGTCGGTGGTGATCGTGACAACAACGGTATGGCGTTTTTGTACCAGAGTCCGGAT
TTTGTAACCTGGACTCGGTATGAGAATCCGCTTGCGGCAGCGGATTCTACGGGTACATGGGAGTGTCCCGACTTTTTTCC
TGTCCCGTTGAATAGTACCAACGGGCTGGATACGTCTGTTGTGAGTAGTGGGAGTGTTTTGCATGTGATGAAAGCGGGAT
TTGAAGGGCATGATTGGTACACGATCGGGACGTATAGTCCTGACCGCGAGAACTTTTTGCCGCAAAACGGGTTGAGTTTG
AGTGGGAGCACGTTGGATTTGAGATACGACTATGGAAACTTTTATGCGTCAAAGTCATTCTTTGACGAGTCGAAGAACAG
GAGGGTTTTGTGGGCGTGGGTTCCTGAAAAAGATTCGGAAGAAGATGATATTGAAAAAGGATGGGCTGGGCTTCAGTCGT
TTCCAAGGGCCCTTTGGATTGACAGAAGTGGGAAGCAGTTGATCCAGTGGCCGGTGGAGGAGATAGAAACACTTCGTGAA
AATGAAGTTAAGCTTAAAAACAAGAGGCTCAAATCCGGGTCTGTTGTCGAAATCGAGGGTGTTACTGCTTCTCAGGCGGA
TGTTACAATTTCGTTTAAATTGGAGGATTTGAAAGAGGCGGAGGTTGTGGATACGTGTTCGGTTGATCCGCAAGCACTTT
GTACCGACAAGGGTGCATCAAGCAAGGGTGTAATCGGGCCTTTTGGTGTTTTGGCTATGGCATCTAAAGACTTGAAAGAA
CAAACCGCGATTTTCTTTAGGGTTTTCCAAAACCAAAAAGGACGCTATTCTGTGCTCATGTGTAGCGACCTTAGCAGGTC
TACAATCAGAAGCACCATCGACAAGACAAGTTTTGGCGCGTTTGTTGACATAGATCCTCGATACGATGAGATCTCACTAA
GAAACTTGATCGACCACTCAATCATCGAGAGTTTCGGAGCAGGCGGAAAAACATGCATCACAAGTCGGGTCTATCCACAA
TTTCTTAACTATGAGGATGCTCATCTTTTTGCATTTAACAACGGCACTCAAAGCGTCAAGATTTCTCAAATGAGTGCTTG
GAGTATGAAAAATGCGGAATTTACCATCGACCAAACCGTAAAAAGTGCAACTTAA