Microexon ID Ps_NC_039364.1:11180157-11180165:+
Species Papaver somniferum
Coordinates NC_039364.1:11180157..11180165
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq CCTTACCGAACTGGTTATCATTTCCAACCTGCTAAAAATTGGATGAATGATCCTAATGCGCCTATGATTTACAATGGAATATACCATCTATTCTATCAATGGAGCACT
Microexon-tag Amino Acid Seq PYRTGYHFQPAKNWMNDPNAPMIYNGIYHLFYQWST
Microexon-tag spanning region11179644-11180344
Microexon-tag prediction score0.9286
Overlapped with the annotated transcript (%) 100
New Transcript ID XM_026543850.1x
Reference Transcript ID XM_026543850.1
Gene ID NA
Gene Name NA
Transcript ID XM_026543850.1
Protein ID XP_026399635.1
Gene ID LOC113295521
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.3e-99
Motif start 51
Motif end 371
Protein seq >XP_026399635.1
MAMNTVPWLIGLCCSVILSHGVQASHHLYRNFQSLQSNGTPIYQPYRTGYHFQPAKNWMNDPNAPMIYNGIYHLFYQWST
SGAVWGDIVWAHATSTDLVNWNHHKIAMVPSEPFDIEGVWSGSATFIQGKPIIQYTGIIKSNQWNYQVQDMAVPKDLSDP
YLVEWNKPIEYNPVIKPTDGINASAFRDPTTGWKGPDGIWRTTIGSKVGNNGVSFLFRSSDFIHWTKAEQPLHWAKDTGM
WECPDFFPVAVDDKEGLDTSVLGPSVKHVFKVSLDDTKHDYYTIGTYNPETDKYTPDEGSIDNDSGLRFDYGKFYASKTF
FDHGKKRRVLWGWANESSSVADDVKKGWAGLQAVPRTVWLEENGKQLKQWPVEEIEQLRTKRVSLRSQVLKGGSVLEVPG
LTASQADVDVKFKLPKLKDVEIMDPSWVNPQLLCSSKPASVQGKSGPFGLLALASKNLEEQTAIFFRVFNHQNKYVVLMC
SDQSRSSLEESNDKTTYGAFIDVDPHHDQTLSLRSLIDHSVVESYGVDGKAVILARVYPTLATDREAHLYVFNNGTGNVK
MSSLKAWSMKKAEIKYLKDLKHSDS*
CDS seq >XM_026543850.1
ATGGCCATGAATACTGTCCCATGGTTGATCGGATTGTGTTGTTCAGTTATCCTTAGTCATGGAGTTCAAGCTTCTCATCA
TCTATACAGAAACTTTCAATCTCTTCAATCTAATGGTACCCCCATATATCAGCCTTACCGAACTGGTTATCATTTCCAAC
CTGCTAAAAATTGGATGAATGATCCTAATGCGCCTATGATTTACAATGGAATATACCATCTATTCTATCAATGGAGCACT
TCCGGTGCAGTTTGGGGTGACATAGTGTGGGCTCATGCGACATCCACCGATCTCGTAAACTGGAATCACCATAAAATCGC
GATGGTCCCATCGGAACCATTTGATATCGAAGGTGTTTGGTCAGGCTCAGCAACATTTATACAAGGAAAGCCGATTATTC
AATACACGGGAATTATTAAGTCGAATCAATGGAACTACCAAGTTCAGGACATGGCAGTGCCGAAGGACTTATCTGATCCT
TACCTCGTAGAGTGGAATAAACCAATTGAATACAATCCAGTAATTAAACCCACTGACGGTATCAATGCTAGTGCTTTCAG
AGATCCGACTACTGGTTGGAAGGGTCCTGACGGAATCTGGAGAACGACAATAGGAAGCAAAGTGGGCAACAATGGAGTAT
CATTTTTATTTCGAAGTAGTGATTTTATTCATTGGACTAAAGCAGAACAACCTCTACATTGGGCGAAGGACACCGGAATG
TGGGAATGCCCAGATTTCTTTCCCGTGGCTGTTGATGATAAAGAGGGTCTGGATACTTCAGTTCTCGGTCCTAGTGTTAA
ACACGTGTTCAAGGTGAGCTTGGATGATACCAAGCATGACTACTATACTATTGGAACTTACAACCCTGAAACGGATAAGT
ATACTCCTGACGAAGGATCCATCGATAATGATTCTGGTTTGAGGTTTGATTACGGAAAATTTTACGCTTCGAAAACATTC
TTTGATCATGGAAAGAAAAGGAGAGTACTGTGGGGTTGGGCAAATGAATCTAGTAGTGTAGCTGATGATGTCAAGAAAGG
TTGGGCGGGATTGCAGGCAGTTCCTCGAACAGTATGGCTTGAGGAAAATGGTAAGCAATTGAAACAATGGCCTGTTGAAG
AAATAGAGCAACTCCGCACCAAACGAGTTTCTTTACGAAGTCAAGTTCTTAAAGGGGGTTCAGTATTAGAAGTTCCTGGT
CTAACAGCTTCACAGGCAGATGTAGATGTTAAATTTAAACTGCCGAAGCTGAAAGATGTAGAAATCATGGACCCTAGTTG
GGTTAACCCACAACTACTTTGTAGTTCAAAGCCAGCATCAGTGCAAGGGAAATCAGGACCATTTGGCTTGCTGGCTTTGG
CTTCTAAGAACCTAGAAGAACAGACAGCTATTTTCTTCAGAGTATTTAACCATCAAAACAAATATGTTGTGCTCATGTGT
AGTGATCAGAGCAGGTCATCTTTGGAAGAAAGTAACGACAAAACCACATATGGAGCTTTTATTGATGTTGATCCTCATCA
TGATCAGACGCTATCTCTAAGGAGCTTGATTGATCATTCAGTAGTGGAGAGTTATGGTGTTGACGGCAAAGCTGTTATAT
TAGCTAGAGTATATCCAACACTAGCTACTGACAGAGAAGCTCACCTTTACGTATTCAACAACGGAACTGGAAATGTAAAG
ATGTCTAGTCTTAAGGCATGGAGTATGAAGAAAGCTGAAATCAAGTATCTAAAAGATTTGAAGCATTCAGATAGCTAA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq CCTTACCGAACTGGTTATCATTTCCAACCTGCTAAAAATTGGATGAATGATCCTAATGCGCCTATGATTTACAATGGAATATACCATCTATTCTATCAATGGAGCACT
Microexon-tag Amino Acid seq PYRTGYHFQPAKNWMNDPNAPMIYNGIYHLFYQWST
Transcript ID XM_026543850.1
Gene ID Ps.40434
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.3e-99
Motif start 51
Motif end 371
Protein seq >XM_026543850.1
MAMNTVPWLIGLCCSVILSHGVQASHHLYRNFQSLQSNGTPIYQPYRTGYHFQPAKNWMNDPNAPMIYNGIYHLFYQWST
SGAVWGDIVWAHATSTDLVNWNHHKIAMVPSEPFDIEGVWSGSATFIQGKPIIQYTGIIKSNQWNYQVQDMAVPKDLSDP
YLVEWNKPIEYNPVIKPTDGINASAFRDPTTGWKGPDGIWRTTIGSKVGNNGVSFLFRSSDFIHWTKAEQPLHWAKDTGM
WECPDFFPVAVDDKEGLDTSVLGPSVKHVFKVSLDDTKHDYYTIGTYNPETDKYTPDEGSIDNDSGLRFDYGKFYASKTF
FDHGKKRRVLWGWANESSSVADDVKKGWAGLQAVPRTVWLEENGKQLKQWPVEEIEQLRTKRVSLRSQVLKGGSVLEVPG
LTASQADVDVKFKLPKLKDVEIMDPSWVNPQLLCSSKPASVQGKSGPFGLLALASKNLEEQTAIFFRVFNHQNKYVVLMC
SDQSRSSLEESNDKTTYGAFIDVDPHHDQTLSLRSLIDHSVVESYGVDGKAVILARVYPTLATDREAHLYVFNNGTGNVK
MSSLKAWSMKKAEIKYLKDLKHSDS*
CDS seq >XM_026543850.1
ATGGCCATGAATACTGTCCCATGGTTGATCGGATTGTGTTGTTCAGTTATCCTTAGTCATGGAGTTCAAGCTTCTCATCA
TCTATACAGAAACTTTCAATCTCTTCAATCTAATGGTACCCCCATATATCAGCCTTACCGAACTGGTTATCATTTCCAAC
CTGCTAAAAATTGGATGAATGATCCTAATGCGCCTATGATTTACAATGGAATATACCATCTATTCTATCAATGGAGCACT
TCCGGTGCAGTTTGGGGTGACATAGTGTGGGCTCATGCGACATCCACCGATCTCGTAAACTGGAATCACCATAAAATCGC
GATGGTCCCATCGGAACCATTTGATATCGAAGGTGTTTGGTCAGGCTCAGCAACATTTATACAAGGAAAGCCGATTATTC
AATACACGGGAATTATTAAGTCGAATCAATGGAACTACCAAGTTCAGGACATGGCAGTGCCGAAGGACTTATCTGATCCT
TACCTCGTAGAGTGGAATAAACCAATTGAATACAATCCAGTAATTAAACCCACTGACGGTATCAATGCTAGTGCTTTCAG
AGATCCGACTACTGGTTGGAAGGGTCCTGACGGAATCTGGAGAACGACAATAGGAAGCAAAGTGGGCAACAATGGAGTAT
CATTTTTATTTCGAAGTAGTGATTTTATTCATTGGACTAAAGCAGAACAACCTCTACATTGGGCGAAGGACACCGGAATG
TGGGAATGCCCAGATTTCTTTCCCGTGGCTGTTGATGATAAAGAGGGTCTGGATACTTCAGTTCTCGGTCCTAGTGTTAA
ACACGTGTTCAAGGTGAGCTTGGATGATACCAAGCATGACTACTATACTATTGGAACTTACAACCCTGAAACGGATAAGT
ATACTCCTGACGAAGGATCCATCGATAATGATTCTGGTTTGAGGTTTGATTACGGAAAATTTTACGCTTCGAAAACATTC
TTTGATCATGGAAAGAAAAGGAGAGTACTGTGGGGTTGGGCAAATGAATCTAGTAGTGTAGCTGATGATGTCAAGAAAGG
TTGGGCGGGATTGCAGGCAGTTCCTCGAACAGTATGGCTTGAGGAAAATGGTAAGCAATTGAAACAATGGCCTGTTGAAG
AAATAGAGCAACTCCGCACCAAACGAGTTTCTTTACGAAGTCAAGTTCTTAAAGGGGGTTCAGTATTAGAAGTTCCTGGT
CTAACAGCTTCACAGGCAGATGTAGATGTTAAATTTAAACTGCCGAAGCTGAAAGATGTAGAAATCATGGACCCTAGTTG
GGTTAACCCACAACTACTTTGTAGTTCAAAGCCAGCATCAGTGCAAGGGAAATCAGGACCATTTGGCTTGCTGGCTTTGG
CTTCTAAGAACCTAGAAGAACAGACAGCTATTTTCTTCAGAGTATTTAACCATCAAAACAAATATGTTGTGCTCATGTGT
AGTGATCAGAGCAGGTCATCTTTGGAAGAAAGTAACGACAAAACCACATATGGAGCTTTTATTGATGTTGATCCTCATCA
TGATCAGACGCTATCTCTAAGGAGCTTGATTGATCATTCAGTAGTGGAGAGTTATGGTGTTGACGGCAAAGCTGTTATAT
TAGCTAGAGTATATCCAACACTAGCTACTGACAGAGAAGCTCACCTTTACGTATTCAACAACGGAACTGGAAATGTAAAG
ATGTCTAGTCTTAAGGCATGGAGTATGAAGAAAGCTGAAATCAAGTATCTAAAAGATTTGAAGCATTCAGATAGCTAA