Microexon ID Ps_NC_039367.1:158343936-158343944:-
Species Papaver somniferum
Coordinates NC_039367.1:158343936..158343944
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTATAGAACTGGTTATCATTTCCAACCTTCCAAGAATTGGATGAATGATCCTAATGGACCTATGGTTGACAATGGAATATACCATCTAATGTACCAATGGAACCCT
Microexon-tag Amino Acid Seq PYRTGYHFQPSKNWMNDPNGPMVDNGIYHLMYQWNP
Microexon-tag spanning region158343679-158344181
Microexon-tag prediction score0.9186
Overlapped with the annotated transcript (%) 100
New Transcript ID XM_026567147.1x
Reference Transcript ID XM_026567147.1
Gene ID NA
Gene Name NA
Transcript ID XM_026567147.1
Protein ID XP_026422932.1
Gene ID LOC113318869
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.9e-103
Motif start 46
Motif end 366
Protein seq >XP_026422932.1
MKSVLWFIGLCSVLISHGVEASHNIYLHLQSYQTNTVHQPYRTGYHFQPSKNWMNDPNGPMVDNGIYHLMYQWNPLGPVW
GNIVWAHATSADLINWQHHKHAIYPSESFDIKGCWSGSATFVQGKPVIQYTGIIESNQWNYQVQDMAVPKNLSDPYLTEW
IKPAEYNPVMKPIDGINASSFRDPTTGWKGPDGKWRVVIGSKIDREGMAILFRSDDFIHWTKAKEPLHSGKEIGMWECPD
FFPVAVDGKKGLETSVLGPNVKHVFKVSLDDTKHDYYTVGTYNPATDKYTPDEGSVDNDSGLRYDYGKFYASKTFFDSEK
HRRILWGWINESSSVENDAKKGWSGVQAIPRVLLLDKSGKQLKQWPVKELNELRGEKVYLHSTVLKGGSVVEVTGVTASQ
ADVFVKFKLPKLEDAETFDSSWSNPQLLCSSKPASVQGKSGPFGLIALASKNFEEQTAIFFRIFNNQNKYVVLMCSDQSR
SSTDDTNDKTAYGAFVNMEPNHDKLNIRSLIDHSIIESYGVDGKAVITARVYPTIAIDTGARLYVFNNGTQDVKMSSLKA
WSMMKADIRYLEDSKTSNT*
CDS seq >XM_026567147.1
ATGAAGTCTGTTTTATGGTTTATTGGATTATGTTCTGTTTTGATTAGCCATGGAGTTGAAGCTTCTCATAATATTTACTT
GCATCTTCAATCATATCAAACCAACACCGTTCACCAACCTTATAGAACTGGTTATCATTTCCAACCTTCCAAGAATTGGA
TGAATGATCCTAATGGACCTATGGTTGACAATGGAATATACCATCTAATGTACCAATGGAACCCTTTGGGTCCAGTATGG
GGTAACATAGTATGGGCTCATGCAACATCGGCCGATCTCATAAACTGGCAACACCATAAACATGCGATTTACCCTTCCGA
ATCATTTGATATCAAAGGTTGCTGGTCAGGTTCAGCAACATTTGTACAAGGAAAACCAGTTATTCAATACACAGGAATCA
TCGAGTCAAATCAATGGAACTACCAAGTGCAAGACATGGCAGTGCCTAAGAATTTATCCGATCCTTACCTTACAGAATGG
ATTAAGCCAGCTGAGTACAATCCGGTAATGAAACCCATTGATGGTATCAATGCTAGTTCCTTCAGGGATCCAACTACTGG
TTGGAAAGGTCCTGATGGAAAATGGAGAGTGGTAATTGGAAGCAAAATCGACCGTGAAGGAATGGCGATTTTATTTCGAA
GTGATGATTTTATTCACTGGACTAAAGCAAAAGAACCTCTTCATTCAGGGAAAGAAATAGGAATGTGGGAATGCCCAGAT
TTCTTTCCAGTGGCTGTTGATGGGAAAAAGGGTCTTGAAACTTCGGTTCTCGGTCCTAATGTTAAGCATGTTTTTAAGGT
GAGTTTGGATGACACCAAACATGACTATTACACAGTCGGTACTTACAATCCTGCAACAGATAAGTATACTCCAGATGAGG
GATCGGTCGATAATGATTCCGGTTTGAGATACGATTACGGGAAGTTTTACGCATCAAAAACGTTCTTCGATAGTGAAAAG
CACAGGAGAATATTGTGGGGTTGGATAAATGAATCTAGCAGTGTAGAGAATGATGCTAAAAAGGGTTGGTCTGGAGTACA
GGCAATTCCTCGAGTTCTCTTGCTCGATAAAAGTGGTAAGCAGTTGAAGCAATGGCCAGTTAAAGAGCTAAATGAACTCC
GAGGTGAAAAGGTTTATTTACACAGTACAGTTCTTAAAGGAGGTTCAGTGGTGGAAGTTACGGGCGTAACAGCTTCACAG
GCAGACGTATTCGTCAAATTTAAACTCCCCAAATTAGAAGATGCGGAAACTTTTGACTCAAGTTGGAGTAACCCACAACT
GCTTTGTAGTTCAAAACCAGCATCAGTACAAGGGAAATCAGGGCCATTTGGTTTAATTGCTTTGGCTTCTAAGAATTTTG
AAGAACAGACAGCAATCTTTTTCAGGATATTCAATAACCAAAACAAATATGTTGTGCTCATGTGCAGTGATCAGAGCAGG
TCATCCACGGACGACACTAACGACAAAACCGCTTATGGAGCTTTTGTTAACATGGAACCAAATCATGATAAGCTAAATAT
CAGGAGCTTGATTGATCACTCTATCATCGAGAGTTATGGAGTTGACGGAAAAGCTGTTATAACAGCTAGAGTTTATCCAA
CGATTGCTATTGATACTGGAGCTCGGCTTTATGTATTCAACAATGGAACTCAAGATGTGAAGATGTCTAGTCTTAAGGCA
TGGAGTATGATGAAAGCTGATATCAGATATTTGGAGGATTCAAAAACTTCAAACACCTAA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTATAGAACTGGTTATCATTTCCAACCTTCCAAGAATTGGATGAATGATCCTAATGGACCTATGGTTGACAATGGAATATACCATCTAATGTACCAATGGAACCCT
Microexon-tag Amino Acid seq PYRTGYHFQPSKNWMNDPNGPMVDNGIYHLMYQWNP
Transcript ID XM_026567147.1
Gene ID Ps.63702
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2e-103
Motif start 46
Motif end 366
Protein seq >XM_026567147.1
MKSVLWFIGLCSVLISHGVEASHNIYLHLQSYQTNTVHQPYRTGYHFQPSKNWMNDPNGPMVDNGIYHLMYQWNPLGPVW
GNIVWAHATSADLINWQHHKHAIYPSESFDIKGCWSGSATFVQGKPVIQYTGIIESNQWNYQVQDMAVPKNLSDPYLTEW
IKPAEYNPVMKPIDGINASSFRDPTTGWKGPDGKWRVVIGSKIDREGMAILFRSDDFIHWTKAKEPLHSGKEIGMWECPD
FFPVAVDGKKGLETSVLGPNVKHVFKVSLDDTKHDYYTVGTYNPATDKYTPDEGSVDNDSGLRYDYGKFYASKTFFDSEK
HRRILWGWINESSSVENDAKKGWSGVQAIPRVLLLDKSGKQLKQWPVKELNELRGEKVYLHSTVLKGGSVVEVTGVTASQ
ADVFVKFKLPKLEDAETFDSSWSNPQLLCSSKPASVQGKSGPFGLIALASKNFEEQTAIFFRIFNNQNKYVVLMCSDQSR
SSTDDTNDKTAYGAFVNMEPNHDKLNIRSLIDHSIIESYGVDGKAVITARVYPTIAIDTGARLYVFNNGTQDVKMSSLKA
WSMMKADIRYLEDSKTSNT*
CDS seq >XM_026567147.1
ATGAAGTCTGTTTTATGGTTTATTGGATTATGTTCTGTTTTGATTAGCCATGGAGTTGAAGCTTCTCATAATATTTACTT
GCATCTTCAATCATATCAAACCAACACCGTTCACCAACCTTATAGAACTGGTTATCATTTCCAACCTTCCAAGAATTGGA
TGAATGATCCTAATGGACCTATGGTTGACAATGGAATATACCATCTAATGTACCAATGGAACCCTTTGGGTCCAGTATGG
GGTAACATAGTATGGGCTCATGCAACATCGGCCGATCTCATAAACTGGCAACACCATAAACATGCGATTTACCCTTCCGA
ATCATTTGATATCAAAGGTTGCTGGTCAGGTTCAGCAACATTTGTACAAGGAAAACCAGTTATTCAATACACAGGAATCA
TCGAGTCAAATCAATGGAACTACCAAGTGCAAGACATGGCAGTGCCTAAGAATTTATCCGATCCTTACCTTACAGAATGG
ATTAAGCCAGCTGAGTACAATCCGGTAATGAAACCCATTGATGGTATCAATGCTAGTTCCTTCAGGGATCCAACTACTGG
TTGGAAAGGTCCTGATGGAAAATGGAGAGTGGTAATTGGAAGCAAAATCGACCGTGAAGGAATGGCGATTTTATTTCGAA
GTGATGATTTTATTCACTGGACTAAAGCAAAAGAACCTCTTCATTCAGGGAAAGAAATAGGAATGTGGGAATGCCCAGAT
TTCTTTCCAGTGGCTGTTGATGGGAAAAAGGGTCTTGAAACTTCGGTTCTCGGTCCTAATGTTAAGCATGTTTTTAAGGT
GAGTTTGGATGACACCAAACATGACTATTACACAGTCGGTACTTACAATCCTGCAACAGATAAGTATACTCCAGATGAGG
GATCGGTCGATAATGATTCCGGTTTGAGATACGATTACGGGAAGTTTTACGCATCAAAAACGTTCTTCGATAGTGAAAAG
CACAGGAGAATATTGTGGGGTTGGATAAATGAATCTAGCAGTGTAGAGAATGATGCTAAAAAGGGTTGGTCTGGAGTACA
GGCAATTCCTCGAGTTCTCTTGCTCGATAAAAGTGGTAAGCAGTTGAAGCAATGGCCAGTTAAAGAGCTAAATGAACTCC
GAGGTGAAAAGGTTTATTTACACAGTACAGTTCTTAAAGGAGGTTCAGTGGTGGAAGTTACGGGCGTAACAGCTTCACAG
GCAGACGTATTCGTCAAATTTAAACTCCCCAAATTAGAAGATGCGGAAACTTTTGACTCAAGTTGGAGTAACCCACAACT
GCTTTGTAGTTCAAAACCAGCATCAGTACAAGGGAAATCAGGGCCATTTGGTTTAATTGCTTTGGCTTCTAAGAATTTTG
AAGAACAGACAGCAATCTTTTTCAGGATATTCAATAACCAAAACAAATATGTTGTGCTCATGTGCAGTGATCAGAGCAGG
TCATCCACGGACGACACTAACGACAAAACCGCTTATGGAGCTTTTGTTAACATGGAACCAAATCATGATAAGCTAAATAT
CAGGAGCTTGATTGATCACTCTATCATCGAGAGTTATGGAGTTGACGGAAAAGCTGTTATAACAGCTAGAGTTTATCCAA
CGATTGCTATTGATACTGGAGCTCGGCTTTATGTATTCAACAATGGAACTCAAGATGTGAAGATGTCTAGTCTTAAGGCA
TGGAGTATGATGAAAGCTGATATCAGATATTTGGAGGATTCAAAAACTTCAAACACCTAA