Microexon ID Ps_NC_039364.1:10974020-10974028:+
Species Papaver somniferum
Coordinates NC_039364.1:10974020..10974028
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTACAGAACTGGTTATCATTTCCAACCTTCCAAGAATTGGATGAATGATCCTAATGGACCTATGGTTTACAACGGAATCTACCATCTAATGTATCAATGGAACCCT
Microexon-tag Amino Acid Seq PYRTGYHFQPSKNWMNDPNGPMVYNGIYHLMYQWNP
Microexon-tag spanning region10973794-10974285
Microexon-tag prediction score0.9295
Overlapped with the annotated transcript (%) 100
New Transcript ID XM_026543834.1x
Reference Transcript ID XM_026543834.1
Gene ID NA
Gene Name NA
Transcript ID XM_026543834.1
Protein ID XP_026399619.1
Gene ID LOC113295504
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.9e-103
Motif start 46
Motif end 366
Protein seq >XP_026399619.1
MKSVLWFIGLCYVLISHGVEASHNIYLHLQSYQTNTVHQPYRTGYHFQPSKNWMNDPNGPMVYNGIYHLMYQWNPLGPVW
GNIVWAHATSTDLINWQHHKHAIYPSESFDIKGCWSGSATFVQGKPVIQYTGIIESTQWNYQVQDMAVPKNLSDPYLTEW
VKPAEYNPVMKPIDGINASSFRDPTTGWKGPDGKWRVVIGSKIDREGMAILFRSDDFIHWTKAEQPLHSGKEIGMWECPD
FFPVAVDGKKGLDTSVLGPNVKHVFKVSLDDTKHDYYTVGTYNPVTDKYTPDEGSVDNDSGLRYDYGKFYASKTFFDSEK
HRRILWGWINESSSVENDAVKGWSGVQAIPRVLLLDKSGKQLKQWPVKELNELRGEKVYLHSTFLKGSSVVEVTGVTASQ
ADVFVKFKLPKLEDAETFDSSWSNPQLLCSSKPASVQGKSGPFGLIALASKNFEEQTAIFFRIFNNQNKYVVLMCSDQSR
SSTDDTNDKTAYGAFVNIEPNHDKLNIRSLIDHSIIESYGVDGKAVITARVYPTIAIDTGARLYVFNNGTQDVKMSSLKA
WSMKKADIRYLEDSKTSNT*
CDS seq >XM_026543834.1
ATGAAGTCTGTTTTATGGTTTATTGGATTATGTTATGTTTTGATTAGCCATGGAGTTGAAGCTTCTCATAATATTTACTT
GCATCTTCAATCATATCAAACCAACACCGTTCATCAACCTTACAGAACTGGTTATCATTTCCAACCTTCCAAGAATTGGA
TGAATGATCCTAATGGACCTATGGTTTACAACGGAATCTACCATCTAATGTATCAATGGAACCCTTTGGGTCCAGTATGG
GGTAACATAGTATGGGCTCATGCAACATCGACCGATCTCATAAACTGGCAACACCATAAACATGCGATTTACCCTTCCGA
ATCATTTGATATCAAAGGTTGCTGGTCAGGTTCAGCAACATTTGTACAAGGAAAACCGGTTATTCAGTACACAGGAATCA
TCGAGTCAACTCAATGGAACTACCAAGTGCAAGACATGGCAGTGCCTAAGAATTTATCCGATCCTTACCTTACAGAATGG
GTTAAGCCAGCTGAGTACAATCCTGTAATGAAACCCATTGATGGTATCAATGCTAGTTCCTTCAGGGATCCAACTACTGG
CTGGAAAGGTCCTGATGGAAAATGGAGAGTGGTAATTGGAAGCAAAATCGACCGTGAAGGAATGGCGATTTTATTCCGAA
GTGACGATTTTATTCACTGGACTAAAGCAGAACAACCTCTCCATTCAGGAAAAGAAATAGGAATGTGGGAATGCCCAGAT
TTCTTTCCAGTGGCTGTTGATGGGAAAAAGGGTCTTGATACTTCGGTTCTCGGTCCTAATGTTAAACATGTTTTTAAGGT
GAGCTTGGATGACACCAAACATGACTATTACACAGTCGGTACTTACAATCCTGTAACAGATAAGTATACTCCAGATGAGG
GATCTGTTGATAATGATTCCGGTTTGAGATACGATTACGGGAAGTTCTACGCTTCAAAAACGTTTTTCGATAGTGAAAAA
CACAGGAGAATATTGTGGGGTTGGATAAATGAATCTAGTAGTGTAGAAAATGATGCTGTAAAGGGTTGGTCTGGAGTGCA
GGCGATTCCTCGAGTTCTCTTGCTCGATAAAAGCGGGAAGCAGTTGAAGCAATGGCCAGTAAAAGAGCTAAATGAACTCC
GAGGTGAAAAAGTTTATTTACATAGTACATTTCTTAAAGGAAGTTCAGTAGTGGAAGTTACTGGCGTAACAGCTTCACAG
GCAGACGTATTCGTCAAATTTAAACTACCGAAGTTAGAAGATGCAGAAACTTTTGACTCAAGTTGGAGTAACCCACAACT
GCTTTGTAGTTCAAAACCAGCATCAGTTCAAGGGAAGTCAGGGCCATTTGGTTTAATTGCTTTGGCTTCTAAGAATTTCG
AAGAACAGACGGCAATCTTTTTCAGGATATTCAATAACCAAAACAAATATGTTGTGCTCATGTGCAGTGATCAGAGCAGG
TCATCCACGGACGACACTAATGACAAAACCGCTTATGGAGCTTTTGTTAACATCGAACCTAATCATGATAAGCTAAATAT
AAGGAGCTTGATTGATCACTCTATCATCGAGAGTTATGGAGTTGACGGAAAAGCTGTTATAACAGCTAGAGTTTATCCAA
CAATTGCCATTGACACTGGAGCTCGGCTTTATGTATTCAACAACGGAACTCAAGATGTGAAGATGTCTAGTCTAAAGGCA
TGGAGTATGAAGAAAGCTGATATCAGATATTTAGAGGATTCAAAAACTTCAAACACTTAA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTACAGAACTGGTTATCATTTCCAACCTTCCAAGAATTGGATGAATGATCCTAATGGACCTATGGTTTACAACGGAATCTACCATCTAATGTATCAATGGAACCCT
Microexon-tag Amino Acid seq PYRTGYHFQPSKNWMNDPNGPMVYNGIYHLMYQWNP
Transcript ID XM_026543834.1
Gene ID Ps.40428
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.9e-103
Motif start 46
Motif end 366
Protein seq >XM_026543834.1
MKSVLWFIGLCYVLISHGVEASHNIYLHLQSYQTNTVHQPYRTGYHFQPSKNWMNDPNGPMVYNGIYHLMYQWNPLGPVW
GNIVWAHATSTDLINWQHHKHAIYPSESFDIKGCWSGSATFVQGKPVIQYTGIIESTQWNYQVQDMAVPKNLSDPYLTEW
VKPAEYNPVMKPIDGINASSFRDPTTGWKGPDGKWRVVIGSKIDREGMAILFRSDDFIHWTKAEQPLHSGKEIGMWECPD
FFPVAVDGKKGLDTSVLGPNVKHVFKVSLDDTKHDYYTVGTYNPVTDKYTPDEGSVDNDSGLRYDYGKFYASKTFFDSEK
HRRILWGWINESSSVENDAVKGWSGVQAIPRVLLLDKSGKQLKQWPVKELNELRGEKVYLHSTFLKGSSVVEVTGVTASQ
ADVFVKFKLPKLEDAETFDSSWSNPQLLCSSKPASVQGKSGPFGLIALASKNFEEQTAIFFRIFNNQNKYVVLMCSDQSR
SSTDDTNDKTAYGAFVNIEPNHDKLNIRSLIDHSIIESYGVDGKAVITARVYPTIAIDTGARLYVFNNGTQDVKMSSLKA
WSMKKADIRYLEDSKTSNT*
CDS seq >XM_026543834.1
ATGAAGTCTGTTTTATGGTTTATTGGATTATGTTATGTTTTGATTAGCCATGGAGTTGAAGCTTCTCATAATATTTACTT
GCATCTTCAATCATATCAAACCAACACCGTTCATCAACCTTACAGAACTGGTTATCATTTCCAACCTTCCAAGAATTGGA
TGAATGATCCTAATGGACCTATGGTTTACAACGGAATCTACCATCTAATGTATCAATGGAACCCTTTGGGTCCAGTATGG
GGTAACATAGTATGGGCTCATGCAACATCGACCGATCTCATAAACTGGCAACACCATAAACATGCGATTTACCCTTCCGA
ATCATTTGATATCAAAGGTTGCTGGTCAGGTTCAGCAACATTTGTACAAGGAAAACCGGTTATTCAGTACACAGGAATCA
TCGAGTCAACTCAATGGAACTACCAAGTGCAAGACATGGCAGTGCCTAAGAATTTATCCGATCCTTACCTTACAGAATGG
GTTAAGCCAGCTGAGTACAATCCTGTAATGAAACCCATTGATGGTATCAATGCTAGTTCCTTCAGGGATCCAACTACTGG
CTGGAAAGGTCCTGATGGAAAATGGAGAGTGGTAATTGGAAGCAAAATCGACCGTGAAGGAATGGCGATTTTATTCCGAA
GTGACGATTTTATTCACTGGACTAAAGCAGAACAACCTCTCCATTCAGGAAAAGAAATAGGAATGTGGGAATGCCCAGAT
TTCTTTCCAGTGGCTGTTGATGGGAAAAAGGGTCTTGATACTTCGGTTCTCGGTCCTAATGTTAAACATGTTTTTAAGGT
GAGCTTGGATGACACCAAACATGACTATTACACAGTCGGTACTTACAATCCTGTAACAGATAAGTATACTCCAGATGAGG
GATCTGTTGATAATGATTCCGGTTTGAGATACGATTACGGGAAGTTCTACGCTTCAAAAACGTTTTTCGATAGTGAAAAA
CACAGGAGAATATTGTGGGGTTGGATAAATGAATCTAGTAGTGTAGAAAATGATGCTGTAAAGGGTTGGTCTGGAGTGCA
GGCGATTCCTCGAGTTCTCTTGCTCGATAAAAGCGGGAAGCAGTTGAAGCAATGGCCAGTAAAAGAGCTAAATGAACTCC
GAGGTGAAAAAGTTTATTTACATAGTACATTTCTTAAAGGAAGTTCAGTAGTGGAAGTTACTGGCGTAACAGCTTCACAG
GCAGACGTATTCGTCAAATTTAAACTACCGAAGTTAGAAGATGCAGAAACTTTTGACTCAAGTTGGAGTAACCCACAACT
GCTTTGTAGTTCAAAACCAGCATCAGTTCAAGGGAAGTCAGGGCCATTTGGTTTAATTGCTTTGGCTTCTAAGAATTTCG
AAGAACAGACGGCAATCTTTTTCAGGATATTCAATAACCAAAACAAATATGTTGTGCTCATGTGCAGTGATCAGAGCAGG
TCATCCACGGACGACACTAATGACAAAACCGCTTATGGAGCTTTTGTTAACATCGAACCTAATCATGATAAGCTAAATAT
AAGGAGCTTGATTGATCACTCTATCATCGAGAGTTATGGAGTTGACGGAAAAGCTGTTATAACAGCTAGAGTTTATCCAA
CAATTGCCATTGACACTGGAGCTCGGCTTTATGTATTCAACAACGGAACTCAAGATGTGAAGATGTCTAGTCTAAAGGCA
TGGAGTATGAAGAAAGCTGATATCAGATATTTAGAGGATTCAAAAACTTCAAACACTTAA