Microexon ID Ps_NC_039367.1:158246648-158246656:-
Species Papaver somniferum
Coordinates NC_039367.1:158246648..158246656
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCATATAGAACTGGTTATCATTTCCAACCTTCCAGGAATTGGATGAATGATCCTAATGGGCCTATGATTTACAATGGAATATACCATCTATTCTATCAATGGAGCACT
Microexon-tag Amino Acid Seq PYRTGYHFQPSRNWMNDPNGPMIYNGIYHLFYQWST
Microexon-tag spanning region158246493-158247162
Microexon-tag prediction score0.9237
Overlapped with the annotated transcript (%) 100
New Transcript ID XM_026567564.1x
Reference Transcript ID XM_026567564.1
Gene ID NA
Gene Name NA
Transcript ID XM_026567564.1
Protein ID XP_026423349.1
Gene ID LOC113319295
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.4e-100
Motif start 43
Motif end 363
Protein seq >XP_026423349.1
MAKNAVSWFTTFCSYFLIGRGVEASQPICKSLQNDQPYRTGYHFQPSRNWMNDPNGPMIYNGIYHLFYQWSTSGSVWGDI
VWAHATSTDLVNWIHHKHAIVPSEAFDIEGVWSGSVTFVERKPIIQYTGIIKSNQWNYQVQDMAVPKDLSDPYLIEWVKP
VEHNPVIAHIEGINASAFRDPTTAWQGPDGIWRVTIGSKVENDGVAFLFRSDDFITWTKAEQPLHSAKETGMWECPDFFP
VAVDGMEGVDTSVMGPSVKHVFKVSLDDTKHDYYTIGTYDHEKDKYTPDEGSIDKDSGLKFDYGKFYASKTFFDNEKNRR
VLWGWVTESSSVENDVKKGWAGLQAVPRTLWLAENGKQVKQWPVEEIKQLRTGRVSLPSKVLAGGSVTEVPGLTASQADV
DVIFELPKLEDVDIMDPSWVDNPQLLCSSMPASVKGKSGPFGLLALASKNFEEQTAIFFRVFNHQNKYFVFMCSDQSRSS
LEESNEKTTYGAFIDFDPEHDKLSLRSLIDHSIVESFGVEGKVVITARVYPKLAIDNEAQLYVFNNGTANVKMSSLEAWS
MKKAEIKYLEELKTSDS*
CDS seq >XM_026567564.1
ATGGCTAAGAATGCTGTTTCATGGTTTACTACATTTTGTTCTTATTTTTTGATTGGTCGTGGAGTTGAAGCTTCTCAACC
AATTTGTAAGAGCCTGCAAAACGATCAACCATATAGAACTGGTTATCATTTCCAACCTTCCAGGAATTGGATGAATGATC
CTAATGGGCCTATGATTTACAATGGAATATACCATCTATTCTATCAATGGAGCACTTCTGGTTCAGTTTGGGGTGATATA
GTGTGGGCTCATGCGACATCAACCGATCTCGTCAACTGGATTCACCATAAACACGCGATTGTCCCATCCGAAGCATTTGA
TATCGAAGGTGTTTGGTCAGGTTCAGTAACATTCGTAGAACGGAAGCCGATTATTCAATACACGGGTATTATCAAGTCGA
ATCAATGGAACTACCAAGTTCAGGATATGGCAGTGCCGAAGGACTTGTCGGATCCTTACCTTATTGAATGGGTTAAACCA
GTGGAACACAATCCTGTAATTGCACACATTGAAGGTATCAATGCTAGTGCATTTAGGGATCCAACCACTGCTTGGCAAGG
TCCTGATGGAATCTGGAGAGTGACAATAGGAAGCAAAGTAGAAAACGATGGAGTCGCGTTTTTATTCCGAAGTGATGATT
TTATTACATGGACGAAAGCAGAACAACCACTACATTCGGCTAAAGAAACTGGAATGTGGGAGTGCCCAGATTTCTTTCCA
GTAGCGGTTGACGGAATGGAGGGTGTGGATACTTCAGTTATGGGTCCTAGTGTCAAACATGTGTTCAAGGTGAGCTTGGA
TGATACCAAACATGATTACTATACAATTGGGACTTACGACCATGAAAAGGATAAGTATACTCCAGATGAGGGATCCATCG
ATAAGGATTCCGGTTTGAAGTTTGATTACGGTAAGTTTTACGCTTCGAAAACATTCTTTGACAATGAAAAGAACAGGAGA
GTATTGTGGGGTTGGGTAACAGAATCTAGTAGTGTAGAAAATGATGTGAAGAAAGGTTGGGCCGGACTACAGGCAGTTCC
TCGAACACTATGGCTAGCAGAAAATGGCAAACAAGTGAAGCAATGGCCTGTTGAAGAGATAAAACAACTACGCACTGGAC
GAGTTTCTTTACCTAGCAAAGTTCTTGCAGGGGGTTCCGTGACGGAAGTTCCTGGTCTAACAGCATCACAGGCAGATGTG
GATGTAATATTTGAACTTCCAAAGTTGGAAGATGTTGATATTATGGACCCTAGTTGGGTCGATAACCCACAACTACTTTG
TAGTTCAATGCCAGCATCAGTGAAAGGGAAGTCTGGGCCGTTTGGTTTGTTGGCCTTGGCTTCTAAGAACTTTGAAGAAC
AGACAGCCATTTTCTTCAGGGTATTCAATCATCAAAACAAATATTTTGTGTTCATGTGCAGTGACCAGAGCCGGTCATCT
TTGGAAGAAAGCAACGAAAAAACCACTTATGGAGCTTTTATTGATTTTGACCCTGAGCATGATAAGCTATCTCTAAGGAG
CTTGATTGACCACTCAATAGTGGAGAGCTTTGGCGTCGAAGGAAAAGTTGTTATAACTGCTAGAGTGTATCCAAAATTGG
CTATTGATAACGAAGCTCAGCTTTATGTATTCAATAATGGAACTGCAAATGTTAAGATGTCAAGTCTGGAGGCATGGAGT
ATGAAGAAAGCTGAAATCAAGTATCTTGAAGAATTGAAAACTTCTGATAGCTAA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCATATAGAACTGGTTATCATTTCCAACCTTCCAGGAATTGGATGAATGATCCTAATGGGCCTATGATTTACAATGGAATATACCATCTATTCTATCAATGGAGCACT
Microexon-tag Amino Acid seq PYRTGYHFQPSRNWMNDPNGPMIYNGIYHLFYQWST
Transcript ID XM_026567564.1
Gene ID LOC113319295
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.4e-100
Motif start 43
Motif end 363
Protein seq >XM_026567564.1
MAKNAVSWFTTFCSYFLIGRGVEASQPICKSLQNDQPYRTGYHFQPSRNWMNDPNGPMIYNGIYHLFYQWSTSGSVWGDI
VWAHATSTDLVNWIHHKHAIVPSEAFDIEGVWSGSVTFVERKPIIQYTGIIKSNQWNYQVQDMAVPKDLSDPYLIEWVKP
VEHNPVIAHIEGINASAFRDPTTAWQGPDGIWRVTIGSKVENDGVAFLFRSDDFITWTKAEQPLHSAKETGMWECPDFFP
VAVDGMEGVDTSVMGPSVKHVFKVSLDDTKHDYYTIGTYDHEKDKYTPDEGSIDKDSGLKFDYGKFYASKTFFDNEKNRR
VLWGWVTESSSVENDVKKGWAGLQAVPRTLWLAENGKQVKQWPVEEIKQLRTGRVSLPSKVLAGGSVTEVPGLTASQADV
DVIFELPKLEDVDIMDPSWVDNPQLLCSSMPASVKGKSGPFGLLALASKNFEEQTAIFFRVFNHQNKYFVFMCSDQSRSS
LEESNEKTTYGAFIDFDPEHDKLSLRSLIDHSIVESFGVEGKVVITARVYPKLAIDNEAQLYVFNNGTANVKMSSLEAWS
MKKAEIKYLEELKTSDS*
CDS seq >XM_026567564.1
ATGGCTAAGAATGCTGTTTCATGGTTTACTACATTTTGTTCTTATTTTTTGATTGGTCGTGGAGTTGAAGCTTCTCAACC
AATTTGTAAGAGCCTGCAAAACGATCAACCATATAGAACTGGTTATCATTTCCAACCTTCCAGGAATTGGATGAATGATC
CTAATGGGCCTATGATTTACAATGGAATATACCATCTATTCTATCAATGGAGCACTTCTGGTTCAGTTTGGGGTGATATA
GTGTGGGCTCATGCGACATCAACCGATCTCGTCAACTGGATTCACCATAAACACGCGATTGTCCCATCCGAAGCATTTGA
TATCGAAGGTGTTTGGTCAGGTTCAGTAACATTCGTAGAACGGAAGCCGATTATTCAATACACGGGTATTATCAAGTCGA
ATCAATGGAACTACCAAGTTCAGGATATGGCAGTGCCGAAGGACTTGTCGGATCCTTACCTTATTGAATGGGTTAAACCA
GTGGAACACAATCCTGTAATTGCACACATTGAAGGTATCAATGCTAGTGCATTTAGGGATCCAACCACTGCTTGGCAAGG
TCCTGATGGAATCTGGAGAGTGACAATAGGAAGCAAAGTAGAAAACGATGGAGTCGCGTTTTTATTCCGAAGTGATGATT
TTATTACATGGACGAAAGCAGAACAACCACTACATTCGGCTAAAGAAACTGGAATGTGGGAGTGCCCAGATTTCTTTCCA
GTAGCGGTTGACGGAATGGAGGGTGTGGATACTTCAGTTATGGGTCCTAGTGTCAAACATGTGTTCAAGGTGAGCTTGGA
TGATACCAAACATGATTACTATACAATTGGGACTTACGACCATGAAAAGGATAAGTATACTCCAGATGAGGGATCCATCG
ATAAGGATTCCGGTTTGAAGTTTGATTACGGTAAGTTTTACGCTTCGAAAACATTCTTTGACAATGAAAAGAACAGGAGA
GTATTGTGGGGTTGGGTAACAGAATCTAGTAGTGTAGAAAATGATGTGAAGAAAGGTTGGGCCGGACTACAGGCAGTTCC
TCGAACACTATGGCTAGCAGAAAATGGCAAACAAGTGAAGCAATGGCCTGTTGAAGAGATAAAACAACTACGCACTGGAC
GAGTTTCTTTACCTAGCAAAGTTCTTGCAGGGGGTTCCGTGACGGAAGTTCCTGGTCTAACAGCATCACAGGCAGATGTG
GATGTAATATTTGAACTTCCAAAGTTGGAAGATGTTGATATTATGGACCCTAGTTGGGTCGATAACCCACAACTACTTTG
TAGTTCAATGCCAGCATCAGTGAAAGGGAAGTCTGGGCCGTTTGGTTTGTTGGCCTTGGCTTCTAAGAACTTTGAAGAAC
AGACAGCCATTTTCTTCAGGGTATTCAATCATCAAAACAAATATTTTGTGTTCATGTGCAGTGACCAGAGCCGGTCATCT
TTGGAAGAAAGCAACGAAAAAACCACTTATGGAGCTTTTATTGATTTTGACCCTGAGCATGATAAGCTATCTCTAAGGAG
CTTGATTGACCACTCAATAGTGGAGAGCTTTGGCGTCGAAGGAAAAGTTGTTATAACTGCTAGAGTGTATCCAAAATTGG
CTATTGATAACGAAGCTCAGCTTTATGTATTCAATAATGGAACTGCAAATGTTAAGATGTCAAGTCTGGAGGCATGGAGT
ATGAAGAAAGCTGAAATCAAGTATCTTGAAGAATTGAAAACTTCTGATAGCTAA