Microexon ID Ps_NC_039358.1:216750126-216750134:-
Species Papaver somniferum
Coordinates NC_039358.1:216750126..216750134
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGTTGCAGAAGAGAAGGCCAGACTAGAAAGGGAAGACAAGTTTATTTGGGTGGTTATGACAAAGAAGATAAGGCTGCTAGGGCTTATGATTTAGCTGCA
Microexon-tag Amino Acid Seq WDNSCRREGQTRKGRQVYLGGYDKEDKAARAYDLAA
Microexon-tag spanning region216749978-216750336
Microexon-tag prediction score0.9784
Overlapped with the annotated transcript (%) 100
New Transcript ID XM_026582548.1x
Reference Transcript ID XM_026582548.1
Gene ID NA
Gene Name NA
Transcript ID XM_026582548.1
Protein ID XP_026438333.1
Gene ID LOC113336857
Gene Name NA
Pfam domain motif AP2
Motif E-value 8.3e-12
Motif start 305
Motif end 363
Protein seq >XP_026438333.1
MASSMNNWLGFSLSPQEVVPTQSQTHHHASNVSRHGFNSDEVSGSEVVSSDCFDLGSHDSSGPTGAYGILEAFNRNHQQQ
HDWSDNNFKSSSELSMLMGGSSNNQHQHHLSHQEEPKLEDFLGGHSFSDHDQQKLHGYDHSGDYMFSNCSLQLPAATLSN
SNGYGGSVANNNNNNNNGSLGLSMIKTWLRNQPAPPQQENKDEGGNSTCGGGGGGDGVVGSNGQTLSLSMSTGSQSSSPA
LPLVTTASTVVTGGGGSGGAESSSSENNKQKSNGTTTALDSPTAGAIVEAVPRKSIDTFGQRTSIYRGVTRHRWTGRYEA
HLWDNSCRREGQTRKGRQVYLGGYDKEDKAARAYDLAALKYWGSTTTTNFPIANYEQELEDMKNMTRQEYVASLRRKSSG
FSRGASIYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGLNAVTNFDMSRYDVNSIMESSTL
PIGGAAKRLKEAEQIAEASVDARRAESDNLTSQLTNGMNNYGWPTIAFQQAQPINMLYPSYNQQRSGWCKQEQEAAVAAH
SFQDLHQLQLGSTHHNFFQPSVLHNLMSQDSSSMEHSSGSNSVIYNGGGGTGSGASNGSYQMGGSNGYIPMGMVMSSDHH
QTNPTGSSFGENDLKQQMGYDSILSSNSTDPYSQGRNAYYMSQQSPPACNNWMPTGVQALAPRSNNIAVCHGAPSTFTVW
NDA*
CDS seq >XM_026582548.1
ATGGCTTCTTCAATGAACAACTGGTTAGGGTTTTCATTATCTCCACAAGAAGTAGTACCAACACAATCACAAACTCATCA
TCATGCATCAAATGTCTCGAGACATGGGTTTAACTCAGATGAAGTTTCCGGTTCTGAAGTTGTATCTAGTGATTGTTTCG
ATCTTGGTTCTCACGATTCTTCAGGTCCAACCGGTGCTTATGGCATCTTAGAAGCTTTTAACAGAAACCATCAACAGCAA
CATGATTGGAGCGACAATAACTTCAAGTCAAGTTCTGAACTTTCTATGTTAATGGGTGGATCTAGTAATAATCAGCATCA
GCATCATTTGAGTCATCAAGAAGAACCAAAGTTAGAAGATTTCTTAGGAGGACATTCTTTTTCTGACCATGATCAGCAGA
AACTTCATGGGTATGATCATAGTGGTGATTACATGTTTTCAAACTGTTCATTGCAGTTACCGGCCGCGACTTTAAGTAAC
TCAAATGGTTATGGTGGCAGTGTTGCCAATAACAACAACAATAATAACAATGGTTCTCTGGGGTTGTCAATGATCAAAAC
CTGGTTGAGAAATCAACCTGCACCGCCCCAACAAGAAAACAAAGATGAGGGTGGAAATAGTACTTGTGGTGGCGGTGGTG
GTGGTGATGGTGTGGTGGGAAGTAATGGACAAACACTATCCTTATCAATGAGTACAGGATCACAATCTAGTTCTCCAGCT
TTGCCGCTTGTAACGACCGCCTCAACGGTTGTAACAGGTGGTGGTGGTAGTGGTGGTGCTGAAAGTTCTTCGTCGGAGAA
TAACAAGCAAAAGAGTAATGGTACTACTACGGCTCTGGATAGTCCTACTGCTGGTGCAATAGTAGAAGCAGTTCCAAGGA
AATCAATTGATACTTTCGGACAGAGAACTTCTATCTACAGAGGGGTAACTAGGCATAGATGGACAGGCAGATATGAAGCC
CATCTTTGGGATAATAGTTGCAGAAGAGAAGGCCAGACTAGAAAGGGAAGACAAGTTTATTTGGGTGGTTATGACAAAGA
AGATAAGGCTGCTAGGGCTTATGATTTAGCTGCATTGAAATACTGGGGATCAACCACCACTACAAACTTTCCGATTGCTA
ATTACGAGCAAGAGTTGGAAGATATGAAGAACATGACTAGACAGGAATATGTAGCATCACTTAGAAGGAAAAGCAGTGGG
TTTTCTCGTGGAGCATCAATTTATCGCGGTGTCACCAGGCACCATCAGCATGGAAGATGGCAAGCAAGAATTGGAAGGGT
TGCAGGGAACAAGGATTTGTACTTGGGCACATTCAGCACCCAAGAAGAAGCAGCGGAGGCCTATGACATAGCTGCGATTA
AATTCCGTGGTTTGAATGCAGTAACAAATTTCGATATGAGCCGATACGATGTAAATAGCATCATGGAAAGTAGTACATTG
CCGATCGGAGGCGCTGCAAAACGTTTAAAAGAAGCTGAGCAAATTGCCGAAGCAAGTGTTGATGCTCGAAGAGCAGAAAG
CGATAACCTTACTTCACAGTTAACCAACGGAATGAACAACTATGGATGGCCCACTATTGCTTTCCAACAAGCTCAGCCTA
TTAACATGCTGTATCCATCGTACAATCAACAAAGATCAGGGTGGTGTAAACAAGAACAAGAAGCGGCGGTTGCTGCTCAT
AGCTTTCAAGATCTTCATCAGCTTCAATTGGGCAGTACCCATCATAACTTTTTTCAACCTTCTGTTCTTCATAATCTCAT
GAGCCAAGATTCCTCTTCAATGGAACATAGTTCTGGTTCCAATTCTGTTATTTATAACGGCGGAGGTGGAACTGGAAGTG
GTGCAAGTAATGGGAGTTATCAAATGGGTGGAAGTAATGGTTATATCCCAATGGGGATGGTTATGAGTAGTGATCATCAT
CAGACTAATCCAACTGGAAGTAGTTTCGGAGAAAATGATTTGAAACAACAAATGGGTTATGATAGTATCTTGTCTTCCAA
TAGTACAGATCCTTACAGTCAAGGAAGAAATGCTTATTACATGTCGCAACAATCACCTCCGGCTTGCAATAACTGGATGC
CTACTGGAGTTCAAGCTCTTGCACCAAGGTCTAATAATATTGCTGTTTGTCATGGTGCTCCGTCGACATTTACAGTTTGG
AATGATGCATAA
Microexon DNA seq TTTATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGTTGCAGAAGAGAAGGCCAGACTAGAAAGGGAAGACAAGTTTATTTGGGTGGTTATGACAAAGAAGATAAGGCTGCTAGGGCTTATGATTTAGCTGCA
Microexon-tag Amino Acid seq WDNSCRREGQTRKGRQVYLGGYDKEDKAARAYDLAA
Transcript ID XM_026582548.1
Gene ID Ps.6715
Gene Name NA
Pfam domain motif AP2
Motif E-value 8.3e-12
Motif start 305
Motif end 363
Protein seq >XM_026582548.1
MASSMNNWLGFSLSPQEVVPTQSQTHHHASNVSRHGFNSDEVSGSEVVSSDCFDLGSHDSSGPTGAYGILEAFNRNHQQQ
HDWSDNNFKSSSELSMLMGGSSNNQHQHHLSHQEEPKLEDFLGGHSFSDHDQQKLHGYDHSGDYMFSNCSLQLPAATLSN
SNGYGGSVANNNNNNNNGSLGLSMIKTWLRNQPAPPQQENKDEGGNSTCGGGGGGDGVVGSNGQTLSLSMSTGSQSSSPA
LPLVTTASTVVTGGGGSGGAESSSSENNKQKSNGTTTALDSPTAGAIVEAVPRKSIDTFGQRTSIYRGVTRHRWTGRYEA
HLWDNSCRREGQTRKGRQVYLGGYDKEDKAARAYDLAALKYWGSTTTTNFPIANYEQELEDMKNMTRQEYVASLRRKSSG
FSRGASIYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGLNAVTNFDMSRYDVNSIMESSTL
PIGGAAKRLKEAEQIAEASVDARRAESDNLTSQLTNGMNNYGWPTIAFQQAQPINMLYPSYNQQRSGWCKQEQEAAVAAH
SFQDLHQLQLGSTHHNFFQPSVLHNLMSQDSSSMEHSSGSNSVIYNGGGGTGSGASNGSYQMGGSNGYIPMGMVMSSDHH
QTNPTGSSFGENDLKQQMGYDSILSSNSTDPYSQGRNAYYMSQQSPPACNNWMPTGVQALAPRSNNIAVCHGAPSTFTVW
NDA*
CDS seq >XM_026582548.1
ATGGCTTCTTCAATGAACAACTGGTTAGGGTTTTCATTATCTCCACAAGAAGTAGTACCAACACAATCACAAACTCATCA
TCATGCATCAAATGTCTCGAGACATGGGTTTAACTCAGATGAAGTTTCCGGTTCTGAAGTTGTATCTAGTGATTGTTTCG
ATCTTGGTTCTCACGATTCTTCAGGTCCAACCGGTGCTTATGGCATCTTAGAAGCTTTTAACAGAAACCATCAACAGCAA
CATGATTGGAGCGACAATAACTTCAAGTCAAGTTCTGAACTTTCTATGTTAATGGGTGGATCTAGTAATAATCAGCATCA
GCATCATTTGAGTCATCAAGAAGAACCAAAGTTAGAAGATTTCTTAGGAGGACATTCTTTTTCTGACCATGATCAGCAGA
AACTTCATGGGTATGATCATAGTGGTGATTACATGTTTTCAAACTGTTCATTGCAGTTACCGGCCGCGACTTTAAGTAAC
TCAAATGGTTATGGTGGCAGTGTTGCCAATAACAACAACAATAATAACAATGGTTCTCTGGGGTTGTCAATGATCAAAAC
CTGGTTGAGAAATCAACCTGCACCGCCCCAACAAGAAAACAAAGATGAGGGTGGAAATAGTACTTGTGGTGGCGGTGGTG
GTGGTGATGGTGTGGTGGGAAGTAATGGACAAACACTATCCTTATCAATGAGTACAGGATCACAATCTAGTTCTCCAGCT
TTGCCGCTTGTAACGACCGCCTCAACGGTTGTAACAGGTGGTGGTGGTAGTGGTGGTGCTGAAAGTTCTTCGTCGGAGAA
TAACAAGCAAAAGAGTAATGGTACTACTACGGCTCTGGATAGTCCTACTGCTGGTGCAATAGTAGAAGCAGTTCCAAGGA
AATCAATTGATACTTTCGGACAGAGAACTTCTATCTACAGAGGGGTAACTAGGCATAGATGGACAGGCAGATATGAAGCC
CATCTTTGGGATAATAGTTGCAGAAGAGAAGGCCAGACTAGAAAGGGAAGACAAGTTTATTTGGGTGGTTATGACAAAGA
AGATAAGGCTGCTAGGGCTTATGATTTAGCTGCATTGAAATACTGGGGATCAACCACCACTACAAACTTTCCGATTGCTA
ATTACGAGCAAGAGTTGGAAGATATGAAGAACATGACTAGACAGGAATATGTAGCATCACTTAGAAGGAAAAGCAGTGGG
TTTTCTCGTGGAGCATCAATTTATCGCGGTGTCACCAGGCACCATCAGCATGGAAGATGGCAAGCAAGAATTGGAAGGGT
TGCAGGGAACAAGGATTTGTACTTGGGCACATTCAGCACCCAAGAAGAAGCAGCGGAGGCCTATGACATAGCTGCGATTA
AATTCCGTGGTTTGAATGCAGTAACAAATTTCGATATGAGCCGATACGATGTAAATAGCATCATGGAAAGTAGTACATTG
CCGATCGGAGGCGCTGCAAAACGTTTAAAAGAAGCTGAGCAAATTGCCGAAGCAAGTGTTGATGCTCGAAGAGCAGAAAG
CGATAACCTTACTTCACAGTTAACCAACGGAATGAACAACTATGGATGGCCCACTATTGCTTTCCAACAAGCTCAGCCTA
TTAACATGCTGTATCCATCGTACAATCAACAAAGATCAGGGTGGTGTAAACAAGAACAAGAAGCGGCGGTTGCTGCTCAT
AGCTTTCAAGATCTTCATCAGCTTCAATTGGGCAGTACCCATCATAACTTTTTTCAACCTTCTGTTCTTCATAATCTCAT
GAGCCAAGATTCCTCTTCAATGGAACATAGTTCTGGTTCCAATTCTGTTATTTATAACGGCGGAGGTGGAACTGGAAGTG
GTGCAAGTAATGGGAGTTATCAAATGGGTGGAAGTAATGGTTATATCCCAATGGGGATGGTTATGAGTAGTGATCATCAT
CAGACTAATCCAACTGGAAGTAGTTTCGGAGAAAATGATTTGAAACAACAAATGGGTTATGATAGTATCTTGTCTTCCAA
TAGTACAGATCCTTACAGTCAAGGAAGAAATGCTTATTACATGTCGCAACAATCACCTCCGGCTTGCAATAACTGGATGC
CTACTGGAGTTCAAGCTCTTGCACCAAGGTCTAATAATATTGCTGTTTGTCATGGTGCTCCGTCGACATTTACAGTTTGG
AATGATGCATAA