Microexon ID Ps_NC_039362.1:189356012-189356020:-
Species Papaver somniferum
Coordinates NC_039362.1:189356012..189356020
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGTTGCAGAAGAGAAGGTCAGACTAGAAAGGGTAGACAAGTTTATTTGGGTGGTTATGACAAAGAAGATAAGGCTGCTAGGGCTTATGATTTGGCTGCC
Microexon-tag Amino Acid Seq WDNSCRREGQTRKGRQVYLGGYDKEDKAARAYDLAA
Microexon-tag spanning region189355865-189356202
Microexon-tag prediction score0.9733
Overlapped with the annotated transcript (%) 100
New Transcript ID XM_026527218.1x
Reference Transcript ID XM_026527218.1
Gene ID NA
Gene Name NA
Transcript ID XM_026527218.1
Protein ID XP_026383003.1
Gene ID LOC113278357
Gene Name NA
Pfam domain motif AP2
Motif E-value 8.3e-12
Motif start 308
Motif end 366
Protein seq >XP_026383003.1
MASSMNNWLGFSLSPQEVVPTQSQTHHHASNVSRHGFNSDEVSGSEVVSSDCFDLGSHDSSGPTGAYGILEAFNRNHQQQ
HDWSGNNFKSSSELSMLMGGSNNNQHQHHLSHQEEPKLEDFLGGHSFSDHDQQKLHGYDHSGDYMFSNCSLQLPAATVSN
SNGYGGSVANNNNNNNNNGSLGLSMIKTWLRNQPAPPQQENKDEVGNSTCGGGGGVGDGVVGSNGQTLSLSMSTGSQSSS
PALPLVTTASTVVTGGGGGSGGAESSSSENNKQKTNGTTTALDSPTAGAIVEAVPRKSIDTFGQRTSIYRGVTRHRWTGR
YEAHLWDNSCRREGQTRKGRQVYLGGYDKEDKAARAYDLAALKYWGSTTTTNFPIANYEQELEDMKNMTRQEYVASLRRK
SSGFSRGASIYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGLNAVTNFDMSRYDVSSIMES
STLPIGGAAKRLKEAEQIAEASVDARRAESDNLTSQLTNGINNYGWPTIAFQQAQPISMLYPYNQQRSGWCKQEQEAAAA
AHSFQDLHQLQLGSTHHNFFQPSVLHNLMSQDSSSLEHSSGSNSVIYNGGGGTGSGASNGSYQMGGSNGYLPMGMVMSSE
HQNNPTGSSFVENDLKQQMGYDSIMSSNSTDPYSQGRNAYYMSQQSPPACNNWMPTGVQALAPRSNNLAVCHGAPSTFTV
WNDA*
CDS seq >XM_026527218.1
ATGGCTTCTTCAATGAACAACTGGTTAGGGTTTTCATTATCTCCGCAAGAAGTAGTACCAACACAATCACAAACTCATCA
TCATGCATCAAATGTCTCGAGACATGGGTTTAACTCTGATGAAGTTTCTGGTTCTGAAGTTGTATCTAGTGATTGTTTCG
ATCTTGGTTCTCACGATTCTTCAGGTCCAACCGGTGCTTATGGCATCTTAGAAGCTTTTAACAGAAACCATCAGCAACAA
CATGATTGGAGCGGCAATAACTTCAAGTCAAGTTCTGAACTTTCTATGTTAATGGGTGGATCTAATAATAATCAGCATCA
GCATCATTTGAGTCATCAAGAAGAACCAAAGTTAGAAGATTTCTTAGGAGGACATTCTTTTTCTGACCATGATCAACAGA
AACTTCACGGGTATGATCATAGTGGTGATTACATGTTTTCAAACTGTTCATTGCAGTTACCGGCCGCGACCGTAAGTAAC
TCAAATGGTTATGGTGGCAGTGTTGCCAATAACAACAACAACAATAATAACAATGGTTCCCTGGGGTTGTCAATGATCAA
AACCTGGTTAAGAAATCAACCTGCACCGCCCCAACAAGAAAACAAAGATGAGGTTGGAAATAGTACTTGTGGTGGCGGTG
GTGGTGTTGGTGATGGTGTGGTGGGAAGTAATGGACAAACACTATCCTTATCAATGAGTACAGGATCACAATCTAGTTCT
CCAGCTTTGCCGCTTGTAACGACCGCGTCAACGGTTGTTACAGGTGGTGGTGGTGGTAGTGGTGGTGCTGAAAGTTCTTC
GTCGGAGAATAACAAGCAAAAGACTAATGGTACTACTACGGCTCTGGATAGTCCTACTGCTGGTGCAATAGTAGAAGCAG
TTCCAAGGAAATCAATTGATACTTTCGGACAGAGAACTTCTATCTACAGAGGTGTAACTAGGCATAGATGGACAGGCAGA
TATGAAGCCCATCTTTGGGATAATAGTTGCAGAAGAGAAGGTCAGACTAGAAAGGGTAGACAAGTTTATTTGGGTGGTTA
TGACAAAGAAGATAAGGCTGCTAGGGCTTATGATTTGGCTGCCTTGAAATATTGGGGATCAACCACCACTACAAACTTTC
CGATTGCTAATTACGAGCAAGAGTTGGAAGATATGAAGAACATGACTAGACAGGAATATGTAGCATCACTTAGAAGGAAA
AGCAGTGGTTTTTCTCGTGGAGCATCAATTTATCGCGGTGTCACCAGGCACCATCAGCACGGTAGATGGCAAGCAAGAAT
TGGAAGGGTTGCAGGGAACAAGGATTTGTACTTGGGCACATTCAGTACCCAAGAAGAAGCAGCTGAGGCCTACGACATAG
CTGCGATTAAGTTCCGTGGTTTGAATGCAGTAACAAATTTCGATATGAGCAGATACGATGTAAGTAGCATCATGGAAAGT
AGTACATTGCCGATCGGAGGCGCTGCAAAACGTTTAAAAGAAGCTGAGCAAATTGCAGAAGCAAGTGTTGATGCTCGAAG
AGCAGAGAGTGATAACCTTACTTCACAGTTAACCAACGGAATCAACAACTACGGATGGCCAACTATTGCTTTCCAACAAG
CTCAGCCTATTAGCATGCTGTATCCATACAATCAACAAAGATCTGGGTGGTGTAAACAGGAACAAGAAGCGGCTGCTGCT
GCTCATAGCTTTCAAGATCTTCATCAGCTTCAATTGGGTAGTACCCATCATAACTTTTTTCAACCTTCTGTTCTTCATAA
TCTCATGAGCCAAGATTCCTCTTCACTGGAACATAGTTCTGGTTCCAATTCTGTTATTTATAATGGCGGAGGTGGAACTG
GAAGTGGTGCAAGTAATGGGAGTTATCAAATGGGTGGAAGTAATGGCTATCTGCCAATGGGAATGGTTATGAGTAGTGAG
CATCAGAACAATCCAACTGGAAGTAGTTTCGTGGAGAATGACTTGAAACAACAAATGGGATATGATAGTATTATGTCTTC
CAACAGTACAGATCCTTATAGTCAAGGAAGAAATGCTTATTACATGTCGCAACAATCACCTCCGGCTTGCAATAACTGGA
TGCCTACTGGAGTTCAAGCTTTAGCACCGAGGTCTAATAATCTCGCTGTTTGTCATGGTGCTCCGTCGACATTTACGGTT
TGGAATGATGCATAA
Microexon DNA seq TTTATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGTTGCAGAAGAGAAGGTCAGACTAGAAAGGGTAGACAAGTTTATTTGGGTGGTTATGACAAAGAAGATAAGGCTGCTAGGGCTTATGATTTGGCTGCC
Microexon-tag Amino Acid seq WDNSCRREGQTRKGRQVYLGGYDKEDKAARAYDLAA
Transcript ID XM_026527218.1
Gene ID Ps.32771
Gene Name NA
Pfam domain motif AP2
Motif E-value 8.3e-12
Motif start 308
Motif end 366
Protein seq >XM_026527218.1
MASSMNNWLGFSLSPQEVVPTQSQTHHHASNVSRHGFNSDEVSGSEVVSSDCFDLGSHDSSGPTGAYGILEAFNRNHQQQ
HDWSGNNFKSSSELSMLMGGSNNNQHQHHLSHQEEPKLEDFLGGHSFSDHDQQKLHGYDHSGDYMFSNCSLQLPAATVSN
SNGYGGSVANNNNNNNNNGSLGLSMIKTWLRNQPAPPQQENKDEVGNSTCGGGGGVGDGVVGSNGQTLSLSMSTGSQSSS
PALPLVTTASTVVTGGGGGSGGAESSSSENNKQKTNGTTTALDSPTAGAIVEAVPRKSIDTFGQRTSIYRGVTRHRWTGR
YEAHLWDNSCRREGQTRKGRQVYLGGYDKEDKAARAYDLAALKYWGSTTTTNFPIANYEQELEDMKNMTRQEYVASLRRK
SSGFSRGASIYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGLNAVTNFDMSRYDVSSIMES
STLPIGGAAKRLKEAEQIAEASVDARRAESDNLTSQLTNGINNYGWPTIAFQQAQPISMLYPYNQQRSGWCKQEQEAAAA
AHSFQDLHQLQLGSTHHNFFQPSVLHNLMSQDSSSLEHSSGSNSVIYNGGGGTGSGASNGSYQMGGSNGYLPMGMVMSSE
HQNNPTGSSFVENDLKQQMGYDSIMSSNSTDPYSQGRNAYYMSQQSPPACNNWMPTGVQALAPRSNNLAVCHGAPSTFTV
WNDA*
CDS seq >XM_026527218.1
ATGGCTTCTTCAATGAACAACTGGTTAGGGTTTTCATTATCTCCGCAAGAAGTAGTACCAACACAATCACAAACTCATCA
TCATGCATCAAATGTCTCGAGACATGGGTTTAACTCTGATGAAGTTTCTGGTTCTGAAGTTGTATCTAGTGATTGTTTCG
ATCTTGGTTCTCACGATTCTTCAGGTCCAACCGGTGCTTATGGCATCTTAGAAGCTTTTAACAGAAACCATCAGCAACAA
CATGATTGGAGCGGCAATAACTTCAAGTCAAGTTCTGAACTTTCTATGTTAATGGGTGGATCTAATAATAATCAGCATCA
GCATCATTTGAGTCATCAAGAAGAACCAAAGTTAGAAGATTTCTTAGGAGGACATTCTTTTTCTGACCATGATCAACAGA
AACTTCACGGGTATGATCATAGTGGTGATTACATGTTTTCAAACTGTTCATTGCAGTTACCGGCCGCGACCGTAAGTAAC
TCAAATGGTTATGGTGGCAGTGTTGCCAATAACAACAACAACAATAATAACAATGGTTCCCTGGGGTTGTCAATGATCAA
AACCTGGTTAAGAAATCAACCTGCACCGCCCCAACAAGAAAACAAAGATGAGGTTGGAAATAGTACTTGTGGTGGCGGTG
GTGGTGTTGGTGATGGTGTGGTGGGAAGTAATGGACAAACACTATCCTTATCAATGAGTACAGGATCACAATCTAGTTCT
CCAGCTTTGCCGCTTGTAACGACCGCGTCAACGGTTGTTACAGGTGGTGGTGGTGGTAGTGGTGGTGCTGAAAGTTCTTC
GTCGGAGAATAACAAGCAAAAGACTAATGGTACTACTACGGCTCTGGATAGTCCTACTGCTGGTGCAATAGTAGAAGCAG
TTCCAAGGAAATCAATTGATACTTTCGGACAGAGAACTTCTATCTACAGAGGTGTAACTAGGCATAGATGGACAGGCAGA
TATGAAGCCCATCTTTGGGATAATAGTTGCAGAAGAGAAGGTCAGACTAGAAAGGGTAGACAAGTTTATTTGGGTGGTTA
TGACAAAGAAGATAAGGCTGCTAGGGCTTATGATTTGGCTGCCTTGAAATATTGGGGATCAACCACCACTACAAACTTTC
CGATTGCTAATTACGAGCAAGAGTTGGAAGATATGAAGAACATGACTAGACAGGAATATGTAGCATCACTTAGAAGGAAA
AGCAGTGGTTTTTCTCGTGGAGCATCAATTTATCGCGGTGTCACCAGGCACCATCAGCACGGTAGATGGCAAGCAAGAAT
TGGAAGGGTTGCAGGGAACAAGGATTTGTACTTGGGCACATTCAGTACCCAAGAAGAAGCAGCTGAGGCCTACGACATAG
CTGCGATTAAGTTCCGTGGTTTGAATGCAGTAACAAATTTCGATATGAGCAGATACGATGTAAGTAGCATCATGGAAAGT
AGTACATTGCCGATCGGAGGCGCTGCAAAACGTTTAAAAGAAGCTGAGCAAATTGCAGAAGCAAGTGTTGATGCTCGAAG
AGCAGAGAGTGATAACCTTACTTCACAGTTAACCAACGGAATCAACAACTACGGATGGCCAACTATTGCTTTCCAACAAG
CTCAGCCTATTAGCATGCTGTATCCATACAATCAACAAAGATCTGGGTGGTGTAAACAGGAACAAGAAGCGGCTGCTGCT
GCTCATAGCTTTCAAGATCTTCATCAGCTTCAATTGGGTAGTACCCATCATAACTTTTTTCAACCTTCTGTTCTTCATAA
TCTCATGAGCCAAGATTCCTCTTCACTGGAACATAGTTCTGGTTCCAATTCTGTTATTTATAATGGCGGAGGTGGAACTG
GAAGTGGTGCAAGTAATGGGAGTTATCAAATGGGTGGAAGTAATGGCTATCTGCCAATGGGAATGGTTATGAGTAGTGAG
CATCAGAACAATCCAACTGGAAGTAGTTTCGTGGAGAATGACTTGAAACAACAAATGGGATATGATAGTATTATGTCTTC
CAACAGTACAGATCCTTATAGTCAAGGAAGAAATGCTTATTACATGTCGCAACAATCACCTCCGGCTTGCAATAACTGGA
TGCCTACTGGAGTTCAAGCTTTAGCACCGAGGTCTAATAATCTCGCTGTTTGTCATGGTGCTCCGTCGACATTTACGGTT
TGGAATGATGCATAA