Microexon ID Ps_NC_039367.1:104066826-104066830:-
Species Papaver somniferum
Coordinates NC_039367.1:104066826..104066830
Microexon Cluster ID MEP07
Size 5
Phase 1
Pfam Domain Motif Peptidase_M1
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 33,19,5,12,39
Microexon location in the Microexon-tag 3
Microexon-tag DNA Seq GCWCAYCCTGTTCGRCCWCAYTCTTAYATYAAGATGGACAACTTCTAYACAGTRACGGTKTATGARAAGGGWGCTGAAGTTGTCMGRATGTACAARACMTTRYTKGGR
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTACG
Microexon Amino Acid seq VT
Microexon-tag DNA Seq GCTCATCCAGTACGACCACATTCTTATATTAAGATGGACAACTTCTACACAGTTACGGTTTATGAAAAGGGAGCTGAAGTTGTCAGGATGTACAAAACTTTGTTGGGA
Microexon-tag Amino Acid Seq AHPVRPHSYIKMDNFYTVTVYEKGAEVVRMYKTLLG
Microexon-tag spanning region104066570-104067134
Microexon-tag prediction score0.985
Overlapped with the annotated transcript (%) 100
New Transcript ID XM_026565444.1x
Reference Transcript ID XM_026565444.1
Gene ID NA
Gene Name NA
Transcript ID XM_026565444.1
Protein ID XP_026421229.1
Gene ID LOC113317316
Gene Name NA
Pfam domain motif Peptidase_M1
Motif E-value 1e-49
Motif start 328
Motif end 537
Protein seq >XP_026421229.1
MARLVIQPYKSCPGLAKTSLLGYFTSSSLFQASRRGCCLQQSFKGVYTSQKYTSSEEVSRWSIHPLLFSSSFRIKQSSKR
LICSVATQPQPSQAEEFKMDTPKEIFLKDYKTPNYYFDTVDLTFSLGEEHTIVCSKITVYPRVEGVSSPLVLDGADLKLL
SIKIDGKELKKEEYHLDSRHLTLSSAPSARFTLEIVTEIYPQKNTSLEGLYKSSGNFCTQCEAEGFRKITFYQDRPDIMA
KYTCRVEGDKTLYPVLLSNGNLIEQGDLEGGKHYALWEDPHKKPCYLFALVAGQLQSRDDSFITRSGREVSLRIWTPAED
LPKTVHAMYSLKAAMKWDEDVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSKLVLASPETASDADYAAILGVIGHEY
FHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDMGSRSVKRIEDVSRLRNYQFPQDAGPIAHPVRPHSYIKMDNFYTVT
VYEKGAEVVRMYKTLLGSQGFRRGMDLYFKRHDGQAVTCEDFYSAMRDANDADFANFLLWYSQAGTPSVKVTSVYNSEAK
TYTLKFSQEVPATPGQAVKEPMLIPVAVGLLDSSGKDMPLKSVYHDGMLQVVSTDGQPAYTTVLQIKKKEEEFVFSEISE
RPVPSLLRGYSAPIRLDSDLTDNHLFFLLAHDSDEFNRWEAGQVLARNLMLSLVADFQQNKPLALKADYVHGLRSILSDT
SLDKEFIAKAITLPGAGEIMDMMEIADPDAVHAVRSFIRKHLASELKAEFLRTVENNRSSDPYVFDHSNMSRRALKNVAL
AYLASFEDTQITELALNEYRTATNMTDQFAALAALVQNPGKTRDDVLADFYGKWKHDYLVVNKWFALQAMSDIPGNVENV
RKLLQHPGFDLRNPNKVYSLIGGFCGSPVNFHAKDGSGYKFLGEIVVQLDKINPQVASRMVSAFSRWRRYDETRQAFAKA
QLEMIMSTNGLSENVFEIASKSLAA*
CDS seq >XM_026565444.1
ATGGCTAGATTGGTGATTCAACCTTACAAAAGTTGTCCGGGTTTGGCAAAGACCAGTCTATTGGGATACTTCACCTCTTC
ATCTCTCTTTCAAGCATCACGACGTGGCTGTTGTTTGCAGCAGTCATTCAAGGGTGTTTATACAAGTCAAAAATACACAA
GTTCTGAGGAGGTTTCCCGTTGGAGTATACATCCACTTCTGTTCTCCTCATCTTTCAGGATTAAGCAGTCTAGTAAAAGA
CTGATCTGCTCCGTTGCAACACAACCCCAACCAAGCCAGGCAGAAGAATTCAAAATGGACACCCCTAAGGAGATTTTTTT
GAAGGATTACAAGACGCCTAATTACTACTTTGATACGGTTGATCTGACGTTCTCATTGGGAGAGGAGCACACAATTGTTT
GTTCAAAAATAACTGTTTACCCTAGAGTTGAAGGTGTCTCTTCTCCACTGGTCTTAGATGGGGCGGATCTTAAGTTACTA
TCAATCAAAATTGATGGCAAGGAACTGAAGAAAGAAGAGTACCACCTGGATTCACGCCATCTGACACTCTCATCAGCCCC
AAGTGCAAGATTTACTTTAGAGATTGTTACCGAGATATATCCTCAGAAAAATACATCTTTGGAGGGGCTCTATAAATCCT
CTGGAAATTTTTGTACACAGTGTGAAGCTGAGGGTTTTCGGAAGATAACATTTTACCAGGATCGACCTGATATTATGGCT
AAATACACATGCCGGGTCGAAGGCGATAAGACTCTATATCCAGTATTGCTTTCAAATGGAAACCTCATAGAGCAAGGAGA
CCTCGAGGGTGGTAAACATTATGCCCTTTGGGAGGATCCACACAAGAAACCATGCTACTTGTTTGCATTGGTTGCTGGAC
AGTTGCAGAGTAGGGATGACTCTTTCATCACTCGGTCGGGACGGGAGGTCTCTCTTAGGATCTGGACCCCTGCAGAGGAT
CTACCAAAAACTGTTCATGCCATGTATTCTCTCAAGGCAGCTATGAAGTGGGACGAGGATGTTTTCGGTCTTGAGTATGA
CTTGGATCTTTTTAATATTGTGGCGGTTCCAGATTTTAACATGGGAGCCATGGAGAACAAGAGTTTGAATATATTTAATT
CGAAGCTTGTTTTGGCATCACCTGAGACTGCTTCAGATGCTGATTATGCTGCAATTCTAGGGGTTATTGGTCATGAGTAT
TTTCATAACTGGACTGGAAACAGGGTGACATGTCGTGACTGGTTCCAGCTGAGTCTGAAGGAAGGTCTAACTGTCTTTCG
GGATCAGGAATTTTCTTCTGACATGGGAAGCCGTTCAGTAAAGAGAATAGAAGATGTTTCAAGGCTGCGTAACTATCAGT
TTCCACAGGATGCTGGTCCTATTGCTCATCCAGTACGACCACATTCTTATATTAAGATGGACAACTTCTACACAGTTACG
GTTTATGAAAAGGGAGCTGAAGTTGTCAGGATGTACAAAACTTTGTTGGGAAGTCAAGGTTTCCGAAGAGGGATGGATCT
TTATTTTAAGAGGCATGATGGCCAAGCTGTAACGTGTGAAGATTTCTATTCTGCAATGCGAGATGCAAATGATGCGGACT
TTGCTAATTTCTTGTTGTGGTACTCTCAGGCAGGAACACCATCCGTGAAGGTTACATCAGTCTACAATTCTGAAGCCAAG
ACATACACTTTGAAGTTCAGTCAAGAGGTGCCTGCTACTCCAGGCCAGGCAGTGAAAGAACCAATGCTCATACCTGTGGC
AGTTGGCCTGCTTGACTCAAGCGGGAAGGATATGCCTCTCAAATCCGTGTATCATGATGGAATGCTTCAGGTGGTTTCCA
CAGATGGGCAACCAGCCTATACTACAGTTCTTCAGATAAAGAAGAAGGAAGAAGAATTCGTGTTCTCTGAAATATCTGAA
CGCCCAGTTCCATCTCTCTTGCGGGGATACAGCGCTCCTATTCGCCTTGACTCTGATCTTACTGATAATCATCTTTTTTT
TCTACTTGCTCATGATTCAGATGAGTTTAATCGTTGGGAGGCGGGGCAAGTGTTGGCTAGAAATTTGATGCTTAGCTTGG
TTGCTGATTTCCAGCAAAACAAACCATTAGCTCTGAAAGCAGACTATGTGCATGGTCTCAGAAGCATACTATCTGACACT
AGCTTGGATAAAGAATTCATTGCTAAGGCAATTACTCTGCCCGGGGCAGGGGAGATCATGGACATGATGGAGATTGCTGA
TCCTGATGCTGTTCACGCTGTCCGATCTTTCATCAGGAAGCATCTTGCTTCTGAACTCAAAGCAGAATTTCTCAGGACGG
TTGAAAACAATAGAAGCTCTGATCCATATGTCTTCGATCATTCCAATATGTCAAGACGCGCTTTGAAGAACGTCGCCCTT
GCCTATCTAGCCTCTTTTGAGGATACCCAGATCACCGAACTTGCTCTGAATGAGTACAGGACTGCTACAAATATGACTGA
CCAGTTTGCAGCTCTAGCAGCCTTGGTCCAAAATCCTGGCAAAACCCGCGATGATGTTCTTGCTGATTTCTATGGCAAGT
GGAAACATGACTATTTGGTTGTGAATAAATGGTTTGCTCTTCAAGCCATGTCAGACATTCCTGGAAATGTTGAAAATGTC
CGCAAGCTGCTGCAGCATCCTGGCTTTGACTTACGCAATCCAAACAAGGTGTACTCACTCATTGGAGGATTCTGCGGGTC
ACCAGTTAATTTCCATGCAAAGGATGGTTCGGGCTATAAATTCTTGGGAGAGATTGTAGTGCAACTTGACAAAATAAACC
CCCAGGTTGCCTCCCGTATGGTGTCTGCCTTCTCGAGGTGGAGGCGATATGATGAGACCAGACAGGCGTTTGCTAAGGCA
CAATTAGAGATGATAATGTCAACCAATGGACTCTCTGAGAATGTGTTTGAAATTGCGTCAAAAAGCTTAGCTGCTTAG
Microexon DNA seq TTACG
Microexon Amino Acid seq VT
Microexon-tag DNA Seq GCTCATCCAGTACGACCACATTCTTATATTAAGATGGACAACTTCTACACAGTTACGGTTTATGAAAAGGGAGCTGAAGTTGTCAGGATGTACAAAACTTTGTTGGGA
Microexon-tag Amino Acid seq AHPVRPHSYIKMDNFYTVTVYEKGAEVVRMYKTLLG
Transcript ID XM_026565444.1
Gene ID Ps.62326
Gene Name NA
Pfam domain motif Peptidase_M1
Motif E-value 1e-49
Motif start 328
Motif end 537
Protein seq >XM_026565444.1
MARLVIQPYKSCPGLAKTSLLGYFTSSSLFQASRRGCCLQQSFKGVYTSQKYTSSEEVSRWSIHPLLFSSSFRIKQSSKR
LICSVATQPQPSQAEEFKMDTPKEIFLKDYKTPNYYFDTVDLTFSLGEEHTIVCSKITVYPRVEGVSSPLVLDGADLKLL
SIKIDGKELKKEEYHLDSRHLTLSSAPSARFTLEIVTEIYPQKNTSLEGLYKSSGNFCTQCEAEGFRKITFYQDRPDIMA
KYTCRVEGDKTLYPVLLSNGNLIEQGDLEGGKHYALWEDPHKKPCYLFALVAGQLQSRDDSFITRSGREVSLRIWTPAED
LPKTVHAMYSLKAAMKWDEDVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSKLVLASPETASDADYAAILGVIGHEY
FHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDMGSRSVKRIEDVSRLRNYQFPQDAGPIAHPVRPHSYIKMDNFYTVT
VYEKGAEVVRMYKTLLGSQGFRRGMDLYFKRHDGQAVTCEDFYSAMRDANDADFANFLLWYSQAGTPSVKVTSVYNSEAK
TYTLKFSQEVPATPGQAVKEPMLIPVAVGLLDSSGKDMPLKSVYHDGMLQVVSTDGQPAYTTVLQIKKKEEEFVFSEISE
RPVPSLLRGYSAPIRLDSDLTDNHLFFLLAHDSDEFNRWEAGQVLARNLMLSLVADFQQNKPLALKADYVHGLRSILSDT
SLDKEFIAKAITLPGAGEIMDMMEIADPDAVHAVRSFIRKHLASELKAEFLRTVENNRSSDPYVFDHSNMSRRALKNVAL
AYLASFEDTQITELALNEYRTATNMTDQFAALAALVQNPGKTRDDVLADFYGKWKHDYLVVNKWFALQAMSDIPGNVENV
RKLLQHPGFDLRNPNKVYSLIGGFCGSPVNFHAKDGSGYKFLGEIVVQLDKINPQVASRMVSAFSRWRRYDETRQAFAKA
QLEMIMSTNGLSENVFEIASKSLAA*
CDS seq >XM_026565444.1
ATGGCTAGATTGGTGATTCAACCTTACAAAAGTTGTCCGGGTTTGGCAAAGACCAGTCTATTGGGATACTTCACCTCTTC
ATCTCTCTTTCAAGCATCACGACGTGGCTGTTGTTTGCAGCAGTCATTCAAGGGTGTTTATACAAGTCAAAAATACACAA
GTTCTGAGGAGGTTTCCCGTTGGAGTATACATCCACTTCTGTTCTCCTCATCTTTCAGGATTAAGCAGTCTAGTAAAAGA
CTGATCTGCTCCGTTGCAACACAACCCCAACCAAGCCAGGCAGAAGAATTCAAAATGGACACCCCTAAGGAGATTTTTTT
GAAGGATTACAAGACGCCTAATTACTACTTTGATACGGTTGATCTGACGTTCTCATTGGGAGAGGAGCACACAATTGTTT
GTTCAAAAATAACTGTTTACCCTAGAGTTGAAGGTGTCTCTTCTCCACTGGTCTTAGATGGGGCGGATCTTAAGTTACTA
TCAATCAAAATTGATGGCAAGGAACTGAAGAAAGAAGAGTACCACCTGGATTCACGCCATCTGACACTCTCATCAGCCCC
AAGTGCAAGATTTACTTTAGAGATTGTTACCGAGATATATCCTCAGAAAAATACATCTTTGGAGGGGCTCTATAAATCCT
CTGGAAATTTTTGTACACAGTGTGAAGCTGAGGGTTTTCGGAAGATAACATTTTACCAGGATCGACCTGATATTATGGCT
AAATACACATGCCGGGTCGAAGGCGATAAGACTCTATATCCAGTATTGCTTTCAAATGGAAACCTCATAGAGCAAGGAGA
CCTCGAGGGTGGTAAACATTATGCCCTTTGGGAGGATCCACACAAGAAACCATGCTACTTGTTTGCATTGGTTGCTGGAC
AGTTGCAGAGTAGGGATGACTCTTTCATCACTCGGTCGGGACGGGAGGTCTCTCTTAGGATCTGGACCCCTGCAGAGGAT
CTACCAAAAACTGTTCATGCCATGTATTCTCTCAAGGCAGCTATGAAGTGGGACGAGGATGTTTTCGGTCTTGAGTATGA
CTTGGATCTTTTTAATATTGTGGCGGTTCCAGATTTTAACATGGGAGCCATGGAGAACAAGAGTTTGAATATATTTAATT
CGAAGCTTGTTTTGGCATCACCTGAGACTGCTTCAGATGCTGATTATGCTGCAATTCTAGGGGTTATTGGTCATGAGTAT
TTTCATAACTGGACTGGAAACAGGGTGACATGTCGTGACTGGTTCCAGCTGAGTCTGAAGGAAGGTCTAACTGTCTTTCG
GGATCAGGAATTTTCTTCTGACATGGGAAGCCGTTCAGTAAAGAGAATAGAAGATGTTTCAAGGCTGCGTAACTATCAGT
TTCCACAGGATGCTGGTCCTATTGCTCATCCAGTACGACCACATTCTTATATTAAGATGGACAACTTCTACACAGTTACG
GTTTATGAAAAGGGAGCTGAAGTTGTCAGGATGTACAAAACTTTGTTGGGAAGTCAAGGTTTCCGAAGAGGGATGGATCT
TTATTTTAAGAGGCATGATGGCCAAGCTGTAACGTGTGAAGATTTCTATTCTGCAATGCGAGATGCAAATGATGCGGACT
TTGCTAATTTCTTGTTGTGGTACTCTCAGGCAGGAACACCATCCGTGAAGGTTACATCAGTCTACAATTCTGAAGCCAAG
ACATACACTTTGAAGTTCAGTCAAGAGGTGCCTGCTACTCCAGGCCAGGCAGTGAAAGAACCAATGCTCATACCTGTGGC
AGTTGGCCTGCTTGACTCAAGCGGGAAGGATATGCCTCTCAAATCCGTGTATCATGATGGAATGCTTCAGGTGGTTTCCA
CAGATGGGCAACCAGCCTATACTACAGTTCTTCAGATAAAGAAGAAGGAAGAAGAATTCGTGTTCTCTGAAATATCTGAA
CGCCCAGTTCCATCTCTCTTGCGGGGATACAGCGCTCCTATTCGCCTTGACTCTGATCTTACTGATAATCATCTTTTTTT
TCTACTTGCTCATGATTCAGATGAGTTTAATCGTTGGGAGGCGGGGCAAGTGTTGGCTAGAAATTTGATGCTTAGCTTGG
TTGCTGATTTCCAGCAAAACAAACCATTAGCTCTGAAAGCAGACTATGTGCATGGTCTCAGAAGCATACTATCTGACACT
AGCTTGGATAAAGAATTCATTGCTAAGGCAATTACTCTGCCCGGGGCAGGGGAGATCATGGACATGATGGAGATTGCTGA
TCCTGATGCTGTTCACGCTGTCCGATCTTTCATCAGGAAGCATCTTGCTTCTGAACTCAAAGCAGAATTTCTCAGGACGG
TTGAAAACAATAGAAGCTCTGATCCATATGTCTTCGATCATTCCAATATGTCAAGACGCGCTTTGAAGAACGTCGCCCTT
GCCTATCTAGCCTCTTTTGAGGATACCCAGATCACCGAACTTGCTCTGAATGAGTACAGGACTGCTACAAATATGACTGA
CCAGTTTGCAGCTCTAGCAGCCTTGGTCCAAAATCCTGGCAAAACCCGCGATGATGTTCTTGCTGATTTCTATGGCAAGT
GGAAACATGACTATTTGGTTGTGAATAAATGGTTTGCTCTTCAAGCCATGTCAGACATTCCTGGAAATGTTGAAAATGTC
CGCAAGCTGCTGCAGCATCCTGGCTTTGACTTACGCAATCCAAACAAGGTGTACTCACTCATTGGAGGATTCTGCGGGTC
ACCAGTTAATTTCCATGCAAAGGATGGTTCGGGCTATAAATTCTTGGGAGAGATTGTAGTGCAACTTGACAAAATAAACC
CCCAGGTTGCCTCCCGTATGGTGTCTGCCTTCTCGAGGTGGAGGCGATATGATGAGACCAGACAGGCGTTTGCTAAGGCA
CAATTAGAGATGATAATGTCAACCAATGGACTCTCTGAGAATGTGTTTGAAATTGCGTCAAAAAGCTTAGCTGCTTAG