Microexon ID Ps_NC_039360.1:143793584-143793592:-
Species Papaver somniferum
Coordinates NC_039360.1:143793584..143793592
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TGTACTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGCTGTAGAAGAGAAGGGCAAAGTAGGAAAGGACGTCAAGTGTACTTGGGTGGGTATGATAAGGAAGATAAAGCTGCAAGAGCGTATGATTTAGCGGCT
Microexon-tag Amino Acid Seq WDNSCRREGQSRKGRQVYLGGYDKEDKAARAYDLAA
Microexon-tag spanning region143793421-143794427
Microexon-tag prediction score0.9691
Overlapped with the annotated transcript (%) 100
New Transcript ID XM_026601031.1x
Reference Transcript ID XM_026601031.1
Gene ID NA
Gene Name NA
Transcript ID XM_026601031.1
Protein ID XP_026456816.1
Gene ID LOC113357587
Gene Name NA
Pfam domain motif AP2
Motif E-value 2.9e-12
Motif start 279
Motif end 337
Protein seq >XP_026456816.1
MTPTNWLSFSLSPLDLYQSTNTTHHHQDAAPTTTATTPKYMPYESLSSDSSHQYYNFDSLYANNNNDYWTNSMRAATNQD
SHGYTAVEEIQDIKEIKTGDFSFLTSLNHQQIVPKLEDFLGGDSVSASSMHHQQHSQTETQEDSALSNVYHEQQSTCYYG
GDQQDLKAITGFHQGTFSSANSGSEVDDSSVSVTGSMGFHPHHHQQALSLESSRNDLLYSDTTTPTTNNNNNNSNQRLSL
AVSSNKTQNSEKAIVAVGDANSESCKKITPDTFGQRTSIYRGVTRHRWTGRYEAHLWDNSCRREGQSRKGRQVYLGGYDK
EDKAARAYDLAALKYWGPTATTNFPVTNYSKEMEDMKTMTKQEFIASLRRKSSGFSRGASMYRGVTRHHQQGRWQARIGR
VAGNKDLYLGTFGTEEEAAEAYDIAAIKFRGLNAVTNFEMNRYNVEAITNNELPIGGAAKRLKLSLEAESETQSPQVHRE
QPPTGSCCSSISTVPGVYQDSATTLYHQHLFHQLSYTNGTSSHPVIAPIMSSPSEFYGRGMIYGSTTPFNISSHFNNSIN
NTEANLNQPWNMTASYNNN*
CDS seq >XM_026601031.1
ATGACGCCTACAAACTGGCTATCGTTTTCTCTATCACCGTTAGACCTCTACCAATCAACCAATACTACTCATCATCATCA
AGATGCTGCTCCTACTACTACTGCTACGACACCAAAATACATGCCTTACGAAAGCTTATCTTCCGATTCTTCCCACCAGT
ACTACAACTTTGATTCTTTGTATGCCAACAACAACAATGATTACTGGACGAATAGTATGAGGGCAGCAACGAATCAAGAT
TCTCATGGCTACACAGCAGTTGAAGAAATTCAAGATATAAAAGAAATCAAAACAGGGGATTTCTCATTTTTAACAAGTTT
GAATCATCAACAGATAGTACCAAAACTTGAGGATTTTCTCGGTGGTGATTCTGTATCTGCTTCATCCATGCATCATCAAC
AACATAGTCAAACAGAAACTCAGGAGGATTCAGCATTAAGTAATGTTTATCATGAACAACAAAGTACTTGTTATTATGGT
GGAGATCAACAAGATCTTAAAGCAATCACTGGTTTCCATCAGGGGACTTTTTCTTCGGCTAATTCTGGTTCTGAAGTTGA
TGATTCATCTGTTTCTGTAACTGGTTCAATGGGTTTCCATCCTCATCATCACCAGCAAGCTTTATCTCTTGAATCCTCCA
GAAATGATTTACTTTACTCAGACACCACTACACCCACTACTAACAATAATAACAACAACAGCAACCAGAGATTGTCATTA
GCGGTTTCCAGTAATAAAACTCAGAATTCAGAAAAAGCTATTGTGGCCGTGGGTGATGCTAATTCCGAGTCATGTAAGAA
AATTACTCCTGATACTTTCGGTCAGAGAACTTCCATTTACAGAGGTGTCACAAGACATAGATGGACAGGTAGATATGAAG
CTCATCTATGGGATAATAGCTGTAGAAGAGAAGGGCAAAGTAGGAAAGGACGTCAAGTGTACTTGGGTGGGTATGATAAG
GAAGATAAAGCTGCAAGAGCGTATGATTTAGCGGCTTTGAAATACTGGGGTCCAACTGCAACAACTAATTTTCCGGTTAC
AAATTACTCCAAGGAAATGGAAGATATGAAGACAATGACCAAGCAAGAGTTCATTGCTTCCCTTAGAAGGAAGAGTAGTG
GATTCTCAAGAGGAGCTTCTATGTACAGAGGAGTAACAAGGCATCACCAGCAAGGCCGATGGCAAGCCAGGATAGGCCGA
GTTGCCGGGAATAAAGATCTCTACCTTGGTACATTTGGAACTGAAGAGGAAGCAGCGGAGGCATATGACATAGCAGCAAT
AAAGTTCAGAGGCTTGAATGCTGTAACCAATTTTGAGATGAACAGATACAATGTTGAAGCCATCACCAATAATGAACTTC
CCATTGGTGGGGCTGCAAAGCGTCTAAAGCTCTCCCTCGAAGCCGAATCAGAAACACAATCACCACAGGTACACCGTGAA
CAACCCCCAACCGGTAGTTGTTGTAGCAGCATTTCCACTGTTCCTGGTGTGTACCAAGATTCTGCAACCACCTTATATCA
TCAGCATCTCTTCCACCAGCTTAGTTACACAAATGGCACCTCCTCCCATCCGGTCATTGCTCCAATAATGTCTTCACCAT
CTGAGTTTTACGGGCGGGGTATGATTTATGGAAGCACCACACCGTTTAACATCAGCAGTCACTTTAACAACAGCATCAAC
AATACTGAAGCTAACCTTAACCAACCCTGGAATATGACGGCATCTTACAACAATAACTGA
Microexon DNA seq TGTACTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGCTGTAGAAGAGAAGGGCAAAGTAGGAAAGGACGTCAAGTGTACTTGGGTGGGTATGATAAGGAAGATAAAGCTGCAAGAGCGTATGATTTAGCGGCT
Microexon-tag Amino Acid seq WDNSCRREGQSRKGRQVYLGGYDKEDKAARAYDLAA
Transcript ID XM_026601031.1
Gene ID Ps.19728
Gene Name NA
Pfam domain motif AP2
Motif E-value 2.9e-12
Motif start 279
Motif end 337
Protein seq >XM_026601031.1
MTPTNWLSFSLSPLDLYQSTNTTHHHQDAAPTTTATTPKYMPYESLSSDSSHQYYNFDSLYANNNNDYWTNSMRAATNQD
SHGYTAVEEIQDIKEIKTGDFSFLTSLNHQQIVPKLEDFLGGDSVSASSMHHQQHSQTETQEDSALSNVYHEQQSTCYYG
GDQQDLKAITGFHQGTFSSANSGSEVDDSSVSVTGSMGFHPHHHQQALSLESSRNDLLYSDTTTPTTNNNNNNSNQRLSL
AVSSNKTQNSEKAIVAVGDANSESCKKITPDTFGQRTSIYRGVTRHRWTGRYEAHLWDNSCRREGQSRKGRQVYLGGYDK
EDKAARAYDLAALKYWGPTATTNFPVTNYSKEMEDMKTMTKQEFIASLRRKSSGFSRGASMYRGVTRHHQQGRWQARIGR
VAGNKDLYLGTFGTEEEAAEAYDIAAIKFRGLNAVTNFEMNRYNVEAITNNELPIGGAAKRLKLSLEAESETQSPQVHRE
QPPTGSCCSSISTVPGVYQDSATTLYHQHLFHQLSYTNGTSSHPVIAPIMSSPSEFYGRGMIYGSTTPFNISSHFNNSIN
NTEANLNQPWNMTASYNNN*
CDS seq >XM_026601031.1
ATGACGCCTACAAACTGGCTATCGTTTTCTCTATCACCGTTAGACCTCTACCAATCAACCAATACTACTCATCATCATCA
AGATGCTGCTCCTACTACTACTGCTACGACACCAAAATACATGCCTTACGAAAGCTTATCTTCCGATTCTTCCCACCAGT
ACTACAACTTTGATTCTTTGTATGCCAACAACAACAATGATTACTGGACGAATAGTATGAGGGCAGCAACGAATCAAGAT
TCTCATGGCTACACAGCAGTTGAAGAAATTCAAGATATAAAAGAAATCAAAACAGGGGATTTCTCATTTTTAACAAGTTT
GAATCATCAACAGATAGTACCAAAACTTGAGGATTTTCTCGGTGGTGATTCTGTATCTGCTTCATCCATGCATCATCAAC
AACATAGTCAAACAGAAACTCAGGAGGATTCAGCATTAAGTAATGTTTATCATGAACAACAAAGTACTTGTTATTATGGT
GGAGATCAACAAGATCTTAAAGCAATCACTGGTTTCCATCAGGGGACTTTTTCTTCGGCTAATTCTGGTTCTGAAGTTGA
TGATTCATCTGTTTCTGTAACTGGTTCAATGGGTTTCCATCCTCATCATCACCAGCAAGCTTTATCTCTTGAATCCTCCA
GAAATGATTTACTTTACTCAGACACCACTACACCCACTACTAACAATAATAACAACAACAGCAACCAGAGATTGTCATTA
GCGGTTTCCAGTAATAAAACTCAGAATTCAGAAAAAGCTATTGTGGCCGTGGGTGATGCTAATTCCGAGTCATGTAAGAA
AATTACTCCTGATACTTTCGGTCAGAGAACTTCCATTTACAGAGGTGTCACAAGACATAGATGGACAGGTAGATATGAAG
CTCATCTATGGGATAATAGCTGTAGAAGAGAAGGGCAAAGTAGGAAAGGACGTCAAGTGTACTTGGGTGGGTATGATAAG
GAAGATAAAGCTGCAAGAGCGTATGATTTAGCGGCTTTGAAATACTGGGGTCCAACTGCAACAACTAATTTTCCGGTTAC
AAATTACTCCAAGGAAATGGAAGATATGAAGACAATGACCAAGCAAGAGTTCATTGCTTCCCTTAGAAGGAAGAGTAGTG
GATTCTCAAGAGGAGCTTCTATGTACAGAGGAGTAACAAGGCATCACCAGCAAGGCCGATGGCAAGCCAGGATAGGCCGA
GTTGCCGGGAATAAAGATCTCTACCTTGGTACATTTGGAACTGAAGAGGAAGCAGCGGAGGCATATGACATAGCAGCAAT
AAAGTTCAGAGGCTTGAATGCTGTAACCAATTTTGAGATGAACAGATACAATGTTGAAGCCATCACCAATAATGAACTTC
CCATTGGTGGGGCTGCAAAGCGTCTAAAGCTCTCCCTCGAAGCCGAATCAGAAACACAATCACCACAGGTACACCGTGAA
CAACCCCCAACCGGTAGTTGTTGTAGCAGCATTTCCACTGTTCCTGGTGTGTACCAAGATTCTGCAACCACCTTATATCA
TCAGCATCTCTTCCACCAGCTTAGTTACACAAATGGCACCTCCTCCCATCCGGTCATTGCTCCAATAATGTCTTCACCAT
CTGAGTTTTACGGGCGGGGTATGATTTATGGAAGCACCACACCGTTTAACATCAGCAGTCACTTTAACAACAGCATCAAC
AATACTGAAGCTAACCTTAACCAACCCTGGAATATGACGGCATCTTACAACAATAACTGA