Microexon ID Pp_10:16614304-16614312:+
Species Physcomitrium patens
Coordinates 10:16614304..16614312
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTATCTTG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAAAAATTGTTGGAATCACAAGCAGAACAAGAAGGGACGTCAAGTTTATCTTGGGGCTTACGACGAGGAAGAAGCAGCAGCTCGGGCTTACGACCTCGCTGCT
Microexon-tag Amino Acid Seq WDKNCWNHKQNKKGRQVYLGAYDEEEAAARAYDLAA
Microexon-tag spanning region16614030-16614460
Microexon-tag prediction score0.9605
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c10_24550V3.1x
Reference Transcript ID Pp3c10_24550V3.1
Gene ID Pp3c10_24550
Gene Name NA
Transcript ID Pp3c10_24550V3.1
Protein ID Pp3c10_24550V3.1
Gene ID Pp3c10_24550
Gene Name NA
Pfam domain motif AP2
Motif E-value 3.2e-13
Motif start 55
Motif end 114
Protein seq >Pp3c10_24550V3.1
MELHGRVARAGHSGVMATTFKRTVRQYPARNKTARKPVVEKDSGDARSGPVKRSSGFRGVTRHRWTGRFEAHLWDKNCWN
HKQNKKGRQVYLGAYDEEEAAARAYDLAALKYWGPGTIINFKLEDYERQIQEMAVISPEEYLASLRRKSSGFSRGVSKYR
GVARHHHNGRWEARIGRVDGNKYLYLGTFATQEEAARAYDLAAIEYRGAAAVTNFDLTYYSQNLSAKQGQAKFIIPEVQH
GTSSPSGGATSSCNEMEILGRINRTLESSLDDELQMLRTLGGVTGSNEDLQILEKKSLIPDGASSTDLQFQGNNSQVVNS
GGPRRNSFMFTKVKYPSDDGSSNDLLELKHSSGAPSVHLSSTPVNFGANFTYSSQQLTDNTPQDVTDTVNLCMSVTSNLD
TQIVNMKRKLESGEDSEEFTLDEISDMETTSTIDLLLATEEDDWHLNQYLNEFSWDSHADSESQLLDSSDTTFCDDGVFP
GTCIGLTDYEVISNVLNELPTIPSRHAYTVSAFAA*
CDS seq >Pp3c10_24550V3.1
ATGGAGCTTCATGGGCGTGTGGCCCGTGCGGGGCACTCCGGGGTGATGGCAACAACTTTCAAGAGAACAGTGAGGCAGTA
TCCAGCCAGGAACAAGACCGCCAGAAAGCCTGTGGTGGAGAAGGATTCTGGTGATGCTCGTAGTGGTCCTGTGAAGCGGA
GTTCAGGTTTTCGAGGAGTGACGAGGCACAGGTGGACGGGGAGATTTGAGGCTCACCTCTGGGATAAAAATTGTTGGAAT
CACAAGCAGAACAAGAAGGGACGTCAAGTTTATCTTGGGGCTTACGACGAGGAAGAAGCAGCAGCTCGGGCTTACGACCT
CGCTGCTCTTAAATATTGGGGACCAGGAACTATAATCAACTTTAAGTTGGAGGACTACGAGCGCCAAATTCAGGAAATGG
CTGTAATTTCTCCAGAGGAGTATCTCGCCTCACTGAGAAGGAAGAGCAGTGGTTTCTCAAGGGGCGTTTCGAAATACCGA
GGTGTTGCCAGACACCATCATAATGGACGCTGGGAAGCCCGCATTGGACGCGTCGATGGAAACAAGTACCTCTACCTCGG
CACCTTTGCCACCCAGGAAGAAGCTGCCCGGGCTTATGATTTGGCAGCAATCGAGTACCGAGGTGCGGCTGCAGTTACAA
ACTTTGATCTAACCTACTACAGCCAAAATCTTTCGGCAAAGCAAGGGCAAGCCAAATTCATAATCCCAGAGGTTCAGCAT
GGCACAAGCTCCCCTTCAGGTGGGGCAACCAGCTCATGCAATGAAATGGAGATTTTGGGAAGAATCAATCGAACTCTTGA
GTCCTCACTTGACGATGAACTGCAGATGCTACGAACCCTCGGTGGAGTCACAGGCTCTAATGAGGACCTTCAGATCCTGG
AGAAGAAGAGCCTTATCCCTGATGGAGCCTCCTCAACGGACTTGCAGTTCCAGGGGAACAATAGCCAGGTTGTGAATTCG
GGGGGCCCTAGAAGAAACTCGTTCATGTTCACGAAAGTGAAATACCCAAGCGATGATGGCTCAAGTAACGATTTACTCGA
GTTGAAACACAGCAGTGGTGCTCCTTCTGTACATTTGTCCTCCACACCTGTCAATTTCGGTGCAAATTTTACCTATTCGT
CCCAACAGCTCACGGACAATACACCACAAGATGTCACTGATACCGTGAATCTCTGCATGTCGGTCACATCCAACCTGGAC
ACTCAAATTGTGAATATGAAGAGAAAACTTGAAAGTGGAGAGGACTCCGAAGAGTTCACGCTAGATGAGATCTCAGATAT
GGAGACAACAAGCACCATAGATCTGTTGCTGGCAACGGAGGAAGATGACTGGCATCTCAATCAGTATCTCAATGAGTTTT
CGTGGGACTCACATGCGGATAGCGAATCTCAGCTCTTGGATTCAAGCGATACTACCTTCTGTGACGATGGAGTGTTCCCT
GGCACATGCATCGGCTTGACGGATTACGAAGTTATTTCGAATGTGCTGAATGAGCTGCCGACAATACCGTCGAGACATGC
CTATACAGTGAGCGCATTCGCGGCCTGA
Microexon DNA seq TTTATCTTG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAAAAATTGTTGGAATCACAAGCAGAACAAGAAGGGACGTCAAGTTTATCTTGGGGCTTACGACGAGGAAGAAGCAGCAGCTCGGGCTTACGACCTCGCTGCT
Microexon-tag Amino Acid seq WDKNCWNHKQNKKGRQVYLGAYDEEEAAARAYDLAA
Transcript ID Pp3c10_24550V3.2
Gene ID Pp.2545
Gene Name NA
Pfam domain motif AP2
Motif E-value 3.2e-13
Motif start 55
Motif end 114
Protein seq >Pp3c10_24550V3.2
MELHGRVARAGHSGVMATTFKRTVRQYPARNKTARKPVVEKDSGDARSGPVKRSSGFRGVTRHRWTGRFEAHLWDKNCWN
HKQNKKGRQVYLGAYDEEEAAARAYDLAALKYWGPGTIINFKLEDYERQIQEMAVISPEEYLASLRRKSSGFSRGVSKYR
GVARHHHNGRWEARIGRVDGNKYLYLGTFATQEEAARAYDLAAIEYRGAAAVTNFDLTYYSQNLSAKQGQAKFIIPEVQH
GTSSPSGGATSSCNEMEILGRINRTLESSLDDELQMLRTLGGVTGSNEDLQILEKKSLIPDGASSTDLQFQGNNSQVVNS
GGPRRNSFMFTKVKYPSDDGSSNDLLELKHSSGAPSVHLSSTPVNFGANFTYSSQQLTDNTPQDVTDTVNLCMSVTSNLD
TQIVNMKRKLESGEDSEEFTLDEISDMETTSTIDLLLATEEDDWHLNQYLNEFSWDSHADSESQLLDSSDTTFCDDGVFP
GTCIGLTDYEVISNVLNELPTIPSRHAYTVSAFAA*
CDS seq >Pp3c10_24550V3.2
ATGGAGCTTCATGGGCGTGTGGCCCGTGCGGGGCACTCCGGGGTGATGGCAACAACTTTCAAGAGAACAGTGAGGCAGTA
TCCAGCCAGGAACAAGACCGCCAGAAAGCCTGTGGTGGAGAAGGATTCTGGTGATGCTCGTAGTGGTCCTGTGAAGCGGA
GTTCAGGTTTTCGAGGAGTGACGAGGCACAGGTGGACGGGGAGATTTGAGGCTCACCTCTGGGATAAAAATTGTTGGAAT
CACAAGCAGAACAAGAAGGGACGTCAAGTTTATCTTGGGGCTTACGACGAGGAAGAAGCAGCAGCTCGGGCTTACGACCT
CGCTGCTCTTAAATATTGGGGACCAGGAACTATAATCAACTTTAAGTTGGAGGACTACGAGCGCCAAATTCAGGAAATGG
CTGTAATTTCTCCAGAGGAGTATCTCGCCTCACTGAGAAGGAAGAGCAGTGGTTTCTCAAGGGGCGTTTCGAAATACCGA
GGTGTTGCCAGACACCATCATAATGGACGCTGGGAAGCCCGCATTGGACGCGTCGATGGAAACAAGTACCTCTACCTCGG
CACCTTTGCCACCCAGGAAGAAGCTGCCCGGGCTTATGATTTGGCAGCAATCGAGTACCGAGGTGCGGCTGCAGTTACAA
ACTTTGATCTAACCTACTACAGCCAAAATCTTTCGGCAAAGCAAGGGCAAGCCAAATTCATAATCCCAGAGGTTCAGCAT
GGCACAAGCTCCCCTTCAGGTGGGGCAACCAGCTCATGCAATGAAATGGAGATTTTGGGAAGAATCAATCGAACTCTTGA
GTCCTCACTTGACGATGAACTGCAGATGCTACGAACCCTCGGTGGAGTCACAGGCTCTAATGAGGACCTTCAGATCCTGG
AGAAGAAGAGCCTTATCCCTGATGGAGCCTCCTCAACGGACTTGCAGTTCCAGGGGAACAATAGCCAGGTTGTGAATTCG
GGGGGCCCTAGAAGAAACTCGTTCATGTTCACGAAAGTGAAATACCCAAGCGATGATGGCTCAAGTAACGATTTACTCGA
GTTGAAACACAGCAGTGGTGCTCCTTCTGTACATTTGTCCTCCACACCTGTCAATTTCGGTGCAAATTTTACCTATTCGT
CCCAACAGCTCACGGACAATACACCACAAGATGTCACTGATACCGTGAATCTCTGCATGTCGGTCACATCCAACCTGGAC
ACTCAAATTGTGAATATGAAGAGAAAACTTGAAAGTGGAGAGGACTCCGAAGAGTTCACGCTAGATGAGATCTCAGATAT
GGAGACAACAAGCACCATAGATCTGTTGCTGGCAACGGAGGAAGATGACTGGCATCTCAATCAGTATCTCAATGAGTTTT
CGTGGGACTCACATGCGGATAGCGAATCTCAGCTCTTGGATTCAAGCGATACTACCTTCTGTGACGATGGAGTGTTCCCT
GGCACATGCATCGGCTTGACGGATTACGAAGTTATTTCGAATGTGCTGAATGAGCTGCCGACAATACCGTCGAGACATGC
CTATACAGTGAGCGCATTCGCGGCCTGA