Microexon ID Pp_18:12953518-12953532:+
Species Physcomitrium patens
Coordinates 18:12953518..12953532
Microexon Cluster ID MEP45
Size 15
Phase 2
Pfam Domain Motif RPE65
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 47,15,46
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq AGRGTTGGKCCTAAYCCMAAGTTTGYYCCWGTKGCTGGATAYCAYTGGTTTGATGGAGATGGMATGATTCATGSYWTGCGYATYAAAGATGGAAAAGCWACWTATGTY
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GTTCGACGGTGACGG
Microexon Amino Acid seq WFDGDG
Microexon-tag DNA Seq CGCGTGGGCCCGAACCCAAGATTCGAGGCTCTTGGTGGATACCACTGGTTCGACGGTGACGGGATGATACATGGATTACATTTGAGTGAAGGCAAGGCCACTTACGTG
Microexon-tag Amino Acid Seq RVGPNPRFEALGGYHWFDGDGMIHGLHLSEGKATYV
Microexon-tag spanning region12953156-12953744
Microexon-tag prediction score0.8832
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c18_17950V3.1x
Reference Transcript ID Pp3c18_17950V3.1
Gene ID Pp3c18_17950
Gene Name NA
Transcript ID Pp3c18_17950V3.1
Protein ID Pp3c18_17950V3.1
Gene ID Pp3c18_17950
Gene Name NA
Pfam domain motif RPE65
Motif E-value 9e-115
Motif start 158
Motif end 644
Protein seq >Pp3c18_17950V3.1
MAESLSPSRHIGAVGAQELPQLVGGSRDSLASSSGHGLRLQRRVSVSRPSLLPCLRRKSGVDHRHKHTTITAVGTSKWKD
PVSDPEIRLPEGITSIPSLIPSWRKKDENRSNKQKVVVRRSLINWAALFCDFLEKMIYETAKGSTSKQEWNYYLSASFAP
VSERAPTTALRIIGTIPECMFGEFFRVGPNPRFEALGGYHWFDGDGMIHGLHLSEGKATYVVRYVRTSRLQQEERYGAPK
FWKVGDMKGVKGYFCIALENLRRSLGVLNVSSGYAGTGNTSLVFHNKKLLALHERDKPYRIKVLEDGDLVTVGLEDFDKR
LQHSFAAHPKIDPVTGEMFFFGYHSESSDLIYRTVSKEGVLRDPVLIKLPTTTITHDFAITENYAILMDLPLVLDPLGMA
QGGFIFRFDPNKESRLGVLPRYATDDSQIRWFTIPTCYILHTVAAWEEEDEIILICCRTDGIDLNPDFGVERKNKAYGDS
YGSPTLYEYRMNFKNGHVHQRQLSNLSIEFPTINPHYVGRKTRYTYCGVVDNELDLMSGIVKYDLYLDPSTTDTAVDDKT
KGNNCTVFHFGPNCYASDTIFVPNNRGMANDAAEDDGFLISFVQDLNTGKSEAIIIDAKTMGPTPVAVVELPGRIPKGFH
AYFVTQEQLQQQA*
CDS seq >Pp3c18_17950V3.1
ATGGCTGAGTCGCTGAGTCCCTCGAGACACATTGGTGCAGTGGGGGCACAAGAATTGCCGCAGCTTGTTGGGGGTTCAAG
AGACAGCCTTGCTTCGTCTTCAGGGCATGGATTGAGGCTCCAGCGGCGAGTTTCTGTCTCAAGGCCAAGTCTACTACCTT
GTCTACGGCGGAAGAGTGGGGTGGATCACCGGCATAAACACACTACGATTACTGCGGTGGGGACATCAAAGTGGAAAGAT
CCAGTGAGTGATCCAGAGATTCGTCTGCCGGAGGGGATTACATCAATACCATCGCTGATTCCGAGCTGGAGGAAGAAGGA
TGAGAACAGGAGCAACAAGCAGAAGGTCGTGGTGCGCAGGTCGCTAATCAACTGGGCAGCGTTGTTCTGTGATTTTCTCG
AAAAAATGATATATGAAACCGCGAAGGGATCGACATCCAAGCAGGAATGGAACTACTATCTGTCGGCCAGCTTTGCTCCT
GTGTCAGAGAGGGCACCAACAACAGCCCTTCGCATCATCGGTACCATACCTGAGTGCATGTTCGGTGAGTTTTTTCGCGT
GGGCCCGAACCCAAGATTCGAGGCTCTTGGTGGATACCACTGGTTCGACGGTGACGGGATGATACATGGATTACATTTGA
GTGAAGGCAAGGCCACTTACGTGGTGCGTTATGTTCGAACATCTCGACTGCAACAAGAGGAGCGTTACGGAGCACCAAAA
TTCTGGAAGGTTGGAGACATGAAGGGCGTAAAAGGATATTTCTGTATTGCATTAGAGAACCTGCGGAGGAGCTTGGGTGT
CTTGAACGTATCCAGCGGCTATGCAGGCACCGGCAACACGTCTCTCGTCTTCCATAATAAGAAACTTTTGGCTCTACATG
AACGAGACAAACCTTATAGAATCAAGGTGTTGGAGGACGGTGACCTCGTAACAGTGGGTCTCGAAGATTTCGACAAGAGA
TTACAACACTCCTTTGCAGCCCATCCAAAGATTGATCCCGTCACAGGTGAGATGTTCTTTTTCGGGTACCATTCCGAGTC
TTCAGACTTGATCTACCGCACTGTATCCAAAGAAGGCGTGCTCAGAGACCCAGTGCTTATAAAATTGCCAACCACCACCA
TTACGCATGACTTCGCAATTACAGAGAACTATGCCATCCTCATGGACCTCCCTCTTGTCTTAGATCCTCTGGGCATGGCG
CAAGGAGGATTCATCTTCAGATTTGATCCCAATAAAGAATCACGACTAGGAGTCTTACCACGATATGCAACTGACGATTC
ACAGATCCGTTGGTTCACAATCCCAACATGTTATATCTTGCACACTGTTGCTGCATGGGAAGAGGAGGATGAGATTATTT
TGATTTGCTGTAGAACGGATGGTATCGACCTAAATCCAGACTTCGGAGTGGAGAGAAAGAACAAGGCCTACGGAGATTCC
TATGGGTCTCCTACACTGTATGAGTACAGAATGAACTTCAAAAACGGTCATGTCCACCAAAGGCAGCTCTCAAATCTATC
CATTGAATTTCCTACAATTAATCCACATTATGTAGGAAGGAAAACCCGCTACACTTATTGTGGTGTAGTTGATAATGAAT
TGGATCTAATGTCAGGTATAGTGAAGTATGACCTATACTTGGATCCATCAACAACTGATACAGCAGTGGATGACAAAACC
AAAGGTAACAATTGCACAGTGTTTCACTTTGGACCCAATTGCTATGCAAGTGACACAATTTTTGTACCCAACAACAGAGG
TATGGCCAATGATGCAGCAGAAGACGATGGCTTCTTGATATCATTTGTACAAGACTTAAACACAGGGAAGTCTGAAGCTA
TCATCATTGATGCAAAGACAATGGGGCCAACACCAGTGGCAGTTGTAGAATTGCCTGGACGGATTCCTAAAGGTTTCCAT
GCTTATTTTGTGACACAGGAGCAGCTTCAACAACAAGCATAA
Microexon DNA seq GTTCGACGGTGACGG
Microexon Amino Acid seq WFDGDG
Microexon-tag DNA Seq CGCGTGGGCCCGAACCCAAGATTCGAGGCTCTTGGTGGATACCACTGGTTCGACGGTGACGGGATGATACATGGATTACATTTGAGTGAAGGCAAGGCCACTTACGTG
Microexon-tag Amino Acid seq RVGPNPRFEALGGYHWFDGDGMIHGLHLSEGKATYV
Transcript ID Pp.9789.1
Gene ID Pp.9789
Gene Name NA
Pfam domain motif RPE65
Motif E-value 1.3e-101
Motif start 158
Motif end 615
Protein seq >Pp.9789.1
MAESLSPSRHIGAVGAQELPQLVGGSRDSLASSSGHGLRLQRRVSVSRPSLLPCLRRKSGVDHRHKHTTITAVGTSKWKD
PVSDPEIRLPEGITSIPSLIPSWRKKDENRSNKQKVVVRRSLINWAALFCDFLEKMIYETAKGSTSKQEWNYYLSASFAP
VSERAPTTALRIIGTIPECMFGEFFRVGPNPRFEALGGYHWFDGDGMIHGLHLSEGKATYVVRYVRTSRLQQEERYGAPK
FWKVGDMKGVKGYFCIALENLRRSLGVLNVSSGYAGTGNTSLVFHNKKLLALHERDKPYRIKVLEDGDLVTVGLEDFDKR
LQHSFAAHPKIDPVTGEMFFFGYHSESSDLIYRTVSKEGVLRDPVLIKLPTTTITHDFAITENYAILMDLPLVLDPLGMA
QGGFIFRFDPNKESRLGVLPRYATDDSQIRWFTIPTCYILHTVAAWEEEDEIILICCRTDGIDLNPDFGVERKNKAYGDS
YGSPTLYEYRMNFKNGHVHQRQLSNLSIEFPTINPHYVGRKTRYTYCGVVDNELDLMSGIVKYDLYLDPSTTDTAVDDKT
KGNNCTVFHFGPNCYASDTIFVPNNRGMANDAAEDDGFLISFVQDLNTGYGLLIIITKLITTMGLQEKKNVYLYFCLFLS
F*
CDS seq >Pp.9789.1
ATGGCTGAGTCGCTGAGTCCCTCGAGACACATTGGTGCAGTGGGGGCACAAGAATTGCCGCAGCTTGTTGGGGGTTCAAG
AGACAGCCTTGCTTCGTCTTCAGGGCATGGATTGAGGCTCCAGCGGCGAGTTTCTGTCTCAAGGCCAAGTCTACTACCTT
GTCTACGGCGGAAGAGTGGGGTGGATCACCGGCATAAACACACTACGATTACTGCGGTGGGGACATCAAAGTGGAAAGAT
CCAGTGAGTGATCCAGAGATTCGTCTGCCGGAGGGGATTACATCAATACCATCGCTGATTCCGAGCTGGAGGAAGAAGGA
TGAGAACAGGAGCAACAAGCAGAAGGTCGTGGTGCGCAGGTCGCTAATCAACTGGGCAGCGTTGTTCTGTGATTTTCTCG
AAAAAATGATATATGAAACCGCGAAGGGATCGACATCCAAGCAGGAATGGAACTACTATCTGTCGGCCAGCTTTGCTCCT
GTGTCAGAGAGGGCACCAACAACAGCCCTTCGCATCATCGGTACCATACCTGAGTGCATGTTCGGTGAGTTTTTTCGCGT
GGGCCCGAACCCAAGATTCGAGGCTCTTGGTGGATACCACTGGTTCGACGGTGACGGGATGATACATGGATTACATTTGA
GTGAAGGCAAGGCCACTTACGTGGTGCGTTATGTTCGAACATCTCGACTGCAACAAGAGGAGCGTTACGGAGCACCAAAA
TTCTGGAAGGTTGGAGACATGAAGGGCGTAAAAGGATATTTCTGTATTGCATTAGAGAACCTGCGGAGGAGCTTGGGTGT
CTTGAACGTATCCAGCGGCTATGCAGGCACCGGCAACACGTCTCTCGTCTTCCATAATAAGAAACTTTTGGCTCTACATG
AACGAGACAAACCTTATAGAATCAAGGTGTTGGAGGACGGTGACCTCGTAACAGTGGGTCTCGAAGATTTCGACAAGAGA
TTACAACACTCCTTTGCAGCCCATCCAAAGATTGATCCCGTCACAGGTGAGATGTTCTTTTTCGGGTACCATTCCGAGTC
TTCAGACTTGATCTACCGCACTGTATCCAAAGAAGGCGTGCTCAGAGACCCAGTGCTTATAAAATTGCCAACCACCACCA
TTACGCATGACTTCGCAATTACAGAGAACTATGCCATCCTCATGGACCTCCCTCTTGTCTTAGATCCTCTGGGCATGGCG
CAAGGAGGATTCATCTTCAGATTTGATCCCAATAAAGAATCACGACTAGGAGTCTTACCACGATATGCAACTGACGATTC
ACAGATCCGTTGGTTCACAATCCCAACATGTTATATCTTGCACACTGTTGCTGCATGGGAAGAGGAGGATGAGATTATTT
TGATTTGCTGTAGAACGGATGGTATCGACCTAAATCCAGACTTCGGAGTGGAGAGAAAGAACAAGGCCTACGGAGATTCC
TATGGGTCTCCTACACTGTATGAGTACAGAATGAACTTCAAAAACGGTCATGTCCACCAAAGGCAGCTCTCAAATCTATC
CATTGAATTTCCTACAATTAATCCACATTATGTAGGAAGGAAAACCCGCTACACTTATTGTGGTGTAGTTGATAATGAAT
TGGATCTAATGTCAGGTATAGTGAAGTATGACCTATACTTGGATCCATCAACAACTGATACAGCAGTGGATGACAAAACC
AAAGGTAACAATTGCACAGTGTTTCACTTTGGACCCAATTGCTATGCAAGTGACACAATTTTTGTACCCAACAACAGAGG
TATGGCCAATGATGCAGCAGAAGACGATGGCTTCTTGATATCATTTGTACAAGACTTAAACACAGGGTATGGGCTTCTCA
TTATAATCACAAAATTAATTACTACAATGGGTCTTCAAGAGAAAAAAAATGTTTATCTCTACTTTTGTTTGTTTCTTTCT
TTTTAA