Microexon ID Pp_21:8329911-8329925:-
Species Physcomitrium patens
Coordinates 21:8329911..8329925
Microexon Cluster ID MEP45
Size 15
Phase 2
Pfam Domain Motif RPE65
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 47,15,46
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq AGRGTTGGKCCTAAYCCMAAGTTTGYYCCWGTKGCTGGATAYCAYTGGTTTGATGGAGATGGMATGATTCATGSYWTGCGYATYAAAGATGGAAAAGCWACWTATGTY
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GTTTGATGGGGACGG
Microexon Amino Acid seq WFDGDG
Microexon-tag DNA Seq CGTGTTGGGCCCAACCCAAGATTGAAACCTATCTCTGGATATCACTGGTTTGATGGGGACGGGATGATGCATGGTTTAAACATTAAGGATGGGAAAGCCACATATGTG
Microexon-tag Amino Acid Seq RVGPNPRLKPISGYHWFDGDGMMHGLNIKDGKATYV
Microexon-tag spanning region8329753-8330244
Microexon-tag prediction score0.9125
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c21_12920V3.1x
Reference Transcript ID Pp3c21_12920V3.1
Gene ID Pp3c21_12920
Gene Name NA
Transcript ID Pp3c21_12920V3.1
Protein ID Pp3c21_12920V3.1
Gene ID Pp3c21_12920
Gene Name NA
Pfam domain motif RPE65
Motif E-value 4.5e-118
Motif start 145
Motif end 614
Protein seq >Pp3c21_12920V3.1
MDTGMASSYLTTSLFPGSNIPSLSKSTVSLPFPLQLFGAHGMAEVHWRRGLSVAASTQNPSPWVAEPDVKEGDRERDDGY
GAGSLNHVYYNPKKVEFRRPKVGSWAASVCDFVERAVVWAFDAGLSDKKAYLLFDNYAPVEELGPVGDLPIIGTIPECLN
GEFVRVGPNPRLKPISGYHWFDGDGMMHGLNIKDGKATYVARFVKTSRLQQEEYYGAAKFLKIGQLKGLRGLFYIAVEKL
RLSLGVLDESNGLYGGNTAFVYHNNRLLALHEIDKPYAIRVLDDGDLQTMGLQDYEQRLGHSFTAHPKIDSTTGELFMFA
YQEKVPYLIYRVISKNGTLREPVPITIPECVMMHDFAITENYALFMDLPLRFNPTGLPKGEFIFKFDPSKESRFGILPRY
ATDESQIRWCKIPTCFIFHNANAWEEGDEVVLITCRMPGIELDLELEFKKEKAWSVFSKLFEFRMNLKTGEVKQRQLSSL
STDFPRINEEYTGRKTRFVYCGVFDELNRVIGVVKYDLDEEPCLTTGDLQKGGNVAGVFSLGPGRSGSEAIFVPLKRGME
GPEDDGYLILFVYDENKGTSEAVIIDAKTMAADPVATVKLPRRVPYGFHAHFVSQEQLMQQA*
CDS seq >Pp3c21_12920V3.1
ATGGATACTGGAATGGCATCCTCGTACCTCACGACAAGCTTGTTTCCGGGGAGCAACATTCCAAGTCTCTCAAAGTCGAC
AGTGTCACTTCCATTCCCGCTGCAGCTCTTTGGAGCGCATGGAATGGCTGAGGTTCACTGGAGGAGAGGGCTATCAGTGG
CTGCGTCCACGCAGAACCCGTCGCCATGGGTTGCGGAACCGGATGTGAAAGAAGGGGATAGGGAGCGGGATGATGGTTAT
GGGGCTGGCAGCTTAAATCATGTTTACTACAATCCGAAGAAGGTTGAGTTTAGGAGGCCGAAGGTTGGTAGCTGGGCGGC
GTCGGTGTGTGATTTTGTTGAGAGGGCGGTAGTGTGGGCGTTTGATGCCGGTCTCAGCGATAAGAAGGCGTACTTGTTGT
TCGATAACTATGCGCCCGTGGAGGAATTGGGTCCGGTGGGGGATTTGCCCATTATCGGCACCATTCCGGAATGTTTGAAT
GGCGAGTTCGTGCGTGTTGGGCCCAACCCAAGATTGAAACCTATCTCTGGATATCACTGGTTTGATGGGGACGGGATGAT
GCATGGTTTAAACATTAAGGATGGGAAAGCCACATATGTGGCCCGATTCGTGAAAACTTCACGGTTACAGCAAGAGGAAT
ATTACGGTGCAGCCAAATTTTTGAAGATTGGACAACTGAAAGGGCTTCGGGGGCTATTTTACATTGCTGTGGAAAAACTA
CGACTTAGCTTGGGCGTCCTTGACGAATCCAATGGCTTATATGGAGGCAACACAGCCTTCGTGTATCACAATAATAGGCT
TCTTGCGTTACACGAGATCGACAAACCTTATGCTATCAGGGTGTTGGACGATGGGGACCTGCAAACAATGGGCCTCCAAG
ATTACGAGCAGAGACTGGGACATTCATTCACAGCCCACCCAAAAATAGATTCCACCACAGGAGAATTGTTTATGTTTGCA
TACCAAGAGAAGGTCCCATATCTGATCTATCGTGTAATATCCAAGAATGGTACTCTTCGGGAGCCGGTGCCCATTACCAT
CCCCGAATGCGTCATGATGCACGACTTCGCCATCACCGAGAACTATGCCCTGTTCATGGACCTTCCTCTTCGATTCAACC
CCACGGGTTTGCCAAAAGGGGAGTTCATTTTCAAGTTTGATCCCAGCAAAGAATCACGGTTTGGAATACTGCCCCGGTAC
GCCACCGATGAGTCGCAGATTCGCTGGTGTAAGATCCCCACTTGTTTCATCTTTCACAATGCGAATGCATGGGAAGAGGG
GGATGAAGTTGTGCTTATAACGTGCCGAATGCCCGGCATCGAACTTGACTTGGAATTGGAATTCAAGAAAGAAAAGGCTT
GGAGTGTGTTCTCCAAATTATTCGAGTTTAGGATGAACCTTAAGACCGGAGAGGTGAAGCAACGACAACTCTCAAGTTTA
AGCACAGATTTTCCCAGAATCAACGAGGAATACACTGGCAGAAAGACCCGGTTTGTCTACTGTGGAGTATTTGACGAGCT
AAACAGGGTAATAGGTGTTGTCAAGTATGACCTTGACGAGGAGCCATGCCTCACGACAGGTGACCTGCAAAAGGGTGGCA
ACGTGGCGGGAGTGTTCAGCCTTGGACCAGGTCGCTCAGGAAGCGAGGCAATTTTTGTGCCACTCAAGCGTGGTATGGAG
GGACCCGAAGACGACGGCTACTTGATCCTGTTTGTGTACGATGAGAACAAAGGGACATCAGAAGCGGTGATCATTGATGC
CAAGACGATGGCTGCAGATCCCGTGGCAACCGTGAAATTGCCCAGACGGGTTCCTTACGGATTTCATGCCCATTTTGTTA
GCCAGGAGCAACTGATGCAACAGGCATGA
Microexon DNA seq GTTTGATGGGGACGG
Microexon Amino Acid seq WFDGDG
Microexon-tag DNA Seq CGTGTTGGGCCCAACCCAAGATTGAAACCTATCTCTGGATATCACTGGTTTGATGGGGACGGGATGATGCATGGTTTAAACATTAAGGATGGGAAAGCCACATATGTG
Microexon-tag Amino Acid seq RVGPNPRLKPISGYHWFDGDGMMHGLNIKDGKATYV
Transcript ID Pp3c21_12920V3.1
Gene ID Pp.13592
Gene Name NA
Pfam domain motif RPE65
Motif E-value 4.5e-118
Motif start 145
Motif end 614
Protein seq >Pp3c21_12920V3.1
MDTGMASSYLTTSLFPGSNIPSLSKSTVSLPFPLQLFGAHGMAEVHWRRGLSVAASTQNPSPWVAEPDVKEGDRERDDGY
GAGSLNHVYYNPKKVEFRRPKVGSWAASVCDFVERAVVWAFDAGLSDKKAYLLFDNYAPVEELGPVGDLPIIGTIPECLN
GEFVRVGPNPRLKPISGYHWFDGDGMMHGLNIKDGKATYVARFVKTSRLQQEEYYGAAKFLKIGQLKGLRGLFYIAVEKL
RLSLGVLDESNGLYGGNTAFVYHNNRLLALHEIDKPYAIRVLDDGDLQTMGLQDYEQRLGHSFTAHPKIDSTTGELFMFA
YQEKVPYLIYRVISKNGTLREPVPITIPECVMMHDFAITENYALFMDLPLRFNPTGLPKGEFIFKFDPSKESRFGILPRY
ATDESQIRWCKIPTCFIFHNANAWEEGDEVVLITCRMPGIELDLELEFKKEKAWSVFSKLFEFRMNLKTGEVKQRQLSSL
STDFPRINEEYTGRKTRFVYCGVFDELNRVIGVVKYDLDEEPCLTTGDLQKGGNVAGVFSLGPGRSGSEAIFVPLKRGME
GPEDDGYLILFVYDENKGTSEAVIIDAKTMAADPVATVKLPRRVPYGFHAHFVSQEQLMQQA*
CDS seq >Pp3c21_12920V3.1
ATGGATACTGGAATGGCATCCTCGTACCTCACGACAAGCTTGTTTCCGGGGAGCAACATTCCAAGTCTCTCAAAGTCGAC
AGTGTCACTTCCATTCCCGCTGCAGCTCTTTGGAGCGCATGGAATGGCTGAGGTTCACTGGAGGAGAGGGCTATCAGTGG
CTGCGTCCACGCAGAACCCGTCGCCATGGGTTGCGGAACCGGATGTGAAAGAAGGGGATAGGGAGCGGGATGATGGTTAT
GGGGCTGGCAGCTTAAATCATGTTTACTACAATCCGAAGAAGGTTGAGTTTAGGAGGCCGAAGGTTGGTAGCTGGGCGGC
GTCGGTGTGTGATTTTGTTGAGAGGGCGGTAGTGTGGGCGTTTGATGCCGGTCTCAGCGATAAGAAGGCGTACTTGTTGT
TCGATAACTATGCGCCCGTGGAGGAATTGGGTCCGGTGGGGGATTTGCCCATTATCGGCACCATTCCGGAATGTTTGAAT
GGCGAGTTCGTGCGTGTTGGGCCCAACCCAAGATTGAAACCTATCTCTGGATATCACTGGTTTGATGGGGACGGGATGAT
GCATGGTTTAAACATTAAGGATGGGAAAGCCACATATGTGGCCCGATTCGTGAAAACTTCACGGTTACAGCAAGAGGAAT
ATTACGGTGCAGCCAAATTTTTGAAGATTGGACAACTGAAAGGGCTTCGGGGGCTATTTTACATTGCTGTGGAAAAACTA
CGACTTAGCTTGGGCGTCCTTGACGAATCCAATGGCTTATATGGAGGCAACACAGCCTTCGTGTATCACAATAATAGGCT
TCTTGCGTTACACGAGATCGACAAACCTTATGCTATCAGGGTGTTGGACGATGGGGACCTGCAAACAATGGGCCTCCAAG
ATTACGAGCAGAGACTGGGACATTCATTCACAGCCCACCCAAAAATAGATTCCACCACAGGAGAATTGTTTATGTTTGCA
TACCAAGAGAAGGTCCCATATCTGATCTATCGTGTAATATCCAAGAATGGTACTCTTCGGGAGCCGGTGCCCATTACCAT
CCCCGAATGCGTCATGATGCACGACTTCGCCATCACCGAGAACTATGCCCTGTTCATGGACCTTCCTCTTCGATTCAACC
CCACGGGTTTGCCAAAAGGGGAGTTCATTTTCAAGTTTGATCCCAGCAAAGAATCACGGTTTGGAATACTGCCCCGGTAC
GCCACCGATGAGTCGCAGATTCGCTGGTGTAAGATCCCCACTTGTTTCATCTTTCACAATGCGAATGCATGGGAAGAGGG
GGATGAAGTTGTGCTTATAACGTGCCGAATGCCCGGCATCGAACTTGACTTGGAATTGGAATTCAAGAAAGAAAAGGCTT
GGAGTGTGTTCTCCAAATTATTCGAGTTTAGGATGAACCTTAAGACCGGAGAGGTGAAGCAACGACAACTCTCAAGTTTA
AGCACAGATTTTCCCAGAATCAACGAGGAATACACTGGCAGAAAGACCCGGTTTGTCTACTGTGGAGTATTTGACGAGCT
AAACAGGGTAATAGGTGTTGTCAAGTATGACCTTGACGAGGAGCCATGCCTCACGACAGGTGACCTGCAAAAGGGTGGCA
ACGTGGCGGGAGTGTTCAGCCTTGGACCAGGTCGCTCAGGAAGCGAGGCAATTTTTGTGCCACTCAAGCGTGGTATGGAG
GGACCCGAAGACGACGGCTACTTGATCCTGTTTGTGTACGATGAGAACAAAGGGACATCAGAAGCGGTGATCATTGATGC
CAAGACGATGGCTGCAGATCCCGTGGCAACCGTGAAATTGCCCAGACGGGTTCCTTACGGATTTCATGCCCATTTTGTTA
GCCAGGAGCAACTGATGCAACAGGCATGA