Microexon ID Pp_23:10744713-10744721:-
Species Physcomitrium patens
Coordinates 23:10744713..10744721
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ACCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTACCGGTCATCATATCATTTCCAACCTCCCAAAAACTGGATGAATGACCCAAATGGACCAATGTACTACGAAGGCTTTTATCACTTATTCTACCAGTACAATCCC
Microexon-tag Amino Acid Seq PYRSSYHFQPPKNWMNDPNGPMYYEGFYHLFYQYNP
Microexon-tag spanning region10744502-10745057
Microexon-tag prediction score0.9456
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c23_16290V3.2x
Reference Transcript ID Pp3c23_16290V3.2
Gene ID Pp3c23_16290
Gene Name NA
Transcript ID Pp3c23_16290V3.2
Protein ID Pp3c23_16290V3.2
Gene ID Pp3c23_16290
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.3e-103
Motif start 114
Motif end 432
Protein seq >Pp3c23_16290V3.2
MYLPCSTKSQFNAVHYHAQREDNSCNLVSVPMLDETSQEGYGLYREVFASRNKKKSAIAFLTIMLIAVVLYTSKIVAPKW
LELFQREKQYLVRDIVRNSAITSCHSQPYRSSYHFQPPKNWMNDPNGPMYYEGFYHLFYQYNPGGAVWGNLTWGHAVSTD
LIHWRDLEPALKPDEWYDNGGVWSGSVTICPDGSPLILYTGVADDLEQSQNLAVPEDLADPLLRKWVKSRENPILRHPVG
IDKEDFRDPTTAWQVNDGTWRILVGAKMGRDGMALLYKSEDLRHWELDENVLHTVPGSGMWECLDFFPIAPFGREGLDTS
VNGPHVKHVLKASMYDDQHDHYAVGTYNLSTESFTPINHALDIQHGLHYDYGKFYASKSFYDPVKKRRIVWGWSNESDSA
AQDIARGWASLQAIPRVLWLDTALGDSLIQAPIEEVDDLRVGKVSKTDVDLEAGSVIKIEGSSGGQVVLLDIEVIFEYPN
VSNVIVQDYGFLNGPFDCGNGGSAQRGVYGPFGLLVLTDDAYQEQTAVFFYIAQDANQRWVTHFCSDQSRSSLLHDIDTT
AFWSDVRVLPTENFLSLRVLVDHSIVESFVQGGRMAITSRVYPKEAVDEKAHVFLFNNSTTQITVRSINVWQMRSITVLP
LA*
CDS seq >Pp3c23_16290V3.2
ATGTACTTACCATGTAGCACGAAGTCGCAATTCAACGCGGTGCATTATCATGCTCAGAGGGAAGATAATTCTTGCAATTT
GGTCTCCGTTCCGATGTTAGATGAAACATCGCAGGAGGGCTATGGCTTGTACCGGGAAGTTTTTGCATCCAGGAACAAGA
AAAAGAGTGCAATTGCTTTCCTCACAATTATGTTGATAGCAGTAGTATTGTACACGAGCAAAATTGTTGCTCCCAAATGG
CTGGAGCTCTTCCAAAGGGAGAAACAGTATCTTGTACGTGATATTGTCAGAAACAGTGCTATTACGAGCTGTCATTCTCA
GCCTTACCGGTCATCATATCATTTCCAACCTCCCAAAAACTGGATGAATGACCCAAATGGACCAATGTACTACGAAGGCT
TTTATCACTTATTCTACCAGTACAATCCCGGAGGAGCTGTGTGGGGCAACCTTACTTGGGGTCATGCCGTCTCCACAGAC
CTTATCCATTGGCGTGATCTTGAGCCTGCCTTGAAACCTGACGAATGGTATGACAATGGAGGCGTATGGTCTGGTTCAGT
CACTATCTGTCCAGACGGATCACCGTTGATTCTATACACAGGTGTAGCAGATGATCTTGAGCAGTCACAGAACTTGGCGG
TACCCGAAGATCTTGCTGATCCTCTTCTTCGCAAGTGGGTGAAAAGTCGAGAGAATCCAATTCTTCGACACCCTGTTGGA
ATTGATAAAGAGGACTTCAGGGATCCAACAACAGCTTGGCAGGTAAATGATGGCACGTGGAGAATACTAGTTGGAGCCAA
AATGGGGAGAGATGGAATGGCGTTGCTTTATAAGAGCGAAGACCTACGTCATTGGGAGCTAGATGAAAATGTGCTGCACA
CAGTTCCTGGCTCGGGAATGTGGGAATGCTTGGACTTCTTTCCTATTGCACCGTTTGGACGAGAAGGGTTGGACACGTCA
GTCAATGGGCCTCATGTCAAACATGTCTTGAAAGCGAGCATGTATGATGATCAGCACGATCATTATGCAGTGGGTACTTA
TAATTTGTCTACAGAATCGTTCACACCTATCAATCATGCTTTGGACATCCAGCACGGCTTGCATTATGACTACGGGAAGT
TTTATGCTTCGAAGTCTTTCTATGACCCAGTGAAGAAGCGTCGCATCGTGTGGGGTTGGTCTAATGAATCCGACAGTGCA
GCTCAAGATATTGCTAGAGGATGGGCATCACTCCAGGCAATACCGAGGGTGTTGTGGTTGGACACTGCATTAGGAGACAG
CTTAATACAGGCACCTATCGAGGAGGTCGATGATTTGAGGGTCGGCAAGGTTTCCAAAACAGATGTGGATTTGGAGGCAG
GCAGTGTCATCAAAATCGAGGGATCCTCTGGAGGCCAGGTGGTTTTGCTGGATATTGAAGTCATATTTGAGTACCCCAAT
GTCTCCAACGTGATAGTTCAAGATTATGGTTTCCTGAATGGACCATTTGACTGTGGTAATGGAGGATCAGCCCAGCGAGG
TGTATATGGCCCATTTGGATTGCTGGTGCTCACAGATGACGCTTACCAAGAGCAAACTGCTGTATTTTTCTACATCGCCC
AGGACGCCAATCAGCGTTGGGTCACGCACTTTTGCAGTGATCAGAGCAGATCTTCTCTTTTGCATGATATTGACACAACT
GCGTTTTGGAGTGATGTTCGCGTCTTGCCTACTGAGAACTTTCTGTCTCTGCGTGTCCTGGTGGATCACTCTATCGTTGA
AAGTTTTGTTCAAGGTGGACGAATGGCTATAACATCGCGTGTTTATCCTAAAGAAGCGGTGGACGAGAAAGCTCATGTAT
TCCTCTTCAACAACAGCACAACGCAGATTACAGTTCGCAGCATCAATGTCTGGCAGATGCGGAGTATTACCGTACTCCCC
CTTGCATGA
Microexon DNA seq ACCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTACCGGTCATCATATCATTTCCAACCTCCCAAAAACTGGATGAATGACCCAAATGGACCAATGTACTACGAAGGCTTTTATCACTTATTCTACCAGTACAATCCC
Microexon-tag Amino Acid seq PYRSSYHFQPPKNWMNDPNGPMYYEGFYHLFYQYNP
Transcript ID Pp.15272.1
Gene ID Pp.15272
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.3e-103
Motif start 114
Motif end 432
Protein seq >Pp.15272.1
MYLPCSTKSQFNAVHYHAQREDNSCNLVSVPMLDETSQEGYGLYREVFASRNKKKSAIAFLTIMLIAVVLYTSKIVAPKW
LELFQREKQYLVRDIVRNSAITSCHSQPYRSSYHFQPPKNWMNDPNGPMYYEGFYHLFYQYNPGGAVWGNLTWGHAVSTD
LIHWRDLEPALKPDEWYDNGGVWSGSVTICPDGSPLILYTGVADDLEQSQNLAVPEDLADPLLRKWVKSRENPILRHPVG
IDKEDFRDPTTAWQVNDGTWRILVGAKMGRDGMALLYKSEDLRHWELDENVLHTVPGSGMWECLDFFPIAPFGREGLDTS
VNGPHVKHVLKASMYDDQHDHYAVGTYNLSTESFTPINHALDIQHGLHYDYGKFYASKSFYDPVKKRRIVWGWSNESDSA
AQDIARGWASLQAIPRVLWLDTALGDSLIQAPIEEVDDLRVGKVSKTDVDLEAGSVIKIEGSSGGQLDIEVIFEYPNVSN
VIVQDYGFLNGPFDCGNGGSAQRGVYGPFGLLVLTDDAYQEQTAVFFYIAQDANQRWVTHFCSDQSRSSLLHDIDTTAFW
SDVRVLPTENFLSLRVLVDHSIVESFVQGGRMAITSRVYPKEAVDEKAHVFLFNNSTTQITVRSINVWQMRSITVLPLA*
CDS seq >Pp.15272.1
ATGTACTTACCATGTAGCACGAAGTCGCAATTCAACGCGGTGCATTATCATGCTCAGAGGGAAGATAATTCTTGCAATTT
GGTCTCCGTTCCGATGTTAGATGAAACATCGCAGGAGGGCTATGGCTTGTACCGGGAAGTTTTTGCATCCAGGAACAAGA
AAAAGAGTGCAATTGCTTTCCTCACAATTATGTTGATAGCAGTAGTATTGTACACGAGCAAAATTGTTGCTCCCAAATGG
CTGGAGCTCTTCCAAAGGGAGAAACAGTATCTTGTACGTGATATTGTCAGAAACAGTGCTATTACGAGCTGTCATTCTCA
GCCTTACCGGTCATCATATCATTTCCAACCTCCCAAAAACTGGATGAATGACCCAAATGGACCAATGTACTACGAAGGCT
TTTATCACTTATTCTACCAGTACAATCCCGGAGGAGCTGTGTGGGGCAACCTTACTTGGGGTCATGCCGTCTCCACAGAC
CTTATCCATTGGCGTGATCTTGAGCCTGCCTTGAAACCTGACGAATGGTATGACAATGGAGGCGTATGGTCTGGTTCAGT
CACTATCTGTCCAGACGGATCACCGTTGATTCTATACACAGGTGTAGCAGATGATCTTGAGCAGTCACAGAACTTGGCGG
TACCCGAAGATCTTGCTGATCCTCTTCTTCGCAAGTGGGTGAAAAGTCGAGAGAATCCAATTCTTCGACACCCTGTTGGA
ATTGATAAAGAGGACTTCAGGGATCCAACAACAGCTTGGCAGGTAAATGATGGCACGTGGAGAATACTAGTTGGAGCCAA
AATGGGGAGAGATGGAATGGCGTTGCTTTATAAGAGCGAAGACCTACGTCATTGGGAGCTAGATGAAAATGTGCTGCACA
CAGTTCCTGGCTCGGGAATGTGGGAATGCTTGGACTTCTTTCCTATTGCACCGTTTGGACGAGAAGGGTTGGACACGTCA
GTCAATGGGCCTCATGTCAAACATGTCTTGAAAGCGAGCATGTATGATGATCAGCACGATCATTATGCAGTGGGTACTTA
TAATTTGTCTACAGAATCGTTCACACCTATCAATCATGCTTTGGACATCCAGCACGGCTTGCATTATGACTACGGGAAGT
TTTATGCTTCGAAGTCTTTCTATGACCCAGTGAAGAAGCGTCGCATCGTGTGGGGTTGGTCTAATGAATCCGACAGTGCA
GCTCAAGATATTGCTAGAGGATGGGCATCACTCCAGGCAATACCGAGGGTGTTGTGGTTGGACACTGCATTAGGAGACAG
CTTAATACAGGCACCTATCGAGGAGGTCGATGATTTGAGGGTCGGCAAGGTTTCCAAAACAGATGTGGATTTGGAGGCAG
GCAGTGTCATCAAAATCGAGGGATCCTCTGGAGGCCAGCTGGATATTGAAGTCATATTTGAGTACCCCAATGTCTCCAAC
GTGATAGTTCAAGATTATGGTTTCCTGAATGGACCATTTGACTGTGGTAATGGAGGATCAGCCCAGCGAGGTGTATATGG
CCCATTTGGATTGCTGGTGCTCACAGATGACGCTTACCAAGAGCAAACTGCTGTATTTTTCTACATCGCCCAGGACGCCA
ATCAGCGTTGGGTCACGCACTTTTGCAGTGATCAGAGCAGATCTTCTCTTTTGCATGATATTGACACAACTGCGTTTTGG
AGTGATGTTCGCGTCTTGCCTACTGAGAACTTTCTGTCTCTGCGTGTCCTGGTGGATCACTCTATCGTTGAAAGTTTTGT
TCAAGGTGGACGAATGGCTATAACATCGCGTGTTTATCCTAAAGAAGCGGTGGACGAGAAAGCTCATGTATTCCTCTTCA
ACAACAGCACAACGCAGATTACAGTTCGCAGCATCAATGTCTGGCAGATGCGGAGTATTACCGTACTCCCCCTTGCATGA