Microexon ID Pp_9:733462-733473:+
Species Physcomitrium patens
Coordinates 9:733462..733473
Microexon Cluster ID MEP28
Size 12
Phase 0
Pfam Domain Motif Peptidase_M1
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 24,19,5,12,48
Microexon location in the Microexon-tag 4
Microexon-tag DNA Seq GTTMGRCCWCAYTCTTAYATYAAGATGGACAACTTCTAYACAGTRACGGTKTATGARAAGGGWGCTGAAGTTGTCMGRATGTACAARACMTTRYTKGGRAGTYMAGGR
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GTTTATGAGAAG
Microexon Amino Acid seq VYEK
Microexon-tag DNA Seq GTACGGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGTTTATGAGAAGGGTGCTGAGGTTGTACGAATGTACCAGACGCTACTAGGAAATGCCGGC
Microexon-tag Amino Acid Seq VRPHSYIKMDNFYTVTVYEKGAEVVRMYQTLLGNAG
Microexon-tag spanning region733006-733632
Microexon-tag prediction score0.9323
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c9_1240V3.3x
Reference Transcript ID Pp3c9_1240V3.3
Gene ID Pp3c9_1240
Gene Name NA
Transcript ID Pp3c9_1240V3.3
Protein ID Pp3c9_1240V3.3
Gene ID Pp3c9_1240
Gene Name NA
Pfam domain motif Peptidase_M1
Motif E-value 3.6e-49
Motif start 235
Motif end 439
Protein seq >Pp3c9_1240V3.3
MKMEESVKETPKEIFLKDYKAPHYAFEKVNLKFVLGEEKTLVTSNIRVLPRSSDGAPLILDGERLNLVTLNINGSPVPEG
SYKFNSRQLNITSLPTSPFDMEIVTEIEPQNNTSLEGLYKSSGNFCTQCEAEGFRKITYFQDRPDVMSKFTTRIEADKTL
YPVLLSNGNLVDEGDLADGKHFAVWDDPWTKPCYLFALVAGQLVSRDDSFTTMSGRKVALRIWTPPQDIPKTVHAMKSLQ
LAMKWDEEVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSRLVLASPETATDADYAAIEGVIGHEYFHNWTGNRVTCR
DWFQLSLKEGLTVFRDQEFSSDMGSRGVKRIADVGRLRTAQFSQDAGPMAHPVRPHSYIKMDNFYTVTVYEKGAEVVRMY
QTLLGNAGFRKGMDLYFQRHDGQAVTCEEFLAAMFDANNVKFPTFPLWYAQAGTPTLTVTTSYDAGAQTFTIKCKQEVPP
TPGQLKKEPMLLPLAVGLLDSHGHDMRLTSVKDGTSLHNLTSVDGDYTTTAVLHVDKAEQEFTFVNITEKPVPSLLRNFS
APVRLVSDVTNDDLFFLLAHDSDQFNRWEAGQTLSRKLMLDLISAQQKGEDLSVDSAFIEGMRSTVMDSSLDKEFVAKCL
TLPTESEIADLMDVADPDAIHNVRRFIIMEIATNMRDVLLKMVEANRSSATYDPSHVHRAQRALKNVALGYLAMLNEPEI
IDLAVNENKNATNMTDQVSALAAICQNSGDARSKALAEFYEQWKDETLVMNKWLALQAMSNIPGNVENVRGLMEHPAFDI
RNPNKVYSLIGGFCASAVNFHAKDGSGYTFLADVVLQLDKLNPQVASRMVSAFSRWRRFDEGRQALAKAQLERITSQDGL
SDNVFEIASKSLAS*
CDS seq >Pp3c9_1240V3.3
ATGAAGATGGAGGAGTCCGTTAAGGAGACTCCTAAGGAGATCTTTCTGAAGGATTACAAGGCACCCCATTATGCTTTCGA
AAAGGTAAATTTAAAGTTTGTGTTGGGAGAAGAGAAGACTCTAGTGACGTCCAACATACGTGTCCTTCCCAGGTCTTCTG
ACGGTGCACCATTGATACTCGATGGAGAGAGGTTGAACTTGGTTACTCTAAATATCAATGGTTCGCCTGTCCCGGAGGGT
AGTTACAAGTTTAATTCTCGGCAATTGAATATTACATCGTTGCCCACCAGCCCTTTCGACATGGAAATTGTCACGGAAAT
TGAGCCACAAAATAATACTTCTCTGGAAGGATTGTATAAGTCATCTGGCAATTTCTGCACTCAATGTGAAGCAGAGGGCT
TCCGTAAGATCACTTACTTCCAGGATCGCCCAGATGTGATGTCTAAGTTCACAACACGTATTGAGGCAGATAAAACTCTT
TACCCAGTGTTGTTAAGCAACGGCAACTTGGTTGATGAGGGTGATTTGGCTGATGGCAAGCATTTCGCAGTGTGGGATGA
TCCCTGGACAAAGCCTTGCTACTTGTTTGCACTTGTAGCTGGCCAGCTTGTGAGTCGTGATGACTCATTTACCACAATGT
CGGGGCGCAAAGTGGCTCTTAGAATCTGGACTCCTCCGCAAGATATTCCTAAGACCGTTCACGCAATGAAATCTTTGCAA
CTGGCTATGAAGTGGGATGAAGAGGTATTTGGTTTGGAATATGACCTAGACCTCTTCAACATTGTTGCTGTGCCGGATTT
CAACATGGGTGCTATGGAGAATAAGAGCTTAAATATTTTTAATTCTAGGTTGGTATTAGCATCTCCTGAGACAGCTACAG
ATGCAGATTACGCTGCTATTGAGGGAGTTATTGGCCACGAGTATTTTCACAATTGGACTGGCAACCGAGTGACATGTAGG
GATTGGTTTCAGCTTAGTTTGAAGGAGGGACTTACTGTTTTCAGAGATCAGGAATTTTCATCTGATATGGGTAGTCGTGG
AGTAAAACGCATTGCTGATGTGGGTCGTTTAAGGACTGCGCAATTTTCTCAGGATGCTGGTCCAATGGCACATCCAGTAC
GGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGTTTATGAGAAGGGTGCTGAGGTTGTACGAATGTAC
CAGACGCTACTAGGAAATGCCGGCTTTAGAAAGGGTATGGACTTGTATTTTCAAAGGCACGACGGTCAGGCTGTCACTTG
TGAGGAGTTCCTGGCTGCTATGTTCGATGCCAATAATGTTAAATTCCCGACATTTCCCCTTTGGTACGCCCAAGCGGGAA
CTCCAACTCTGACTGTGACCACCTCATACGACGCTGGTGCACAAACATTCACCATCAAGTGCAAGCAAGAAGTTCCCCCC
ACCCCAGGACAATTAAAGAAGGAACCAATGCTTCTACCTTTGGCTGTAGGGTTGCTTGATTCACACGGGCACGACATGCG
TCTTACTTCTGTGAAAGATGGAACCTCCCTGCACAACTTGACTAGTGTAGATGGTGACTACACGACCACTGCAGTACTGC
ATGTTGACAAGGCTGAGCAAGAATTTACATTTGTGAATATCACGGAGAAGCCAGTTCCGTCACTGCTGCGTAATTTCAGT
GCACCAGTTCGCCTTGTGTCTGATGTTACAAACGATGACCTTTTCTTTCTCCTGGCTCATGATTCCGATCAATTCAACAG
GTGGGAAGCTGGCCAGACTCTAAGTCGCAAGCTTATGCTGGACCTGATCTCTGCGCAGCAGAAAGGAGAGGATCTTAGCG
TAGACAGTGCTTTTATTGAAGGCATGCGAAGCACTGTTATGGATTCGTCACTTGACAAGGAATTCGTGGCAAAATGCCTT
ACATTACCGACAGAAAGTGAGATTGCGGACCTGATGGATGTGGCTGATCCTGATGCCATACATAATGTGAGACGGTTTAT
CATTATGGAGATTGCTACTAATATGCGCGATGTCCTACTTAAAATGGTGGAGGCAAACAGAAGTTCAGCTACTTACGATC
CAAGCCATGTGCACAGGGCACAGCGTGCTTTAAAGAATGTTGCTTTGGGATACTTGGCGATGTTGAACGAACCTGAGATT
ATAGACCTGGCTGTGAACGAAAACAAGAATGCCACAAACATGACTGATCAAGTATCTGCCTTGGCTGCTATTTGCCAGAA
TTCTGGTGATGCACGGTCTAAGGCACTGGCAGAGTTCTACGAACAATGGAAGGATGAGACTTTGGTCATGAACAAATGGC
TAGCTTTGCAAGCCATGTCAAATATCCCAGGAAATGTAGAGAATGTGCGGGGTTTGATGGAGCACCCTGCTTTTGACATC
CGGAATCCAAACAAGGTCTATTCCTTGATTGGTGGTTTCTGCGCGTCCGCTGTAAATTTTCATGCTAAAGACGGTTCAGG
GTACACGTTTTTGGCTGACGTTGTCCTTCAGTTGGATAAGCTGAATCCCCAGGTTGCGTCACGCATGGTGTCGGCTTTCT
CCAGATGGCGGAGGTTTGATGAAGGCAGGCAGGCATTGGCCAAAGCACAATTGGAGCGGATTACGAGCCAGGATGGTCTG
TCAGACAATGTTTTCGAAATTGCCTCAAAGAGTTTGGCGTCGTAA
Microexon DNA seq GTTTATGAGAAG
Microexon Amino Acid seq VYEK
Microexon-tag DNA Seq GTACGGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGTTTATGAGAAGGGTGCTGAGGTTGTACGAATGTACCAGACGCTACTAGGAAATGCCGGC
Microexon-tag Amino Acid seq VRPHSYIKMDNFYTVTVYEKGAEVVRMYQTLLGNAG
Transcript ID Pp.24480.1
Gene ID Pp.24480
Gene Name NA
Pfam domain motif Peptidase_M1
Motif E-value 3.6e-49
Motif start 235
Motif end 439
Protein seq >Pp.24480.1
MKMEESVKETPKEIFLKDYKAPHYAFEKVNLKFVLGEEKTLVTSNIRVLPRSSDGAPLILDGERLNLVTLNINGSPVPEG
SYKFNSRQLNITSLPTSPFDMEIVTEIEPQNNTSLEGLYKSSGNFCTQCEAEGFRKITYFQDRPDVMSKFTTRIEADKTL
YPVLLSNGNLVDEGDLADGKHFAVWDDPWTKPCYLFALVAGQLVSRDDSFTTMSGRKVALRIWTPPQDIPKTVHAMKSLQ
LAMKWDEEVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSRLVLASPETATDADYAAIEGVIGHEYFHNWTGNRVTCR
DWFQLSLKEGLTVFRDQEFSSDMGSRGVKRIADVGRLRTAQFSQDAGPMAHPVRPHSYIKMDNFYTVTVYEKGAEVVRMY
QTLLGNAGFRKGMDLYFQRHDGQAVTCEEFLAAMFDANNVKFPTFPLWYAQAGTPTLTVTTSYDAGAQTFTIKCKQEVPP
TPGQLKKEPMLLPLAVGLLDSHGHDMRLTSVKDGTSLHNLTSVDGDYTTTAVLHVDKAEQEFTFVNITEKPVPSLLRNFS
APVRLVSDVTNDDLFFLLAHDSDQFNRWEAGQTLSRKLMLDLISAQQKGEDLSVDSAFIEGMRSTVMDSSLDKEFVAKCL
TLPTESEIADLMDVADPDAIHNVRRFIIMEIATNMRDVLLKMVEANRSSATYDPSHVHRAQRALKNVALGYLAMLNEPEI
IDLAVNENKNATNMTDQVSALAAICQNSGDARSKALAEFYEQWKDETLVMNKWLALQAMSNIPGNVENVRGLMEHPAFDI
RNPNKVYSLIGGFCASAVNFHAKDGSGYTFLADVVLQLDKLNPQVASRMVSAFSRWRRFDEGRQALAKAQLERITSQDGL
SDNVFEIASKSLAS*
CDS seq >Pp.24480.1
ATGAAGATGGAGGAGTCCGTTAAGGAGACTCCTAAGGAGATCTTTCTGAAGGATTACAAGGCACCCCATTATGCTTTCGA
AAAGGTAAATTTAAAGTTTGTGTTGGGAGAAGAGAAGACTCTAGTGACGTCCAACATACGTGTCCTTCCCAGGTCTTCTG
ACGGTGCACCATTGATACTCGATGGAGAGAGGTTGAACTTGGTTACTCTAAATATCAATGGTTCGCCTGTCCCGGAGGGT
AGTTACAAGTTTAATTCTCGGCAATTGAATATTACATCGTTGCCCACCAGCCCTTTCGACATGGAAATTGTCACGGAAAT
TGAGCCACAAAATAATACTTCTCTGGAAGGATTGTATAAGTCATCTGGCAATTTCTGCACTCAATGTGAAGCAGAGGGCT
TCCGTAAGATCACTTACTTCCAGGATCGCCCAGATGTGATGTCTAAGTTCACAACACGTATTGAGGCAGATAAAACTCTT
TACCCAGTGTTGTTAAGCAACGGCAACTTGGTTGATGAGGGTGATTTGGCTGATGGCAAGCATTTCGCAGTGTGGGATGA
TCCCTGGACAAAGCCTTGCTACTTGTTTGCACTTGTAGCTGGCCAGCTTGTGAGTCGTGATGACTCATTTACCACAATGT
CGGGGCGCAAAGTGGCTCTTAGAATCTGGACTCCTCCGCAAGATATTCCTAAGACCGTTCACGCAATGAAATCTTTGCAA
CTGGCTATGAAGTGGGATGAAGAGGTATTTGGTTTGGAATATGACCTAGACCTCTTCAACATTGTTGCTGTGCCGGATTT
CAACATGGGTGCTATGGAGAATAAGAGCTTAAATATTTTTAATTCTAGGTTGGTATTAGCATCTCCTGAGACAGCTACAG
ATGCAGATTACGCTGCTATTGAGGGAGTTATTGGCCACGAGTATTTTCACAATTGGACTGGCAACCGAGTGACATGTAGG
GATTGGTTTCAGCTTAGTTTGAAGGAGGGACTTACTGTTTTCAGAGATCAGGAATTTTCATCTGATATGGGTAGTCGTGG
AGTAAAACGCATTGCTGATGTGGGTCGTTTAAGGACTGCGCAATTTTCTCAGGATGCTGGTCCAATGGCACATCCAGTAC
GGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGTTTATGAGAAGGGTGCTGAGGTTGTACGAATGTAC
CAGACGCTACTAGGAAATGCCGGCTTTAGAAAGGGTATGGACTTGTATTTTCAAAGGCACGACGGTCAGGCTGTCACTTG
TGAGGAGTTCCTGGCTGCTATGTTCGATGCCAATAATGTTAAATTCCCGACATTTCCCCTTTGGTACGCCCAAGCGGGAA
CTCCAACTCTGACTGTGACCACCTCATACGACGCTGGTGCACAAACATTCACCATCAAGTGCAAGCAAGAAGTTCCCCCC
ACCCCAGGACAATTAAAGAAGGAACCAATGCTTCTACCTTTGGCTGTAGGGTTGCTTGATTCACACGGGCACGACATGCG
TCTTACTTCTGTGAAAGATGGAACCTCCCTGCACAACTTGACTAGTGTAGATGGTGACTACACGACCACTGCAGTACTGC
ATGTTGACAAGGCTGAGCAAGAATTTACATTTGTGAATATCACGGAGAAGCCAGTTCCGTCACTGCTGCGTAATTTCAGT
GCACCAGTTCGCCTTGTGTCTGATGTTACAAACGATGACCTTTTCTTTCTCCTGGCTCATGATTCCGATCAATTCAACAG
GTGGGAAGCTGGCCAGACTCTAAGTCGCAAGCTTATGCTGGACCTGATCTCTGCGCAGCAGAAAGGAGAGGATCTTAGCG
TAGACAGTGCTTTTATTGAAGGCATGCGAAGCACTGTTATGGATTCGTCACTTGACAAGGAATTCGTGGCAAAATGCCTT
ACATTACCGACAGAAAGTGAGATTGCGGACCTGATGGATGTGGCTGATCCTGATGCCATACATAATGTGAGACGGTTTAT
CATTATGGAGATTGCTACTAATATGCGCGATGTCCTACTTAAAATGGTGGAGGCAAACAGAAGTTCAGCTACTTACGATC
CAAGCCATGTGCACAGGGCACAGCGTGCTTTAAAGAATGTTGCTTTGGGATACTTGGCGATGTTGAACGAACCTGAGATT
ATAGACCTGGCTGTGAACGAAAACAAGAATGCCACAAACATGACTGATCAAGTATCTGCCTTGGCTGCTATTTGCCAGAA
TTCTGGTGATGCACGGTCTAAGGCACTGGCAGAGTTCTACGAACAATGGAAGGATGAGACTTTGGTCATGAACAAATGGC
TAGCTTTGCAAGCCATGTCAAATATCCCAGGAAATGTAGAGAATGTGCGGGGTTTGATGGAGCACCCTGCTTTTGACATC
CGGAATCCAAACAAGGTCTATTCCTTGATTGGTGGTTTCTGCGCGTCCGCTGTAAATTTTCATGCTAAAGACGGTTCAGG
GTACACGTTTTTGGCTGACGTTGTCCTTCAGTTGGATAAGCTGAATCCCCAGGTTGCGTCACGCATGGTGTCGGCTTTCT
CCAGATGGCGGAGGTTTGATGAAGGCAGGCAGGCATTGGCCAAAGCACAATTGGAGCGGATTACGAGCCAGGATGGTCTG
TCAGACAATGTTTTCGAAATTGCCTCAAAGAGTTTGGCGTCGTAA