| Microexon ID | Pp_9:733462-733473:+ |
| Species | Physcomitrium patens | Coordinates | 9:733462..733473 |
| Microexon Cluster ID | MEP28 |
| Size | 12 |
| Phase | 0 |
| Pfam Domain Motif | Peptidase_M1 |
| Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 24,19,5,12,48 |
| Microexon location in the Microexon-tag | 4 |
| Microexon-tag DNA Seq | GTTMGRCCWCAYTCTTAYATYAAGATGGACAACTTCTAYACAGTRACGGTKTATGARAAGGGWGCTGAAGTTGTCMGRATGTACAARACMTTRYTKGGRAGTYMAGGR |
| Logo of Microexon-tag DNA Seq | ![]() |
| Alignment of exons | ![]() |
| Microexon DNA seq | GTTTATGAGAAG |
| Microexon Amino Acid seq | VYEK |
| Microexon-tag DNA Seq | GTACGGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGTTTATGAGAAGGGTGCTGAGGTTGTACGAATGTACCAGACGCTACTAGGAAATGCCGGC |
| Microexon-tag Amino Acid Seq | VRPHSYIKMDNFYTVTVYEKGAEVVRMYQTLLGNAG |
| Microexon-tag spanning region | 733006-733632 |
| Microexon-tag prediction score | 0.9323 |
| Overlapped with the annotated transcript (%) | 100 |
| New Transcript ID | Pp3c9_1240V3.3x |
| Reference Transcript ID | Pp3c9_1240V3.3 |
| Gene ID | Pp3c9_1240 |
| Gene Name | NA |
| Transcript ID | Pp3c9_1240V3.3 |
| Protein ID | Pp3c9_1240V3.3 |
| Gene ID | Pp3c9_1240 |
| Gene Name | NA |
| Pfam domain motif | Peptidase_M1 |
| Motif E-value | 3.6e-49 |
| Motif start | 235 |
| Motif end | 439 |
| Protein seq | >Pp3c9_1240V3.3 MKMEESVKETPKEIFLKDYKAPHYAFEKVNLKFVLGEEKTLVTSNIRVLPRSSDGAPLILDGERLNLVTLNINGSPVPEG SYKFNSRQLNITSLPTSPFDMEIVTEIEPQNNTSLEGLYKSSGNFCTQCEAEGFRKITYFQDRPDVMSKFTTRIEADKTL YPVLLSNGNLVDEGDLADGKHFAVWDDPWTKPCYLFALVAGQLVSRDDSFTTMSGRKVALRIWTPPQDIPKTVHAMKSLQ LAMKWDEEVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSRLVLASPETATDADYAAIEGVIGHEYFHNWTGNRVTCR DWFQLSLKEGLTVFRDQEFSSDMGSRGVKRIADVGRLRTAQFSQDAGPMAHPVRPHSYIKMDNFYTVTVYEKGAEVVRMY QTLLGNAGFRKGMDLYFQRHDGQAVTCEEFLAAMFDANNVKFPTFPLWYAQAGTPTLTVTTSYDAGAQTFTIKCKQEVPP TPGQLKKEPMLLPLAVGLLDSHGHDMRLTSVKDGTSLHNLTSVDGDYTTTAVLHVDKAEQEFTFVNITEKPVPSLLRNFS APVRLVSDVTNDDLFFLLAHDSDQFNRWEAGQTLSRKLMLDLISAQQKGEDLSVDSAFIEGMRSTVMDSSLDKEFVAKCL TLPTESEIADLMDVADPDAIHNVRRFIIMEIATNMRDVLLKMVEANRSSATYDPSHVHRAQRALKNVALGYLAMLNEPEI IDLAVNENKNATNMTDQVSALAAICQNSGDARSKALAEFYEQWKDETLVMNKWLALQAMSNIPGNVENVRGLMEHPAFDI RNPNKVYSLIGGFCASAVNFHAKDGSGYTFLADVVLQLDKLNPQVASRMVSAFSRWRRFDEGRQALAKAQLERITSQDGL SDNVFEIASKSLAS* |
| CDS seq | >Pp3c9_1240V3.3 ATGAAGATGGAGGAGTCCGTTAAGGAGACTCCTAAGGAGATCTTTCTGAAGGATTACAAGGCACCCCATTATGCTTTCGA AAAGGTAAATTTAAAGTTTGTGTTGGGAGAAGAGAAGACTCTAGTGACGTCCAACATACGTGTCCTTCCCAGGTCTTCTG ACGGTGCACCATTGATACTCGATGGAGAGAGGTTGAACTTGGTTACTCTAAATATCAATGGTTCGCCTGTCCCGGAGGGT AGTTACAAGTTTAATTCTCGGCAATTGAATATTACATCGTTGCCCACCAGCCCTTTCGACATGGAAATTGTCACGGAAAT TGAGCCACAAAATAATACTTCTCTGGAAGGATTGTATAAGTCATCTGGCAATTTCTGCACTCAATGTGAAGCAGAGGGCT TCCGTAAGATCACTTACTTCCAGGATCGCCCAGATGTGATGTCTAAGTTCACAACACGTATTGAGGCAGATAAAACTCTT TACCCAGTGTTGTTAAGCAACGGCAACTTGGTTGATGAGGGTGATTTGGCTGATGGCAAGCATTTCGCAGTGTGGGATGA TCCCTGGACAAAGCCTTGCTACTTGTTTGCACTTGTAGCTGGCCAGCTTGTGAGTCGTGATGACTCATTTACCACAATGT CGGGGCGCAAAGTGGCTCTTAGAATCTGGACTCCTCCGCAAGATATTCCTAAGACCGTTCACGCAATGAAATCTTTGCAA CTGGCTATGAAGTGGGATGAAGAGGTATTTGGTTTGGAATATGACCTAGACCTCTTCAACATTGTTGCTGTGCCGGATTT CAACATGGGTGCTATGGAGAATAAGAGCTTAAATATTTTTAATTCTAGGTTGGTATTAGCATCTCCTGAGACAGCTACAG ATGCAGATTACGCTGCTATTGAGGGAGTTATTGGCCACGAGTATTTTCACAATTGGACTGGCAACCGAGTGACATGTAGG GATTGGTTTCAGCTTAGTTTGAAGGAGGGACTTACTGTTTTCAGAGATCAGGAATTTTCATCTGATATGGGTAGTCGTGG AGTAAAACGCATTGCTGATGTGGGTCGTTTAAGGACTGCGCAATTTTCTCAGGATGCTGGTCCAATGGCACATCCAGTAC GGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGTTTATGAGAAGGGTGCTGAGGTTGTACGAATGTAC CAGACGCTACTAGGAAATGCCGGCTTTAGAAAGGGTATGGACTTGTATTTTCAAAGGCACGACGGTCAGGCTGTCACTTG TGAGGAGTTCCTGGCTGCTATGTTCGATGCCAATAATGTTAAATTCCCGACATTTCCCCTTTGGTACGCCCAAGCGGGAA CTCCAACTCTGACTGTGACCACCTCATACGACGCTGGTGCACAAACATTCACCATCAAGTGCAAGCAAGAAGTTCCCCCC ACCCCAGGACAATTAAAGAAGGAACCAATGCTTCTACCTTTGGCTGTAGGGTTGCTTGATTCACACGGGCACGACATGCG TCTTACTTCTGTGAAAGATGGAACCTCCCTGCACAACTTGACTAGTGTAGATGGTGACTACACGACCACTGCAGTACTGC ATGTTGACAAGGCTGAGCAAGAATTTACATTTGTGAATATCACGGAGAAGCCAGTTCCGTCACTGCTGCGTAATTTCAGT GCACCAGTTCGCCTTGTGTCTGATGTTACAAACGATGACCTTTTCTTTCTCCTGGCTCATGATTCCGATCAATTCAACAG GTGGGAAGCTGGCCAGACTCTAAGTCGCAAGCTTATGCTGGACCTGATCTCTGCGCAGCAGAAAGGAGAGGATCTTAGCG TAGACAGTGCTTTTATTGAAGGCATGCGAAGCACTGTTATGGATTCGTCACTTGACAAGGAATTCGTGGCAAAATGCCTT ACATTACCGACAGAAAGTGAGATTGCGGACCTGATGGATGTGGCTGATCCTGATGCCATACATAATGTGAGACGGTTTAT CATTATGGAGATTGCTACTAATATGCGCGATGTCCTACTTAAAATGGTGGAGGCAAACAGAAGTTCAGCTACTTACGATC CAAGCCATGTGCACAGGGCACAGCGTGCTTTAAAGAATGTTGCTTTGGGATACTTGGCGATGTTGAACGAACCTGAGATT ATAGACCTGGCTGTGAACGAAAACAAGAATGCCACAAACATGACTGATCAAGTATCTGCCTTGGCTGCTATTTGCCAGAA TTCTGGTGATGCACGGTCTAAGGCACTGGCAGAGTTCTACGAACAATGGAAGGATGAGACTTTGGTCATGAACAAATGGC TAGCTTTGCAAGCCATGTCAAATATCCCAGGAAATGTAGAGAATGTGCGGGGTTTGATGGAGCACCCTGCTTTTGACATC CGGAATCCAAACAAGGTCTATTCCTTGATTGGTGGTTTCTGCGCGTCCGCTGTAAATTTTCATGCTAAAGACGGTTCAGG GTACACGTTTTTGGCTGACGTTGTCCTTCAGTTGGATAAGCTGAATCCCCAGGTTGCGTCACGCATGGTGTCGGCTTTCT CCAGATGGCGGAGGTTTGATGAAGGCAGGCAGGCATTGGCCAAAGCACAATTGGAGCGGATTACGAGCCAGGATGGTCTG TCAGACAATGTTTTCGAAATTGCCTCAAAGAGTTTGGCGTCGTAA |
| Microexon DNA seq | GTTTATGAGAAG |
| Microexon Amino Acid seq | VYEK |
| Microexon-tag DNA Seq | GTACGGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGTTTATGAGAAGGGTGCTGAGGTTGTACGAATGTACCAGACGCTACTAGGAAATGCCGGC |
| Microexon-tag Amino Acid seq | VRPHSYIKMDNFYTVTVYEKGAEVVRMYQTLLGNAG |
| Transcript ID | Pp.24480.1 |
| Gene ID | Pp.24480 |
| Gene Name | NA |
| Pfam domain motif | Peptidase_M1 |
| Motif E-value | 3.6e-49 |
| Motif start | 235 |
| Motif end | 439 |
| Protein seq | >Pp.24480.1 MKMEESVKETPKEIFLKDYKAPHYAFEKVNLKFVLGEEKTLVTSNIRVLPRSSDGAPLILDGERLNLVTLNINGSPVPEG SYKFNSRQLNITSLPTSPFDMEIVTEIEPQNNTSLEGLYKSSGNFCTQCEAEGFRKITYFQDRPDVMSKFTTRIEADKTL YPVLLSNGNLVDEGDLADGKHFAVWDDPWTKPCYLFALVAGQLVSRDDSFTTMSGRKVALRIWTPPQDIPKTVHAMKSLQ LAMKWDEEVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSRLVLASPETATDADYAAIEGVIGHEYFHNWTGNRVTCR DWFQLSLKEGLTVFRDQEFSSDMGSRGVKRIADVGRLRTAQFSQDAGPMAHPVRPHSYIKMDNFYTVTVYEKGAEVVRMY QTLLGNAGFRKGMDLYFQRHDGQAVTCEEFLAAMFDANNVKFPTFPLWYAQAGTPTLTVTTSYDAGAQTFTIKCKQEVPP TPGQLKKEPMLLPLAVGLLDSHGHDMRLTSVKDGTSLHNLTSVDGDYTTTAVLHVDKAEQEFTFVNITEKPVPSLLRNFS APVRLVSDVTNDDLFFLLAHDSDQFNRWEAGQTLSRKLMLDLISAQQKGEDLSVDSAFIEGMRSTVMDSSLDKEFVAKCL TLPTESEIADLMDVADPDAIHNVRRFIIMEIATNMRDVLLKMVEANRSSATYDPSHVHRAQRALKNVALGYLAMLNEPEI IDLAVNENKNATNMTDQVSALAAICQNSGDARSKALAEFYEQWKDETLVMNKWLALQAMSNIPGNVENVRGLMEHPAFDI RNPNKVYSLIGGFCASAVNFHAKDGSGYTFLADVVLQLDKLNPQVASRMVSAFSRWRRFDEGRQALAKAQLERITSQDGL SDNVFEIASKSLAS* |
| CDS seq | >Pp.24480.1 ATGAAGATGGAGGAGTCCGTTAAGGAGACTCCTAAGGAGATCTTTCTGAAGGATTACAAGGCACCCCATTATGCTTTCGA AAAGGTAAATTTAAAGTTTGTGTTGGGAGAAGAGAAGACTCTAGTGACGTCCAACATACGTGTCCTTCCCAGGTCTTCTG ACGGTGCACCATTGATACTCGATGGAGAGAGGTTGAACTTGGTTACTCTAAATATCAATGGTTCGCCTGTCCCGGAGGGT AGTTACAAGTTTAATTCTCGGCAATTGAATATTACATCGTTGCCCACCAGCCCTTTCGACATGGAAATTGTCACGGAAAT TGAGCCACAAAATAATACTTCTCTGGAAGGATTGTATAAGTCATCTGGCAATTTCTGCACTCAATGTGAAGCAGAGGGCT TCCGTAAGATCACTTACTTCCAGGATCGCCCAGATGTGATGTCTAAGTTCACAACACGTATTGAGGCAGATAAAACTCTT TACCCAGTGTTGTTAAGCAACGGCAACTTGGTTGATGAGGGTGATTTGGCTGATGGCAAGCATTTCGCAGTGTGGGATGA TCCCTGGACAAAGCCTTGCTACTTGTTTGCACTTGTAGCTGGCCAGCTTGTGAGTCGTGATGACTCATTTACCACAATGT CGGGGCGCAAAGTGGCTCTTAGAATCTGGACTCCTCCGCAAGATATTCCTAAGACCGTTCACGCAATGAAATCTTTGCAA CTGGCTATGAAGTGGGATGAAGAGGTATTTGGTTTGGAATATGACCTAGACCTCTTCAACATTGTTGCTGTGCCGGATTT CAACATGGGTGCTATGGAGAATAAGAGCTTAAATATTTTTAATTCTAGGTTGGTATTAGCATCTCCTGAGACAGCTACAG ATGCAGATTACGCTGCTATTGAGGGAGTTATTGGCCACGAGTATTTTCACAATTGGACTGGCAACCGAGTGACATGTAGG GATTGGTTTCAGCTTAGTTTGAAGGAGGGACTTACTGTTTTCAGAGATCAGGAATTTTCATCTGATATGGGTAGTCGTGG AGTAAAACGCATTGCTGATGTGGGTCGTTTAAGGACTGCGCAATTTTCTCAGGATGCTGGTCCAATGGCACATCCAGTAC GGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGTTTATGAGAAGGGTGCTGAGGTTGTACGAATGTAC CAGACGCTACTAGGAAATGCCGGCTTTAGAAAGGGTATGGACTTGTATTTTCAAAGGCACGACGGTCAGGCTGTCACTTG TGAGGAGTTCCTGGCTGCTATGTTCGATGCCAATAATGTTAAATTCCCGACATTTCCCCTTTGGTACGCCCAAGCGGGAA CTCCAACTCTGACTGTGACCACCTCATACGACGCTGGTGCACAAACATTCACCATCAAGTGCAAGCAAGAAGTTCCCCCC ACCCCAGGACAATTAAAGAAGGAACCAATGCTTCTACCTTTGGCTGTAGGGTTGCTTGATTCACACGGGCACGACATGCG TCTTACTTCTGTGAAAGATGGAACCTCCCTGCACAACTTGACTAGTGTAGATGGTGACTACACGACCACTGCAGTACTGC ATGTTGACAAGGCTGAGCAAGAATTTACATTTGTGAATATCACGGAGAAGCCAGTTCCGTCACTGCTGCGTAATTTCAGT GCACCAGTTCGCCTTGTGTCTGATGTTACAAACGATGACCTTTTCTTTCTCCTGGCTCATGATTCCGATCAATTCAACAG GTGGGAAGCTGGCCAGACTCTAAGTCGCAAGCTTATGCTGGACCTGATCTCTGCGCAGCAGAAAGGAGAGGATCTTAGCG TAGACAGTGCTTTTATTGAAGGCATGCGAAGCACTGTTATGGATTCGTCACTTGACAAGGAATTCGTGGCAAAATGCCTT ACATTACCGACAGAAAGTGAGATTGCGGACCTGATGGATGTGGCTGATCCTGATGCCATACATAATGTGAGACGGTTTAT CATTATGGAGATTGCTACTAATATGCGCGATGTCCTACTTAAAATGGTGGAGGCAAACAGAAGTTCAGCTACTTACGATC CAAGCCATGTGCACAGGGCACAGCGTGCTTTAAAGAATGTTGCTTTGGGATACTTGGCGATGTTGAACGAACCTGAGATT ATAGACCTGGCTGTGAACGAAAACAAGAATGCCACAAACATGACTGATCAAGTATCTGCCTTGGCTGCTATTTGCCAGAA TTCTGGTGATGCACGGTCTAAGGCACTGGCAGAGTTCTACGAACAATGGAAGGATGAGACTTTGGTCATGAACAAATGGC TAGCTTTGCAAGCCATGTCAAATATCCCAGGAAATGTAGAGAATGTGCGGGGTTTGATGGAGCACCCTGCTTTTGACATC CGGAATCCAAACAAGGTCTATTCCTTGATTGGTGGTTTCTGCGCGTCCGCTGTAAATTTTCATGCTAAAGACGGTTCAGG GTACACGTTTTTGGCTGACGTTGTCCTTCAGTTGGATAAGCTGAATCCCCAGGTTGCGTCACGCATGGTGTCGGCTTTCT CCAGATGGCGGAGGTTTGATGAAGGCAGGCAGGCATTGGCCAAAGCACAATTGGAGCGGATTACGAGCCAGGATGGTCTG TCAGACAATGTTTTCGAAATTGCCTCAAAGAGTTTGGCGTCGTAA |

