
Microexon ID | Pp_9:733216-733220:+ |
Species | Physcomitrium patens | Coordinates | 9:733216..733220 |
Microexon Cluster ID | MEP07 |
Size | 5 |
Phase | 1 |
Pfam Domain Motif | Peptidase_M1 |
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 33,19,5,12,39 |
Microexon location in the Microexon-tag | 3 |
Microexon-tag DNA Seq | GCWCAYCCTGTTCGRCCWCAYTCTTAYATYAAGATGGACAACTTCTAYACAGTRACGGTKTATGARAAGGGWGCTGAAGTTGTCMGRATGTACAARACMTTRYTKGGR |
Logo of Microexon-tag DNA Seq | ![]() |
Alignment of exons | ![]() |
Microexon DNA seq | TCACT |
Microexon Amino Acid seq | VT |
Microexon-tag DNA Seq | GCACATCCAGTACGGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGTTTATGAGAAGGGTGCTGAGGTTGTACGAATGTACCAGACGCTACTAGGA |
Microexon-tag Amino Acid Seq | AHPVRPHSYIKMDNFYTVTVYEKGAEVVRMYQTLLG |
Microexon-tag spanning region | 732997-733623 |
Microexon-tag prediction score | 0.9449 |
Overlapped with the annotated transcript (%) | 100 |
New Transcript ID | Pp3c9_1240V3.3x |
Reference Transcript ID | Pp3c9_1240V3.3 |
Gene ID | Pp3c9_1240 |
Gene Name | NA |
Transcript ID | Pp3c9_1240V3.1 |
Protein ID | Pp3c9_1240V3.1 |
Gene ID | Pp3c9_1240 |
Gene Name | NA |
Pfam domain motif | Peptidase_M1 |
Motif E-value | 6.7e-43 |
Motif start | 235 |
Motif end | 435 |
Protein seq | >Pp3c9_1240V3.1 MKMEESVKETPKEIFLKDYKAPHYAFEKVNLKFVLGEEKTLVTSNIRVLPRSSDGAPLILDGERLNLVTLNINGSPVPEG SYKFNSRQLNITSLPTSPFDMEIVTEIEPQNNTSLEGLYKSSGNFCTQCEAEGFRKITYFQDRPDVMSKFTTRIEADKTL YPVLLSNGNLVDEGDLADGKHFAVWDDPWTKPCYLFALVAGQLVSRDDSFTTMSGRKVALRIWTPPQDIPKTVHAMKSLQ LAMKWDEEVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSRLVLASPETATDADYAAIEGVIGHEYFHNWTGNRVTCR DWFQLSLKEGLTVFRDQEFSSDMGSRGVKRIADVGRLRTAQFSQDAGPMAHPVRPHSYIKMDNFYTVTGAEVVRMYQTLL GNAGFRKGMDLYFQRHDGQAVTCEEFLAAMFDANNVKFPTFPLWYAQAGTPTLTVTTSYDAGAQTFTIKCKQEVPPTPGQ LKKEPMLLPLAVGLLDSHGHDMRLTSVKDGTSLHNLTSVDGDYTTTAVLHVDKAEQEFTFVNITEKPVPSLLRNFSAPVR LVSDVTNDDLFFLLAHDSDQFNRWEAGQTLSRKLMLDLISAQQKGEDLSVDSAFIEGMRSTVMDSSLDKEFVAKCLTLPT ESEIADLMDVADPDAIHNVRRFIIMEIATNMRDVLLKMVEANRSSATYDPSHVHRAQRALKNVALGYLAMLNEPEIIDLA VNENKNATNMTDQVSALAAICQNSGDARSKALAEFYEQWKDETLVMNKWLALQAMSNIPGNVENVRGLMEHPAFDIRNPN KVYSLIGGFCASAVNFHAKDGSGYTFLADVVLQLDKLNPQVASRMVSAFSRWRRFDEGRQALAKAQLERITSQDGLSDNV FEIASKSLAS* |
CDS seq | >Pp3c9_1240V3.1 ATGAAGATGGAGGAGTCCGTTAAGGAGACTCCTAAGGAGATCTTTCTGAAGGATTACAAGGCACCCCATTATGCTTTCGA AAAGGTAAATTTAAAGTTTGTGTTGGGAGAAGAGAAGACTCTAGTGACGTCCAACATACGTGTCCTTCCCAGGTCTTCTG ACGGTGCACCATTGATACTCGATGGAGAGAGGTTGAACTTGGTTACTCTAAATATCAATGGTTCGCCTGTCCCGGAGGGT AGTTACAAGTTTAATTCTCGGCAATTGAATATTACATCGTTGCCCACCAGCCCTTTCGACATGGAAATTGTCACGGAAAT TGAGCCACAAAATAATACTTCTCTGGAAGGATTGTATAAGTCATCTGGCAATTTCTGCACTCAATGTGAAGCAGAGGGCT TCCGTAAGATCACTTACTTCCAGGATCGCCCAGATGTGATGTCTAAGTTCACAACACGTATTGAGGCAGATAAAACTCTT TACCCAGTGTTGTTAAGCAACGGCAACTTGGTTGATGAGGGTGATTTGGCTGATGGCAAGCATTTCGCAGTGTGGGATGA TCCCTGGACAAAGCCTTGCTACTTGTTTGCACTTGTAGCTGGCCAGCTTGTGAGTCGTGATGACTCATTTACCACAATGT CGGGGCGCAAAGTGGCTCTTAGAATCTGGACTCCTCCGCAAGATATTCCTAAGACCGTTCACGCAATGAAATCTTTGCAA CTGGCTATGAAGTGGGATGAAGAGGTATTTGGTTTGGAATATGACCTAGACCTCTTCAACATTGTTGCTGTGCCGGATTT CAACATGGGTGCTATGGAGAATAAGAGCTTAAATATTTTTAATTCTAGGTTGGTATTAGCATCTCCTGAGACAGCTACAG ATGCAGATTACGCTGCTATTGAGGGAGTTATTGGCCACGAGTATTTTCACAATTGGACTGGCAACCGAGTGACATGTAGG GATTGGTTTCAGCTTAGTTTGAAGGAGGGACTTACTGTTTTCAGAGATCAGGAATTTTCATCTGATATGGGTAGTCGTGG AGTAAAACGCATTGCTGATGTGGGTCGTTTAAGGACTGCGCAATTTTCTCAGGATGCTGGTCCAATGGCACATCCAGTAC GGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGGTGCTGAGGTTGTACGAATGTACCAGACGCTACTA GGAAATGCCGGCTTTAGAAAGGGTATGGACTTGTATTTTCAAAGGCACGACGGTCAGGCTGTCACTTGTGAGGAGTTCCT GGCTGCTATGTTCGATGCCAATAATGTTAAATTCCCGACATTTCCCCTTTGGTACGCCCAAGCGGGAACTCCAACTCTGA CTGTGACCACCTCATACGACGCTGGTGCACAAACATTCACCATCAAGTGCAAGCAAGAAGTTCCCCCCACCCCAGGACAA TTAAAGAAGGAACCAATGCTTCTACCTTTGGCTGTAGGGTTGCTTGATTCACACGGGCACGACATGCGTCTTACTTCTGT GAAAGATGGAACCTCCCTGCACAACTTGACTAGTGTAGATGGTGACTACACGACCACTGCAGTACTGCATGTTGACAAGG CTGAGCAAGAATTTACATTTGTGAATATCACGGAGAAGCCAGTTCCGTCACTGCTGCGTAATTTCAGTGCACCAGTTCGC CTTGTGTCTGATGTTACAAACGATGACCTTTTCTTTCTCCTGGCTCATGATTCCGATCAATTCAACAGGTGGGAAGCTGG CCAGACTCTAAGTCGCAAGCTTATGCTGGACCTGATCTCTGCGCAGCAGAAAGGAGAGGATCTTAGCGTAGACAGTGCTT TTATTGAAGGCATGCGAAGCACTGTTATGGATTCGTCACTTGACAAGGAATTCGTGGCAAAATGCCTTACATTACCGACA GAAAGTGAGATTGCGGACCTGATGGATGTGGCTGATCCTGATGCCATACATAATGTGAGACGGTTTATCATTATGGAGAT TGCTACTAATATGCGCGATGTCCTACTTAAAATGGTGGAGGCAAACAGAAGTTCAGCTACTTACGATCCAAGCCATGTGC ACAGGGCACAGCGTGCTTTAAAGAATGTTGCTTTGGGATACTTGGCGATGTTGAACGAACCTGAGATTATAGACCTGGCT GTGAACGAAAACAAGAATGCCACAAACATGACTGATCAAGTATCTGCCTTGGCTGCTATTTGCCAGAATTCTGGTGATGC ACGGTCTAAGGCACTGGCAGAGTTCTACGAACAATGGAAGGATGAGACTTTGGTCATGAACAAATGGCTAGCTTTGCAAG CCATGTCAAATATCCCAGGAAATGTAGAGAATGTGCGGGGTTTGATGGAGCACCCTGCTTTTGACATCCGGAATCCAAAC AAGGTCTATTCCTTGATTGGTGGTTTCTGCGCGTCCGCTGTAAATTTTCATGCTAAAGACGGTTCAGGGTACACGTTTTT GGCTGACGTTGTCCTTCAGTTGGATAAGCTGAATCCCCAGGTTGCGTCACGCATGGTGTCGGCTTTCTCCAGATGGCGGA GGTTTGATGAAGGCAGGCAGGCATTGGCCAAAGCACAATTGGAGCGGATTACGAGCCAGGATGGTCTGTCAGACAATGTT TTCGAAATTGCCTCAAAGAGTTTGGCGTCGTAA |
Microexon DNA seq | TCACT |
Microexon Amino Acid seq | VT |
Microexon-tag DNA Seq | GCACATCCAGTACGGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGTTTATGAGAAGGGTGCTGAGGTTGTACGAATGTACCAGACGCTACTAGGA |
Microexon-tag Amino Acid seq | AHPVRPHSYIKMDNFYTVTVYEKGAEVVRMYQTLLG |
Transcript ID | Pp.24480.1 |
Gene ID | Pp.24480 |
Gene Name | NA |
Pfam domain motif | Peptidase_M1 |
Motif E-value | 3.6e-49 |
Motif start | 235 |
Motif end | 439 |
Protein seq | >Pp.24480.1 MKMEESVKETPKEIFLKDYKAPHYAFEKVNLKFVLGEEKTLVTSNIRVLPRSSDGAPLILDGERLNLVTLNINGSPVPEG SYKFNSRQLNITSLPTSPFDMEIVTEIEPQNNTSLEGLYKSSGNFCTQCEAEGFRKITYFQDRPDVMSKFTTRIEADKTL YPVLLSNGNLVDEGDLADGKHFAVWDDPWTKPCYLFALVAGQLVSRDDSFTTMSGRKVALRIWTPPQDIPKTVHAMKSLQ LAMKWDEEVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSRLVLASPETATDADYAAIEGVIGHEYFHNWTGNRVTCR DWFQLSLKEGLTVFRDQEFSSDMGSRGVKRIADVGRLRTAQFSQDAGPMAHPVRPHSYIKMDNFYTVTVYEKGAEVVRMY QTLLGNAGFRKGMDLYFQRHDGQAVTCEEFLAAMFDANNVKFPTFPLWYAQAGTPTLTVTTSYDAGAQTFTIKCKQEVPP TPGQLKKEPMLLPLAVGLLDSHGHDMRLTSVKDGTSLHNLTSVDGDYTTTAVLHVDKAEQEFTFVNITEKPVPSLLRNFS APVRLVSDVTNDDLFFLLAHDSDQFNRWEAGQTLSRKLMLDLISAQQKGEDLSVDSAFIEGMRSTVMDSSLDKEFVAKCL TLPTESEIADLMDVADPDAIHNVRRFIIMEIATNMRDVLLKMVEANRSSATYDPSHVHRAQRALKNVALGYLAMLNEPEI IDLAVNENKNATNMTDQVSALAAICQNSGDARSKALAEFYEQWKDETLVMNKWLALQAMSNIPGNVENVRGLMEHPAFDI RNPNKVYSLIGGFCASAVNFHAKDGSGYTFLADVVLQLDKLNPQVASRMVSAFSRWRRFDEGRQALAKAQLERITSQDGL SDNVFEIASKSLAS* |
CDS seq | >Pp.24480.1 ATGAAGATGGAGGAGTCCGTTAAGGAGACTCCTAAGGAGATCTTTCTGAAGGATTACAAGGCACCCCATTATGCTTTCGA AAAGGTAAATTTAAAGTTTGTGTTGGGAGAAGAGAAGACTCTAGTGACGTCCAACATACGTGTCCTTCCCAGGTCTTCTG ACGGTGCACCATTGATACTCGATGGAGAGAGGTTGAACTTGGTTACTCTAAATATCAATGGTTCGCCTGTCCCGGAGGGT AGTTACAAGTTTAATTCTCGGCAATTGAATATTACATCGTTGCCCACCAGCCCTTTCGACATGGAAATTGTCACGGAAAT TGAGCCACAAAATAATACTTCTCTGGAAGGATTGTATAAGTCATCTGGCAATTTCTGCACTCAATGTGAAGCAGAGGGCT TCCGTAAGATCACTTACTTCCAGGATCGCCCAGATGTGATGTCTAAGTTCACAACACGTATTGAGGCAGATAAAACTCTT TACCCAGTGTTGTTAAGCAACGGCAACTTGGTTGATGAGGGTGATTTGGCTGATGGCAAGCATTTCGCAGTGTGGGATGA TCCCTGGACAAAGCCTTGCTACTTGTTTGCACTTGTAGCTGGCCAGCTTGTGAGTCGTGATGACTCATTTACCACAATGT CGGGGCGCAAAGTGGCTCTTAGAATCTGGACTCCTCCGCAAGATATTCCTAAGACCGTTCACGCAATGAAATCTTTGCAA CTGGCTATGAAGTGGGATGAAGAGGTATTTGGTTTGGAATATGACCTAGACCTCTTCAACATTGTTGCTGTGCCGGATTT CAACATGGGTGCTATGGAGAATAAGAGCTTAAATATTTTTAATTCTAGGTTGGTATTAGCATCTCCTGAGACAGCTACAG ATGCAGATTACGCTGCTATTGAGGGAGTTATTGGCCACGAGTATTTTCACAATTGGACTGGCAACCGAGTGACATGTAGG GATTGGTTTCAGCTTAGTTTGAAGGAGGGACTTACTGTTTTCAGAGATCAGGAATTTTCATCTGATATGGGTAGTCGTGG AGTAAAACGCATTGCTGATGTGGGTCGTTTAAGGACTGCGCAATTTTCTCAGGATGCTGGTCCAATGGCACATCCAGTAC GGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGTTTATGAGAAGGGTGCTGAGGTTGTACGAATGTAC CAGACGCTACTAGGAAATGCCGGCTTTAGAAAGGGTATGGACTTGTATTTTCAAAGGCACGACGGTCAGGCTGTCACTTG TGAGGAGTTCCTGGCTGCTATGTTCGATGCCAATAATGTTAAATTCCCGACATTTCCCCTTTGGTACGCCCAAGCGGGAA CTCCAACTCTGACTGTGACCACCTCATACGACGCTGGTGCACAAACATTCACCATCAAGTGCAAGCAAGAAGTTCCCCCC ACCCCAGGACAATTAAAGAAGGAACCAATGCTTCTACCTTTGGCTGTAGGGTTGCTTGATTCACACGGGCACGACATGCG TCTTACTTCTGTGAAAGATGGAACCTCCCTGCACAACTTGACTAGTGTAGATGGTGACTACACGACCACTGCAGTACTGC ATGTTGACAAGGCTGAGCAAGAATTTACATTTGTGAATATCACGGAGAAGCCAGTTCCGTCACTGCTGCGTAATTTCAGT GCACCAGTTCGCCTTGTGTCTGATGTTACAAACGATGACCTTTTCTTTCTCCTGGCTCATGATTCCGATCAATTCAACAG GTGGGAAGCTGGCCAGACTCTAAGTCGCAAGCTTATGCTGGACCTGATCTCTGCGCAGCAGAAAGGAGAGGATCTTAGCG TAGACAGTGCTTTTATTGAAGGCATGCGAAGCACTGTTATGGATTCGTCACTTGACAAGGAATTCGTGGCAAAATGCCTT ACATTACCGACAGAAAGTGAGATTGCGGACCTGATGGATGTGGCTGATCCTGATGCCATACATAATGTGAGACGGTTTAT CATTATGGAGATTGCTACTAATATGCGCGATGTCCTACTTAAAATGGTGGAGGCAAACAGAAGTTCAGCTACTTACGATC CAAGCCATGTGCACAGGGCACAGCGTGCTTTAAAGAATGTTGCTTTGGGATACTTGGCGATGTTGAACGAACCTGAGATT ATAGACCTGGCTGTGAACGAAAACAAGAATGCCACAAACATGACTGATCAAGTATCTGCCTTGGCTGCTATTTGCCAGAA TTCTGGTGATGCACGGTCTAAGGCACTGGCAGAGTTCTACGAACAATGGAAGGATGAGACTTTGGTCATGAACAAATGGC TAGCTTTGCAAGCCATGTCAAATATCCCAGGAAATGTAGAGAATGTGCGGGGTTTGATGGAGCACCCTGCTTTTGACATC CGGAATCCAAACAAGGTCTATTCCTTGATTGGTGGTTTCTGCGCGTCCGCTGTAAATTTTCATGCTAAAGACGGTTCAGG GTACACGTTTTTGGCTGACGTTGTCCTTCAGTTGGATAAGCTGAATCCCCAGGTTGCGTCACGCATGGTGTCGGCTTTCT CCAGATGGCGGAGGTTTGATGAAGGCAGGCAGGCATTGGCCAAAGCACAATTGGAGCGGATTACGAGCCAGGATGGTCTG TCAGACAATGTTTTCGAAATTGCCTCAAAGAGTTTGGCGTCGTAA |