| Microexon ID | Pp_3:7088071-7088085:+ |
| Species | Physcomitrium patens | Coordinates | 3:7088071..7088085 |
| Microexon Cluster ID | MEP42 |
| Size | 15 |
| Phase | 0 |
| Pfam Domain Motif | bHLH-MYC_N |
| Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 48,15,45 |
| Microexon location in the Microexon-tag | 2 |
| Microexon-tag DNA Seq | GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY |
| Logo of Microexon-tag DNA Seq | ![]() |
| Alignment of exons | ![]() |
| Microexon DNA seq | ATGGCAGGAATTCAG |
| Microexon Amino Acid seq | MAGIQ |
| Microexon-tag DNA Seq | GCAGATAAGGCTTCTTATAAAATCTGCACACGAGCCAATTTAGCGAAGATGGCAGGAATTCAGACTATTTTATGTGTGCCAATCATGAATGGGGTGGTTGAGCTTGGT |
| Microexon-tag Amino Acid Seq | ADKASYKICTRANLAKMAGIQTILCVPIMNGVVELG |
| Microexon-tag spanning region | 7087588-7088319 |
| Microexon-tag prediction score | 0.8545 |
| Overlapped with the annotated transcript (%) | 100 |
| New Transcript ID | Pp3c3_9950V3.1x |
| Reference Transcript ID | Pp3c3_9950V3.1 |
| Gene ID | Pp3c3_9950 |
| Gene Name | NA |
| Transcript ID | Pp3c3_9950V3.1 |
| Protein ID | Pp3c3_9950V3.1 |
| Gene ID | Pp3c3_9950 |
| Gene Name | NA |
| Pfam domain motif | bHLH-MYC_N |
| Motif E-value | 2.5e-53 |
| Motif start | 51 |
| Motif end | 227 |
| Protein seq | >Pp3c3_9950V3.1 MMEMGTPNYWDAADPLMVEAFIGGYEIPGYETQDDLASTLGQDLEQNDSVLQRRLHRLVEESSEDWTYGIFWQLSLSPSG ESMLGWGDGYYKGPKDSDQFEPRKTQTEEHQLQRKKVLRELQALVSCPDDDGTEDVSDTEWFYLVSMCHSFAKGVGTPGQ ALAFGEYVWLEEADKASYKICTRANLAKMAGIQTILCVPIMNGVVELGSTDAIHERLDVVEYVKMVFQEPTWGLTNMSPI ISQSQVGKFDTTFMPHYPSIPFDSTSVSGVSSMTLNTDPGLADSESMDFGTRHSHMGKMVSHSGAFGFNGYDHVWGQTNE FHYNDPLPDDNVERDLGQPMCNILGSLPLQDEKLPLASSPPPKTLDSDSRYSIFQQNNVKKPPQLDHTQTSLPVTERLHP KPHTSQAFLHHNGSFDVGEMFNPPGHTQTVRSNPPSLDEQLHSPSMPAVEKLPIVEKPTSIYKPESVEKPMPVFKPLPQP PSPPASKPAVPVPANGLLLAGHLDQECVDTELITMKNNVVEAPKVPRKRGRKPANDREEPLNHVQAERQRREKLNKRFYA LRAVVPNVSKMDKASLLGDAIAHINHLQEKLQDAEMRIKDLQRVASSKHEQDQEVLAIGTLKDAIQLKPEGNGTSPVFGT FSGGKRFSIAVDIVGEEAMIRISCLREAYSVVNMMMTLQELRLDIQHSNTSTTSDDILHIVIAKMKPTLKFTEEQLIALL ERSCQNTGYLRKREGSDRLLQRPDNSPQLQ* |
| CDS seq | >Pp3c3_9950V3.1 ATGATGGAGATGGGGACGCCAAACTACTGGGATGCCGCGGATCCGTTGATGGTGGAGGCCTTTATTGGAGGCTATGAGAT TCCGGGTTATGAGACACAGGATGATCTGGCTAGCACCTTGGGGCAGGATTTAGAGCAGAATGATTCTGTGCTGCAGCGTA GGTTGCACAGACTCGTGGAAGAGTCGTCGGAGGATTGGACCTATGGCATCTTCTGGCAGCTCTCTCTTTCGCCTTCCGGA GAGTCAATGTTGGGGTGGGGGGATGGGTATTACAAGGGACCGAAAGATAGTGACCAATTTGAGCCAAGGAAAACACAAAC CGAGGAGCATCAGCTACAGAGAAAGAAAGTACTACGAGAGCTTCAGGCTCTTGTTTCCTGTCCAGATGATGATGGCACTG AAGACGTCTCAGATACGGAGTGGTTTTACCTTGTTTCTATGTGTCACTCATTTGCAAAAGGGGTCGGTACCCCTGGTCAG GCATTGGCGTTTGGAGAATATGTATGGCTGGAGGAGGCAGATAAGGCTTCTTATAAAATCTGCACACGAGCCAATTTAGC GAAGATGGCAGGAATTCAGACTATTTTATGTGTGCCAATCATGAATGGGGTGGTTGAGCTTGGTTCAACTGATGCAATTC ACGAGCGCTTGGATGTTGTTGAGTATGTTAAGATGGTGTTCCAAGAGCCAACGTGGGGGTTGACTAATATGTCCCCAATT ATTTCCCAATCTCAAGTTGGAAAATTTGATACAACATTTATGCCTCATTATCCGAGTATACCTTTTGATTCGACGAGTGT CTCAGGTGTATCCTCCATGACACTGAACACTGATCCAGGTCTAGCAGACAGCGAAAGCATGGATTTTGGGACAAGACACA GCCATATGGGCAAGATGGTATCCCATTCAGGTGCCTTTGGTTTCAACGGATATGACCATGTATGGGGACAAACTAATGAG TTCCACTATAATGATCCACTTCCAGATGACAACGTCGAGAGAGATTTAGGACAACCTATGTGTAATATTTTAGGTAGCTT ACCTCTACAAGACGAGAAGCTCCCCTTAGCATCCAGTCCACCTCCTAAAACCCTAGACTCAGATTCGAGGTATTCAATTT TTCAGCAGAACAATGTAAAGAAGCCTCCCCAACTAGATCATACGCAGACATCTTTGCCTGTAACGGAAAGGTTACATCCT AAGCCACACACATCACAGGCCTTTCTGCATCATAATGGTTCCTTCGACGTGGGTGAAATGTTTAACCCCCCAGGGCATAC ACAGACTGTGCGATCAAATCCACCCAGCCTAGACGAGCAGTTGCATTCTCCTTCAATGCCTGCAGTCGAAAAGCTTCCAA TTGTTGAGAAACCCACGTCTATTTACAAACCTGAAAGTGTTGAGAAGCCTATGCCTGTTTTCAAGCCACTTCCACAACCA CCATCTCCTCCCGCTTCAAAGCCAGCAGTACCTGTCCCTGCAAATGGTTTACTGCTTGCCGGCCACCTGGACCAGGAGTG TGTTGATACAGAACTGATTACAATGAAAAATAATGTGGTCGAGGCTCCAAAAGTGCCTCGTAAACGGGGTCGGAAGCCCG CCAATGACCGGGAAGAGCCCCTGAACCATGTACAAGCTGAGCGGCAGCGGCGAGAGAAACTTAATAAACGATTTTATGCT CTTCGGGCTGTTGTGCCAAATGTCTCAAAGATGGACAAAGCTTCACTGTTAGGCGATGCAATTGCGCACATTAACCACCT GCAAGAGAAACTTCAGGATGCAGAAATGCGCATAAAGGATCTTCAGAGAGTTGCAAGTTCTAAGCACGAGCAAGACCAAG AGGTGCTTGCAATTGGTACGCTCAAGGATGCTATCCAACTGAAGCCTGAAGGGAATGGGACTAGCCCTGTGTTTGGCACA TTTTCTGGTGGTAAGAGGTTTAGTATTGCCGTAGATATCGTTGGAGAGGAGGCTATGATACGAATCAGCTGTCTGCGAGA AGCTTACTCTGTTGTCAATATGATGATGACTCTACAAGAATTACGACTCGACATACAACATTCTAATACATCCACCACAA GTGATGATATCCTGCATATTGTTATAGCCAAGATGAAACCAACCTTAAAGTTTACAGAGGAGCAGCTGATTGCTTTACTC GAAAGATCCTGTCAAAATACCGGGTACTTGAGGAAGCGGGAAGGAAGTGATAGACTTTTGCAAAGACCTGACAATTCTCC CCAACTTCAATAA |
| Microexon DNA seq | ATGGCAGGAATTCAG |
| Microexon Amino Acid seq | MAGIQ |
| Microexon-tag DNA Seq | ATGGCAGGAATTCAGACTATTTTATGTGTGCCAATCATGAATGGGGTGGTTGAGCTTGGT |
| Microexon-tag Amino Acid seq | MAGIQTILCVPIMNGVVELG |
| Transcript ID | Pp.18027.1 |
| Gene ID | Pp.18027 |
| Gene Name | NA |
| Pfam domain motif | bHLH-MYC_N |
| Motif E-value | 6.5e-11 |
| Motif start | 1 |
| Motif end | 39 |
| Protein seq | >Pp.18027.1 MAGIQTILCVPIMNGVVELGSTDAIHERLDVVEYVKMVFQEPTWGLTNMSPIISQSQVGKFDTTFMPHYPSIPFDSTSVS GVSSMTLNTDPGLADSESMDFGTRHSHMGKMVSHSGAFGFNGYDHVWGQTNEFHYNDPLPDDNVERDLGQPMCNILGSLP LQDEKLPLASSPPPKTLDSDSRYSIFQQNNVKKPPQLDHTQTSLPVTERLHPKPHTSQAFLHHNGSFDVGEMFNPPGHTQ TVRSNPPSLDEQLHSPSMPAVEKLPIVEKPTSIYKPESVEKPMPVFKPLPQPPSPPASKPAVPVPANGLLLAGHLDQECV DTELITMKNNVVEAPKVPRKRGRKPANDREEPLNHVQAERQRREKLNKRFYALRAVVPNVSKMDKASLLGDAIAHINHLQ EKLQDAEMRIKDLQRVASSKHEQDQEVLAIGTLKDAIQLKPEGNGTSPVFGTFSGGKRFSIAVDIVGEEAMIRISCLREA YSVVNMMMTLQELRLDIQHSNTSTTSDDILHIVIAKMKPTLKFTEEQLIALLERSCQNTGYLRKREGSDRLLQRPDNSPQ LQ* |
| CDS seq | >Pp.18027.1 ATGGCAGGAATTCAGACTATTTTATGTGTGCCAATCATGAATGGGGTGGTTGAGCTTGGTTCAACTGATGCAATTCACGA GCGCTTGGATGTTGTTGAGTATGTTAAGATGGTGTTCCAAGAGCCAACGTGGGGGTTGACTAATATGTCCCCAATTATTT CCCAATCTCAAGTTGGAAAATTTGATACAACATTTATGCCTCATTATCCGAGTATACCTTTTGATTCGACGAGTGTCTCA GGTGTATCCTCCATGACACTGAACACTGATCCAGGTCTAGCAGACAGCGAAAGCATGGATTTTGGGACAAGACACAGCCA TATGGGCAAGATGGTATCCCATTCAGGTGCCTTTGGTTTCAACGGATATGACCATGTATGGGGACAAACTAATGAGTTCC ACTATAATGATCCACTTCCAGATGACAACGTCGAGAGAGATTTAGGACAACCTATGTGTAATATTTTAGGTAGCTTACCT CTACAAGACGAGAAGCTCCCCTTAGCATCCAGTCCACCTCCTAAAACCCTAGACTCAGATTCGAGGTATTCAATTTTTCA GCAGAACAATGTAAAGAAGCCTCCCCAACTAGATCATACGCAGACATCTTTGCCTGTAACGGAAAGGTTACATCCTAAGC CACACACATCACAGGCCTTTCTGCATCATAATGGTTCCTTCGACGTGGGTGAAATGTTTAACCCCCCAGGGCATACACAG ACTGTGCGATCAAATCCACCCAGCCTAGACGAGCAGTTGCATTCTCCTTCAATGCCTGCAGTCGAAAAGCTTCCAATTGT TGAGAAACCCACGTCTATTTACAAACCTGAAAGTGTTGAGAAGCCTATGCCTGTTTTCAAGCCACTTCCACAACCACCAT CTCCTCCCGCTTCAAAGCCAGCAGTACCTGTCCCTGCAAATGGTTTACTGCTTGCCGGCCACCTGGACCAGGAGTGTGTT GATACAGAACTGATTACAATGAAAAATAATGTGGTCGAGGCTCCAAAAGTGCCTCGTAAACGGGGTCGGAAGCCCGCCAA TGACCGGGAAGAGCCCCTGAACCATGTACAAGCTGAGCGGCAGCGGCGAGAGAAACTTAATAAACGATTTTATGCTCTTC GGGCTGTTGTGCCAAATGTCTCAAAGATGGACAAAGCTTCACTGTTAGGCGATGCAATTGCGCACATTAACCACCTGCAA GAGAAACTTCAGGATGCAGAAATGCGCATAAAGGATCTTCAGAGAGTTGCAAGTTCTAAGCACGAGCAAGACCAAGAGGT GCTTGCAATTGGTACGCTCAAGGATGCTATCCAACTGAAGCCTGAAGGGAATGGGACTAGCCCTGTGTTTGGCACATTTT CTGGTGGTAAGAGGTTTAGTATTGCCGTAGATATCGTTGGAGAGGAGGCTATGATACGAATCAGCTGTCTGCGAGAAGCT TACTCTGTTGTCAATATGATGATGACTCTACAAGAATTACGACTCGACATACAACATTCTAATACATCCACCACAAGTGA TGATATCCTGCATATTGTTATAGCCAAGATGAAACCAACCTTAAAGTTTACAGAGGAGCAGCTGATTGCTTTACTCGAAA GATCCTGTCAAAATACCGGGTACTTGAGGAAGCGGGAAGGAAGTGATAGACTTTTGCAAAGACCTGACAATTCTCCCCAA CTTCAATAA |

