| Microexon ID | Pp_3:7099543-7099557:- |
| Species | Physcomitrium patens | Coordinates | 3:7099543..7099557 |
| Microexon Cluster ID | MEP42 |
| Size | 15 |
| Phase | 0 |
| Pfam Domain Motif | bHLH-MYC_N |
| Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 48,15,45 |
| Microexon location in the Microexon-tag | 2 |
| Microexon-tag DNA Seq | GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY |
| Logo of Microexon-tag DNA Seq | ![]() |
| Alignment of exons | ![]() |
| Microexon DNA seq | ATGGCTGGTATCCAG |
| Microexon Amino Acid seq | MAGIQ |
| Microexon-tag DNA Seq | GCAAACAAGGCCCCCAATCATATTTGTACACGAGCTCATTTAGCCAAGATGGCTGGTATCCAGACAATCGTCTGTGTTCCTACAAGAACTGGGGTTGTTGAGCTTGGA |
| Microexon-tag Amino Acid Seq | ANKAPNHICTRAHLAKMAGIQTIVCVPTRTGVVELG |
| Microexon-tag spanning region | 7099304-7100083 |
| Microexon-tag prediction score | 0.858 |
| Overlapped with the annotated transcript (%) | 100 |
| New Transcript ID | Pp3c3_9970V3.1x |
| Reference Transcript ID | Pp3c3_9970V3.1 |
| Gene ID | Pp3c3_9970 |
| Gene Name | NA |
| Transcript ID | Pp3c3_9970V3.1 |
| Protein ID | Pp3c3_9970V3.1 |
| Gene ID | Pp3c3_9970 |
| Gene Name | NA |
| Pfam domain motif | bHLH-MYC_N |
| Motif E-value | 9e-43 |
| Motif start | 16 |
| Motif end | 163 |
| Protein seq | >Pp3c3_9970V3.1 MPVVLPTEYPSKYVCRMLGWGDGYFKGPKENEISEKRIDQGGSEEDQQLRRKVLRELQSLVSNTEEDVSDYVTDTEWFYL VSMSHSFAYGVGTPGQALATESPVWLTEANKAPNHICTRAHLAKMAGIQTIVCVPTRTGVVELGSTDLISQNMDVVHHIK MVFDEPFWGANRSQVMAQSLLMDSDATFFPPSPSIMSMGTTSAFASSPSVASRGSTLGKDHESHYRGRNVSVEKIGSSMA STSFDTLDYMWQQSDEMQFNDGVSVGTTEKDQGQSRLYYPVLGPPVLVEKLPFSATSLISRTRAAEVKHSSMLQNVEKLA SEDQKPSSLPHIKVHTTHSYPEKTGAGELSQVLSTPDLRQSIEMKLPAQVETRRAPGITGGATKPVAEKAKPVPKPPQQQ QTAISGPPASASGRSSFDQSEHDSFQESEAEISFKESSAVEFSLNVGTKPPRKRGRKPANDREEPLSHVQAERQRREKLN QRFYALRAVVPNVSKMDKASLLGDAIAYINELTSKLQSAEAQIKDLKGHVVGSSDKSQESLSIARGSMDNSTIDGLSIRP QGSVNSTSISGNAPSGTKPTIAVHILGQEAMIRINCLKDSVALLQMMMALQELRLEVRHSNTSTTQDMVLHIVIVKIEPT EHYTQEQLCAILERSCQPYSCSTKDEGHGLSEKLGSSRRSQ* |
| CDS seq | >Pp3c3_9970V3.1 ATGCCTGTTGTTTTACCGACAGAGTATCCATCGAAATATGTTTGCAGGATGTTGGGGTGGGGTGATGGGTACTTCAAGGG GCCGAAAGAGAATGAAATTTCTGAGAAGCGTATAGATCAGGGAGGTAGTGAAGAGGACCAGCAACTGAGACGGAAAGTGT TGAGAGAACTGCAGTCTCTCGTGAGCAATACTGAAGAGGACGTTAGTGATTATGTAACAGATACAGAATGGTTCTACCTT GTCTCTATGTCGCACTCATTTGCGTATGGTGTGGGGACTCCGGGCCAAGCACTGGCAACTGAAAGTCCTGTGTGGCTTAC TGAGGCAAACAAGGCCCCCAATCATATTTGTACACGAGCTCATTTAGCCAAGATGGCTGGTATCCAGACAATCGTCTGTG TTCCTACAAGAACTGGGGTTGTTGAGCTTGGATCTACGGATTTGATAAGTCAAAACATGGACGTTGTTCACCATATAAAG ATGGTATTTGATGAACCATTTTGGGGAGCAAATCGAAGTCAAGTTATGGCTCAATCCTTGCTTATGGACAGCGATGCGAC TTTCTTCCCTCCAAGTCCAAGTATCATGTCGATGGGCACCACAAGTGCGTTTGCATCAAGCCCCAGCGTAGCTAGTAGAG GCTCTACTCTTGGTAAAGATCATGAATCACATTATAGAGGGAGGAATGTGTCAGTCGAAAAGATTGGTTCCTCAATGGCA AGCACTAGCTTTGATACTTTGGACTATATGTGGCAGCAATCAGACGAAATGCAATTTAATGATGGAGTATCAGTTGGTAC TACTGAGAAGGACCAAGGACAATCACGTTTGTATTATCCTGTTTTGGGACCACCTGTTCTGGTAGAGAAACTTCCATTTT CTGCCACTAGTTTGATAAGTAGAACCAGGGCAGCAGAAGTGAAGCATTCAAGCATGCTTCAGAATGTTGAAAAGTTGGCC TCTGAAGATCAAAAGCCTTCATCACTACCACATATCAAGGTACATACTACACACTCCTACCCCGAAAAGACTGGTGCTGG GGAATTGAGCCAGGTTTTAAGTACCCCAGATCTCAGACAATCGATAGAGATGAAACTTCCAGCGCAAGTAGAAACCAGGC GTGCTCCCGGAATAACTGGGGGCGCTACGAAGCCTGTGGCTGAGAAAGCCAAACCTGTTCCTAAACCTCCTCAGCAACAA CAGACGGCGATATCAGGACCTCCAGCTAGCGCAAGTGGGCGTTCAAGTTTTGATCAGTCAGAGCATGATTCCTTTCAAGA ATCAGAAGCTGAGATCTCTTTCAAGGAGAGCAGCGCAGTGGAGTTCAGTTTAAATGTTGGTACAAAACCTCCTCGAAAAC GGGGTCGAAAGCCAGCCAATGACAGAGAGGAGCCCTTAAGCCATGTGCAAGCGGAAAGGCAGAGAAGGGAGAAGCTCAAT CAGCGATTTTATGCTTTGCGAGCTGTGGTGCCAAACGTTTCCAAAATGGACAAAGCGTCGTTGTTGGGCGATGCTATTGC ATACATTAATGAGCTTACGAGCAAGCTGCAATCGGCAGAGGCTCAAATCAAAGACCTGAAAGGTCATGTGGTTGGTTCGA GTGACAAATCACAAGAATCACTTTCGATTGCAAGGGGTTCTATGGATAACTCGACTATAGATGGGTTGAGCATAAGACCA CAAGGGAGTGTTAACAGCACCTCGATTTCTGGAAATGCACCAAGTGGGACGAAACCTACTATTGCAGTGCACATTCTTGG GCAAGAGGCTATGATTCGTATAAACTGTTTAAAGGACTCTGTTGCCCTTCTTCAGATGATGATGGCATTGCAAGAGTTAC GTTTGGAGGTTCGGCACTCAAATACCTCCACCACGCAGGACATGGTTCTACACATTGTAATTGTGAAGATTGAGCCAACT GAACATTACACACAGGAGCAGCTATGTGCGATACTGGAGAGGTCATGTCAACCCTACAGCTGTTCTACCAAGGATGAAGG TCACGGGCTTTCAGAAAAGCTAGGTTCATCAAGACGATCTCAATAA |
| Microexon DNA seq | ATGGCTGGTATCCAG |
| Microexon Amino Acid seq | MAGIQ |
| Microexon-tag DNA Seq | GCAAACAAGGCCCCCAATCATATTTGTACACGAGCTCATTTAGCCAAGATGGCTGGTATCCAGACAATCGTCTGTGTTCCTACAAGAACTGGGGTTGTTGAGCTTGGA |
| Microexon-tag Amino Acid seq | ANKAPNHICTRAHLAKMAGIQTIVCVPTRTGVVELG |
| Transcript ID | Pp.18029.1 |
| Gene ID | Pp.18029 |
| Gene Name | NA |
| Pfam domain motif | bHLH-MYC_N |
| Motif E-value | 9e-43 |
| Motif start | 16 |
| Motif end | 163 |
| Protein seq | >Pp.18029.1 MPVVLPTEYPSKYVCRMLGWGDGYFKGPKENEISEKRIDQGGSEEDQQLRRKVLRELQSLVSNTEEDVSDYVTDTEWFYL VSMSHSFAYGVGTPGQALATESPVWLTEANKAPNHICTRAHLAKMAGIQTIVCVPTRTGVVELGSTDLISQNMDVVHHIK MVFDEPFWGANRSQVMAQSLLMDSDATFFPPSPSIMSMGTTSAFASSPSVASRGSTLGKDHESHYRGRNVSVEKIGSSMA STSFDTLDYMWQQSDEMQFNDGVSVGTTEKDQGQSRLYYPVLGPPVLVEKLPFSATSLISRTRAAEVKHSSMLQNVEKLA SEDQKPSSLPHIKVHTTHSYPEKTGAGELSQVLSTPDLRQSIEMKLPAQVETRRAPGITGGATKPVAEKAKPVPKPPQQQ QTAISGPPASASGRSSFDQSEHDSFQESEAEISFKESSAVEFSLNVGTKPPRKRGRKPANDREEPLSHVQAERQRREKLN QRFYALRAVVPNVSKMDKASLLGDAIAYINELTSKLQSAEAQIKDLKGHVVGSSDKSQESLSIARGSMDNSTIDGLSIRP QGSVNSTSISGNAPSGTKPTIAVHILGQEAMIRINCLKDSVALLQMMMALQELRLEVRHSNTSTTQDMVLHIVIVKIEPT EHYTQEQLCAILERSCQPYSCSTKDEGHGLSEKLGSSRRSQ* |
| CDS seq | >Pp.18029.1 ATGCCTGTTGTTTTACCGACAGAGTATCCATCGAAATATGTTTGCAGGATGTTGGGGTGGGGTGATGGGTACTTCAAGGG GCCGAAAGAGAATGAAATTTCTGAGAAGCGTATAGATCAGGGAGGTAGTGAAGAGGACCAGCAACTGAGACGGAAAGTGT TGAGAGAACTGCAGTCTCTCGTGAGCAATACTGAAGAGGACGTTAGTGATTATGTAACAGATACAGAATGGTTCTACCTT GTCTCTATGTCGCACTCATTTGCGTATGGTGTGGGGACTCCGGGCCAAGCACTGGCAACTGAAAGTCCTGTGTGGCTTAC TGAGGCAAACAAGGCCCCCAATCATATTTGTACACGAGCTCATTTAGCCAAGATGGCTGGTATCCAGACAATCGTCTGTG TTCCTACAAGAACTGGGGTTGTTGAGCTTGGATCTACGGATTTGATAAGTCAAAACATGGACGTTGTTCACCATATAAAG ATGGTATTTGATGAACCATTTTGGGGAGCAAATCGAAGTCAAGTTATGGCTCAATCCTTGCTTATGGACAGCGATGCGAC TTTCTTCCCTCCAAGTCCAAGTATCATGTCGATGGGCACCACAAGTGCGTTTGCATCAAGCCCCAGCGTAGCTAGTAGAG GCTCTACTCTTGGTAAAGATCATGAATCACATTATAGAGGGAGGAATGTGTCAGTCGAAAAGATTGGTTCCTCAATGGCA AGCACTAGCTTTGATACTTTGGACTATATGTGGCAGCAATCAGACGAAATGCAATTTAATGATGGAGTATCAGTTGGTAC TACTGAGAAGGACCAAGGACAATCACGTTTGTATTATCCTGTTTTGGGACCACCTGTTCTGGTAGAGAAACTTCCATTTT CTGCCACTAGTTTGATAAGTAGAACCAGGGCAGCAGAAGTGAAGCATTCAAGCATGCTTCAGAATGTTGAAAAGTTGGCC TCTGAAGATCAAAAGCCTTCATCACTACCACATATCAAGGTACATACTACACACTCCTACCCCGAAAAGACTGGTGCTGG GGAATTGAGCCAGGTTTTAAGTACCCCAGATCTCAGACAATCGATAGAGATGAAACTTCCAGCGCAAGTAGAAACCAGGC GTGCTCCCGGAATAACTGGGGGCGCTACGAAGCCTGTGGCTGAGAAAGCCAAACCTGTTCCTAAACCTCCTCAGCAACAA CAGACGGCGATATCAGGACCTCCAGCTAGCGCAAGTGGGCGTTCAAGTTTTGATCAGTCAGAGCATGATTCCTTTCAAGA ATCAGAAGCTGAGATCTCTTTCAAGGAGAGCAGCGCAGTGGAGTTCAGTTTAAATGTTGGTACAAAACCTCCTCGAAAAC GGGGTCGAAAGCCAGCCAATGACAGAGAGGAGCCCTTAAGCCATGTGCAAGCGGAAAGGCAGAGAAGGGAGAAGCTCAAT CAGCGATTTTATGCTTTGCGAGCTGTGGTGCCAAACGTTTCCAAAATGGACAAAGCGTCGTTGTTGGGCGATGCTATTGC ATACATTAATGAGCTTACGAGCAAGCTGCAATCGGCAGAGGCTCAAATCAAAGACCTGAAAGGTCATGTGGTTGGTTCGA GTGACAAATCACAAGAATCACTTTCGATTGCAAGGGGTTCTATGGATAACTCGACTATAGATGGGTTGAGCATAAGACCA CAAGGGAGTGTTAACAGCACCTCGATTTCTGGAAATGCACCAAGTGGGACGAAACCTACTATTGCAGTGCACATTCTTGG GCAAGAGGCTATGATTCGTATAAACTGTTTAAAGGACTCTGTTGCCCTTCTTCAGATGATGATGGCATTGCAAGAGTTAC GTTTGGAGGTTCGGCACTCAAATACCTCCACCACGCAGGACATGGTTCTACACATTGTAATTGTGAAGATTGAGCCAACT GAACATTACACACAGGAGCAGCTATGTGCGATACTGGAGAGGTCATGTCAACCCTACAGCTGTTCTACCAAGGATGAAGG TCACGGGCTTTCAGAAAAGCTAGGTTCATCAAGACGATCTCAATAA |

