
Microexon ID | Pp_3:7099543-7099557:- |
Species | Physcomitrium patens | Coordinates | 3:7099543..7099557 |
Microexon Cluster ID | MEP42 |
Size | 15 |
Phase | 0 |
Pfam Domain Motif | bHLH-MYC_N |
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 48,15,45 |
Microexon location in the Microexon-tag | 2 |
Microexon-tag DNA Seq | GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY |
Logo of Microexon-tag DNA Seq | ![]() |
Alignment of exons | ![]() |
Microexon DNA seq | ATGGCTGGTATCCAG |
Microexon Amino Acid seq | MAGIQ |
Microexon-tag DNA Seq | GCAAACAAGGCCCCCAATCATATTTGTACACGAGCTCATTTAGCCAAGATGGCTGGTATCCAGACAATCGTCTGTGTTCCTACAAGAACTGGGGTTGTTGAGCTTGGA |
Microexon-tag Amino Acid Seq | ANKAPNHICTRAHLAKMAGIQTIVCVPTRTGVVELG |
Microexon-tag spanning region | 7099304-7100083 |
Microexon-tag prediction score | 0.858 |
Overlapped with the annotated transcript (%) | 100 |
New Transcript ID | Pp3c3_9970V3.1x |
Reference Transcript ID | Pp3c3_9970V3.1 |
Gene ID | Pp3c3_9970 |
Gene Name | NA |
Transcript ID | Pp3c3_9970V3.1 |
Protein ID | Pp3c3_9970V3.1 |
Gene ID | Pp3c3_9970 |
Gene Name | NA |
Pfam domain motif | bHLH-MYC_N |
Motif E-value | 9e-43 |
Motif start | 16 |
Motif end | 163 |
Protein seq | >Pp3c3_9970V3.1 MPVVLPTEYPSKYVCRMLGWGDGYFKGPKENEISEKRIDQGGSEEDQQLRRKVLRELQSLVSNTEEDVSDYVTDTEWFYL VSMSHSFAYGVGTPGQALATESPVWLTEANKAPNHICTRAHLAKMAGIQTIVCVPTRTGVVELGSTDLISQNMDVVHHIK MVFDEPFWGANRSQVMAQSLLMDSDATFFPPSPSIMSMGTTSAFASSPSVASRGSTLGKDHESHYRGRNVSVEKIGSSMA STSFDTLDYMWQQSDEMQFNDGVSVGTTEKDQGQSRLYYPVLGPPVLVEKLPFSATSLISRTRAAEVKHSSMLQNVEKLA SEDQKPSSLPHIKVHTTHSYPEKTGAGELSQVLSTPDLRQSIEMKLPAQVETRRAPGITGGATKPVAEKAKPVPKPPQQQ QTAISGPPASASGRSSFDQSEHDSFQESEAEISFKESSAVEFSLNVGTKPPRKRGRKPANDREEPLSHVQAERQRREKLN QRFYALRAVVPNVSKMDKASLLGDAIAYINELTSKLQSAEAQIKDLKGHVVGSSDKSQESLSIARGSMDNSTIDGLSIRP QGSVNSTSISGNAPSGTKPTIAVHILGQEAMIRINCLKDSVALLQMMMALQELRLEVRHSNTSTTQDMVLHIVIVKIEPT EHYTQEQLCAILERSCQPYSCSTKDEGHGLSEKLGSSRRSQ* |
CDS seq | >Pp3c3_9970V3.1 ATGCCTGTTGTTTTACCGACAGAGTATCCATCGAAATATGTTTGCAGGATGTTGGGGTGGGGTGATGGGTACTTCAAGGG GCCGAAAGAGAATGAAATTTCTGAGAAGCGTATAGATCAGGGAGGTAGTGAAGAGGACCAGCAACTGAGACGGAAAGTGT TGAGAGAACTGCAGTCTCTCGTGAGCAATACTGAAGAGGACGTTAGTGATTATGTAACAGATACAGAATGGTTCTACCTT GTCTCTATGTCGCACTCATTTGCGTATGGTGTGGGGACTCCGGGCCAAGCACTGGCAACTGAAAGTCCTGTGTGGCTTAC TGAGGCAAACAAGGCCCCCAATCATATTTGTACACGAGCTCATTTAGCCAAGATGGCTGGTATCCAGACAATCGTCTGTG TTCCTACAAGAACTGGGGTTGTTGAGCTTGGATCTACGGATTTGATAAGTCAAAACATGGACGTTGTTCACCATATAAAG ATGGTATTTGATGAACCATTTTGGGGAGCAAATCGAAGTCAAGTTATGGCTCAATCCTTGCTTATGGACAGCGATGCGAC TTTCTTCCCTCCAAGTCCAAGTATCATGTCGATGGGCACCACAAGTGCGTTTGCATCAAGCCCCAGCGTAGCTAGTAGAG GCTCTACTCTTGGTAAAGATCATGAATCACATTATAGAGGGAGGAATGTGTCAGTCGAAAAGATTGGTTCCTCAATGGCA AGCACTAGCTTTGATACTTTGGACTATATGTGGCAGCAATCAGACGAAATGCAATTTAATGATGGAGTATCAGTTGGTAC TACTGAGAAGGACCAAGGACAATCACGTTTGTATTATCCTGTTTTGGGACCACCTGTTCTGGTAGAGAAACTTCCATTTT CTGCCACTAGTTTGATAAGTAGAACCAGGGCAGCAGAAGTGAAGCATTCAAGCATGCTTCAGAATGTTGAAAAGTTGGCC TCTGAAGATCAAAAGCCTTCATCACTACCACATATCAAGGTACATACTACACACTCCTACCCCGAAAAGACTGGTGCTGG GGAATTGAGCCAGGTTTTAAGTACCCCAGATCTCAGACAATCGATAGAGATGAAACTTCCAGCGCAAGTAGAAACCAGGC GTGCTCCCGGAATAACTGGGGGCGCTACGAAGCCTGTGGCTGAGAAAGCCAAACCTGTTCCTAAACCTCCTCAGCAACAA CAGACGGCGATATCAGGACCTCCAGCTAGCGCAAGTGGGCGTTCAAGTTTTGATCAGTCAGAGCATGATTCCTTTCAAGA ATCAGAAGCTGAGATCTCTTTCAAGGAGAGCAGCGCAGTGGAGTTCAGTTTAAATGTTGGTACAAAACCTCCTCGAAAAC GGGGTCGAAAGCCAGCCAATGACAGAGAGGAGCCCTTAAGCCATGTGCAAGCGGAAAGGCAGAGAAGGGAGAAGCTCAAT CAGCGATTTTATGCTTTGCGAGCTGTGGTGCCAAACGTTTCCAAAATGGACAAAGCGTCGTTGTTGGGCGATGCTATTGC ATACATTAATGAGCTTACGAGCAAGCTGCAATCGGCAGAGGCTCAAATCAAAGACCTGAAAGGTCATGTGGTTGGTTCGA GTGACAAATCACAAGAATCACTTTCGATTGCAAGGGGTTCTATGGATAACTCGACTATAGATGGGTTGAGCATAAGACCA CAAGGGAGTGTTAACAGCACCTCGATTTCTGGAAATGCACCAAGTGGGACGAAACCTACTATTGCAGTGCACATTCTTGG GCAAGAGGCTATGATTCGTATAAACTGTTTAAAGGACTCTGTTGCCCTTCTTCAGATGATGATGGCATTGCAAGAGTTAC GTTTGGAGGTTCGGCACTCAAATACCTCCACCACGCAGGACATGGTTCTACACATTGTAATTGTGAAGATTGAGCCAACT GAACATTACACACAGGAGCAGCTATGTGCGATACTGGAGAGGTCATGTCAACCCTACAGCTGTTCTACCAAGGATGAAGG TCACGGGCTTTCAGAAAAGCTAGGTTCATCAAGACGATCTCAATAA |
Microexon DNA seq | ATGGCTGGTATCCAG |
Microexon Amino Acid seq | MAGIQ |
Microexon-tag DNA Seq | GCAAACAAGGCCCCCAATCATATTTGTACACGAGCTCATTTAGCCAAGATGGCTGGTATCCAGACAATCGTCTGTGTTCCTACAAGAACTGGGGTTGTTGAGCTTGGA |
Microexon-tag Amino Acid seq | ANKAPNHICTRAHLAKMAGIQTIVCVPTRTGVVELG |
Transcript ID | Pp.18029.1 |
Gene ID | Pp.18029 |
Gene Name | NA |
Pfam domain motif | bHLH-MYC_N |
Motif E-value | 9e-43 |
Motif start | 16 |
Motif end | 163 |
Protein seq | >Pp.18029.1 MPVVLPTEYPSKYVCRMLGWGDGYFKGPKENEISEKRIDQGGSEEDQQLRRKVLRELQSLVSNTEEDVSDYVTDTEWFYL VSMSHSFAYGVGTPGQALATESPVWLTEANKAPNHICTRAHLAKMAGIQTIVCVPTRTGVVELGSTDLISQNMDVVHHIK MVFDEPFWGANRSQVMAQSLLMDSDATFFPPSPSIMSMGTTSAFASSPSVASRGSTLGKDHESHYRGRNVSVEKIGSSMA STSFDTLDYMWQQSDEMQFNDGVSVGTTEKDQGQSRLYYPVLGPPVLVEKLPFSATSLISRTRAAEVKHSSMLQNVEKLA SEDQKPSSLPHIKVHTTHSYPEKTGAGELSQVLSTPDLRQSIEMKLPAQVETRRAPGITGGATKPVAEKAKPVPKPPQQQ QTAISGPPASASGRSSFDQSEHDSFQESEAEISFKESSAVEFSLNVGTKPPRKRGRKPANDREEPLSHVQAERQRREKLN QRFYALRAVVPNVSKMDKASLLGDAIAYINELTSKLQSAEAQIKDLKGHVVGSSDKSQESLSIARGSMDNSTIDGLSIRP QGSVNSTSISGNAPSGTKPTIAVHILGQEAMIRINCLKDSVALLQMMMALQELRLEVRHSNTSTTQDMVLHIVIVKIEPT EHYTQEQLCAILERSCQPYSCSTKDEGHGLSEKLGSSRRSQ* |
CDS seq | >Pp.18029.1 ATGCCTGTTGTTTTACCGACAGAGTATCCATCGAAATATGTTTGCAGGATGTTGGGGTGGGGTGATGGGTACTTCAAGGG GCCGAAAGAGAATGAAATTTCTGAGAAGCGTATAGATCAGGGAGGTAGTGAAGAGGACCAGCAACTGAGACGGAAAGTGT TGAGAGAACTGCAGTCTCTCGTGAGCAATACTGAAGAGGACGTTAGTGATTATGTAACAGATACAGAATGGTTCTACCTT GTCTCTATGTCGCACTCATTTGCGTATGGTGTGGGGACTCCGGGCCAAGCACTGGCAACTGAAAGTCCTGTGTGGCTTAC TGAGGCAAACAAGGCCCCCAATCATATTTGTACACGAGCTCATTTAGCCAAGATGGCTGGTATCCAGACAATCGTCTGTG TTCCTACAAGAACTGGGGTTGTTGAGCTTGGATCTACGGATTTGATAAGTCAAAACATGGACGTTGTTCACCATATAAAG ATGGTATTTGATGAACCATTTTGGGGAGCAAATCGAAGTCAAGTTATGGCTCAATCCTTGCTTATGGACAGCGATGCGAC TTTCTTCCCTCCAAGTCCAAGTATCATGTCGATGGGCACCACAAGTGCGTTTGCATCAAGCCCCAGCGTAGCTAGTAGAG GCTCTACTCTTGGTAAAGATCATGAATCACATTATAGAGGGAGGAATGTGTCAGTCGAAAAGATTGGTTCCTCAATGGCA AGCACTAGCTTTGATACTTTGGACTATATGTGGCAGCAATCAGACGAAATGCAATTTAATGATGGAGTATCAGTTGGTAC TACTGAGAAGGACCAAGGACAATCACGTTTGTATTATCCTGTTTTGGGACCACCTGTTCTGGTAGAGAAACTTCCATTTT CTGCCACTAGTTTGATAAGTAGAACCAGGGCAGCAGAAGTGAAGCATTCAAGCATGCTTCAGAATGTTGAAAAGTTGGCC TCTGAAGATCAAAAGCCTTCATCACTACCACATATCAAGGTACATACTACACACTCCTACCCCGAAAAGACTGGTGCTGG GGAATTGAGCCAGGTTTTAAGTACCCCAGATCTCAGACAATCGATAGAGATGAAACTTCCAGCGCAAGTAGAAACCAGGC GTGCTCCCGGAATAACTGGGGGCGCTACGAAGCCTGTGGCTGAGAAAGCCAAACCTGTTCCTAAACCTCCTCAGCAACAA CAGACGGCGATATCAGGACCTCCAGCTAGCGCAAGTGGGCGTTCAAGTTTTGATCAGTCAGAGCATGATTCCTTTCAAGA ATCAGAAGCTGAGATCTCTTTCAAGGAGAGCAGCGCAGTGGAGTTCAGTTTAAATGTTGGTACAAAACCTCCTCGAAAAC GGGGTCGAAAGCCAGCCAATGACAGAGAGGAGCCCTTAAGCCATGTGCAAGCGGAAAGGCAGAGAAGGGAGAAGCTCAAT CAGCGATTTTATGCTTTGCGAGCTGTGGTGCCAAACGTTTCCAAAATGGACAAAGCGTCGTTGTTGGGCGATGCTATTGC ATACATTAATGAGCTTACGAGCAAGCTGCAATCGGCAGAGGCTCAAATCAAAGACCTGAAAGGTCATGTGGTTGGTTCGA GTGACAAATCACAAGAATCACTTTCGATTGCAAGGGGTTCTATGGATAACTCGACTATAGATGGGTTGAGCATAAGACCA CAAGGGAGTGTTAACAGCACCTCGATTTCTGGAAATGCACCAAGTGGGACGAAACCTACTATTGCAGTGCACATTCTTGG GCAAGAGGCTATGATTCGTATAAACTGTTTAAAGGACTCTGTTGCCCTTCTTCAGATGATGATGGCATTGCAAGAGTTAC GTTTGGAGGTTCGGCACTCAAATACCTCCACCACGCAGGACATGGTTCTACACATTGTAATTGTGAAGATTGAGCCAACT GAACATTACACACAGGAGCAGCTATGTGCGATACTGGAGAGGTCATGTCAACCCTACAGCTGTTCTACCAAGGATGAAGG TCACGGGCTTTCAGAAAAGCTAGGTTCATCAAGACGATCTCAATAA |