Microexon ID Pp_3:7099543-7099557:-
Species Physcomitrium patens
Coordinates 3:7099543..7099557
Microexon Cluster ID MEP42
Size 15
Phase 0
Pfam Domain Motif bHLH-MYC_N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATGGCTGGTATCCAG
Microexon Amino Acid seq MAGIQ
Microexon-tag DNA Seq GCAAACAAGGCCCCCAATCATATTTGTACACGAGCTCATTTAGCCAAGATGGCTGGTATCCAGACAATCGTCTGTGTTCCTACAAGAACTGGGGTTGTTGAGCTTGGA
Microexon-tag Amino Acid Seq ANKAPNHICTRAHLAKMAGIQTIVCVPTRTGVVELG
Microexon-tag spanning region7099304-7100083
Microexon-tag prediction score0.858
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c3_9970V3.1x
Reference Transcript ID Pp3c3_9970V3.1
Gene ID Pp3c3_9970
Gene Name NA
Transcript ID Pp3c3_9970V3.1
Protein ID Pp3c3_9970V3.1
Gene ID Pp3c3_9970
Gene Name NA
Pfam domain motif bHLH-MYC_N
Motif E-value 9e-43
Motif start 16
Motif end 163
Protein seq >Pp3c3_9970V3.1
MPVVLPTEYPSKYVCRMLGWGDGYFKGPKENEISEKRIDQGGSEEDQQLRRKVLRELQSLVSNTEEDVSDYVTDTEWFYL
VSMSHSFAYGVGTPGQALATESPVWLTEANKAPNHICTRAHLAKMAGIQTIVCVPTRTGVVELGSTDLISQNMDVVHHIK
MVFDEPFWGANRSQVMAQSLLMDSDATFFPPSPSIMSMGTTSAFASSPSVASRGSTLGKDHESHYRGRNVSVEKIGSSMA
STSFDTLDYMWQQSDEMQFNDGVSVGTTEKDQGQSRLYYPVLGPPVLVEKLPFSATSLISRTRAAEVKHSSMLQNVEKLA
SEDQKPSSLPHIKVHTTHSYPEKTGAGELSQVLSTPDLRQSIEMKLPAQVETRRAPGITGGATKPVAEKAKPVPKPPQQQ
QTAISGPPASASGRSSFDQSEHDSFQESEAEISFKESSAVEFSLNVGTKPPRKRGRKPANDREEPLSHVQAERQRREKLN
QRFYALRAVVPNVSKMDKASLLGDAIAYINELTSKLQSAEAQIKDLKGHVVGSSDKSQESLSIARGSMDNSTIDGLSIRP
QGSVNSTSISGNAPSGTKPTIAVHILGQEAMIRINCLKDSVALLQMMMALQELRLEVRHSNTSTTQDMVLHIVIVKIEPT
EHYTQEQLCAILERSCQPYSCSTKDEGHGLSEKLGSSRRSQ*
CDS seq >Pp3c3_9970V3.1
ATGCCTGTTGTTTTACCGACAGAGTATCCATCGAAATATGTTTGCAGGATGTTGGGGTGGGGTGATGGGTACTTCAAGGG
GCCGAAAGAGAATGAAATTTCTGAGAAGCGTATAGATCAGGGAGGTAGTGAAGAGGACCAGCAACTGAGACGGAAAGTGT
TGAGAGAACTGCAGTCTCTCGTGAGCAATACTGAAGAGGACGTTAGTGATTATGTAACAGATACAGAATGGTTCTACCTT
GTCTCTATGTCGCACTCATTTGCGTATGGTGTGGGGACTCCGGGCCAAGCACTGGCAACTGAAAGTCCTGTGTGGCTTAC
TGAGGCAAACAAGGCCCCCAATCATATTTGTACACGAGCTCATTTAGCCAAGATGGCTGGTATCCAGACAATCGTCTGTG
TTCCTACAAGAACTGGGGTTGTTGAGCTTGGATCTACGGATTTGATAAGTCAAAACATGGACGTTGTTCACCATATAAAG
ATGGTATTTGATGAACCATTTTGGGGAGCAAATCGAAGTCAAGTTATGGCTCAATCCTTGCTTATGGACAGCGATGCGAC
TTTCTTCCCTCCAAGTCCAAGTATCATGTCGATGGGCACCACAAGTGCGTTTGCATCAAGCCCCAGCGTAGCTAGTAGAG
GCTCTACTCTTGGTAAAGATCATGAATCACATTATAGAGGGAGGAATGTGTCAGTCGAAAAGATTGGTTCCTCAATGGCA
AGCACTAGCTTTGATACTTTGGACTATATGTGGCAGCAATCAGACGAAATGCAATTTAATGATGGAGTATCAGTTGGTAC
TACTGAGAAGGACCAAGGACAATCACGTTTGTATTATCCTGTTTTGGGACCACCTGTTCTGGTAGAGAAACTTCCATTTT
CTGCCACTAGTTTGATAAGTAGAACCAGGGCAGCAGAAGTGAAGCATTCAAGCATGCTTCAGAATGTTGAAAAGTTGGCC
TCTGAAGATCAAAAGCCTTCATCACTACCACATATCAAGGTACATACTACACACTCCTACCCCGAAAAGACTGGTGCTGG
GGAATTGAGCCAGGTTTTAAGTACCCCAGATCTCAGACAATCGATAGAGATGAAACTTCCAGCGCAAGTAGAAACCAGGC
GTGCTCCCGGAATAACTGGGGGCGCTACGAAGCCTGTGGCTGAGAAAGCCAAACCTGTTCCTAAACCTCCTCAGCAACAA
CAGACGGCGATATCAGGACCTCCAGCTAGCGCAAGTGGGCGTTCAAGTTTTGATCAGTCAGAGCATGATTCCTTTCAAGA
ATCAGAAGCTGAGATCTCTTTCAAGGAGAGCAGCGCAGTGGAGTTCAGTTTAAATGTTGGTACAAAACCTCCTCGAAAAC
GGGGTCGAAAGCCAGCCAATGACAGAGAGGAGCCCTTAAGCCATGTGCAAGCGGAAAGGCAGAGAAGGGAGAAGCTCAAT
CAGCGATTTTATGCTTTGCGAGCTGTGGTGCCAAACGTTTCCAAAATGGACAAAGCGTCGTTGTTGGGCGATGCTATTGC
ATACATTAATGAGCTTACGAGCAAGCTGCAATCGGCAGAGGCTCAAATCAAAGACCTGAAAGGTCATGTGGTTGGTTCGA
GTGACAAATCACAAGAATCACTTTCGATTGCAAGGGGTTCTATGGATAACTCGACTATAGATGGGTTGAGCATAAGACCA
CAAGGGAGTGTTAACAGCACCTCGATTTCTGGAAATGCACCAAGTGGGACGAAACCTACTATTGCAGTGCACATTCTTGG
GCAAGAGGCTATGATTCGTATAAACTGTTTAAAGGACTCTGTTGCCCTTCTTCAGATGATGATGGCATTGCAAGAGTTAC
GTTTGGAGGTTCGGCACTCAAATACCTCCACCACGCAGGACATGGTTCTACACATTGTAATTGTGAAGATTGAGCCAACT
GAACATTACACACAGGAGCAGCTATGTGCGATACTGGAGAGGTCATGTCAACCCTACAGCTGTTCTACCAAGGATGAAGG
TCACGGGCTTTCAGAAAAGCTAGGTTCATCAAGACGATCTCAATAA
Microexon DNA seq ATGGCTGGTATCCAG
Microexon Amino Acid seq MAGIQ
Microexon-tag DNA Seq GCAAACAAGGCCCCCAATCATATTTGTACACGAGCTCATTTAGCCAAGATGGCTGGTATCCAGACAATCGTCTGTGTTCCTACAAGAACTGGGGTTGTTGAGCTTGGA
Microexon-tag Amino Acid seq ANKAPNHICTRAHLAKMAGIQTIVCVPTRTGVVELG
Transcript ID Pp.18029.1
Gene ID Pp.18029
Gene Name NA
Pfam domain motif bHLH-MYC_N
Motif E-value 9e-43
Motif start 16
Motif end 163
Protein seq >Pp.18029.1
MPVVLPTEYPSKYVCRMLGWGDGYFKGPKENEISEKRIDQGGSEEDQQLRRKVLRELQSLVSNTEEDVSDYVTDTEWFYL
VSMSHSFAYGVGTPGQALATESPVWLTEANKAPNHICTRAHLAKMAGIQTIVCVPTRTGVVELGSTDLISQNMDVVHHIK
MVFDEPFWGANRSQVMAQSLLMDSDATFFPPSPSIMSMGTTSAFASSPSVASRGSTLGKDHESHYRGRNVSVEKIGSSMA
STSFDTLDYMWQQSDEMQFNDGVSVGTTEKDQGQSRLYYPVLGPPVLVEKLPFSATSLISRTRAAEVKHSSMLQNVEKLA
SEDQKPSSLPHIKVHTTHSYPEKTGAGELSQVLSTPDLRQSIEMKLPAQVETRRAPGITGGATKPVAEKAKPVPKPPQQQ
QTAISGPPASASGRSSFDQSEHDSFQESEAEISFKESSAVEFSLNVGTKPPRKRGRKPANDREEPLSHVQAERQRREKLN
QRFYALRAVVPNVSKMDKASLLGDAIAYINELTSKLQSAEAQIKDLKGHVVGSSDKSQESLSIARGSMDNSTIDGLSIRP
QGSVNSTSISGNAPSGTKPTIAVHILGQEAMIRINCLKDSVALLQMMMALQELRLEVRHSNTSTTQDMVLHIVIVKIEPT
EHYTQEQLCAILERSCQPYSCSTKDEGHGLSEKLGSSRRSQ*
CDS seq >Pp.18029.1
ATGCCTGTTGTTTTACCGACAGAGTATCCATCGAAATATGTTTGCAGGATGTTGGGGTGGGGTGATGGGTACTTCAAGGG
GCCGAAAGAGAATGAAATTTCTGAGAAGCGTATAGATCAGGGAGGTAGTGAAGAGGACCAGCAACTGAGACGGAAAGTGT
TGAGAGAACTGCAGTCTCTCGTGAGCAATACTGAAGAGGACGTTAGTGATTATGTAACAGATACAGAATGGTTCTACCTT
GTCTCTATGTCGCACTCATTTGCGTATGGTGTGGGGACTCCGGGCCAAGCACTGGCAACTGAAAGTCCTGTGTGGCTTAC
TGAGGCAAACAAGGCCCCCAATCATATTTGTACACGAGCTCATTTAGCCAAGATGGCTGGTATCCAGACAATCGTCTGTG
TTCCTACAAGAACTGGGGTTGTTGAGCTTGGATCTACGGATTTGATAAGTCAAAACATGGACGTTGTTCACCATATAAAG
ATGGTATTTGATGAACCATTTTGGGGAGCAAATCGAAGTCAAGTTATGGCTCAATCCTTGCTTATGGACAGCGATGCGAC
TTTCTTCCCTCCAAGTCCAAGTATCATGTCGATGGGCACCACAAGTGCGTTTGCATCAAGCCCCAGCGTAGCTAGTAGAG
GCTCTACTCTTGGTAAAGATCATGAATCACATTATAGAGGGAGGAATGTGTCAGTCGAAAAGATTGGTTCCTCAATGGCA
AGCACTAGCTTTGATACTTTGGACTATATGTGGCAGCAATCAGACGAAATGCAATTTAATGATGGAGTATCAGTTGGTAC
TACTGAGAAGGACCAAGGACAATCACGTTTGTATTATCCTGTTTTGGGACCACCTGTTCTGGTAGAGAAACTTCCATTTT
CTGCCACTAGTTTGATAAGTAGAACCAGGGCAGCAGAAGTGAAGCATTCAAGCATGCTTCAGAATGTTGAAAAGTTGGCC
TCTGAAGATCAAAAGCCTTCATCACTACCACATATCAAGGTACATACTACACACTCCTACCCCGAAAAGACTGGTGCTGG
GGAATTGAGCCAGGTTTTAAGTACCCCAGATCTCAGACAATCGATAGAGATGAAACTTCCAGCGCAAGTAGAAACCAGGC
GTGCTCCCGGAATAACTGGGGGCGCTACGAAGCCTGTGGCTGAGAAAGCCAAACCTGTTCCTAAACCTCCTCAGCAACAA
CAGACGGCGATATCAGGACCTCCAGCTAGCGCAAGTGGGCGTTCAAGTTTTGATCAGTCAGAGCATGATTCCTTTCAAGA
ATCAGAAGCTGAGATCTCTTTCAAGGAGAGCAGCGCAGTGGAGTTCAGTTTAAATGTTGGTACAAAACCTCCTCGAAAAC
GGGGTCGAAAGCCAGCCAATGACAGAGAGGAGCCCTTAAGCCATGTGCAAGCGGAAAGGCAGAGAAGGGAGAAGCTCAAT
CAGCGATTTTATGCTTTGCGAGCTGTGGTGCCAAACGTTTCCAAAATGGACAAAGCGTCGTTGTTGGGCGATGCTATTGC
ATACATTAATGAGCTTACGAGCAAGCTGCAATCGGCAGAGGCTCAAATCAAAGACCTGAAAGGTCATGTGGTTGGTTCGA
GTGACAAATCACAAGAATCACTTTCGATTGCAAGGGGTTCTATGGATAACTCGACTATAGATGGGTTGAGCATAAGACCA
CAAGGGAGTGTTAACAGCACCTCGATTTCTGGAAATGCACCAAGTGGGACGAAACCTACTATTGCAGTGCACATTCTTGG
GCAAGAGGCTATGATTCGTATAAACTGTTTAAAGGACTCTGTTGCCCTTCTTCAGATGATGATGGCATTGCAAGAGTTAC
GTTTGGAGGTTCGGCACTCAAATACCTCCACCACGCAGGACATGGTTCTACACATTGTAATTGTGAAGATTGAGCCAACT
GAACATTACACACAGGAGCAGCTATGTGCGATACTGGAGAGGTCATGTCAACCCTACAGCTGTTCTACCAAGGATGAAGG
TCACGGGCTTTCAGAAAAGCTAGGTTCATCAAGACGATCTCAATAA