
Microexon ID | Pp_13:8359289-8359303:- |
Species | Physcomitrium patens | Coordinates | 13:8359289..8359303 |
Microexon Cluster ID | MEP42 |
Size | 15 |
Phase | 0 |
Pfam Domain Motif | bHLH-MYC_N |
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 48,15,45 |
Microexon location in the Microexon-tag | 2 |
Microexon-tag DNA Seq | GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY |
Logo of Microexon-tag DNA Seq | ![]() |
Alignment of exons | ![]() |
Microexon DNA seq | ATGGCAGGAATCCAG |
Microexon Amino Acid seq | MAGIQ |
Microexon-tag DNA Seq | GCAGATAAAGCATCTAATAAAATCTGCACACGAGCTAATTTAGCAAAGATGGCAGGAATCCAGACTATTTTATGTGTGCCAACCATGAATGGGGTGGTCGAGCTTGGT |
Microexon-tag Amino Acid Seq | ADKASNKICTRANLAKMAGIQTILCVPTMNGVVELG |
Microexon-tag spanning region | 8359035-8359701 |
Microexon-tag prediction score | 0.8588 |
Overlapped with the annotated transcript (%) | 100 |
New Transcript ID | Pp3c13_11550V3.1x |
Reference Transcript ID | Pp3c13_11550V3.1 |
Gene ID | Pp3c13_11550 |
Gene Name | NA |
Transcript ID | Pp3c13_11550V3.1 |
Protein ID | Pp3c13_11550V3.1 |
Gene ID | Pp3c13_11550 |
Gene Name | NA |
Pfam domain motif | bHLH-MYC_N |
Motif E-value | 5e-54 |
Motif start | 50 |
Motif end | 226 |
Protein seq | >Pp3c13_11550V3.1 METGPPNLWDATDPLMVEAFIGGYGIPGYETQVDLGCTAGQDLEQNDSVLQRRLHTLVEESSENWIYGIFWQRSLSPSGE SILGWGDGYYKGPNDSDEFDSRQTLTEEHQLQRKKVLRELQALVSCLDDDATEDVSNTEWFYLVSMCHSFALGVGTPGQA LALGQHIWLEEADKASNKICTRANLAKMAGIQTILCVPTMNGVVELGSTDLIHRRWDVVEHIKMVFQDSTWGLDDMQIMS HSQVANFDSTLMPYNSSMTLDPTSIAGTTSITSNTDPGLADHESVDFNARLFHTDKLLLHSGGLGFNGFDHLWGQTNDFH CNDPLPDDVENSLGQSMYDILGKLPLQEEQLPFLASSSFPKNPESNSRHSVFQMNNVGKLPQLDHRSASLPAMEKPHSKL QPTYTFPQYSGDFEVGEMYNPPGHIPTMRPKLPNQEERVQFPLMPAVEKASVIEKPMPLLKPIPQSPSPPVSKPAGPSVS ANGLKLTDHLGQDFVDPESVTIKVNVMEAPKLPRKRGRKPANDREEPLNHVQAERQRREKLNKRFYALRAVVPNVSKMDK ASLLGDAIAHINYLQEKLHDAEMRIKDLQRVCSAKRERGQEALVIGAPKDDTQLKPERNGTRPVFGIFPGGKRFSIAVNV FGEEAMIRVNCVRDAYSVVNMMMALQELRLDIQHSNTSSTSDDILHIVVAKMKPTERYTQEQLAALLERSDQATGYLTKR EGSDTALQKLGNPP* |
CDS seq | >Pp3c13_11550V3.1 ATGGAGACGGGACCGCCGAACTTGTGGGATGCCACGGACCCGTTGATGGTGGAAGCCTTTATTGGGGGCTATGGGATTCC GGGGTATGAGACTCAGGTCGATCTGGGGTGCACCGCTGGACAGGATTTAGAGCAGAATGATTCTGTGTTGCAGCGGAGGT TGCACACGCTGGTGGAAGAATCGTCAGAGAATTGGATTTATGGCATCTTCTGGCAGCGGTCGCTTTCACCTTCTGGAGAG TCAATATTGGGGTGGGGGGATGGGTATTATAAGGGACCTAATGATAGCGACGAATTTGATTCAAGGCAAACACTAACAGA AGAGCATCAACTACAAAGGAAGAAAGTACTACGAGAGCTACAGGCTCTTGTTTCGTGTCTAGATGATGACGCCACTGAAG ACGTCTCAAACACGGAGTGGTTTTACCTTGTTTCTATGTGTCATTCATTTGCACTAGGGGTTGGCACTCCTGGTCAGGCA TTGGCATTGGGGCAACACATATGGCTGGAGGAAGCAGATAAAGCATCTAATAAAATCTGCACACGAGCTAATTTAGCAAA GATGGCAGGAATCCAGACTATTTTATGTGTGCCAACCATGAATGGGGTGGTCGAGCTTGGTTCAACTGATTTGATCCACA GACGCTGGGATGTTGTTGAGCATATTAAGATGGTGTTTCAAGACTCAACGTGGGGGTTAGATGATATGCAAATTATGTCG CATTCCCAAGTTGCAAATTTTGATTCTACATTGATGCCCTACAATTCAAGTATGACTTTGGATCCGACGAGTATCGCAGG TACAACCTCCATTACATCGAACACTGACCCAGGTCTAGCAGACCACGAAAGCGTAGACTTTAATGCAAGGCTCTTTCACA CGGATAAGCTGTTACTCCATTCGGGTGGACTTGGCTTCAACGGGTTCGACCATTTGTGGGGACAAACAAACGATTTCCAC TGCAATGATCCACTTCCAGATGATGTTGAGAATAGTTTAGGACAATCCATGTATGATATTTTAGGTAAATTACCTCTACA GGAAGAGCAGCTTCCTTTCCTAGCATCTAGTTCATTTCCTAAAAACCCCGAGTCAAACTCGCGACATTCAGTTTTTCAGA TGAACAATGTAGGGAAACTCCCGCAATTAGATCATAGATCGGCATCCTTGCCTGCAATGGAAAAACCACATTCCAAGCTG CAGCCAACCTATACCTTTCCGCAATACAGTGGTGACTTCGAAGTGGGTGAAATGTATAATCCTCCAGGGCATATTCCAAC AATGAGACCAAAGCTACCCAATCAAGAGGAACGGGTGCAATTTCCTTTAATGCCTGCAGTTGAAAAGGCTTCAGTTATTG AGAAGCCTATGCCTTTATTGAAGCCAATTCCACAATCTCCGTCTCCACCAGTTTCGAAGCCAGCGGGACCTTCTGTTTCT GCTAATGGATTGAAGCTTACCGATCACCTAGGCCAGGATTTTGTAGATCCGGAGTCTGTAACAATAAAAGTAAATGTGAT GGAGGCTCCAAAGCTCCCTCGCAAGCGAGGTCGGAAGCCTGCTAATGACAGGGAAGAGCCGCTGAACCATGTTCAAGCCG AGCGGCAACGGCGAGAGAAGCTCAATAAACGCTTTTATGCACTTCGAGCCGTTGTGCCAAATGTCTCAAAGATGGACAAA GCTTCATTGCTGGGCGATGCTATTGCGCACATCAACTACCTGCAGGAGAAACTTCATGACGCAGAAATGCGCATAAAGGA CCTTCAGAGGGTTTGCAGTGCGAAGCGCGAGCGTGGCCAAGAGGCTCTTGTAATTGGTGCACCTAAAGACGATACCCAAC TGAAGCCTGAGAGGAATGGCACTCGGCCTGTGTTTGGCATATTTCCTGGTGGTAAGAGGTTCAGCATTGCCGTGAACGTC TTCGGAGAGGAGGCAATGATACGAGTCAACTGCGTGCGAGATGCTTACTCTGTTGTCAACATGATGATGGCCCTGCAAGA ATTGCGCTTGGACATACAACATTCTAATACATCCTCCACAAGTGATGACATCTTGCATATTGTTGTTGCTAAGATGAAAC CAACTGAAAGGTACACGCAGGAGCAGCTCGCTGCGTTACTTGAAAGGTCTGATCAAGCCACTGGGTATTTGACGAAGCGG GAAGGAAGTGACACGGCTTTGCAAAAACTAGGCAATCCTCCCTAA |
Microexon DNA seq | ATGGCAGGAATCCAG |
Microexon Amino Acid seq | MAGIQ |
Microexon-tag DNA Seq | GCAGATAAAGCATCTAATAAAATCTGCACACGAGCTAATTTAGCAAAGATGGCAGGAATCCAGACTATTTTATGTGTGCCAACCATGAATGGGGTGGTCGAGCTTGGT |
Microexon-tag Amino Acid seq | ADKASNKICTRANLAKMAGIQTILCVPTMNGVVELG |
Transcript ID | Pp.4843.1 |
Gene ID | Pp.4843 |
Gene Name | NA |
Pfam domain motif | bHLH-MYC_N |
Motif E-value | 2.5e-22 |
Motif start | 51 |
Motif end | 121 |
Protein seq | >Pp.4843.1 MVLSLHLSSAFLFLRLLAILLFNLADVCRPSRIYYSSRKLDWVTLCFECSTPGQALALGQHIWLEEADKASNKICTRANL AKMAGIQTILCVPTMNGVVELGSTDLIHRRWDVVEHIKMVFQDSTWGLDDMQIMSHSQVANFDSTLMPYNSSMTLDPTSI AGTTSITSNTDPGLADHESVDFNARLFHTDKLLLHSGGLGFNGFDHLWGQTNDFHCNDPLPDDVENSLGQSMYDILGKLP LQEEQLPFLASSSFPKNPESNSRHSVFQMNNVGKLPQLDHRSASLPAMEKPHSKLQPTYTFPQYSGDFEVGEMYNPPGHI PTMRPKLPNQEERVQFPLMPAVEKASVIEKPMPLLKPIPQSPSPPVSKPAGPSVSANGLKLTDHLGQDFVDPESVTIKVN VMEAPKLPRKRGRKPANDREEPLNHVQAERQRREKLNKRFYALRAVVPNVSKMDKASLLGDAIAHINYLQEKLHDAEMRI KDLQRVCSAKRERGQEALVIGAPKDDTQLKPERNGTRPVFGIFPGGKRFSIAVNVFGEEAMIRVNCVRDAYSVVNMMMAL QELRLDIQHSNTSSTSDDILHIVVAKMKPTERYTQEQLAALLERSDQATGYLTKREGSDTALQKLGNPP* |
CDS seq | >Pp.4843.1 ATGGTTTTGTCTCTGCATCTCTCTTCCGCTTTTCTTTTCCTTCGGCTATTAGCTATATTGCTTTTCAATTTGGCTGATGT ATGCAGACCAAGTCGCATCTATTACTCCTCGAGAAAGCTTGATTGGGTCACTCTTTGTTTTGAATGTAGCACTCCTGGTC AGGCATTGGCATTGGGGCAACACATATGGCTGGAGGAAGCAGATAAAGCATCTAATAAAATCTGCACACGAGCTAATTTA GCAAAGATGGCAGGAATCCAGACTATTTTATGTGTGCCAACCATGAATGGGGTGGTCGAGCTTGGTTCAACTGATTTGAT CCACAGACGCTGGGATGTTGTTGAGCATATTAAGATGGTGTTTCAAGACTCAACGTGGGGGTTAGATGATATGCAAATTA TGTCGCATTCCCAAGTTGCAAATTTTGATTCTACATTGATGCCCTACAATTCAAGTATGACTTTGGATCCGACGAGTATC GCAGGTACAACCTCCATTACATCGAACACTGACCCAGGTCTAGCAGACCACGAAAGCGTAGACTTTAATGCAAGGCTCTT TCACACGGATAAGCTGTTACTCCATTCGGGTGGACTTGGCTTCAACGGGTTCGACCATTTGTGGGGACAAACAAACGATT TCCACTGCAATGATCCACTTCCAGATGATGTTGAGAATAGTTTAGGACAATCCATGTATGATATTTTAGGTAAATTACCT CTACAGGAAGAGCAGCTTCCTTTCCTAGCATCTAGTTCATTTCCTAAAAACCCCGAGTCAAACTCGCGACATTCAGTTTT TCAGATGAACAATGTAGGGAAACTCCCGCAATTAGATCATAGATCGGCATCCTTGCCTGCAATGGAAAAACCACATTCCA AGCTGCAGCCAACCTATACCTTTCCGCAATACAGTGGTGACTTCGAAGTGGGTGAAATGTATAATCCTCCAGGGCATATT CCAACAATGAGACCAAAGCTACCCAATCAAGAGGAACGGGTGCAATTTCCTTTAATGCCTGCAGTTGAAAAGGCTTCAGT TATTGAGAAGCCTATGCCTTTATTGAAGCCAATTCCACAATCTCCGTCTCCACCAGTTTCGAAGCCAGCGGGACCTTCTG TTTCTGCTAATGGATTGAAGCTTACCGATCACCTAGGCCAGGATTTTGTAGATCCGGAGTCTGTAACAATAAAAGTAAAT GTGATGGAGGCTCCAAAGCTCCCTCGCAAGCGAGGTCGGAAGCCTGCTAATGACAGGGAAGAGCCGCTGAACCATGTTCA AGCCGAGCGGCAACGGCGAGAGAAGCTCAATAAACGCTTTTATGCACTTCGAGCCGTTGTGCCAAATGTCTCAAAGATGG ACAAAGCTTCATTGCTGGGCGATGCTATTGCGCACATCAACTACCTGCAGGAGAAACTTCATGACGCAGAAATGCGCATA AAGGACCTTCAGAGGGTTTGCAGTGCGAAGCGCGAGCGTGGCCAAGAGGCTCTTGTAATTGGTGCACCTAAAGACGATAC CCAACTGAAGCCTGAGAGGAATGGCACTCGGCCTGTGTTTGGCATATTTCCTGGTGGTAAGAGGTTCAGCATTGCCGTGA ACGTCTTCGGAGAGGAGGCAATGATACGAGTCAACTGCGTGCGAGATGCTTACTCTGTTGTCAACATGATGATGGCCCTG CAAGAATTGCGCTTGGACATACAACATTCTAATACATCCTCCACAAGTGATGACATCTTGCATATTGTTGTTGCTAAGAT GAAACCAACTGAAAGGTACACGCAGGAGCAGCTCGCTGCGTTACTTGAAAGGTCTGATCAAGCCACTGGGTATTTGACGA AGCGGGAAGGAAGTGACACGGCTTTGCAAAAACTAGGCAATCCTCCCTAA |