
Microexon ID | Pp_3:7088071-7088085:+ |
Species | Physcomitrium patens | Coordinates | 3:7088071..7088085 |
Microexon Cluster ID | MEP42 |
Size | 15 |
Phase | 0 |
Pfam Domain Motif | bHLH-MYC_N |
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 48,15,45 |
Microexon location in the Microexon-tag | 2 |
Microexon-tag DNA Seq | GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY |
Logo of Microexon-tag DNA Seq | ![]() |
Alignment of exons | ![]() |
Microexon DNA seq | ATGGCAGGAATTCAG |
Microexon Amino Acid seq | MAGIQ |
Microexon-tag DNA Seq | GCAGATAAGGCTTCTTATAAAATCTGCACACGAGCCAATTTAGCGAAGATGGCAGGAATTCAGACTATTTTATGTGTGCCAATCATGAATGGGGTGGTTGAGCTTGGT |
Microexon-tag Amino Acid Seq | ADKASYKICTRANLAKMAGIQTILCVPIMNGVVELG |
Microexon-tag spanning region | 7087588-7088319 |
Microexon-tag prediction score | 0.8545 |
Overlapped with the annotated transcript (%) | 100 |
New Transcript ID | Pp3c3_9950V3.1x |
Reference Transcript ID | Pp3c3_9950V3.1 |
Gene ID | Pp3c3_9950 |
Gene Name | NA |
Transcript ID | Pp3c3_9950V3.1 |
Protein ID | Pp3c3_9950V3.1 |
Gene ID | Pp3c3_9950 |
Gene Name | NA |
Pfam domain motif | bHLH-MYC_N |
Motif E-value | 2.5e-53 |
Motif start | 51 |
Motif end | 227 |
Protein seq | >Pp3c3_9950V3.1 MMEMGTPNYWDAADPLMVEAFIGGYEIPGYETQDDLASTLGQDLEQNDSVLQRRLHRLVEESSEDWTYGIFWQLSLSPSG ESMLGWGDGYYKGPKDSDQFEPRKTQTEEHQLQRKKVLRELQALVSCPDDDGTEDVSDTEWFYLVSMCHSFAKGVGTPGQ ALAFGEYVWLEEADKASYKICTRANLAKMAGIQTILCVPIMNGVVELGSTDAIHERLDVVEYVKMVFQEPTWGLTNMSPI ISQSQVGKFDTTFMPHYPSIPFDSTSVSGVSSMTLNTDPGLADSESMDFGTRHSHMGKMVSHSGAFGFNGYDHVWGQTNE FHYNDPLPDDNVERDLGQPMCNILGSLPLQDEKLPLASSPPPKTLDSDSRYSIFQQNNVKKPPQLDHTQTSLPVTERLHP KPHTSQAFLHHNGSFDVGEMFNPPGHTQTVRSNPPSLDEQLHSPSMPAVEKLPIVEKPTSIYKPESVEKPMPVFKPLPQP PSPPASKPAVPVPANGLLLAGHLDQECVDTELITMKNNVVEAPKVPRKRGRKPANDREEPLNHVQAERQRREKLNKRFYA LRAVVPNVSKMDKASLLGDAIAHINHLQEKLQDAEMRIKDLQRVASSKHEQDQEVLAIGTLKDAIQLKPEGNGTSPVFGT FSGGKRFSIAVDIVGEEAMIRISCLREAYSVVNMMMTLQELRLDIQHSNTSTTSDDILHIVIAKMKPTLKFTEEQLIALL ERSCQNTGYLRKREGSDRLLQRPDNSPQLQ* |
CDS seq | >Pp3c3_9950V3.1 ATGATGGAGATGGGGACGCCAAACTACTGGGATGCCGCGGATCCGTTGATGGTGGAGGCCTTTATTGGAGGCTATGAGAT TCCGGGTTATGAGACACAGGATGATCTGGCTAGCACCTTGGGGCAGGATTTAGAGCAGAATGATTCTGTGCTGCAGCGTA GGTTGCACAGACTCGTGGAAGAGTCGTCGGAGGATTGGACCTATGGCATCTTCTGGCAGCTCTCTCTTTCGCCTTCCGGA GAGTCAATGTTGGGGTGGGGGGATGGGTATTACAAGGGACCGAAAGATAGTGACCAATTTGAGCCAAGGAAAACACAAAC CGAGGAGCATCAGCTACAGAGAAAGAAAGTACTACGAGAGCTTCAGGCTCTTGTTTCCTGTCCAGATGATGATGGCACTG AAGACGTCTCAGATACGGAGTGGTTTTACCTTGTTTCTATGTGTCACTCATTTGCAAAAGGGGTCGGTACCCCTGGTCAG GCATTGGCGTTTGGAGAATATGTATGGCTGGAGGAGGCAGATAAGGCTTCTTATAAAATCTGCACACGAGCCAATTTAGC GAAGATGGCAGGAATTCAGACTATTTTATGTGTGCCAATCATGAATGGGGTGGTTGAGCTTGGTTCAACTGATGCAATTC ACGAGCGCTTGGATGTTGTTGAGTATGTTAAGATGGTGTTCCAAGAGCCAACGTGGGGGTTGACTAATATGTCCCCAATT ATTTCCCAATCTCAAGTTGGAAAATTTGATACAACATTTATGCCTCATTATCCGAGTATACCTTTTGATTCGACGAGTGT CTCAGGTGTATCCTCCATGACACTGAACACTGATCCAGGTCTAGCAGACAGCGAAAGCATGGATTTTGGGACAAGACACA GCCATATGGGCAAGATGGTATCCCATTCAGGTGCCTTTGGTTTCAACGGATATGACCATGTATGGGGACAAACTAATGAG TTCCACTATAATGATCCACTTCCAGATGACAACGTCGAGAGAGATTTAGGACAACCTATGTGTAATATTTTAGGTAGCTT ACCTCTACAAGACGAGAAGCTCCCCTTAGCATCCAGTCCACCTCCTAAAACCCTAGACTCAGATTCGAGGTATTCAATTT TTCAGCAGAACAATGTAAAGAAGCCTCCCCAACTAGATCATACGCAGACATCTTTGCCTGTAACGGAAAGGTTACATCCT AAGCCACACACATCACAGGCCTTTCTGCATCATAATGGTTCCTTCGACGTGGGTGAAATGTTTAACCCCCCAGGGCATAC ACAGACTGTGCGATCAAATCCACCCAGCCTAGACGAGCAGTTGCATTCTCCTTCAATGCCTGCAGTCGAAAAGCTTCCAA TTGTTGAGAAACCCACGTCTATTTACAAACCTGAAAGTGTTGAGAAGCCTATGCCTGTTTTCAAGCCACTTCCACAACCA CCATCTCCTCCCGCTTCAAAGCCAGCAGTACCTGTCCCTGCAAATGGTTTACTGCTTGCCGGCCACCTGGACCAGGAGTG TGTTGATACAGAACTGATTACAATGAAAAATAATGTGGTCGAGGCTCCAAAAGTGCCTCGTAAACGGGGTCGGAAGCCCG CCAATGACCGGGAAGAGCCCCTGAACCATGTACAAGCTGAGCGGCAGCGGCGAGAGAAACTTAATAAACGATTTTATGCT CTTCGGGCTGTTGTGCCAAATGTCTCAAAGATGGACAAAGCTTCACTGTTAGGCGATGCAATTGCGCACATTAACCACCT GCAAGAGAAACTTCAGGATGCAGAAATGCGCATAAAGGATCTTCAGAGAGTTGCAAGTTCTAAGCACGAGCAAGACCAAG AGGTGCTTGCAATTGGTACGCTCAAGGATGCTATCCAACTGAAGCCTGAAGGGAATGGGACTAGCCCTGTGTTTGGCACA TTTTCTGGTGGTAAGAGGTTTAGTATTGCCGTAGATATCGTTGGAGAGGAGGCTATGATACGAATCAGCTGTCTGCGAGA AGCTTACTCTGTTGTCAATATGATGATGACTCTACAAGAATTACGACTCGACATACAACATTCTAATACATCCACCACAA GTGATGATATCCTGCATATTGTTATAGCCAAGATGAAACCAACCTTAAAGTTTACAGAGGAGCAGCTGATTGCTTTACTC GAAAGATCCTGTCAAAATACCGGGTACTTGAGGAAGCGGGAAGGAAGTGATAGACTTTTGCAAAGACCTGACAATTCTCC CCAACTTCAATAA |
Microexon DNA seq | ATGGCAGGAATTCAG |
Microexon Amino Acid seq | MAGIQ |
Microexon-tag DNA Seq | ATGGCAGGAATTCAGACTATTTTATGTGTGCCAATCATGAATGGGGTGGTTGAGCTTGGT |
Microexon-tag Amino Acid seq | MAGIQTILCVPIMNGVVELG |
Transcript ID | Pp.18027.1 |
Gene ID | Pp.18027 |
Gene Name | NA |
Pfam domain motif | bHLH-MYC_N |
Motif E-value | 6.5e-11 |
Motif start | 1 |
Motif end | 39 |
Protein seq | >Pp.18027.1 MAGIQTILCVPIMNGVVELGSTDAIHERLDVVEYVKMVFQEPTWGLTNMSPIISQSQVGKFDTTFMPHYPSIPFDSTSVS GVSSMTLNTDPGLADSESMDFGTRHSHMGKMVSHSGAFGFNGYDHVWGQTNEFHYNDPLPDDNVERDLGQPMCNILGSLP LQDEKLPLASSPPPKTLDSDSRYSIFQQNNVKKPPQLDHTQTSLPVTERLHPKPHTSQAFLHHNGSFDVGEMFNPPGHTQ TVRSNPPSLDEQLHSPSMPAVEKLPIVEKPTSIYKPESVEKPMPVFKPLPQPPSPPASKPAVPVPANGLLLAGHLDQECV DTELITMKNNVVEAPKVPRKRGRKPANDREEPLNHVQAERQRREKLNKRFYALRAVVPNVSKMDKASLLGDAIAHINHLQ EKLQDAEMRIKDLQRVASSKHEQDQEVLAIGTLKDAIQLKPEGNGTSPVFGTFSGGKRFSIAVDIVGEEAMIRISCLREA YSVVNMMMTLQELRLDIQHSNTSTTSDDILHIVIAKMKPTLKFTEEQLIALLERSCQNTGYLRKREGSDRLLQRPDNSPQ LQ* |
CDS seq | >Pp.18027.1 ATGGCAGGAATTCAGACTATTTTATGTGTGCCAATCATGAATGGGGTGGTTGAGCTTGGTTCAACTGATGCAATTCACGA GCGCTTGGATGTTGTTGAGTATGTTAAGATGGTGTTCCAAGAGCCAACGTGGGGGTTGACTAATATGTCCCCAATTATTT CCCAATCTCAAGTTGGAAAATTTGATACAACATTTATGCCTCATTATCCGAGTATACCTTTTGATTCGACGAGTGTCTCA GGTGTATCCTCCATGACACTGAACACTGATCCAGGTCTAGCAGACAGCGAAAGCATGGATTTTGGGACAAGACACAGCCA TATGGGCAAGATGGTATCCCATTCAGGTGCCTTTGGTTTCAACGGATATGACCATGTATGGGGACAAACTAATGAGTTCC ACTATAATGATCCACTTCCAGATGACAACGTCGAGAGAGATTTAGGACAACCTATGTGTAATATTTTAGGTAGCTTACCT CTACAAGACGAGAAGCTCCCCTTAGCATCCAGTCCACCTCCTAAAACCCTAGACTCAGATTCGAGGTATTCAATTTTTCA GCAGAACAATGTAAAGAAGCCTCCCCAACTAGATCATACGCAGACATCTTTGCCTGTAACGGAAAGGTTACATCCTAAGC CACACACATCACAGGCCTTTCTGCATCATAATGGTTCCTTCGACGTGGGTGAAATGTTTAACCCCCCAGGGCATACACAG ACTGTGCGATCAAATCCACCCAGCCTAGACGAGCAGTTGCATTCTCCTTCAATGCCTGCAGTCGAAAAGCTTCCAATTGT TGAGAAACCCACGTCTATTTACAAACCTGAAAGTGTTGAGAAGCCTATGCCTGTTTTCAAGCCACTTCCACAACCACCAT CTCCTCCCGCTTCAAAGCCAGCAGTACCTGTCCCTGCAAATGGTTTACTGCTTGCCGGCCACCTGGACCAGGAGTGTGTT GATACAGAACTGATTACAATGAAAAATAATGTGGTCGAGGCTCCAAAAGTGCCTCGTAAACGGGGTCGGAAGCCCGCCAA TGACCGGGAAGAGCCCCTGAACCATGTACAAGCTGAGCGGCAGCGGCGAGAGAAACTTAATAAACGATTTTATGCTCTTC GGGCTGTTGTGCCAAATGTCTCAAAGATGGACAAAGCTTCACTGTTAGGCGATGCAATTGCGCACATTAACCACCTGCAA GAGAAACTTCAGGATGCAGAAATGCGCATAAAGGATCTTCAGAGAGTTGCAAGTTCTAAGCACGAGCAAGACCAAGAGGT GCTTGCAATTGGTACGCTCAAGGATGCTATCCAACTGAAGCCTGAAGGGAATGGGACTAGCCCTGTGTTTGGCACATTTT CTGGTGGTAAGAGGTTTAGTATTGCCGTAGATATCGTTGGAGAGGAGGCTATGATACGAATCAGCTGTCTGCGAGAAGCT TACTCTGTTGTCAATATGATGATGACTCTACAAGAATTACGACTCGACATACAACATTCTAATACATCCACCACAAGTGA TGATATCCTGCATATTGTTATAGCCAAGATGAAACCAACCTTAAAGTTTACAGAGGAGCAGCTGATTGCTTTACTCGAAA GATCCTGTCAAAATACCGGGTACTTGAGGAAGCGGGAAGGAAGTGATAGACTTTTGCAAAGACCTGACAATTCTCCCCAA CTTCAATAA |