Microexon ID Pp_3:7088071-7088085:+
Species Physcomitrium patens
Coordinates 3:7088071..7088085
Microexon Cluster ID MEP42
Size 15
Phase 0
Pfam Domain Motif bHLH-MYC_N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATGGCAGGAATTCAG
Microexon Amino Acid seq MAGIQ
Microexon-tag DNA Seq GCAGATAAGGCTTCTTATAAAATCTGCACACGAGCCAATTTAGCGAAGATGGCAGGAATTCAGACTATTTTATGTGTGCCAATCATGAATGGGGTGGTTGAGCTTGGT
Microexon-tag Amino Acid Seq ADKASYKICTRANLAKMAGIQTILCVPIMNGVVELG
Microexon-tag spanning region7087588-7088319
Microexon-tag prediction score0.8545
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c3_9950V3.1x
Reference Transcript ID Pp3c3_9950V3.1
Gene ID Pp3c3_9950
Gene Name NA
Transcript ID Pp3c3_9950V3.1
Protein ID Pp3c3_9950V3.1
Gene ID Pp3c3_9950
Gene Name NA
Pfam domain motif bHLH-MYC_N
Motif E-value 2.5e-53
Motif start 51
Motif end 227
Protein seq >Pp3c3_9950V3.1
MMEMGTPNYWDAADPLMVEAFIGGYEIPGYETQDDLASTLGQDLEQNDSVLQRRLHRLVEESSEDWTYGIFWQLSLSPSG
ESMLGWGDGYYKGPKDSDQFEPRKTQTEEHQLQRKKVLRELQALVSCPDDDGTEDVSDTEWFYLVSMCHSFAKGVGTPGQ
ALAFGEYVWLEEADKASYKICTRANLAKMAGIQTILCVPIMNGVVELGSTDAIHERLDVVEYVKMVFQEPTWGLTNMSPI
ISQSQVGKFDTTFMPHYPSIPFDSTSVSGVSSMTLNTDPGLADSESMDFGTRHSHMGKMVSHSGAFGFNGYDHVWGQTNE
FHYNDPLPDDNVERDLGQPMCNILGSLPLQDEKLPLASSPPPKTLDSDSRYSIFQQNNVKKPPQLDHTQTSLPVTERLHP
KPHTSQAFLHHNGSFDVGEMFNPPGHTQTVRSNPPSLDEQLHSPSMPAVEKLPIVEKPTSIYKPESVEKPMPVFKPLPQP
PSPPASKPAVPVPANGLLLAGHLDQECVDTELITMKNNVVEAPKVPRKRGRKPANDREEPLNHVQAERQRREKLNKRFYA
LRAVVPNVSKMDKASLLGDAIAHINHLQEKLQDAEMRIKDLQRVASSKHEQDQEVLAIGTLKDAIQLKPEGNGTSPVFGT
FSGGKRFSIAVDIVGEEAMIRISCLREAYSVVNMMMTLQELRLDIQHSNTSTTSDDILHIVIAKMKPTLKFTEEQLIALL
ERSCQNTGYLRKREGSDRLLQRPDNSPQLQ*
CDS seq >Pp3c3_9950V3.1
ATGATGGAGATGGGGACGCCAAACTACTGGGATGCCGCGGATCCGTTGATGGTGGAGGCCTTTATTGGAGGCTATGAGAT
TCCGGGTTATGAGACACAGGATGATCTGGCTAGCACCTTGGGGCAGGATTTAGAGCAGAATGATTCTGTGCTGCAGCGTA
GGTTGCACAGACTCGTGGAAGAGTCGTCGGAGGATTGGACCTATGGCATCTTCTGGCAGCTCTCTCTTTCGCCTTCCGGA
GAGTCAATGTTGGGGTGGGGGGATGGGTATTACAAGGGACCGAAAGATAGTGACCAATTTGAGCCAAGGAAAACACAAAC
CGAGGAGCATCAGCTACAGAGAAAGAAAGTACTACGAGAGCTTCAGGCTCTTGTTTCCTGTCCAGATGATGATGGCACTG
AAGACGTCTCAGATACGGAGTGGTTTTACCTTGTTTCTATGTGTCACTCATTTGCAAAAGGGGTCGGTACCCCTGGTCAG
GCATTGGCGTTTGGAGAATATGTATGGCTGGAGGAGGCAGATAAGGCTTCTTATAAAATCTGCACACGAGCCAATTTAGC
GAAGATGGCAGGAATTCAGACTATTTTATGTGTGCCAATCATGAATGGGGTGGTTGAGCTTGGTTCAACTGATGCAATTC
ACGAGCGCTTGGATGTTGTTGAGTATGTTAAGATGGTGTTCCAAGAGCCAACGTGGGGGTTGACTAATATGTCCCCAATT
ATTTCCCAATCTCAAGTTGGAAAATTTGATACAACATTTATGCCTCATTATCCGAGTATACCTTTTGATTCGACGAGTGT
CTCAGGTGTATCCTCCATGACACTGAACACTGATCCAGGTCTAGCAGACAGCGAAAGCATGGATTTTGGGACAAGACACA
GCCATATGGGCAAGATGGTATCCCATTCAGGTGCCTTTGGTTTCAACGGATATGACCATGTATGGGGACAAACTAATGAG
TTCCACTATAATGATCCACTTCCAGATGACAACGTCGAGAGAGATTTAGGACAACCTATGTGTAATATTTTAGGTAGCTT
ACCTCTACAAGACGAGAAGCTCCCCTTAGCATCCAGTCCACCTCCTAAAACCCTAGACTCAGATTCGAGGTATTCAATTT
TTCAGCAGAACAATGTAAAGAAGCCTCCCCAACTAGATCATACGCAGACATCTTTGCCTGTAACGGAAAGGTTACATCCT
AAGCCACACACATCACAGGCCTTTCTGCATCATAATGGTTCCTTCGACGTGGGTGAAATGTTTAACCCCCCAGGGCATAC
ACAGACTGTGCGATCAAATCCACCCAGCCTAGACGAGCAGTTGCATTCTCCTTCAATGCCTGCAGTCGAAAAGCTTCCAA
TTGTTGAGAAACCCACGTCTATTTACAAACCTGAAAGTGTTGAGAAGCCTATGCCTGTTTTCAAGCCACTTCCACAACCA
CCATCTCCTCCCGCTTCAAAGCCAGCAGTACCTGTCCCTGCAAATGGTTTACTGCTTGCCGGCCACCTGGACCAGGAGTG
TGTTGATACAGAACTGATTACAATGAAAAATAATGTGGTCGAGGCTCCAAAAGTGCCTCGTAAACGGGGTCGGAAGCCCG
CCAATGACCGGGAAGAGCCCCTGAACCATGTACAAGCTGAGCGGCAGCGGCGAGAGAAACTTAATAAACGATTTTATGCT
CTTCGGGCTGTTGTGCCAAATGTCTCAAAGATGGACAAAGCTTCACTGTTAGGCGATGCAATTGCGCACATTAACCACCT
GCAAGAGAAACTTCAGGATGCAGAAATGCGCATAAAGGATCTTCAGAGAGTTGCAAGTTCTAAGCACGAGCAAGACCAAG
AGGTGCTTGCAATTGGTACGCTCAAGGATGCTATCCAACTGAAGCCTGAAGGGAATGGGACTAGCCCTGTGTTTGGCACA
TTTTCTGGTGGTAAGAGGTTTAGTATTGCCGTAGATATCGTTGGAGAGGAGGCTATGATACGAATCAGCTGTCTGCGAGA
AGCTTACTCTGTTGTCAATATGATGATGACTCTACAAGAATTACGACTCGACATACAACATTCTAATACATCCACCACAA
GTGATGATATCCTGCATATTGTTATAGCCAAGATGAAACCAACCTTAAAGTTTACAGAGGAGCAGCTGATTGCTTTACTC
GAAAGATCCTGTCAAAATACCGGGTACTTGAGGAAGCGGGAAGGAAGTGATAGACTTTTGCAAAGACCTGACAATTCTCC
CCAACTTCAATAA
Microexon DNA seq ATGGCAGGAATTCAG
Microexon Amino Acid seq MAGIQ
Microexon-tag DNA Seq ATGGCAGGAATTCAGACTATTTTATGTGTGCCAATCATGAATGGGGTGGTTGAGCTTGGT
Microexon-tag Amino Acid seq MAGIQTILCVPIMNGVVELG
Transcript ID Pp.18027.1
Gene ID Pp.18027
Gene Name NA
Pfam domain motif bHLH-MYC_N
Motif E-value 6.5e-11
Motif start 1
Motif end 39
Protein seq >Pp.18027.1
MAGIQTILCVPIMNGVVELGSTDAIHERLDVVEYVKMVFQEPTWGLTNMSPIISQSQVGKFDTTFMPHYPSIPFDSTSVS
GVSSMTLNTDPGLADSESMDFGTRHSHMGKMVSHSGAFGFNGYDHVWGQTNEFHYNDPLPDDNVERDLGQPMCNILGSLP
LQDEKLPLASSPPPKTLDSDSRYSIFQQNNVKKPPQLDHTQTSLPVTERLHPKPHTSQAFLHHNGSFDVGEMFNPPGHTQ
TVRSNPPSLDEQLHSPSMPAVEKLPIVEKPTSIYKPESVEKPMPVFKPLPQPPSPPASKPAVPVPANGLLLAGHLDQECV
DTELITMKNNVVEAPKVPRKRGRKPANDREEPLNHVQAERQRREKLNKRFYALRAVVPNVSKMDKASLLGDAIAHINHLQ
EKLQDAEMRIKDLQRVASSKHEQDQEVLAIGTLKDAIQLKPEGNGTSPVFGTFSGGKRFSIAVDIVGEEAMIRISCLREA
YSVVNMMMTLQELRLDIQHSNTSTTSDDILHIVIAKMKPTLKFTEEQLIALLERSCQNTGYLRKREGSDRLLQRPDNSPQ
LQ*
CDS seq >Pp.18027.1
ATGGCAGGAATTCAGACTATTTTATGTGTGCCAATCATGAATGGGGTGGTTGAGCTTGGTTCAACTGATGCAATTCACGA
GCGCTTGGATGTTGTTGAGTATGTTAAGATGGTGTTCCAAGAGCCAACGTGGGGGTTGACTAATATGTCCCCAATTATTT
CCCAATCTCAAGTTGGAAAATTTGATACAACATTTATGCCTCATTATCCGAGTATACCTTTTGATTCGACGAGTGTCTCA
GGTGTATCCTCCATGACACTGAACACTGATCCAGGTCTAGCAGACAGCGAAAGCATGGATTTTGGGACAAGACACAGCCA
TATGGGCAAGATGGTATCCCATTCAGGTGCCTTTGGTTTCAACGGATATGACCATGTATGGGGACAAACTAATGAGTTCC
ACTATAATGATCCACTTCCAGATGACAACGTCGAGAGAGATTTAGGACAACCTATGTGTAATATTTTAGGTAGCTTACCT
CTACAAGACGAGAAGCTCCCCTTAGCATCCAGTCCACCTCCTAAAACCCTAGACTCAGATTCGAGGTATTCAATTTTTCA
GCAGAACAATGTAAAGAAGCCTCCCCAACTAGATCATACGCAGACATCTTTGCCTGTAACGGAAAGGTTACATCCTAAGC
CACACACATCACAGGCCTTTCTGCATCATAATGGTTCCTTCGACGTGGGTGAAATGTTTAACCCCCCAGGGCATACACAG
ACTGTGCGATCAAATCCACCCAGCCTAGACGAGCAGTTGCATTCTCCTTCAATGCCTGCAGTCGAAAAGCTTCCAATTGT
TGAGAAACCCACGTCTATTTACAAACCTGAAAGTGTTGAGAAGCCTATGCCTGTTTTCAAGCCACTTCCACAACCACCAT
CTCCTCCCGCTTCAAAGCCAGCAGTACCTGTCCCTGCAAATGGTTTACTGCTTGCCGGCCACCTGGACCAGGAGTGTGTT
GATACAGAACTGATTACAATGAAAAATAATGTGGTCGAGGCTCCAAAAGTGCCTCGTAAACGGGGTCGGAAGCCCGCCAA
TGACCGGGAAGAGCCCCTGAACCATGTACAAGCTGAGCGGCAGCGGCGAGAGAAACTTAATAAACGATTTTATGCTCTTC
GGGCTGTTGTGCCAAATGTCTCAAAGATGGACAAAGCTTCACTGTTAGGCGATGCAATTGCGCACATTAACCACCTGCAA
GAGAAACTTCAGGATGCAGAAATGCGCATAAAGGATCTTCAGAGAGTTGCAAGTTCTAAGCACGAGCAAGACCAAGAGGT
GCTTGCAATTGGTACGCTCAAGGATGCTATCCAACTGAAGCCTGAAGGGAATGGGACTAGCCCTGTGTTTGGCACATTTT
CTGGTGGTAAGAGGTTTAGTATTGCCGTAGATATCGTTGGAGAGGAGGCTATGATACGAATCAGCTGTCTGCGAGAAGCT
TACTCTGTTGTCAATATGATGATGACTCTACAAGAATTACGACTCGACATACAACATTCTAATACATCCACCACAAGTGA
TGATATCCTGCATATTGTTATAGCCAAGATGAAACCAACCTTAAAGTTTACAGAGGAGCAGCTGATTGCTTTACTCGAAA
GATCCTGTCAAAATACCGGGTACTTGAGGAAGCGGGAAGGAAGTGATAGACTTTTGCAAAGACCTGACAATTCTCCCCAA
CTTCAATAA