Microexon ID Pp_13:8359289-8359303:-
Species Physcomitrium patens
Coordinates 13:8359289..8359303
Microexon Cluster ID MEP42
Size 15
Phase 0
Pfam Domain Motif bHLH-MYC_N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATGGCAGGAATCCAG
Microexon Amino Acid seq MAGIQ
Microexon-tag DNA Seq GCAGATAAAGCATCTAATAAAATCTGCACACGAGCTAATTTAGCAAAGATGGCAGGAATCCAGACTATTTTATGTGTGCCAACCATGAATGGGGTGGTCGAGCTTGGT
Microexon-tag Amino Acid Seq ADKASNKICTRANLAKMAGIQTILCVPTMNGVVELG
Microexon-tag spanning region8359035-8359701
Microexon-tag prediction score0.8588
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c13_11550V3.1x
Reference Transcript ID Pp3c13_11550V3.1
Gene ID Pp3c13_11550
Gene Name NA
Transcript ID Pp3c13_11550V3.1
Protein ID Pp3c13_11550V3.1
Gene ID Pp3c13_11550
Gene Name NA
Pfam domain motif bHLH-MYC_N
Motif E-value 5e-54
Motif start 50
Motif end 226
Protein seq >Pp3c13_11550V3.1
METGPPNLWDATDPLMVEAFIGGYGIPGYETQVDLGCTAGQDLEQNDSVLQRRLHTLVEESSENWIYGIFWQRSLSPSGE
SILGWGDGYYKGPNDSDEFDSRQTLTEEHQLQRKKVLRELQALVSCLDDDATEDVSNTEWFYLVSMCHSFALGVGTPGQA
LALGQHIWLEEADKASNKICTRANLAKMAGIQTILCVPTMNGVVELGSTDLIHRRWDVVEHIKMVFQDSTWGLDDMQIMS
HSQVANFDSTLMPYNSSMTLDPTSIAGTTSITSNTDPGLADHESVDFNARLFHTDKLLLHSGGLGFNGFDHLWGQTNDFH
CNDPLPDDVENSLGQSMYDILGKLPLQEEQLPFLASSSFPKNPESNSRHSVFQMNNVGKLPQLDHRSASLPAMEKPHSKL
QPTYTFPQYSGDFEVGEMYNPPGHIPTMRPKLPNQEERVQFPLMPAVEKASVIEKPMPLLKPIPQSPSPPVSKPAGPSVS
ANGLKLTDHLGQDFVDPESVTIKVNVMEAPKLPRKRGRKPANDREEPLNHVQAERQRREKLNKRFYALRAVVPNVSKMDK
ASLLGDAIAHINYLQEKLHDAEMRIKDLQRVCSAKRERGQEALVIGAPKDDTQLKPERNGTRPVFGIFPGGKRFSIAVNV
FGEEAMIRVNCVRDAYSVVNMMMALQELRLDIQHSNTSSTSDDILHIVVAKMKPTERYTQEQLAALLERSDQATGYLTKR
EGSDTALQKLGNPP*
CDS seq >Pp3c13_11550V3.1
ATGGAGACGGGACCGCCGAACTTGTGGGATGCCACGGACCCGTTGATGGTGGAAGCCTTTATTGGGGGCTATGGGATTCC
GGGGTATGAGACTCAGGTCGATCTGGGGTGCACCGCTGGACAGGATTTAGAGCAGAATGATTCTGTGTTGCAGCGGAGGT
TGCACACGCTGGTGGAAGAATCGTCAGAGAATTGGATTTATGGCATCTTCTGGCAGCGGTCGCTTTCACCTTCTGGAGAG
TCAATATTGGGGTGGGGGGATGGGTATTATAAGGGACCTAATGATAGCGACGAATTTGATTCAAGGCAAACACTAACAGA
AGAGCATCAACTACAAAGGAAGAAAGTACTACGAGAGCTACAGGCTCTTGTTTCGTGTCTAGATGATGACGCCACTGAAG
ACGTCTCAAACACGGAGTGGTTTTACCTTGTTTCTATGTGTCATTCATTTGCACTAGGGGTTGGCACTCCTGGTCAGGCA
TTGGCATTGGGGCAACACATATGGCTGGAGGAAGCAGATAAAGCATCTAATAAAATCTGCACACGAGCTAATTTAGCAAA
GATGGCAGGAATCCAGACTATTTTATGTGTGCCAACCATGAATGGGGTGGTCGAGCTTGGTTCAACTGATTTGATCCACA
GACGCTGGGATGTTGTTGAGCATATTAAGATGGTGTTTCAAGACTCAACGTGGGGGTTAGATGATATGCAAATTATGTCG
CATTCCCAAGTTGCAAATTTTGATTCTACATTGATGCCCTACAATTCAAGTATGACTTTGGATCCGACGAGTATCGCAGG
TACAACCTCCATTACATCGAACACTGACCCAGGTCTAGCAGACCACGAAAGCGTAGACTTTAATGCAAGGCTCTTTCACA
CGGATAAGCTGTTACTCCATTCGGGTGGACTTGGCTTCAACGGGTTCGACCATTTGTGGGGACAAACAAACGATTTCCAC
TGCAATGATCCACTTCCAGATGATGTTGAGAATAGTTTAGGACAATCCATGTATGATATTTTAGGTAAATTACCTCTACA
GGAAGAGCAGCTTCCTTTCCTAGCATCTAGTTCATTTCCTAAAAACCCCGAGTCAAACTCGCGACATTCAGTTTTTCAGA
TGAACAATGTAGGGAAACTCCCGCAATTAGATCATAGATCGGCATCCTTGCCTGCAATGGAAAAACCACATTCCAAGCTG
CAGCCAACCTATACCTTTCCGCAATACAGTGGTGACTTCGAAGTGGGTGAAATGTATAATCCTCCAGGGCATATTCCAAC
AATGAGACCAAAGCTACCCAATCAAGAGGAACGGGTGCAATTTCCTTTAATGCCTGCAGTTGAAAAGGCTTCAGTTATTG
AGAAGCCTATGCCTTTATTGAAGCCAATTCCACAATCTCCGTCTCCACCAGTTTCGAAGCCAGCGGGACCTTCTGTTTCT
GCTAATGGATTGAAGCTTACCGATCACCTAGGCCAGGATTTTGTAGATCCGGAGTCTGTAACAATAAAAGTAAATGTGAT
GGAGGCTCCAAAGCTCCCTCGCAAGCGAGGTCGGAAGCCTGCTAATGACAGGGAAGAGCCGCTGAACCATGTTCAAGCCG
AGCGGCAACGGCGAGAGAAGCTCAATAAACGCTTTTATGCACTTCGAGCCGTTGTGCCAAATGTCTCAAAGATGGACAAA
GCTTCATTGCTGGGCGATGCTATTGCGCACATCAACTACCTGCAGGAGAAACTTCATGACGCAGAAATGCGCATAAAGGA
CCTTCAGAGGGTTTGCAGTGCGAAGCGCGAGCGTGGCCAAGAGGCTCTTGTAATTGGTGCACCTAAAGACGATACCCAAC
TGAAGCCTGAGAGGAATGGCACTCGGCCTGTGTTTGGCATATTTCCTGGTGGTAAGAGGTTCAGCATTGCCGTGAACGTC
TTCGGAGAGGAGGCAATGATACGAGTCAACTGCGTGCGAGATGCTTACTCTGTTGTCAACATGATGATGGCCCTGCAAGA
ATTGCGCTTGGACATACAACATTCTAATACATCCTCCACAAGTGATGACATCTTGCATATTGTTGTTGCTAAGATGAAAC
CAACTGAAAGGTACACGCAGGAGCAGCTCGCTGCGTTACTTGAAAGGTCTGATCAAGCCACTGGGTATTTGACGAAGCGG
GAAGGAAGTGACACGGCTTTGCAAAAACTAGGCAATCCTCCCTAA
Microexon DNA seq ATGGCAGGAATCCAG
Microexon Amino Acid seq MAGIQ
Microexon-tag DNA Seq GCAGATAAAGCATCTAATAAAATCTGCACACGAGCTAATTTAGCAAAGATGGCAGGAATCCAGACTATTTTATGTGTGCCAACCATGAATGGGGTGGTCGAGCTTGGT
Microexon-tag Amino Acid seq ADKASNKICTRANLAKMAGIQTILCVPTMNGVVELG
Transcript ID Pp.4843.1
Gene ID Pp.4843
Gene Name NA
Pfam domain motif bHLH-MYC_N
Motif E-value 2.5e-22
Motif start 51
Motif end 121
Protein seq >Pp.4843.1
MVLSLHLSSAFLFLRLLAILLFNLADVCRPSRIYYSSRKLDWVTLCFECSTPGQALALGQHIWLEEADKASNKICTRANL
AKMAGIQTILCVPTMNGVVELGSTDLIHRRWDVVEHIKMVFQDSTWGLDDMQIMSHSQVANFDSTLMPYNSSMTLDPTSI
AGTTSITSNTDPGLADHESVDFNARLFHTDKLLLHSGGLGFNGFDHLWGQTNDFHCNDPLPDDVENSLGQSMYDILGKLP
LQEEQLPFLASSSFPKNPESNSRHSVFQMNNVGKLPQLDHRSASLPAMEKPHSKLQPTYTFPQYSGDFEVGEMYNPPGHI
PTMRPKLPNQEERVQFPLMPAVEKASVIEKPMPLLKPIPQSPSPPVSKPAGPSVSANGLKLTDHLGQDFVDPESVTIKVN
VMEAPKLPRKRGRKPANDREEPLNHVQAERQRREKLNKRFYALRAVVPNVSKMDKASLLGDAIAHINYLQEKLHDAEMRI
KDLQRVCSAKRERGQEALVIGAPKDDTQLKPERNGTRPVFGIFPGGKRFSIAVNVFGEEAMIRVNCVRDAYSVVNMMMAL
QELRLDIQHSNTSSTSDDILHIVVAKMKPTERYTQEQLAALLERSDQATGYLTKREGSDTALQKLGNPP*
CDS seq >Pp.4843.1
ATGGTTTTGTCTCTGCATCTCTCTTCCGCTTTTCTTTTCCTTCGGCTATTAGCTATATTGCTTTTCAATTTGGCTGATGT
ATGCAGACCAAGTCGCATCTATTACTCCTCGAGAAAGCTTGATTGGGTCACTCTTTGTTTTGAATGTAGCACTCCTGGTC
AGGCATTGGCATTGGGGCAACACATATGGCTGGAGGAAGCAGATAAAGCATCTAATAAAATCTGCACACGAGCTAATTTA
GCAAAGATGGCAGGAATCCAGACTATTTTATGTGTGCCAACCATGAATGGGGTGGTCGAGCTTGGTTCAACTGATTTGAT
CCACAGACGCTGGGATGTTGTTGAGCATATTAAGATGGTGTTTCAAGACTCAACGTGGGGGTTAGATGATATGCAAATTA
TGTCGCATTCCCAAGTTGCAAATTTTGATTCTACATTGATGCCCTACAATTCAAGTATGACTTTGGATCCGACGAGTATC
GCAGGTACAACCTCCATTACATCGAACACTGACCCAGGTCTAGCAGACCACGAAAGCGTAGACTTTAATGCAAGGCTCTT
TCACACGGATAAGCTGTTACTCCATTCGGGTGGACTTGGCTTCAACGGGTTCGACCATTTGTGGGGACAAACAAACGATT
TCCACTGCAATGATCCACTTCCAGATGATGTTGAGAATAGTTTAGGACAATCCATGTATGATATTTTAGGTAAATTACCT
CTACAGGAAGAGCAGCTTCCTTTCCTAGCATCTAGTTCATTTCCTAAAAACCCCGAGTCAAACTCGCGACATTCAGTTTT
TCAGATGAACAATGTAGGGAAACTCCCGCAATTAGATCATAGATCGGCATCCTTGCCTGCAATGGAAAAACCACATTCCA
AGCTGCAGCCAACCTATACCTTTCCGCAATACAGTGGTGACTTCGAAGTGGGTGAAATGTATAATCCTCCAGGGCATATT
CCAACAATGAGACCAAAGCTACCCAATCAAGAGGAACGGGTGCAATTTCCTTTAATGCCTGCAGTTGAAAAGGCTTCAGT
TATTGAGAAGCCTATGCCTTTATTGAAGCCAATTCCACAATCTCCGTCTCCACCAGTTTCGAAGCCAGCGGGACCTTCTG
TTTCTGCTAATGGATTGAAGCTTACCGATCACCTAGGCCAGGATTTTGTAGATCCGGAGTCTGTAACAATAAAAGTAAAT
GTGATGGAGGCTCCAAAGCTCCCTCGCAAGCGAGGTCGGAAGCCTGCTAATGACAGGGAAGAGCCGCTGAACCATGTTCA
AGCCGAGCGGCAACGGCGAGAGAAGCTCAATAAACGCTTTTATGCACTTCGAGCCGTTGTGCCAAATGTCTCAAAGATGG
ACAAAGCTTCATTGCTGGGCGATGCTATTGCGCACATCAACTACCTGCAGGAGAAACTTCATGACGCAGAAATGCGCATA
AAGGACCTTCAGAGGGTTTGCAGTGCGAAGCGCGAGCGTGGCCAAGAGGCTCTTGTAATTGGTGCACCTAAAGACGATAC
CCAACTGAAGCCTGAGAGGAATGGCACTCGGCCTGTGTTTGGCATATTTCCTGGTGGTAAGAGGTTCAGCATTGCCGTGA
ACGTCTTCGGAGAGGAGGCAATGATACGAGTCAACTGCGTGCGAGATGCTTACTCTGTTGTCAACATGATGATGGCCCTG
CAAGAATTGCGCTTGGACATACAACATTCTAATACATCCTCCACAAGTGATGACATCTTGCATATTGTTGTTGCTAAGAT
GAAACCAACTGAAAGGTACACGCAGGAGCAGCTCGCTGCGTTACTTGAAAGGTCTGATCAAGCCACTGGGTATTTGACGA
AGCGGGAAGGAAGTGACACGGCTTTGCAAAAACTAGGCAATCCTCCCTAA