Microexon ID Pp_3:10039830-10039842:-
Species Physcomitrium patens
Coordinates 3:10039830..10039842
Microexon Cluster ID MEP33
Size 13
Phase 2
Pfam Domain Motif Unknown
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 47,13,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GAVTTYYTRTCARTHTCRGARTTYCTWGTKGARRTWTCTGATGAYYTGTWTGAYTAYGAGGATGAYGTKTTRRAGAAYAAYTTCAAYATTYTGCGCATGTTTGTYRRA
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GTACGACTATGAG
Microexon Amino Acid seq LYDYE
Microexon-tag DNA Seq GAGTTTCTTTCTATCTCGGAGCTTCTCGTGGAGATGGCTGATGATTTGTACGACTATGAGGCAGATATAGGCAAGAATAGCTTCAACGTGCTTCGAATGTTTTTGTAC
Microexon-tag Amino Acid Seq EFLSISELLVEMADDLYDYEADIGKNSFNVLRMFLY
Microexon-tag spanning region10039661-10040055
Microexon-tag prediction score0.8966
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c3_14090V3.1x
Reference Transcript ID Pp3c3_14090V3.1
Gene ID Pp3c3_14090
Gene Name NA
Transcript ID Pp3c3_14090V3.1
Protein ID Pp3c3_14090V3.1
Gene ID Pp3c3_14090
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp3c3_14090V3.1
MEALERLQAVHAHLRLLHTHDIITTHSPSNRFLADFLLLLGESAGNSAEMEELCGVLVHAIPKMMRTSLFDEMACANASA
AGSRSVPNSPRSPGDELKIEDVTPPKSNRKGHRRSKNSEMSPFVREKTLTTPDHRSASYEGGVVNCLISFKSMERARSTL
EDFCRSYFMFHNMDVHNPIHVFRYLPLLVFVESYIYQLDEQNEDQLCFSALTGDSKPSCEVSSEFASGLGKDPFAGLRVV
LQEHDLLTKRIEDELMNGLHYWHLEHILCTAISAKEEVKADDVVEALRLKSFDYRVLNLLMYGLRNEPVNEAHFEFLSIS
ELLVEMADDLYDYEADIGKNSFNVLRMFLYIFGPSEAPTMLADFIGRSEDKYQILLTELEPHLQVQYNKRCEEAVKEGDE
LT*
CDS seq >Pp3c3_14090V3.1
ATGGAGGCACTGGAGCGTTTGCAAGCAGTTCATGCCCACCTTCGGCTTCTTCATACCCATGATATCATCACTACCCACAG
CCCTTCCAACCGATTTCTCGCTGATTTTCTCCTGCTGCTGGGTGAATCGGCCGGGAATTCTGCCGAGATGGAGGAGCTAT
GTGGAGTGTTGGTACATGCAATTCCGAAGATGATGAGGACATCCTTGTTTGATGAGATGGCATGTGCGAATGCGTCAGCT
GCTGGTAGTAGATCAGTACCTAACTCGCCCAGGAGCCCAGGAGATGAGCTCAAGATCGAGGATGTGACTCCGCCTAAAAG
TAATCGTAAAGGTCATCGAAGGAGTAAGAATTCAGAGATGAGCCCATTTGTAAGAGAGAAAACATTGACGACGCCAGACC
ACAGAAGTGCATCATACGAAGGTGGTGTAGTGAATTGCCTGATAAGTTTTAAGAGCATGGAGAGGGCTCGCTCCACACTT
GAAGATTTTTGCAGGTCGTACTTTATGTTTCATAACATGGACGTGCACAATCCCATTCACGTCTTTCGCTACCTGCCGCT
CCTTGTCTTTGTCGAGTCCTACATATATCAGCTAGATGAGCAGAACGAAGATCAATTATGTTTTTCAGCATTAACGGGAG
ACTCAAAACCTTCATGCGAGGTCTCAAGTGAGTTCGCAAGTGGATTAGGAAAGGATCCATTCGCAGGATTGCGAGTTGTA
CTGCAGGAACATGATTTACTAACAAAGAGGATTGAAGATGAGCTAATGAATGGTCTTCATTATTGGCATTTGGAGCATAT
TCTTTGTACTGCGATTTCAGCTAAAGAAGAGGTTAAGGCTGATGATGTTGTGGAAGCTTTACGCCTCAAGTCATTTGACT
ATCGTGTGCTAAACCTTCTCATGTATGGTTTGCGCAATGAACCGGTCAATGAGGCGCACTTTGAGTTTCTTTCTATCTCG
GAGCTTCTCGTGGAGATGGCTGATGATTTGTACGACTATGAGGCAGATATAGGCAAGAATAGCTTCAACGTGCTTCGAAT
GTTTTTGTACATATTCGGGCCTAGTGAAGCCCCAACAATGCTTGCTGATTTTATTGGAAGAAGTGAAGACAAGTATCAAA
TTCTGTTGACAGAGCTGGAACCGCATCTTCAAGTCCAGTATAACAAACGGTGCGAGGAGGCTGTCAAAGAAGGTGATGAG
CTCACGTAG
Microexon DNA seq GTACGACTATGAG
Microexon Amino Acid seq LYDYE
Microexon-tag DNA Seq GAGTTTCTTTCTATCTCGGAGCTTCTCGTGGAGATGGCTGATGATTTGTACGACTATGAGGCAGATATAGGCAAGAATAGCTTCAACGTGCTTCGAATGTTTTTGTAC
Microexon-tag Amino Acid seq EFLSISELLVEMADDLYDYEADIGKNSFNVLRMFLY
Transcript ID Pp3c3_14090V3.2
Gene ID Pp.18185
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp3c3_14090V3.2
MWSVGTCNSEAAGSRSVPNSPRSPGDELKIEDVTPPKSNRKGHRRSKNSEMSPFVREKTLTTPDHRSASYEGGVVNCLIS
FKSMERARSTLEDFCRSYFMFHNMDVHNPIHVFRYLPLLVFVESYIYQLDEQNEDQLCFSALTGDSKPSCEVSSEFASGL
GKDPFAGLRVVLQEHDLLTKRIEDELMNGLHYWHLEHILCTAISAKEEVKADDVVEALRLKSFDYRVLNLLMYGLRNEPV
NEAHFEFLSISELLVEMADDLYDYEADIGKNSFNVLRMFLYIFGPSEAPTMLADFIGRSEDKYQILLTELEPHLQVQYNK
RCEEAVKEGVYKGSHRCEGQAKTRMPTANTLLP*
CDS seq >Pp3c3_14090V3.2
ATGTGGAGTGTTGGTACATGCAATTCCGAAGCTGCTGGTAGTAGATCAGTACCTAACTCGCCCAGGAGCCCAGGAGATGA
GCTCAAGATCGAGGATGTGACTCCGCCTAAAAGTAATCGTAAAGGTCATCGAAGGAGTAAGAATTCAGAGATGAGCCCAT
TTGTAAGAGAGAAAACATTGACGACGCCAGACCACAGAAGTGCATCATACGAAGGTGGTGTAGTGAATTGCCTGATAAGT
TTTAAGAGCATGGAGAGGGCTCGCTCCACACTTGAAGATTTTTGCAGGTCGTACTTTATGTTTCATAACATGGACGTGCA
CAATCCCATTCACGTCTTTCGCTACCTGCCGCTCCTTGTCTTTGTCGAGTCCTACATATATCAGCTAGATGAGCAGAACG
AAGATCAATTATGTTTTTCAGCATTAACGGGAGACTCAAAACCTTCATGCGAGGTCTCAAGTGAGTTCGCAAGTGGATTA
GGAAAGGATCCATTCGCAGGATTGCGAGTTGTACTGCAGGAACATGATTTACTAACAAAGAGGATTGAAGATGAGCTAAT
GAATGGTCTTCATTATTGGCATTTGGAGCATATTCTTTGTACTGCGATTTCAGCTAAAGAAGAGGTTAAGGCTGATGATG
TTGTGGAAGCTTTACGCCTCAAGTCATTTGACTATCGTGTGCTAAACCTTCTCATGTATGGTTTGCGCAATGAACCGGTC
AATGAGGCGCACTTTGAGTTTCTTTCTATCTCGGAGCTTCTCGTGGAGATGGCTGATGATTTGTACGACTATGAGGCAGA
TATAGGCAAGAATAGCTTCAACGTGCTTCGAATGTTTTTGTACATATTCGGGCCTAGTGAAGCCCCAACAATGCTTGCTG
ATTTTATTGGAAGAAGTGAAGACAAGTATCAAATTCTGTTGACAGAGCTGGAACCGCATCTTCAAGTCCAGTATAACAAA
CGGTGCGAGGAGGCTGTCAAAGAAGGTGTTTACAAGGGTAGCCACAGATGTGAGGGGCAGGCCAAAACAAGGATGCCTAC
GGCAAACACACTACTTCCATAA