Microexon ID Pp_9:733216-733220:+
Species Physcomitrium patens
Coordinates 9:733216..733220
Microexon Cluster ID MEP07
Size 5
Phase 1
Pfam Domain Motif Peptidase_M1
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 33,19,5,12,39
Microexon location in the Microexon-tag 3
Microexon-tag DNA Seq GCWCAYCCTGTTCGRCCWCAYTCTTAYATYAAGATGGACAACTTCTAYACAGTRACGGTKTATGARAAGGGWGCTGAAGTTGTCMGRATGTACAARACMTTRYTKGGR
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TCACT
Microexon Amino Acid seq VT
Microexon-tag DNA Seq GCACATCCAGTACGGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGTTTATGAGAAGGGTGCTGAGGTTGTACGAATGTACCAGACGCTACTAGGA
Microexon-tag Amino Acid Seq AHPVRPHSYIKMDNFYTVTVYEKGAEVVRMYQTLLG
Microexon-tag spanning region732997-733623
Microexon-tag prediction score0.9449
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c9_1240V3.3x
Reference Transcript ID Pp3c9_1240V3.3
Gene ID Pp3c9_1240
Gene Name NA
Transcript ID Pp3c9_1240V3.1
Protein ID Pp3c9_1240V3.1
Gene ID Pp3c9_1240
Gene Name NA
Pfam domain motif Peptidase_M1
Motif E-value 6.7e-43
Motif start 235
Motif end 435
Protein seq >Pp3c9_1240V3.1
MKMEESVKETPKEIFLKDYKAPHYAFEKVNLKFVLGEEKTLVTSNIRVLPRSSDGAPLILDGERLNLVTLNINGSPVPEG
SYKFNSRQLNITSLPTSPFDMEIVTEIEPQNNTSLEGLYKSSGNFCTQCEAEGFRKITYFQDRPDVMSKFTTRIEADKTL
YPVLLSNGNLVDEGDLADGKHFAVWDDPWTKPCYLFALVAGQLVSRDDSFTTMSGRKVALRIWTPPQDIPKTVHAMKSLQ
LAMKWDEEVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSRLVLASPETATDADYAAIEGVIGHEYFHNWTGNRVTCR
DWFQLSLKEGLTVFRDQEFSSDMGSRGVKRIADVGRLRTAQFSQDAGPMAHPVRPHSYIKMDNFYTVTGAEVVRMYQTLL
GNAGFRKGMDLYFQRHDGQAVTCEEFLAAMFDANNVKFPTFPLWYAQAGTPTLTVTTSYDAGAQTFTIKCKQEVPPTPGQ
LKKEPMLLPLAVGLLDSHGHDMRLTSVKDGTSLHNLTSVDGDYTTTAVLHVDKAEQEFTFVNITEKPVPSLLRNFSAPVR
LVSDVTNDDLFFLLAHDSDQFNRWEAGQTLSRKLMLDLISAQQKGEDLSVDSAFIEGMRSTVMDSSLDKEFVAKCLTLPT
ESEIADLMDVADPDAIHNVRRFIIMEIATNMRDVLLKMVEANRSSATYDPSHVHRAQRALKNVALGYLAMLNEPEIIDLA
VNENKNATNMTDQVSALAAICQNSGDARSKALAEFYEQWKDETLVMNKWLALQAMSNIPGNVENVRGLMEHPAFDIRNPN
KVYSLIGGFCASAVNFHAKDGSGYTFLADVVLQLDKLNPQVASRMVSAFSRWRRFDEGRQALAKAQLERITSQDGLSDNV
FEIASKSLAS*
CDS seq >Pp3c9_1240V3.1
ATGAAGATGGAGGAGTCCGTTAAGGAGACTCCTAAGGAGATCTTTCTGAAGGATTACAAGGCACCCCATTATGCTTTCGA
AAAGGTAAATTTAAAGTTTGTGTTGGGAGAAGAGAAGACTCTAGTGACGTCCAACATACGTGTCCTTCCCAGGTCTTCTG
ACGGTGCACCATTGATACTCGATGGAGAGAGGTTGAACTTGGTTACTCTAAATATCAATGGTTCGCCTGTCCCGGAGGGT
AGTTACAAGTTTAATTCTCGGCAATTGAATATTACATCGTTGCCCACCAGCCCTTTCGACATGGAAATTGTCACGGAAAT
TGAGCCACAAAATAATACTTCTCTGGAAGGATTGTATAAGTCATCTGGCAATTTCTGCACTCAATGTGAAGCAGAGGGCT
TCCGTAAGATCACTTACTTCCAGGATCGCCCAGATGTGATGTCTAAGTTCACAACACGTATTGAGGCAGATAAAACTCTT
TACCCAGTGTTGTTAAGCAACGGCAACTTGGTTGATGAGGGTGATTTGGCTGATGGCAAGCATTTCGCAGTGTGGGATGA
TCCCTGGACAAAGCCTTGCTACTTGTTTGCACTTGTAGCTGGCCAGCTTGTGAGTCGTGATGACTCATTTACCACAATGT
CGGGGCGCAAAGTGGCTCTTAGAATCTGGACTCCTCCGCAAGATATTCCTAAGACCGTTCACGCAATGAAATCTTTGCAA
CTGGCTATGAAGTGGGATGAAGAGGTATTTGGTTTGGAATATGACCTAGACCTCTTCAACATTGTTGCTGTGCCGGATTT
CAACATGGGTGCTATGGAGAATAAGAGCTTAAATATTTTTAATTCTAGGTTGGTATTAGCATCTCCTGAGACAGCTACAG
ATGCAGATTACGCTGCTATTGAGGGAGTTATTGGCCACGAGTATTTTCACAATTGGACTGGCAACCGAGTGACATGTAGG
GATTGGTTTCAGCTTAGTTTGAAGGAGGGACTTACTGTTTTCAGAGATCAGGAATTTTCATCTGATATGGGTAGTCGTGG
AGTAAAACGCATTGCTGATGTGGGTCGTTTAAGGACTGCGCAATTTTCTCAGGATGCTGGTCCAATGGCACATCCAGTAC
GGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGGTGCTGAGGTTGTACGAATGTACCAGACGCTACTA
GGAAATGCCGGCTTTAGAAAGGGTATGGACTTGTATTTTCAAAGGCACGACGGTCAGGCTGTCACTTGTGAGGAGTTCCT
GGCTGCTATGTTCGATGCCAATAATGTTAAATTCCCGACATTTCCCCTTTGGTACGCCCAAGCGGGAACTCCAACTCTGA
CTGTGACCACCTCATACGACGCTGGTGCACAAACATTCACCATCAAGTGCAAGCAAGAAGTTCCCCCCACCCCAGGACAA
TTAAAGAAGGAACCAATGCTTCTACCTTTGGCTGTAGGGTTGCTTGATTCACACGGGCACGACATGCGTCTTACTTCTGT
GAAAGATGGAACCTCCCTGCACAACTTGACTAGTGTAGATGGTGACTACACGACCACTGCAGTACTGCATGTTGACAAGG
CTGAGCAAGAATTTACATTTGTGAATATCACGGAGAAGCCAGTTCCGTCACTGCTGCGTAATTTCAGTGCACCAGTTCGC
CTTGTGTCTGATGTTACAAACGATGACCTTTTCTTTCTCCTGGCTCATGATTCCGATCAATTCAACAGGTGGGAAGCTGG
CCAGACTCTAAGTCGCAAGCTTATGCTGGACCTGATCTCTGCGCAGCAGAAAGGAGAGGATCTTAGCGTAGACAGTGCTT
TTATTGAAGGCATGCGAAGCACTGTTATGGATTCGTCACTTGACAAGGAATTCGTGGCAAAATGCCTTACATTACCGACA
GAAAGTGAGATTGCGGACCTGATGGATGTGGCTGATCCTGATGCCATACATAATGTGAGACGGTTTATCATTATGGAGAT
TGCTACTAATATGCGCGATGTCCTACTTAAAATGGTGGAGGCAAACAGAAGTTCAGCTACTTACGATCCAAGCCATGTGC
ACAGGGCACAGCGTGCTTTAAAGAATGTTGCTTTGGGATACTTGGCGATGTTGAACGAACCTGAGATTATAGACCTGGCT
GTGAACGAAAACAAGAATGCCACAAACATGACTGATCAAGTATCTGCCTTGGCTGCTATTTGCCAGAATTCTGGTGATGC
ACGGTCTAAGGCACTGGCAGAGTTCTACGAACAATGGAAGGATGAGACTTTGGTCATGAACAAATGGCTAGCTTTGCAAG
CCATGTCAAATATCCCAGGAAATGTAGAGAATGTGCGGGGTTTGATGGAGCACCCTGCTTTTGACATCCGGAATCCAAAC
AAGGTCTATTCCTTGATTGGTGGTTTCTGCGCGTCCGCTGTAAATTTTCATGCTAAAGACGGTTCAGGGTACACGTTTTT
GGCTGACGTTGTCCTTCAGTTGGATAAGCTGAATCCCCAGGTTGCGTCACGCATGGTGTCGGCTTTCTCCAGATGGCGGA
GGTTTGATGAAGGCAGGCAGGCATTGGCCAAAGCACAATTGGAGCGGATTACGAGCCAGGATGGTCTGTCAGACAATGTT
TTCGAAATTGCCTCAAAGAGTTTGGCGTCGTAA
Microexon DNA seq TCACT
Microexon Amino Acid seq VT
Microexon-tag DNA Seq GCACATCCAGTACGGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGTTTATGAGAAGGGTGCTGAGGTTGTACGAATGTACCAGACGCTACTAGGA
Microexon-tag Amino Acid seq AHPVRPHSYIKMDNFYTVTVYEKGAEVVRMYQTLLG
Transcript ID Pp.24480.1
Gene ID Pp.24480
Gene Name NA
Pfam domain motif Peptidase_M1
Motif E-value 3.6e-49
Motif start 235
Motif end 439
Protein seq >Pp.24480.1
MKMEESVKETPKEIFLKDYKAPHYAFEKVNLKFVLGEEKTLVTSNIRVLPRSSDGAPLILDGERLNLVTLNINGSPVPEG
SYKFNSRQLNITSLPTSPFDMEIVTEIEPQNNTSLEGLYKSSGNFCTQCEAEGFRKITYFQDRPDVMSKFTTRIEADKTL
YPVLLSNGNLVDEGDLADGKHFAVWDDPWTKPCYLFALVAGQLVSRDDSFTTMSGRKVALRIWTPPQDIPKTVHAMKSLQ
LAMKWDEEVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSRLVLASPETATDADYAAIEGVIGHEYFHNWTGNRVTCR
DWFQLSLKEGLTVFRDQEFSSDMGSRGVKRIADVGRLRTAQFSQDAGPMAHPVRPHSYIKMDNFYTVTVYEKGAEVVRMY
QTLLGNAGFRKGMDLYFQRHDGQAVTCEEFLAAMFDANNVKFPTFPLWYAQAGTPTLTVTTSYDAGAQTFTIKCKQEVPP
TPGQLKKEPMLLPLAVGLLDSHGHDMRLTSVKDGTSLHNLTSVDGDYTTTAVLHVDKAEQEFTFVNITEKPVPSLLRNFS
APVRLVSDVTNDDLFFLLAHDSDQFNRWEAGQTLSRKLMLDLISAQQKGEDLSVDSAFIEGMRSTVMDSSLDKEFVAKCL
TLPTESEIADLMDVADPDAIHNVRRFIIMEIATNMRDVLLKMVEANRSSATYDPSHVHRAQRALKNVALGYLAMLNEPEI
IDLAVNENKNATNMTDQVSALAAICQNSGDARSKALAEFYEQWKDETLVMNKWLALQAMSNIPGNVENVRGLMEHPAFDI
RNPNKVYSLIGGFCASAVNFHAKDGSGYTFLADVVLQLDKLNPQVASRMVSAFSRWRRFDEGRQALAKAQLERITSQDGL
SDNVFEIASKSLAS*
CDS seq >Pp.24480.1
ATGAAGATGGAGGAGTCCGTTAAGGAGACTCCTAAGGAGATCTTTCTGAAGGATTACAAGGCACCCCATTATGCTTTCGA
AAAGGTAAATTTAAAGTTTGTGTTGGGAGAAGAGAAGACTCTAGTGACGTCCAACATACGTGTCCTTCCCAGGTCTTCTG
ACGGTGCACCATTGATACTCGATGGAGAGAGGTTGAACTTGGTTACTCTAAATATCAATGGTTCGCCTGTCCCGGAGGGT
AGTTACAAGTTTAATTCTCGGCAATTGAATATTACATCGTTGCCCACCAGCCCTTTCGACATGGAAATTGTCACGGAAAT
TGAGCCACAAAATAATACTTCTCTGGAAGGATTGTATAAGTCATCTGGCAATTTCTGCACTCAATGTGAAGCAGAGGGCT
TCCGTAAGATCACTTACTTCCAGGATCGCCCAGATGTGATGTCTAAGTTCACAACACGTATTGAGGCAGATAAAACTCTT
TACCCAGTGTTGTTAAGCAACGGCAACTTGGTTGATGAGGGTGATTTGGCTGATGGCAAGCATTTCGCAGTGTGGGATGA
TCCCTGGACAAAGCCTTGCTACTTGTTTGCACTTGTAGCTGGCCAGCTTGTGAGTCGTGATGACTCATTTACCACAATGT
CGGGGCGCAAAGTGGCTCTTAGAATCTGGACTCCTCCGCAAGATATTCCTAAGACCGTTCACGCAATGAAATCTTTGCAA
CTGGCTATGAAGTGGGATGAAGAGGTATTTGGTTTGGAATATGACCTAGACCTCTTCAACATTGTTGCTGTGCCGGATTT
CAACATGGGTGCTATGGAGAATAAGAGCTTAAATATTTTTAATTCTAGGTTGGTATTAGCATCTCCTGAGACAGCTACAG
ATGCAGATTACGCTGCTATTGAGGGAGTTATTGGCCACGAGTATTTTCACAATTGGACTGGCAACCGAGTGACATGTAGG
GATTGGTTTCAGCTTAGTTTGAAGGAGGGACTTACTGTTTTCAGAGATCAGGAATTTTCATCTGATATGGGTAGTCGTGG
AGTAAAACGCATTGCTGATGTGGGTCGTTTAAGGACTGCGCAATTTTCTCAGGATGCTGGTCCAATGGCACATCCAGTAC
GGCCTCATTCTTACATCAAAATGGATAATTTCTATACAGTCACTGTTTATGAGAAGGGTGCTGAGGTTGTACGAATGTAC
CAGACGCTACTAGGAAATGCCGGCTTTAGAAAGGGTATGGACTTGTATTTTCAAAGGCACGACGGTCAGGCTGTCACTTG
TGAGGAGTTCCTGGCTGCTATGTTCGATGCCAATAATGTTAAATTCCCGACATTTCCCCTTTGGTACGCCCAAGCGGGAA
CTCCAACTCTGACTGTGACCACCTCATACGACGCTGGTGCACAAACATTCACCATCAAGTGCAAGCAAGAAGTTCCCCCC
ACCCCAGGACAATTAAAGAAGGAACCAATGCTTCTACCTTTGGCTGTAGGGTTGCTTGATTCACACGGGCACGACATGCG
TCTTACTTCTGTGAAAGATGGAACCTCCCTGCACAACTTGACTAGTGTAGATGGTGACTACACGACCACTGCAGTACTGC
ATGTTGACAAGGCTGAGCAAGAATTTACATTTGTGAATATCACGGAGAAGCCAGTTCCGTCACTGCTGCGTAATTTCAGT
GCACCAGTTCGCCTTGTGTCTGATGTTACAAACGATGACCTTTTCTTTCTCCTGGCTCATGATTCCGATCAATTCAACAG
GTGGGAAGCTGGCCAGACTCTAAGTCGCAAGCTTATGCTGGACCTGATCTCTGCGCAGCAGAAAGGAGAGGATCTTAGCG
TAGACAGTGCTTTTATTGAAGGCATGCGAAGCACTGTTATGGATTCGTCACTTGACAAGGAATTCGTGGCAAAATGCCTT
ACATTACCGACAGAAAGTGAGATTGCGGACCTGATGGATGTGGCTGATCCTGATGCCATACATAATGTGAGACGGTTTAT
CATTATGGAGATTGCTACTAATATGCGCGATGTCCTACTTAAAATGGTGGAGGCAAACAGAAGTTCAGCTACTTACGATC
CAAGCCATGTGCACAGGGCACAGCGTGCTTTAAAGAATGTTGCTTTGGGATACTTGGCGATGTTGAACGAACCTGAGATT
ATAGACCTGGCTGTGAACGAAAACAAGAATGCCACAAACATGACTGATCAAGTATCTGCCTTGGCTGCTATTTGCCAGAA
TTCTGGTGATGCACGGTCTAAGGCACTGGCAGAGTTCTACGAACAATGGAAGGATGAGACTTTGGTCATGAACAAATGGC
TAGCTTTGCAAGCCATGTCAAATATCCCAGGAAATGTAGAGAATGTGCGGGGTTTGATGGAGCACCCTGCTTTTGACATC
CGGAATCCAAACAAGGTCTATTCCTTGATTGGTGGTTTCTGCGCGTCCGCTGTAAATTTTCATGCTAAAGACGGTTCAGG
GTACACGTTTTTGGCTGACGTTGTCCTTCAGTTGGATAAGCTGAATCCCCAGGTTGCGTCACGCATGGTGTCGGCTTTCT
CCAGATGGCGGAGGTTTGATGAAGGCAGGCAGGCATTGGCCAAAGCACAATTGGAGCGGATTACGAGCCAGGATGGTCTG
TCAGACAATGTTTTCGAAATTGCCTCAAAGAGTTTGGCGTCGTAA