Microexon ID Pp_5:3115413-3115425:-
Species Physcomitrium patens
Coordinates 5:3115413..3115425
Microexon Cluster ID MEP32
Size 13
Phase 0
Pfam Domain Motif MCM6_C
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,13,47
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq AMYRTTGAWSTKWCWGATGTYCTYTCTARYTTCCCKGACATMTCARTGGHWCTGRYTGAAGAWATYATGGAKARRCTWSTWAAMSAWRRTRTACTRTCAARRRCRGGA
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Pp_5:3115413-3115425:- does not have available information here.
Transcript ID Pp3c5_4780V3.2
Protein ID Pp3c5_4780V3.2
Gene ID Pp3c5_4780
Gene Name NA
Pfam domain motif MCM6_C
Motif E-value 0.0075
Motif start 364
Motif end 448
Protein seq >Pp3c5_4780V3.2
MTLLRVEATPAEATRQWKEVSEAESLQLTRNLLRIAVFNISYIRGLFPDSFFQDKFVPALGKHSGHFSSLWSVQPSQYFL
EQAGRTVSAELYLFFLGVMNVYFVHRYLCDCLTDCNLAACEFYDMQSIFFKSRSKSDVRNAVTEMNVKKLQPKDMESKRF
IEWIEKVSFKYSAKGPSEDITSMTIESVSGKPVRTDTTNQTTPQQMKQSACKMVRTLVQLMHTLDHVPKERTIIMKLHYY
DDVTPEDYEPPFFSSSTYEDPSPWAGEPLKMKVGYVDSRHYKLAIKVKSVLDPCEDENELAKNEMCSGTDVMVTSDDDSS
TSSGANDDNTEPSQKSQPRGDVAIQSPENKEQLESFDEETSLANKFKKMRTTAAVAHISDTDRTEDSEFEAESLRSVRIY
LQNREDKTVHMNDIMKEFPQISKVVMGELLDRLVEEEVLIRLKKDLFRVKKIQINPKERTKYSNETLQHNGINCKSLTSN
VKPNKIPVPTQKANESLPDEQQGLYEKALLVTLPREYIAVAQLQEAFQDLSPGKIRQLISRMWLDGYIEKVPHIRCKGRQ
VFHTPATNKKLHELRVSYCSKMQKDEPGRKKYDNMDASQDWGVLIREKAQSRTLDELVCGRTQALGSDDTRQDQPAVKKH
TKMAPAFKEPALPQKPLRERAKITPSTLGAKGKRIRVLNSDSLMNVNKDSQGEVVDRAGKASKVLSK*
CDS seq >Pp3c5_4780V3.2
ATGACCTTACTTCGGGTTGAAGCCACTCCTGCAGAGGCTACTCGGCAATGGAAGGAGGTTTCGGAAGCGGAGTCATTGCA
GCTTACGAGGAACCTCCTCCGGATTGCTGTATTCAATATTAGCTATATCAGGGGCCTGTTTCCTGATAGCTTTTTTCAGG
ACAAGTTTGTGCCTGCATTAGGTAAGCATAGTGGACATTTCAGCAGCCTCTGGAGTGTACAACCTTCGCAGTATTTCTTG
GAACAGGCCGGTCGTACAGTGTCTGCCGAATTGTATCTTTTCTTTTTGGGAGTGATGAATGTTTACTTTGTCCATCGTTA
CCTGTGTGATTGTCTCACCGATTGTAACCTCGCAGCTTGCGAATTCTATGACATGCAATCAATTTTCTTCAAAAGTCGCT
CTAAATCTGACGTTCGAAATGCTGTGACAGAGATGAACGTCAAGAAGTTGCAGCCAAAGGATATGGAATCGAAGAGATTC
ATCGAATGGATTGAGAAAGTTTCGTTTAAGTACTCTGCTAAAGGGCCGTCGGAGGATATCACCTCCATGACTATTGAGAG
CGTAAGTGGCAAGCCTGTGCGCACTGACACAACTAATCAGACGACTCCCCAGCAGATGAAGCAGTCCGCATGTAAGATGG
TGCGCACACTTGTGCAGCTGATGCACACACTGGACCATGTCCCGAAAGAGCGCACCATCATTATGAAGCTCCACTACTAC
GACGACGTAACACCAGAAGATTACGAGCCTCCTTTTTTCTCATCTAGTACTTATGAAGACCCTAGTCCATGGGCCGGGGA
GCCTTTGAAGATGAAGGTTGGTTATGTCGACAGCAGACATTACAAGCTCGCAATCAAGGTGAAGAGCGTGTTAGATCCTT
GTGAAGATGAGAACGAGTTGGCCAAGAATGAGATGTGCTCTGGTACTGATGTAATGGTCACAAGTGATGATGATTCGTCA
ACCAGCTCTGGGGCCAATGATGATAATACAGAGCCCAGTCAGAAATCCCAGCCAAGAGGGGATGTTGCAATACAGAGTCC
AGAAAACAAAGAGCAACTTGAGTCATTTGATGAAGAAACGTCTCTTGCGAACAAGTTTAAGAAAATGCGGACAACTGCTG
CAGTTGCTCATATTAGTGACACTGATAGAACAGAAGATTCCGAGTTCGAGGCTGAGAGTTTGCGAAGCGTGAGAATTTAC
CTTCAGAATAGAGAAGATAAGACTGTCCACATGAACGACATTATGAAGGAGTTCCCTCAAATTTCAAAGGTTGTCATGGG
AGAGCTTCTGGATCGCCTCGTTGAAGAAGAGGTGCTAATAAGGCTGAAAAAGGACCTATTCCGTGTGAAGAAAATTCAGA
TTAATCCGAAAGAGCGCACGAAGTACTCGAACGAAACCCTGCAGCATAATGGAATAAACTGTAAGTCTCTTACCTCAAAT
GTCAAACCAAACAAAATACCCGTACCAACACAAAAGGCCAACGAATCATTGCCCGATGAACAGCAGGGCCTCTACGAAAA
AGCCCTTCTTGTCACATTGCCGCGGGAATACATAGCAGTGGCGCAGCTTCAAGAGGCCTTTCAAGATCTTAGCCCTGGAA
AAATTCGCCAACTAATATCTCGAATGTGGCTCGATGGTTACATAGAGAAAGTCCCTCACATAAGATGCAAAGGTCGCCAG
GTTTTTCACACCCCCGCAACAAACAAGAAACTTCACGAATTGCGTGTGTCATATTGTTCAAAGATGCAGAAGGACGAGCC
TGGGAGAAAGAAATACGATAACATGGACGCATCGCAAGATTGGGGCGTTCTCATTCGCGAAAAAGCTCAATCCAGGACCC
TAGACGAATTGGTGTGCGGTCGCACGCAAGCTCTTGGCTCCGATGATACTCGACAGGACCAGCCAGCTGTTAAAAAGCAC
ACCAAAATGGCCCCAGCTTTCAAAGAACCAGCTCTGCCACAAAAGCCACTACGAGAACGAGCTAAAATTACACCATCAAC
TCTAGGAGCCAAGGGGAAAAGGATCAGAGTGCTCAATTCAGACTCACTTATGAATGTCAATAAGGACAGCCAAGGCGAGG
TGGTAGACAGGGCAGGAAAAGCCAGCAAGGTATTGTCAAAGTAA
Microexon DNA seq GTTGTCATGGGAG
Microexon Amino Acid seq VVMGE
Microexon-tag DNA Seq ACTGTCCACATGAACGACATTATGAAGGAGTTCCCTCAAATTTCAAAGGTTGTCATGGGAGAGCTTCTGGATCGCCTCGTTGAAGAAGAGGTGCTAATAAGGCTGAAA
Microexon-tag Amino Acid seq TVHMNDIMKEFPQISKVVMGELLDRLVEEEVLIRLK
Transcript ID Pp.20522.1
Gene ID Pp.20522
Gene Name NA
Pfam domain motif MCM6_C
Motif E-value 0.0069
Motif start 312
Motif end 396
Protein seq >Pp.20522.1
MTLLRVEATPAEATRQWKEVSEAESLQLTRNLLRIAVFNISYIRGLFPDSFFQDKFVPALEMNVKKLQPKDMESKRFIEW
IEKGVFDALKKKYLKTLFLTICRGQDGPLIEEYMFSFKYSAKGPSEDITSMTIESVSGKPVRTDTTNQTTPQQMKQSACK
MVRTLVQLMHTLDHVPKERTIIMKLHYYDDVTPEDYEPPFFSSSTYEDPSPWAGEPLKMKVGYVDSRHYKLAIKVKSVLD
PCEDENELAKNEMCSGTDVMVTSDDDSSTSSGANDDNTEPSQKSQPRGDVAIQSPENKEQLESFDEETSLANKFKKMRTT
AAVAHISDTDRTEDSEFEAESLRSVRIYLQNREDKTVHMNDIMKEFPQISKVVMGELLDRLVEEEVLIRLKKDLFRVKKI
QINPKERTKYSNETLQHNGINCKSLTSNVKPNKIPVPTQKANESLPDEQQGLYEKALLVTLPREYIAVAQLQEAFQDLSP
GKIRQLISRMWLDGYIEKVPHIRCKGRQVFHTPATNKKLHELRVSYCSKMQKDEPGRKKYDNMDASQDWGVLIREKAQSR
TLDELVCGRTQALGSDDTRQDQPAVKKHTKMAPAFKEPALPQKPLRERAKITPSTLGAKGKRIRVLNSDSLMNVNKDSQG
EVVDRAGKASKVDGPIYQPPFKRVRTE*
CDS seq >Pp.20522.1
ATGACCTTACTTCGGGTTGAAGCCACTCCTGCAGAGGCTACTCGGCAATGGAAGGAGGTTTCGGAAGCGGAGTCATTGCA
GCTTACGAGGAACCTCCTCCGGATTGCTGTATTCAATATTAGCTATATCAGGGGCCTGTTTCCTGATAGCTTTTTTCAGG
ACAAGTTTGTGCCTGCATTAGAGATGAACGTCAAGAAGTTGCAGCCAAAGGATATGGAATCGAAGAGATTCATCGAATGG
ATTGAGAAAGGAGTTTTCGATGCCTTAAAAAAGAAGTATCTGAAGACGCTTTTCTTGACGATCTGTCGTGGACAAGATGG
CCCACTGATTGAGGAATATATGTTTTCGTTTAAGTACTCTGCTAAAGGGCCGTCGGAGGATATCACCTCCATGACTATTG
AGAGCGTAAGTGGCAAGCCTGTGCGCACTGACACAACTAATCAGACGACTCCCCAGCAGATGAAGCAGTCCGCATGTAAG
ATGGTGCGCACACTTGTGCAGCTGATGCACACACTGGACCATGTCCCGAAAGAGCGCACCATCATTATGAAGCTCCACTA
CTACGACGACGTAACACCAGAAGATTACGAGCCTCCTTTTTTCTCATCTAGTACTTATGAAGACCCTAGTCCATGGGCCG
GGGAGCCTTTGAAGATGAAGGTTGGTTATGTCGACAGCAGACATTACAAGCTCGCAATCAAGGTGAAGAGCGTGTTAGAT
CCTTGTGAAGATGAGAACGAGTTGGCCAAGAATGAGATGTGCTCTGGTACTGATGTAATGGTCACAAGTGATGATGATTC
GTCAACCAGCTCTGGGGCCAATGATGATAATACAGAGCCCAGTCAGAAATCCCAGCCAAGAGGGGATGTTGCAATACAGA
GTCCAGAAAACAAAGAGCAACTTGAGTCATTTGATGAAGAAACGTCTCTTGCGAACAAGTTTAAGAAAATGCGGACAACT
GCTGCAGTTGCTCATATTAGTGACACTGATAGAACAGAAGATTCCGAGTTCGAGGCTGAGAGTTTGCGAAGCGTGAGAAT
TTACCTTCAGAATAGAGAAGATAAGACTGTCCACATGAACGACATTATGAAGGAGTTCCCTCAAATTTCAAAGGTTGTCA
TGGGAGAGCTTCTGGATCGCCTCGTTGAAGAAGAGGTGCTAATAAGGCTGAAAAAGGACCTATTCCGTGTGAAGAAAATT
CAGATTAATCCGAAAGAGCGCACGAAGTACTCGAACGAAACCCTGCAGCATAATGGAATAAACTGTAAGTCTCTTACCTC
AAATGTCAAACCAAACAAAATACCCGTACCAACACAAAAGGCCAACGAATCATTGCCCGATGAACAGCAGGGCCTCTACG
AAAAAGCCCTTCTTGTCACATTGCCGCGGGAATACATAGCAGTGGCGCAGCTTCAAGAGGCCTTTCAAGATCTTAGCCCT
GGAAAAATTCGCCAACTAATATCTCGAATGTGGCTCGATGGTTACATAGAGAAAGTCCCTCACATAAGATGCAAAGGTCG
CCAGGTTTTTCACACCCCCGCAACAAACAAGAAACTTCACGAATTGCGTGTGTCATATTGTTCAAAGATGCAGAAGGACG
AGCCTGGGAGAAAGAAATACGATAACATGGACGCATCGCAAGATTGGGGCGTTCTCATTCGCGAAAAAGCTCAATCCAGG
ACCCTAGACGAATTGGTGTGCGGTCGCACGCAAGCTCTTGGCTCCGATGATACTCGACAGGACCAGCCAGCTGTTAAAAA
GCACACCAAAATGGCCCCAGCTTTCAAAGAACCAGCTCTGCCACAAAAGCCACTACGAGAACGAGCTAAAATTACACCAT
CAACTCTAGGAGCCAAGGGGAAAAGGATCAGAGTGCTCAATTCAGACTCACTTATGAATGTCAATAAGGACAGCCAAGGC
GAGGTGGTAGACAGGGCAGGAAAAGCCAGCAAGGTTGACGGACCGATTTATCAACCACCCTTCAAACGTGTTCGGACAGA
ATGA