Microexon ID Pp_23:8897688-8897696:-
Species Physcomitrium patens
Coordinates 23:8897688..8897696
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTATTTAG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAAGAATTGTTGGAATGATAAACAAAAGAAGAAAGGACGACAAGTTTATTTAGGAGCATACGATGAAGAAGAGGCGGCTGCCAGGGCTTACGACCTTGCTGCG
Microexon-tag Amino Acid Seq WDKNCWNDKQKKKGRQVYLGAYDEEEAAARAYDLAA
Microexon-tag spanning region8897443-8897923
Microexon-tag prediction score0.9582
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c23_13360V3.1x
Reference Transcript ID Pp3c23_13360V3.1
Gene ID Pp3c23_13360
Gene Name NA
Transcript ID Pp3c23_13360V3.1
Protein ID Pp3c23_13360V3.1
Gene ID Pp3c23_13360
Gene Name NA
Pfam domain motif AP2
Motif E-value 2e-13
Motif start 59
Motif end 117
Protein seq >Pp3c23_13360V3.1
MEPRQRSLMLSSLVVAEDKVCKRTVRQYPARSKSKRKSFAERELAGADACGGPLKRSSSFRGVTRHRWTGRFEAHLWDKN
CWNDKQKKKGRQVYLGAYDEEEAAARAYDLAALKYWGQSTVINFKLEDYQQQLEEMRNITREEYLATLRRKSSGFSRGVS
KYRGVARHHHNGRWEARIGRVDGNKYLYLGTFGTQEEAARAYDRAAIEYRGPAAVTNFDLTCYTQQLARKPVQSKLAFEE
GQGASSVSVGGSCTNDQMYPPIPLEQSCDMQLLPTLDGDDSQLVDRKSIIFDSGSCRDLSQLLQSPFSPGSDAGGSCIKS
LTKLRYACDGGSSNNLQSLAYEDPAQITESRCSSQYQYTDDSMEQDVIFNANPYLTNDFSHQEYQIVDEKPKICLEDITY
VVNECIPGDKIPEMETSGSSVVTTMMIPRESNYLKSFEIDFDDDSHTECESQIESQVDNNSFWDEFLGSNDPLLQETSFP
DVFDEQQTFRCPSSDSDLTLTFDM*
CDS seq >Pp3c23_13360V3.1
ATGGAGCCGCGGCAGCGAAGCCTGATGCTCAGTAGTTTGGTTGTCGCGGAGGACAAGGTTTGTAAGAGGACTGTGAGGCA
GTACCCTGCGAGGAGTAAAAGCAAGCGGAAGTCGTTTGCGGAGCGAGAGCTCGCGGGAGCTGATGCATGCGGTGGCCCTT
TGAAGAGAAGCTCGTCGTTTAGAGGAGTGACCAGGCACCGATGGACAGGAAGATTTGAGGCACACCTCTGGGACAAGAAT
TGTTGGAATGATAAACAAAAGAAGAAAGGACGACAAGTTTATTTAGGAGCATACGATGAAGAAGAGGCGGCTGCCAGGGC
TTACGACCTTGCTGCGCTCAAATACTGGGGACAGAGTACTGTTATTAACTTCAAGTTGGAAGATTATCAACAACAGCTTG
AGGAGATGAGGAACATTACCCGTGAGGAGTACCTTGCCACTCTTCGAAGAAAAAGCAGCGGCTTCTCGCGGGGAGTTTCC
AAATATCGAGGTGTTGCTAGGCATCATCACAACGGACGCTGGGAAGCTCGCATTGGTCGTGTTGATGGCAACAAGTACTT
GTATCTCGGCACTTTCGGTACACAAGAAGAAGCCGCTCGAGCGTATGACAGGGCTGCCATTGAGTACCGTGGGCCAGCAG
CTGTTACCAACTTCGATCTCACGTGTTACACCCAGCAATTGGCGCGAAAGCCCGTGCAATCGAAGTTGGCATTCGAGGAG
GGTCAAGGCGCAAGCTCAGTCTCAGTAGGCGGCTCGTGCACTAACGATCAGATGTATCCACCAATCCCCTTAGAACAATC
ATGCGACATGCAGTTGCTGCCGACTCTTGATGGAGATGATTCCCAGCTGGTGGACAGGAAGAGCATCATCTTTGACAGTG
GCTCTTGCAGGGACCTCTCCCAGCTCCTTCAATCACCCTTCAGTCCAGGCTCGGACGCAGGGGGCTCGTGCATCAAGTCA
TTGACGAAACTACGGTACGCTTGCGATGGTGGCTCAAGCAACAACCTCCAGAGCCTCGCGTACGAGGATCCTGCTCAGAT
CACAGAGTCTCGTTGTTCGTCTCAGTACCAATACACAGACGATAGCATGGAGCAGGACGTGATCTTCAACGCCAATCCTT
ATCTTACAAATGACTTTTCCCATCAAGAATATCAGATTGTAGACGAGAAGCCCAAAATATGTCTAGAAGACATCACCTAT
GTCGTCAACGAATGCATTCCTGGGGACAAGATCCCGGAGATGGAGACGAGCGGCTCTAGCGTTGTGACCACCATGATGAT
TCCGAGGGAATCCAACTACTTGAAGAGCTTTGAGATTGACTTCGACGACGACTCTCACACGGAATGTGAGAGTCAGATAG
AGAGTCAAGTGGATAATAATTCTTTCTGGGATGAATTTCTGGGGTCGAACGACCCTCTATTGCAGGAGACCTCCTTCCCC
GACGTATTCGACGAGCAGCAGACGTTCAGGTGTCCTTCTTCTGACAGCGACTTAACTTTAACGTTTGACATGTAG
Microexon DNA seq TTTATTTAG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAAGAATTGTTGGAATGATAAACAAAAGAAGAAAGGACGACAAGTTTATTTAGGAGCATACGATGAAGAAGAGGCGGCTGCCAGGGCTTACGACCTTGCTGCG
Microexon-tag Amino Acid seq WDKNCWNDKQKKKGRQVYLGAYDEEEAAARAYDLAA
Transcript ID Pp3c23_13360V3.1
Gene ID Pp.15148
Gene Name NA
Pfam domain motif AP2
Motif E-value 2e-13
Motif start 59
Motif end 117
Protein seq >Pp3c23_13360V3.1
MEPRQRSLMLSSLVVAEDKVCKRTVRQYPARSKSKRKSFAERELAGADACGGPLKRSSSFRGVTRHRWTGRFEAHLWDKN
CWNDKQKKKGRQVYLGAYDEEEAAARAYDLAALKYWGQSTVINFKLEDYQQQLEEMRNITREEYLATLRRKSSGFSRGVS
KYRGVARHHHNGRWEARIGRVDGNKYLYLGTFGTQEEAARAYDRAAIEYRGPAAVTNFDLTCYTQQLARKPVQSKLAFEE
GQGASSVSVGGSCTNDQMYPPIPLEQSCDMQLLPTLDGDDSQLVDRKSIIFDSGSCRDLSQLLQSPFSPGSDAGGSCIKS
LTKLRYACDGGSSNNLQSLAYEDPAQITESRCSSQYQYTDDSMEQDVIFNANPYLTNDFSHQEYQIVDEKPKICLEDITY
VVNECIPGDKIPEMETSGSSVVTTMMIPRESNYLKSFEIDFDDDSHTECESQIESQVDNNSFWDEFLGSNDPLLQETSFP
DVFDEQQTFRCPSSDSDLTLTFDM*
CDS seq >Pp3c23_13360V3.1
ATGGAGCCGCGGCAGCGAAGCCTGATGCTCAGTAGTTTGGTTGTCGCGGAGGACAAGGTTTGTAAGAGGACTGTGAGGCA
GTACCCTGCGAGGAGTAAAAGCAAGCGGAAGTCGTTTGCGGAGCGAGAGCTCGCGGGAGCTGATGCATGCGGTGGCCCTT
TGAAGAGAAGCTCGTCGTTTAGAGGAGTGACCAGGCACCGATGGACAGGAAGATTTGAGGCACACCTCTGGGACAAGAAT
TGTTGGAATGATAAACAAAAGAAGAAAGGACGACAAGTTTATTTAGGAGCATACGATGAAGAAGAGGCGGCTGCCAGGGC
TTACGACCTTGCTGCGCTCAAATACTGGGGACAGAGTACTGTTATTAACTTCAAGTTGGAAGATTATCAACAACAGCTTG
AGGAGATGAGGAACATTACCCGTGAGGAGTACCTTGCCACTCTTCGAAGAAAAAGCAGCGGCTTCTCGCGGGGAGTTTCC
AAATATCGAGGTGTTGCTAGGCATCATCACAACGGACGCTGGGAAGCTCGCATTGGTCGTGTTGATGGCAACAAGTACTT
GTATCTCGGCACTTTCGGTACACAAGAAGAAGCCGCTCGAGCGTATGACAGGGCTGCCATTGAGTACCGTGGGCCAGCAG
CTGTTACCAACTTCGATCTCACGTGTTACACCCAGCAATTGGCGCGAAAGCCCGTGCAATCGAAGTTGGCATTCGAGGAG
GGTCAAGGCGCAAGCTCAGTCTCAGTAGGCGGCTCGTGCACTAACGATCAGATGTATCCACCAATCCCCTTAGAACAATC
ATGCGACATGCAGTTGCTGCCGACTCTTGATGGAGATGATTCCCAGCTGGTGGACAGGAAGAGCATCATCTTTGACAGTG
GCTCTTGCAGGGACCTCTCCCAGCTCCTTCAATCACCCTTCAGTCCAGGCTCGGACGCAGGGGGCTCGTGCATCAAGTCA
TTGACGAAACTACGGTACGCTTGCGATGGTGGCTCAAGCAACAACCTCCAGAGCCTCGCGTACGAGGATCCTGCTCAGAT
CACAGAGTCTCGTTGTTCGTCTCAGTACCAATACACAGACGATAGCATGGAGCAGGACGTGATCTTCAACGCCAATCCTT
ATCTTACAAATGACTTTTCCCATCAAGAATATCAGATTGTAGACGAGAAGCCCAAAATATGTCTAGAAGACATCACCTAT
GTCGTCAACGAATGCATTCCTGGGGACAAGATCCCGGAGATGGAGACGAGCGGCTCTAGCGTTGTGACCACCATGATGAT
TCCGAGGGAATCCAACTACTTGAAGAGCTTTGAGATTGACTTCGACGACGACTCTCACACGGAATGTGAGAGTCAGATAG
AGAGTCAAGTGGATAATAATTCTTTCTGGGATGAATTTCTGGGGTCGAACGACCCTCTATTGCAGGAGACCTCCTTCCCC
GACGTATTCGACGAGCAGCAGACGTTCAGGTGTCCTTCTTCTGACAGCGACTTAACTTTAACGTTTGACATGTAG