Microexon ID Pp_8:2974751-2974758:+
Species Physcomitrium patens
Coordinates 8:2974751..2974758
Microexon Cluster ID MEP19
Size 8
Phase 2
Pfam Domain Motif Unknown
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 50,8,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GYRKCWMAYCGTGAWCCTCGWTTTMGRTCYMRWAYKCRWGAYRRTGAAGGRTCTCAAGGTAARYCTGARGTRTCWRCYRTTGTTTATAAAGYTGGTGARTGCATGCAA
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GGCTCAAG
Microexon Amino Acid seq GAQG
Microexon-tag DNA Seq CCAGCAAACCGAGACCCTCGGTCTCGGTTTCGGCCCAAAGATATTGAAGGGGCTCAAGGCAGGGCAGAAAGTTCATTCATTAATTACAGAGTTGGGGAAGGAATGCCG
Microexon-tag Amino Acid Seq PANRDPRSRFRPKDIEGAQGRAESSFINYRVGEGMP
Microexon-tag spanning region2974197-2974976
Microexon-tag prediction score0.8699
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c8_5360V3.1x
Reference Transcript ID Pp3c8_5360V3.1
Gene ID Pp3c8_5360
Gene Name NA
Transcript ID Pp3c8_5360V3.1
Protein ID Pp3c8_5360V3.1
Gene ID Pp3c8_5360
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp3c8_5360V3.1
MSLPVKRQHEETASVVGKALEGSGTGTGKQNYAGVGSGMVMEPASAFGGELRPAKVARHGDRGVDPAEKVEFFNSSHANF
KPGRDGGGSGSIEGTVVEGNSIKEGQVQTLRIKFSGGSAEGKSDREGRDPGRLDLERKTSGGKDYGDVLKESSGNKELLG
RVNVKSEEQLLQSGGSGREVGQELVDNARGEGNSEGLRKKSPREGEGKVAVKDVPTDVKREEPHEKEKEKKDEKRSEREE
DALPSLQQQQQQQLLPQSHLDVTNEKDNERDEKERDKGDKEKERMWEREGSRDKEREDARSEKREKREKERDRDFHRRER
EERDDNLVNKGEKDDGKPVKVEDIDTDRKLTREVDERKLVEKEARERVRERDRDKERDEDGEKEKRKKRSREQEKNRDSL
GTMDTEVAGGEKEKDAAHGHGVQQRKRMLRPRGQSNPANRDPRSRFRPKDIEGAQGRAESSFINYRVGEGMPELAKLRKE
YDSGERNSSDNGWGPAVEIRIPAEHATTNNRQVRGSQLWGTDIYTDDSDIVAVLLHTGYYSPSPTPPPDSISELRATIRI
LESQNMYTSTLRNSIRSRAWGGGSGCSYNVERCRIVKQGGGTVELEPSLTRTPPFVPTLAPAASERTVTTRAASSSPYRQ
QRFMQEVTIQYNLCNEPWAKYSMSIVADRGLKKSQYTSARLKKGEVLYVETQVHRYELAYDGERTTCNGATTATSSFPPG
TSGMEKGKEKWGSTDRTLAVGDKEPILQTHNGEKSHGNHPSNGLTGGHHGHSSSEPHEYYRWSKCKRPLSLSSMKKKGVP
LSEEFIEVLEEGLAWEEIQWSPTGVWVRGTVYILSRAQFFSSDKDDMEE*
CDS seq >Pp3c8_5360V3.1
ATGAGTCTTCCGGTGAAGCGGCAGCATGAGGAGACGGCGTCGGTAGTCGGGAAGGCGCTGGAGGGCAGTGGCACTGGAAC
TGGCAAGCAAAATTATGCGGGAGTGGGGAGCGGAATGGTGATGGAGCCTGCGTCTGCATTTGGAGGAGAGTTGCGTCCCG
CTAAAGTGGCGCGCCATGGTGATCGGGGAGTGGACCCGGCAGAGAAGGTTGAGTTTTTCAACAGTTCACATGCGAATTTC
AAACCTGGAAGAGATGGAGGTGGCTCTGGGTCGATTGAGGGAACTGTCGTAGAGGGGAACAGTATCAAGGAAGGGCAAGT
CCAGACACTGAGGATTAAGTTCTCGGGTGGGAGTGCTGAGGGAAAGAGTGATCGGGAGGGTCGGGATCCAGGGCGGCTGG
ATTTAGAGAGGAAGACGAGTGGTGGGAAGGATTACGGAGATGTGCTGAAGGAGAGCAGCGGCAACAAGGAGCTTCTTGGT
AGGGTTAACGTAAAGAGCGAGGAGCAGCTGCTGCAATCGGGAGGGAGTGGTCGGGAAGTAGGGCAAGAATTAGTTGATAA
CGCCCGCGGGGAAGGAAATTCTGAGGGCTTGAGAAAGAAGAGCCCTCGCGAGGGTGAGGGTAAAGTAGCCGTGAAGGATG
TGCCAACTGATGTGAAGAGGGAGGAGCCGCATGAGAAGGAGAAGGAGAAGAAGGATGAGAAGCGTAGTGAGCGTGAAGAA
GATGCGCTGCCGTCACTGCAGCAACAACAGCAGCAGCAGCTGCTGCCCCAATCGCATCTGGATGTTACTAATGAGAAAGA
CAATGAGAGGGATGAGAAAGAGAGGGACAAGGGAGATAAGGAGAAAGAGAGGATGTGGGAACGGGAGGGGTCACGCGATA
AGGAAAGAGAGGATGCACGTTCGGAGAAACGAGAGAAGAGGGAGAAAGAGCGTGATCGGGATTTTCATCGGCGAGAGCGT
GAAGAGCGAGATGATAATCTTGTGAACAAAGGAGAGAAAGATGATGGGAAACCTGTTAAGGTTGAAGACATAGATACAGA
TAGGAAGCTTACGCGTGAGGTTGACGAACGAAAATTGGTCGAGAAGGAAGCTCGCGAGCGTGTTAGGGAAAGAGACAGAG
ACAAGGAGAGAGATGAGGATGGGGAAAAAGAGAAGCGCAAGAAACGGAGTCGCGAGCAAGAGAAAAATCGTGATTCCTTG
GGTACAATGGACACTGAAGTCGCAGGGGGGGAGAAGGAAAAGGATGCCGCTCATGGTCATGGAGTGCAGCAGCGCAAGAG
GATGTTGCGTCCTAGAGGCCAGTCAAACCCAGCAAACCGAGACCCTCGGTCTCGGTTTCGGCCCAAAGATATTGAAGGGG
CTCAAGGCAGGGCAGAAAGTTCATTCATTAATTACAGAGTTGGGGAAGGAATGCCGGAACTTGCAAAACTTCGGAAGGAG
TACGATTCTGGGGAACGAAATAGCTCAGATAATGGGTGGGGCCCTGCCGTTGAAATTCGTATCCCTGCTGAGCACGCTAC
TACGAATAACCGTCAGGTCAGAGGGAGCCAGTTATGGGGAACAGACATATATACAGACGACTCTGACATAGTTGCAGTCT
TGCTACATACAGGATACTACTCACCTTCACCCACTCCGCCACCAGATTCAATATCAGAGCTGCGGGCCACCATTCGGATT
CTTGAATCCCAAAATATGTACACTTCTACATTACGAAATAGTATTCGATCACGTGCGTGGGGAGGCGGAAGTGGATGCAG
CTACAACGTTGAAAGGTGCCGCATAGTGAAGCAAGGAGGGGGTACTGTAGAGCTGGAGCCATCTTTGACTCGCACTCCCC
CATTTGTTCCGACACTGGCTCCTGCAGCATCAGAGCGAACCGTTACTACGAGAGCTGCCTCTTCTAGTCCATATCGGCAA
CAAAGATTTATGCAAGAAGTGACTATACAATACAACTTATGCAATGAACCCTGGGCTAAATACAGCATGAGCATTGTGGC
CGACCGTGGACTAAAGAAATCTCAATACACTTCTGCTCGACTCAAAAAAGGGGAAGTGCTCTATGTAGAGACCCAGGTTC
ATCGGTATGAACTGGCATATGATGGAGAGCGTACAACGTGCAATGGCGCAACCACTGCCACCTCTTCTTTCCCTCCAGGG
ACTTCTGGAATGGAAAAGGGCAAAGAGAAATGGGGTTCGACAGATAGAACTTTGGCAGTTGGCGACAAAGAACCAATTTT
GCAAACCCACAATGGAGAAAAATCACATGGAAATCATCCAAGCAATGGGTTAACAGGTGGCCATCATGGTCACTCTAGTA
GTGAGCCACATGAATACTACAGATGGTCCAAGTGTAAACGGCCTCTGTCGCTATCGTCTATGAAAAAGAAAGGTGTACCC
TTGTCAGAAGAATTTATTGAGGTTTTGGAAGAAGGTTTGGCTTGGGAGGAGATTCAGTGGTCCCCAACGGGTGTGTGGGT
TCGGGGAACAGTGTACATCCTTAGTAGAGCCCAATTTTTTTCTTCCGATAAGGATGATATGGAAGAATAG
Microexon DNA seq GGCTCAAG
Microexon Amino Acid seq GAQG
Microexon-tag DNA Seq CCAGCAAACCGAGACCCTCGGTCTCGGTTTCGGCCCAAAGATATTGAAGGGGCTCAAGGCAGGGCAGAAAGTTCATTCATTAATTACAGAGTTGGGGAAGGAATGCCG
Microexon-tag Amino Acid seq PANRDPRSRFRPKDIEGAQGRAESSFINYRVGEGMP
Transcript ID Pp3c8_5360V3.3
Gene ID Pp.23676
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp3c8_5360V3.3
MSLPVKRQHEETASVVGKALEGSGTGTGKQNYAGVGSGMVMEPASAFGGELRPAKVARHGDRGVDPAEKVEFFNSSHANF
KPGRDGGGSGSIEGTVVEGNSIKEGQVQTLRIKFSGGSAEGKSDREGRDPGRLDLERKTSGGKDYGDVLKESSGNKELLG
RVNVKSEEQLLQSGGSGREVGQELVDNARGEGNSEGLRKKSPREGEGKVAVKDVPTDVKREEPHEKEKEKKDEKRSEREE
DALPSLQQQQQQQLLPQSHLDVTNEKDNERDEKERDKGDKEKERMWEREGSRDKEREDARSEKREKREKERDRDFHRRER
EERDDNLVNKGEKDDGKPVKVEDIDTDRKLTREVDERKLVEKEARERVRERDRDKERDEDGEKEKRKKRSREQEKNRDSL
GTMDTEVAGGEKEKDAAHGHGVQQRKRMLRPRGQSNPANRDPRSRFRPKDIEGAQGRAESSFINYRVGEGMPELAKLRKE
YDSGERNSSDNGWGPAVEIRIPAEHATTNNRQVRGSQLWGTDIYTDDSDIVAVLLHTGYYSPSPTPPPDSISELRATIRI
LESQNMYTSTLRNSIRSRAWGGGSGCSYNVERCRIVKQGGGTVELEPSLTRTPPFVPTLAPAASERTVTTRAASSSPYRQ
QRFMQEVTIQYNLCNEPWAKYSMSIVADRGLKKSQYTSARLKKGEVLYVETQVHRYELAYDGERTTCNGATTATSSFPPG
TSGMEKGKEKWGSTDRTLAVGDKEPILQTHNGEKSHGNHPSNGLTGGHHGHSSSEPHEYYRWSKCKRPLSLSSMKKKGVP
LSEEFIEVLEEGLAWEEIQWSPTGVWVRGTVYILSRAQFFSSDKDDMEE*
CDS seq >Pp3c8_5360V3.3
ATGAGTCTTCCGGTGAAGCGGCAGCATGAGGAGACGGCGTCGGTAGTCGGGAAGGCGCTGGAGGGCAGTGGCACTGGAAC
TGGCAAGCAAAATTATGCGGGAGTGGGGAGCGGAATGGTGATGGAGCCTGCGTCTGCATTTGGAGGAGAGTTGCGTCCCG
CTAAAGTGGCGCGCCATGGTGATCGGGGAGTGGACCCGGCAGAGAAGGTTGAGTTTTTCAACAGTTCACATGCGAATTTC
AAACCTGGAAGAGATGGAGGTGGCTCTGGGTCGATTGAGGGAACTGTCGTAGAGGGGAACAGTATCAAGGAAGGGCAAGT
CCAGACACTGAGGATTAAGTTCTCGGGTGGGAGTGCTGAGGGAAAGAGTGATCGGGAGGGTCGGGATCCAGGGCGGCTGG
ATTTAGAGAGGAAGACGAGTGGTGGGAAGGATTACGGAGATGTGCTGAAGGAGAGCAGCGGCAACAAGGAGCTTCTTGGT
AGGGTTAACGTAAAGAGCGAGGAGCAGCTGCTGCAATCGGGAGGGAGTGGTCGGGAAGTAGGGCAAGAATTAGTTGATAA
CGCCCGCGGGGAAGGAAATTCTGAGGGCTTGAGAAAGAAGAGCCCTCGCGAGGGTGAGGGTAAAGTAGCCGTGAAGGATG
TGCCAACTGATGTGAAGAGGGAGGAGCCGCATGAGAAGGAGAAGGAGAAGAAGGATGAGAAGCGTAGTGAGCGTGAAGAA
GATGCGCTGCCGTCACTGCAGCAACAACAGCAGCAGCAGCTGCTGCCCCAATCGCATCTGGATGTTACTAATGAGAAAGA
CAATGAGAGGGATGAGAAAGAGAGGGACAAGGGAGATAAGGAGAAAGAGAGGATGTGGGAACGGGAGGGGTCACGCGATA
AGGAAAGAGAGGATGCACGTTCGGAGAAACGAGAGAAGAGGGAGAAAGAGCGTGATCGGGATTTTCATCGGCGAGAGCGT
GAAGAGCGAGATGATAATCTTGTGAACAAAGGAGAGAAAGATGATGGGAAACCTGTTAAGGTTGAAGACATAGATACAGA
TAGGAAGCTTACGCGTGAGGTTGACGAACGAAAATTGGTCGAGAAGGAAGCTCGCGAGCGTGTTAGGGAAAGAGACAGAG
ACAAGGAGAGAGATGAGGATGGGGAAAAAGAGAAGCGCAAGAAACGGAGTCGCGAGCAAGAGAAAAATCGTGATTCCTTG
GGTACAATGGACACTGAAGTCGCAGGGGGGGAGAAGGAAAAGGATGCCGCTCATGGTCATGGAGTGCAGCAGCGCAAGAG
GATGTTGCGTCCTAGAGGCCAGTCAAACCCAGCAAACCGAGACCCTCGGTCTCGGTTTCGGCCCAAAGATATTGAAGGGG
CTCAAGGCAGGGCAGAAAGTTCATTCATTAATTACAGAGTTGGGGAAGGAATGCCGGAACTTGCAAAACTTCGGAAGGAG
TACGATTCTGGGGAACGAAATAGCTCAGATAATGGGTGGGGCCCTGCCGTTGAAATTCGTATCCCTGCTGAGCACGCTAC
TACGAATAACCGTCAGGTCAGAGGGAGCCAGTTATGGGGAACAGACATATATACAGACGACTCTGACATAGTTGCAGTCT
TGCTACATACAGGATACTACTCACCTTCACCCACTCCGCCACCAGATTCAATATCAGAGCTGCGGGCCACCATTCGGATT
CTTGAATCCCAAAATATGTACACTTCTACATTACGAAATAGTATTCGATCACGTGCGTGGGGAGGCGGAAGTGGATGCAG
CTACAACGTTGAAAGGTGCCGCATAGTGAAGCAAGGAGGGGGTACTGTAGAGCTGGAGCCATCTTTGACTCGCACTCCCC
CATTTGTTCCGACACTGGCTCCTGCAGCATCAGAGCGAACCGTTACTACGAGAGCTGCCTCTTCTAGTCCATATCGGCAA
CAAAGATTTATGCAAGAAGTGACTATACAATACAACTTATGCAATGAACCCTGGGCTAAATACAGCATGAGCATTGTGGC
CGACCGTGGACTAAAGAAATCTCAATACACTTCTGCTCGACTCAAAAAAGGGGAAGTGCTCTATGTAGAGACCCAGGTTC
ATCGGTATGAACTGGCATATGATGGAGAGCGTACAACGTGCAATGGCGCAACCACTGCCACCTCTTCTTTCCCTCCAGGG
ACTTCTGGAATGGAAAAGGGCAAAGAGAAATGGGGTTCGACAGATAGAACTTTGGCAGTTGGCGACAAAGAACCAATTTT
GCAAACCCACAATGGAGAAAAATCACATGGAAATCATCCAAGCAATGGGTTAACAGGTGGCCATCATGGTCACTCTAGTA
GTGAGCCACATGAATACTACAGATGGTCCAAGTGTAAACGGCCTCTGTCGCTATCGTCTATGAAAAAGAAAGGTGTACCC
TTGTCAGAAGAATTTATTGAGGTTTTGGAAGAAGGTTTGGCTTGGGAGGAGATTCAGTGGTCCCCAACGGGTGTGTGGGT
TCGGGGAACAGTGTACATCCTTAGTAGAGCCCAATTTTTTTCTTCCGATAAGGATGATATGGAAGAATAG