Microexon ID Pp_22:1994941-1994955:-
Species Physcomitrium patens
Coordinates 22:1994941..1994955
Microexon Cluster ID MEP42
Size 15
Phase 0
Pfam Domain Motif bHLH-MYC_N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq AGTTCTAACATCCAG
Microexon Amino Acid seq SSNIQ
Microexon-tag DNA Seq GCCCATGAGCAGGAGACGGCAGTTTTCACACGCTCTCTTCCTGCCAAGAGTTCTAACATCCAGACGGTAGTCTGCATTCCTCTAAAGAATGGCGTCCTTGAATTCGGA
Microexon-tag Amino Acid Seq AHEQETAVFTRSLPAKSSNIQTVVCIPLKNGVLEFG
Microexon-tag spanning region1994768-1995480
Microexon-tag prediction score0.8767
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c22_2930V3.1x
Reference Transcript ID Pp3c22_2930V3.1
Gene ID Pp3c22_2930
Gene Name NA
Transcript ID Pp3c22_2930V3.1
Protein ID Pp3c22_2930V3.1
Gene ID Pp3c22_2930
Gene Name NA
Pfam domain motif bHLH-MYC_N
Motif E-value 2.4e-39
Motif start 190
Motif end 373
Protein seq >Pp3c22_2930V3.1
MTLKCIFLKLLERAYRYVIRVSRRFDEVTYTRDMRLPFVACDVSNGILPCVKDAHTPCAERCRKSESANASRNRERGVEM
WSRQSRWSCFLVWKIGVRLSNYFQHVVRLGVIKATAVSEDVSLRSCARVPSTCEVFADLYNRAGKPTHLDTSVYERNSTE
EQTIGNSPHVKHFSASEIFTGMTPELPSKLRLQLQVATLEKNWTYSAFWKPAFVNQKKILVWGDGYYNGVIKTYKTIHGM
ELTPKEFGLQRSQQLRDLCLTLDSRTRDQHASKPFALKVDDLADPEWFFLLSMIYDFAENEGMVGKTAARGQYTWLRQAH
EQETAVFTRSLPAKSSNIQTVVCIPLKNGVLEFGTSEDVPDDSKLVQSILAFFMEPLNRTPSKNTVSSERKRVHSETTRQ
KLVCAGHLSQLESSVARSPKRVKRSSFSTAETGEAIPLNNENIQMEPSCEGPRISNRCDEIKKDASRPNTFHAWKKNPAP
LQKCLHPEKL*
CDS seq >Pp3c22_2930V3.1
ATGACTTTGAAGTGCATTTTTTTGAAATTGCTGGAGAGAGCTTACCGGTACGTCATTCGCGTATCCAGACGATTCGATGA
AGTAACTTACACTCGCGATATGAGGTTGCCATTTGTCGCATGCGATGTGTCGAATGGAATACTGCCTTGTGTGAAAGATG
CTCATACACCTTGTGCCGAACGCTGCAGGAAGAGCGAATCCGCGAACGCGTCGCGGAATCGAGAGCGTGGTGTGGAGATG
TGGTCTAGGCAGTCGCGGTGGTCATGCTTCCTTGTTTGGAAGATAGGCGTCCGATTATCGAACTATTTCCAGCACGTTGT
GAGGTTGGGCGTGATCAAAGCGACGGCGGTATCCGAAGATGTTAGTCTGAGGTCTTGCGCTCGCGTTCCTTCGACGTGCG
AAGTCTTTGCGGATTTGTATAATCGTGCTGGAAAACCTACTCATTTAGACACATCTGTGTATGAGCGCAACTCGACAGAA
GAACAAACAATCGGCAATTCACCTCATGTGAAGCACTTTTCGGCGAGTGAGATTTTCACAGGAATGACGCCGGAGTTGCC
CTCAAAGTTGCGGCTCCAATTGCAAGTCGCTACCCTTGAGAAAAATTGGACATATTCCGCTTTTTGGAAGCCTGCCTTTG
TGAATCAGAAGAAGATATTGGTCTGGGGTGATGGCTATTATAATGGAGTAATCAAAACATATAAAACCATTCACGGCATG
GAGTTGACACCCAAAGAGTTTGGCTTGCAGCGTTCGCAGCAGCTCCGTGACCTCTGCCTTACATTAGATTCCCGTACCAG
GGACCAGCATGCCAGTAAGCCGTTTGCGTTGAAAGTCGATGACCTTGCAGACCCCGAGTGGTTTTTCCTCCTGAGCATGA
TCTACGATTTCGCTGAAAATGAAGGGATGGTAGGAAAAACAGCAGCAAGAGGCCAATATACATGGCTGCGCCAAGCCCAT
GAGCAGGAGACGGCAGTTTTCACACGCTCTCTTCCTGCCAAGAGTTCTAACATCCAGACGGTAGTCTGCATTCCTCTAAA
GAATGGCGTCCTTGAATTCGGAACATCAGAGGATGTGCCAGACGACTCAAAACTTGTGCAAAGTATTTTAGCCTTCTTTA
TGGAGCCGTTGAACAGGACGCCCTCGAAAAATACTGTGTCTAGTGAGCGTAAGCGCGTTCATTCAGAGACCACTCGCCAG
AAATTGGTTTGTGCGGGTCATCTCAGCCAACTTGAAAGTTCCGTTGCGAGGAGTCCCAAGAGGGTGAAGCGTTCCAGTTT
TTCGACAGCTGAGACTGGGGAAGCGATCCCACTGAATAATGAGAACATCCAGATGGAACCATCTTGTGAGGGGCCTAGGA
TATCTAATCGATGTGACGAAATCAAGAAGGATGCTTCGCGTCCGAACACATTTCATGCCTGGAAGAAAAATCCTGCACCC
TTGCAGAAGTGTCTGCATCCTGAGAAACTGTAA
Microexon DNA seq AGTTCTAACATCCAG
Microexon Amino Acid seq SSNIQ
Microexon-tag DNA Seq GCCCATGAGCAGGAGACGGCAGTTTTCACACGCTCTCTTCCTGCCAAGAGTTCTAACATCCAGACGGTAGTCTGCATTCCTCTAAAGAATGGCGTCCTTGAATTCGGA
Microexon-tag Amino Acid seq AHEQETAVFTRSLPAKSSNIQTVVCIPLKNGVLEFG
Transcript ID Pp.14040.1
Gene ID Pp.14040
Gene Name NA
Pfam domain motif bHLH-MYC_N
Motif E-value 7.8e-40
Motif start 9
Motif end 192
Protein seq >Pp.14040.1
MTPELPSKLRLQLQVATLEKNWTYSAFWKPAFVNQKKILVWGDGYYNGVIKTYKTIHGMELTPKEFGLQRSQQLRDLCLT
LDSRTRDQHASKPFALKVDDLADPEWFFLLSMIYDFAENEGMVGKTAARGQYTWLRQAHEQETAVFTRSLPAKSSNIQTV
VCIPLKNGVLEFGTSEDVPDDSKLVQSILAFFMEPLNRTPSKNTVSSERKRVHSETTRQKLVCAGHLSQLESSVARSPKR
VKRSSFSTAETGEAIPLNNENIQMEPSCEGPRISNRCDEIKKDASRPNTFHAWKKNPAPLQKCLHPEKL*
CDS seq >Pp.14040.1
ATGACGCCGGAGTTGCCCTCAAAGTTGCGGCTCCAATTGCAAGTCGCTACCCTTGAGAAAAATTGGACATATTCCGCTTT
TTGGAAGCCTGCCTTTGTGAATCAGAAGAAGATATTGGTCTGGGGTGATGGCTATTATAATGGAGTAATCAAAACATATA
AAACCATTCACGGCATGGAGTTGACACCCAAAGAGTTTGGCTTGCAGCGTTCGCAGCAGCTCCGTGACCTCTGCCTTACA
TTAGATTCCCGTACCAGGGACCAGCATGCCAGTAAGCCGTTTGCGTTGAAAGTCGATGACCTTGCAGACCCCGAGTGGTT
TTTCCTCCTGAGCATGATCTACGATTTCGCTGAAAATGAAGGGATGGTAGGAAAAACAGCAGCAAGAGGCCAATATACAT
GGCTGCGCCAAGCCCATGAGCAGGAGACGGCAGTTTTCACACGCTCTCTTCCTGCCAAGAGTTCTAACATCCAGACGGTA
GTCTGCATTCCTCTAAAGAATGGCGTCCTTGAATTCGGAACATCAGAGGATGTGCCAGACGACTCAAAACTTGTGCAAAG
TATTTTAGCCTTCTTTATGGAGCCGTTGAACAGGACGCCCTCGAAAAATACTGTGTCTAGTGAGCGTAAGCGCGTTCATT
CAGAGACCACTCGCCAGAAATTGGTTTGTGCGGGTCATCTCAGCCAACTTGAAAGTTCCGTTGCGAGGAGTCCCAAGAGG
GTGAAGCGTTCCAGTTTTTCGACAGCTGAGACTGGGGAAGCGATCCCACTGAATAATGAGAACATCCAGATGGAACCATC
TTGTGAGGGGCCTAGGATATCTAATCGATGTGACGAAATCAAGAAGGATGCTTCGCGTCCGAACACATTTCATGCCTGGA
AGAAAAATCCTGCACCCTTGCAGAAGTGTCTGCATCCTGAGAAACTGTAA