Microexon ID Pp_22:8216799-8216806:-
Species Physcomitrium patens
Coordinates 22:8216799..8216806
Microexon Cluster ID Unclassified
Size 8
Pp_22:8216799-8216806:- does not have available information here.
Transcript ID Pp3c22_12680V3.2
Protein ID Pp3c22_12680V3.2
Gene ID Pp3c22_12680
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp3c22_12680V3.2
MGEGLNDGDNEFGAEGEDERNDVSSDLPQENGTNNDFPSPRSDQTVVRAGSLESELFIYTGRMRALHIFRAEESKADEEI
NLIRAATFVGQHAYPRISAEEVETQMDEMAAALEPQLPSTGGRYTMRMINTINKYLYHELGYKGEINYLDPDNSCINMVL
ESREVTHFWSTSIGRITAYNVFGVHGTCLTCGFTNARDGKITFLQDAEERLSVVYGVLVEINPEFLKHTSITNRVFLLRL
LSNLKRMYFERKDPVSTLCIIVYLRFCDQVSFKRDYGICLYLLNRFSEAIPCLDASGSAYCNRRRVYEVTSHKNATKQIV
LRKRYFLE*
CDS seq >Pp3c22_12680V3.2
ATGGGAGAAGGATTGAATGATGGTGATAATGAGTTTGGCGCTGAAGGAGAAGATGAGAGGAATGATGTCTCCAGTGATCT
CCCCCAGGAAAATGGTACTAACAATGATTTTCCAAGTCCCCGATCAGACCAAACAGTCGTTAGAGCGGGTTCACTGGAGT
CTGAGCTGTTCATATACACGGGTCGGATGAGAGCTCTACATATCTTCAGGGCAGAAGAGTCAAAGGCTGACGAAGAGATC
AACTTAATCCGGGCTGCGACTTTTGTGGGTCAGCATGCGTACCCAAGAATCAGTGCCGAAGAAGTCGAAACTCAGATGGA
TGAGATGGCTGCTGCACTTGAACCTCAGCTACCGTCTACTGGTGGGCGGTACACAATGCGAATGATCAATACCATCAATA
AGTACCTTTATCATGAACTAGGATACAAGGGTGAAATAAACTACCTTGATCCTGATAATTCATGTATAAACATGGTGTTA
GAGAGCCGAGAAGTAACGCACTTTTGGTCCACTTCGATTGGCAGGATTACCGCTTACAATGTCTTTGGTGTACATGGAAC
TTGCTTAACGTGTGGGTTTACTAATGCGAGGGATGGCAAGATCACTTTCCTGCAGGACGCTGAAGAGAGGTTATCTGTAG
TTTATGGTGTGCTGGTGGAGATCAATCCTGAATTCTTGAAGCACACATCCATCACTAATCGAGTCTTCTTGCTACGCCTT
CTATCGAACTTGAAACGAATGTATTTTGAACGGAAAGATCCCGTCAGCACACTATGCATCATTGTTTACCTAAGATTTTG
CGACCAGGTGTCATTCAAGAGGGATTATGGAATATGCTTGTATCTTTTAAACCGTTTCTCAGAAGCAATACCTTGTCTTG
ATGCTTCAGGAAGCGCCTATTGCAACCGACGCAGAGTCTATGAGGTTACTTCTCACAAAAATGCGACGAAACAAATTGTC
TTGAGGAAACGATACTTTCTGGAATAA
Microexon DNA seq ATGGCAAG
Microexon Amino Acid seq DGK
Microexon-tag DNA Seq TTTGGTGTACATGGAACTTGCTTAACGTGTGGGTTTACTAATGCGAGGGATGGCAAGATCACTTTCCTGCAGGACGCTGAAGAGAGGTTATCTGTAGTTTATGGTGTG
Microexon-tag Amino Acid seq FGVHGTCLTCGFTNARDGKITFLQDAEERLSVVYGV
Transcript ID Pp3c22_12680V3.2
Gene ID Pp.14377
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp3c22_12680V3.2
MGEGLNDGDNEFGAEGEDERNDVSSDLPQENGTNNDFPSPRSDQTVVRAGSLESELFIYTGRMRALHIFRAEESKADEEI
NLIRAATFVGQHAYPRISAEEVETQMDEMAAALEPQLPSTGGRYTMRMINTINKYLYHELGYKGEINYLDPDNSCINMVL
ESREVTHFWSTSIGRITAYNVFGVHGTCLTCGFTNARDGKITFLQDAEERLSVVYGVLVEINPEFLKHTSITNRVFLLRL
LSNLKRMYFERKDPVSTLCIIVYLRFCDQVSFKRDYGICLYLLNRFSEAIPCLDASGSAYCNRRRVYEVTSHKNATKQIV
LRKRYFLE*
CDS seq >Pp3c22_12680V3.2
ATGGGAGAAGGATTGAATGATGGTGATAATGAGTTTGGCGCTGAAGGAGAAGATGAGAGGAATGATGTCTCCAGTGATCT
CCCCCAGGAAAATGGTACTAACAATGATTTTCCAAGTCCCCGATCAGACCAAACAGTCGTTAGAGCGGGTTCACTGGAGT
CTGAGCTGTTCATATACACGGGTCGGATGAGAGCTCTACATATCTTCAGGGCAGAAGAGTCAAAGGCTGACGAAGAGATC
AACTTAATCCGGGCTGCGACTTTTGTGGGTCAGCATGCGTACCCAAGAATCAGTGCCGAAGAAGTCGAAACTCAGATGGA
TGAGATGGCTGCTGCACTTGAACCTCAGCTACCGTCTACTGGTGGGCGGTACACAATGCGAATGATCAATACCATCAATA
AGTACCTTTATCATGAACTAGGATACAAGGGTGAAATAAACTACCTTGATCCTGATAATTCATGTATAAACATGGTGTTA
GAGAGCCGAGAAGTAACGCACTTTTGGTCCACTTCGATTGGCAGGATTACCGCTTACAATGTCTTTGGTGTACATGGAAC
TTGCTTAACGTGTGGGTTTACTAATGCGAGGGATGGCAAGATCACTTTCCTGCAGGACGCTGAAGAGAGGTTATCTGTAG
TTTATGGTGTGCTGGTGGAGATCAATCCTGAATTCTTGAAGCACACATCCATCACTAATCGAGTCTTCTTGCTACGCCTT
CTATCGAACTTGAAACGAATGTATTTTGAACGGAAAGATCCCGTCAGCACACTATGCATCATTGTTTACCTAAGATTTTG
CGACCAGGTGTCATTCAAGAGGGATTATGGAATATGCTTGTATCTTTTAAACCGTTTCTCAGAAGCAATACCTTGTCTTG
ATGCTTCAGGAAGCGCCTATTGCAACCGACGCAGAGTCTATGAGGTTACTTCTCACAAAAATGCGACGAAACAAATTGTC
TTGAGGAAACGATACTTTCTGGAATAA