Microexon ID Pp_26:856783-856787:+
Species Physcomitrium patens
Coordinates 26:856783..856787
Microexon Cluster ID Unclassified
Size 5
Pp_26:856783-856787:+ does not have available information here.
Transcript ID Pp3c26_1820V3.1
Protein ID Pp3c26_1820V3.1
Gene ID Pp3c26_1820
Gene Name NA
Pfam domain motif DUF535
Motif E-value 0.87
Motif start 97
Motif end 144
Protein seq >Pp3c26_1820V3.1
MRGCGLFTLDTQVGQCPVKMASGEDSQRSSSRKAHLFDFCIKFLRISGIRLPIPPHSGDFTGSSSKIRQTYFLKITLLHG
TEQQIIGEFEVVGDECIINHQISCQIAYARPDKLHVYIDQNWSAGLSGHITVDFKEFFEDLDAEICLEYPLFSSGPSHVR
SPAKLEMSVLCACSTKSALFSNQLHLNQEYMNFEDLSKIRPSLEDLWYPPDQLNPESGQEALSMLPGSVLKQVTPSNLCL
DKVDREIEMLKEMEGDLIELNSKLNKYVAAQSLEGYAPEQYTRKIPTRPDVVAAAFFAAGMAFQRQVDNIKRHRMYSGTE
DPLLPGESPRLLLEDVSSTFKSEAPTNQEVQEVDTFCHGSSRSLVFSENSHSCGLPQSCSCSSEAATSLAGDIDEDENDQ
SFVDSSVCSGDTSLSLSEPELSSPRTSSDTRSSLGECSTSEGTMNFVRNDILSEPHGRLKKNNVLSVVDSVLGALAIIGS
PGSRSKDIHPGGSKDERVTIAGSMLGRQQVTSIQAIRKSNPGLGWLTNSMQFE*
CDS seq >Pp3c26_1820V3.1
ATGCGAGGTTGTGGATTATTTACACTTGACACTCAAGTAGGACAATGTCCTGTGAAGATGGCATCGGGTGAAGATTCGCA
AAGATCTTCGTCGCGTAAAGCTCATCTATTTGACTTTTGTATCAAGTTTTTAAGGATATCTGGGATACGTCTTCCTATCC
CGCCACATTCTGGAGACTTCACAGGGAGCTCTTCAAAGATTCGACAGACATATTTTCTGAAGATCACTCTTCTACACGGA
ACTGAACAACAGATAATTGGAGAGTTTGAAGTAGTTGGCGACGAATGCATCATCAATCACCAGATCTCGTGCCAAATTGC
GTATGCTAGGCCTGATAAGCTTCACGTGTACATAGACCAGAATTGGAGTGCTGGTTTGTCTGGACATATTACCGTGGATT
TTAAGGAGTTTTTTGAGGACCTTGATGCTGAAATCTGTCTTGAATACCCTCTATTTTCAAGTGGACCGTCACATGTTCGA
AGTCCAGCTAAGTTGGAGATGTCGGTCTTATGTGCCTGCTCAACCAAGTCCGCTTTGTTTTCAAATCAACTTCATCTTAA
CCAAGAATATATGAACTTCGAGGATTTAAGTAAAATTAGGCCATCCTTAGAGGATCTTTGGTACCCACCAGATCAGTTGA
ACCCTGAAAGCGGCCAGGAGGCTCTATCAATGCTACCCGGGAGCGTTCTCAAACAGGTCACACCATCAAATCTGTGCCTT
GATAAAGTTGATAGAGAGATTGAAATGCTGAAGGAAATGGAAGGTGACCTCATTGAACTAAATTCGAAGTTAAACAAATA
TGTAGCAGCACAAAGCTTGGAGGGATATGCACCTGAACAATATACGCGTAAGATTCCGACTAGACCGGACGTTGTGGCAG
CTGCGTTTTTCGCAGCAGGCATGGCTTTCCAAAGACAAGTCGACAATATTAAGAGACATAGAATGTACAGCGGAACCGAG
GATCCACTTTTGCCTGGAGAAAGTCCACGCTTGCTTCTAGAAGATGTTTCGTCAACCTTCAAGAGTGAAGCGCCTACAAA
CCAAGAAGTTCAAGAGGTTGACACATTTTGCCATGGATCAAGTAGGAGTCTCGTTTTTAGTGAAAACTCGCATAGTTGCG
GCCTTCCACAGTCTTGCAGCTGCAGCAGTGAAGCTGCCACAAGTTTGGCAGGAGACATAGATGAAGACGAGAATGACCAA
AGCTTTGTAGATTCTTCTGTATGCTCCGGGGATACGTCCTTATCCCTTTCGGAACCAGAACTATCAAGTCCAAGGACCTC
TAGTGATACACGTTCATCACTTGGTGAATGTAGTACTTCAGAAGGGACCATGAATTTCGTTAGGAATGATATTCTCAGTG
AGCCACATGGTAGGTTAAAAAAAAACAACGTATTGAGTGTTGTAGATAGCGTACTTGGGGCTTTGGCAATCATAGGTTCA
CCAGGTTCTAGATCCAAGGATATTCATCCTGGAGGCAGCAAAGACGAGAGAGTAACTATTGCAGGATCAATGCTTGGGAG
ACAACAGGTCACGAGTATCCAAGCAATCCGTAAGAGCAATCCAGGTTTGGGTTGGCTCACAAATTCAATGCAGTTTGAGT
GA
Microexon DNA seq ACCAG
Microexon Amino Acid seq DQ
Microexon-tag DNA Seq ATCTCGTGCCAAATTGCGTATGCTAGGCCTGATAAGCTTCACGTGTACATAGACCAGAATTGGAGTGCTGGTTTGTCTGGACATATTACCGTGGATTTTAAGGAGTTT
Microexon-tag Amino Acid seq ISCQIAYARPDKLHVYIDQNWSAGLSGHITVDFKEF
Transcript ID Pp3c26_1820V3.1
Gene ID Pp.16908
Gene Name NA
Pfam domain motif DUF535
Motif E-value 0.87
Motif start 97
Motif end 144
Protein seq >Pp3c26_1820V3.1
MRGCGLFTLDTQVGQCPVKMASGEDSQRSSSRKAHLFDFCIKFLRISGIRLPIPPHSGDFTGSSSKIRQTYFLKITLLHG
TEQQIIGEFEVVGDECIINHQISCQIAYARPDKLHVYIDQNWSAGLSGHITVDFKEFFEDLDAEICLEYPLFSSGPSHVR
SPAKLEMSVLCACSTKSALFSNQLHLNQEYMNFEDLSKIRPSLEDLWYPPDQLNPESGQEALSMLPGSVLKQVTPSNLCL
DKVDREIEMLKEMEGDLIELNSKLNKYVAAQSLEGYAPEQYTRKIPTRPDVVAAAFFAAGMAFQRQVDNIKRHRMYSGTE
DPLLPGESPRLLLEDVSSTFKSEAPTNQEVQEVDTFCHGSSRSLVFSENSHSCGLPQSCSCSSEAATSLAGDIDEDENDQ
SFVDSSVCSGDTSLSLSEPELSSPRTSSDTRSSLGECSTSEGTMNFVRNDILSEPHGRLKKNNVLSVVDSVLGALAIIGS
PGSRSKDIHPGGSKDERVTIAGSMLGRQQVTSIQAIRKSNPGLGWLTNSMQFE*
CDS seq >Pp3c26_1820V3.1
ATGCGAGGTTGTGGATTATTTACACTTGACACTCAAGTAGGACAATGTCCTGTGAAGATGGCATCGGGTGAAGATTCGCA
AAGATCTTCGTCGCGTAAAGCTCATCTATTTGACTTTTGTATCAAGTTTTTAAGGATATCTGGGATACGTCTTCCTATCC
CGCCACATTCTGGAGACTTCACAGGGAGCTCTTCAAAGATTCGACAGACATATTTTCTGAAGATCACTCTTCTACACGGA
ACTGAACAACAGATAATTGGAGAGTTTGAAGTAGTTGGCGACGAATGCATCATCAATCACCAGATCTCGTGCCAAATTGC
GTATGCTAGGCCTGATAAGCTTCACGTGTACATAGACCAGAATTGGAGTGCTGGTTTGTCTGGACATATTACCGTGGATT
TTAAGGAGTTTTTTGAGGACCTTGATGCTGAAATCTGTCTTGAATACCCTCTATTTTCAAGTGGACCGTCACATGTTCGA
AGTCCAGCTAAGTTGGAGATGTCGGTCTTATGTGCCTGCTCAACCAAGTCCGCTTTGTTTTCAAATCAACTTCATCTTAA
CCAAGAATATATGAACTTCGAGGATTTAAGTAAAATTAGGCCATCCTTAGAGGATCTTTGGTACCCACCAGATCAGTTGA
ACCCTGAAAGCGGCCAGGAGGCTCTATCAATGCTACCCGGGAGCGTTCTCAAACAGGTCACACCATCAAATCTGTGCCTT
GATAAAGTTGATAGAGAGATTGAAATGCTGAAGGAAATGGAAGGTGACCTCATTGAACTAAATTCGAAGTTAAACAAATA
TGTAGCAGCACAAAGCTTGGAGGGATATGCACCTGAACAATATACGCGTAAGATTCCGACTAGACCGGACGTTGTGGCAG
CTGCGTTTTTCGCAGCAGGCATGGCTTTCCAAAGACAAGTCGACAATATTAAGAGACATAGAATGTACAGCGGAACCGAG
GATCCACTTTTGCCTGGAGAAAGTCCACGCTTGCTTCTAGAAGATGTTTCGTCAACCTTCAAGAGTGAAGCGCCTACAAA
CCAAGAAGTTCAAGAGGTTGACACATTTTGCCATGGATCAAGTAGGAGTCTCGTTTTTAGTGAAAACTCGCATAGTTGCG
GCCTTCCACAGTCTTGCAGCTGCAGCAGTGAAGCTGCCACAAGTTTGGCAGGAGACATAGATGAAGACGAGAATGACCAA
AGCTTTGTAGATTCTTCTGTATGCTCCGGGGATACGTCCTTATCCCTTTCGGAACCAGAACTATCAAGTCCAAGGACCTC
TAGTGATACACGTTCATCACTTGGTGAATGTAGTACTTCAGAAGGGACCATGAATTTCGTTAGGAATGATATTCTCAGTG
AGCCACATGGTAGGTTAAAAAAAAACAACGTATTGAGTGTTGTAGATAGCGTACTTGGGGCTTTGGCAATCATAGGTTCA
CCAGGTTCTAGATCCAAGGATATTCATCCTGGAGGCAGCAAAGACGAGAGAGTAACTATTGCAGGATCAATGCTTGGGAG
ACAACAGGTCACGAGTATCCAAGCAATCCGTAAGAGCAATCCAGGTTTGGGTTGGCTCACAAATTCAATGCAGTTTGAGT
GA