Microexon ID Pp_4:1098800-1098807:+
Species Physcomitrium patens
Coordinates 4:1098800..1098807
Microexon Cluster ID Unclassified
Size 8
Pp_4:1098800-1098807:+ does not have available information here.
Transcript ID Pp3c4_1850V3.5
Protein ID Pp3c4_1850V3.5
Gene ID Pp3c4_1850
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp3c4_1850V3.5
MWEEFERFYFDKRLQARNQWEMGRESPASNPSLAPLHQSQSTTVRPRNSIFKRPVSPLKLYSCTSLQEGREEEMSSSDNA
QRDDEIARLPSLDFSAGEDSNFKLDVDSSVPVNPGPMGPPASLNSWNSDLLELPVGEGGPPNKRKSCILSLDGGGMRGLI
AARILSHLENILQEKVGEKVKLCDYFDLLAGTSTGAVLATMLVTPDANGNPTFTAEGCCEFYKKNGRLIFQHRWYDPFHG
SVRQLYRPKYSGRRFEDLLKKYTFIDGKFLTLLDTLKPLVVTSFDISQATPFFFVRQAAQKDQSRNFRLWEVCRATAAAP
TYFPPASVRSVDGRVQGTLIDGGAVQNNPALVATTHAISNNEEFPYVNGLEDVLILSIGAGQMDKKHDLQKARKWGMTKW
VRPIMDIMMDGTADTVDYQLAAAYAGNNCSENYLRIQLSGLPNKTSVMDCATQKNIHDLITISDDLIKRKAIMRNAYGEK
VTLDQTNEERLSWFADQLIMQKTIRENPQDPQFAQHAYPAYVKEGLGPAGHLVSPLFARLFEQRQHLQLGAQGKKSRSLR
F*
CDS seq >Pp3c4_1850V3.5
ATGTGGGAGGAGTTCGAAAGATTCTACTTCGACAAGCGCCTCCAGGCCCGGAATCAATGGGAAATGGGGAGAGAATCACC
CGCCTCAAATCCTTCTCTTGCGCCACTTCATCAGTCTCAAAGCACCACAGTTCGTCCTCGAAACAGTATATTTAAAAGAC
CTGTTAGCCCTCTGAAACTGTACAGTTGCACCTCACTACAAGAAGGTAGAGAGGAAGAGATGAGCAGCTCCGACAATGCT
CAACGTGATGATGAAATTGCTCGACTTCCAAGCTTGGATTTCAGTGCAGGAGAAGATTCGAATTTCAAGCTTGATGTTGA
TAGCAGTGTCCCTGTAAACCCTGGCCCTATGGGTCCGCCAGCGTCCTTGAATAGTTGGAATAGCGACCTGTTGGAACTTC
CCGTCGGTGAAGGTGGCCCTCCAAACAAGAGAAAATCATGTATCTTGAGTTTAGATGGTGGAGGTATGCGGGGGCTTATT
GCAGCAAGGATCCTCTCTCACCTTGAAAACATTTTGCAGGAAAAGGTGGGTGAGAAGGTGAAACTATGCGACTACTTCGA
CTTGCTTGCCGGTACCAGCACAGGCGCTGTTCTTGCTACAATGTTGGTAACCCCAGACGCAAATGGGAATCCCACTTTCA
CCGCTGAGGGTTGTTGCGAATTTTACAAAAAAAACGGACGACTTATATTTCAGCATCGGTGGTACGATCCATTCCATGGA
TCGGTACGGCAGTTGTACCGACCCAAGTACTCAGGTCGACGTTTTGAGGATCTTCTCAAGAAGTACACATTCATTGATGG
GAAGTTTCTTACCCTCCTTGACACCCTGAAACCCCTTGTGGTGACCTCCTTCGACATATCTCAAGCCACACCATTCTTCT
TCGTGCGACAAGCTGCTCAGAAAGATCAAAGTCGAAACTTCAGGTTGTGGGAGGTTTGTCGGGCAACAGCAGCAGCTCCT
ACATACTTTCCACCAGCGTCAGTGCGATCAGTGGATGGGAGAGTTCAAGGCACTCTCATCGATGGCGGTGCCGTTCAGAA
CAATCCCGCGCTTGTTGCAACTACGCACGCAATTAGCAACAACGAGGAGTTTCCATACGTTAATGGTCTTGAGGACGTGT
TGATATTGTCGATTGGGGCTGGTCAGATGGACAAGAAGCATGACCTACAGAAAGCGAGGAAATGGGGCATGACCAAATGG
GTTCGCCCGATAATGGATATTATGATGGACGGCACTGCAGACACGGTGGACTATCAATTAGCTGCTGCCTATGCTGGAAA
TAATTGCTCTGAGAACTATCTTCGTATTCAGTTGTCCGGGCTCCCAAACAAGACATCAGTGATGGATTGTGCAACTCAGA
AGAATATCCATGACTTGATTACAATAAGTGACGATTTAATCAAGCGCAAGGCTATCATGCGGAACGCATATGGAGAGAAG
GTAACGCTTGATCAAACAAACGAAGAGCGGCTTTCGTGGTTTGCTGACCAGTTGATCATGCAAAAGACCATACGAGAAAA
TCCTCAAGATCCTCAATTTGCCCAACATGCTTATCCGGCGTACGTAAAAGAAGGTCTTGGTCCTGCTGGTCATCTCGTCT
CTCCTCTATTTGCCCGTTTATTTGAGCAACGCCAACACTTGCAACTTGGTGCTCAGGGCAAAAAATCAAGATCACTCCGA
TTCTAG
Microexon DNA seq GGAAATGG
Microexon Amino Acid seq WEMG
Microexon-tag DNA Seq GAGTTCGAAAGATTCTACTTCGACAAGCGCCTCCAGGCCCGGAATCAATGGGAAATGGGGAGAGAATCACCCGCCTCAAATCCTTCTCTTGCGCCACTTCATCAGTCT
Microexon-tag Amino Acid seq EFERFYFDKRLQARNQWEMGRESPASNPSLAPLHQS
Transcript ID Pp.19222.1
Gene ID Pp.19222
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp.19222.1
MWEEFERFYFDKRLQARNQWEMGRESPASNPSLAPLHQSQSTTVRPRNSIFKRPVSPLKLYSCTSLQEGREEEMSSSDNA
QRDDEIARLPSLDFSAGEDSNFKLDVDSSVPVNPGPMGPPASLNSWNSDLLELPVGEGGPPNKRKSCILSLDGGGMRGLI
AARILSHLENILQEKVGEKVKLCDYFDLLAGTSTGAVLATMLVTPDANGNPTFTAEGCCEFYKKNGRLIFQHRWYDPFHG
SVRQLYRPKYSGRRFEDLLKKYTFIDGKFLTLLDTLKPLVVTSFDISQATPFFFVRQAAQKDQSRNFRLWEVCRATAAAP
TYFPPASVRSVDGRVQGTLIDGGAVQNNPALVATTHAISNNEEFPYVNGLEDVLELQYNKYF*
CDS seq >Pp.19222.1
ATGTGGGAGGAGTTCGAAAGATTCTACTTCGACAAGCGCCTCCAGGCCCGGAATCAATGGGAAATGGGGAGAGAATCACC
CGCCTCAAATCCTTCTCTTGCGCCACTTCATCAGTCTCAAAGCACCACAGTTCGTCCTCGAAACAGTATATTTAAAAGAC
CTGTTAGCCCTCTGAAACTGTACAGTTGCACCTCACTACAAGAAGGTAGAGAGGAAGAGATGAGCAGCTCCGACAATGCT
CAACGTGATGATGAAATTGCTCGACTTCCAAGCTTGGATTTCAGTGCAGGAGAAGATTCGAATTTCAAGCTTGATGTTGA
TAGCAGTGTCCCTGTAAACCCTGGCCCTATGGGTCCGCCAGCGTCCTTGAATAGTTGGAATAGCGACCTGTTGGAACTTC
CCGTCGGTGAAGGTGGCCCTCCAAACAAGAGAAAATCATGTATCTTGAGTTTAGATGGTGGAGGTATGCGGGGGCTTATT
GCAGCAAGGATCCTCTCTCACCTTGAAAACATTTTGCAGGAAAAGGTGGGTGAGAAGGTGAAACTATGCGACTACTTCGA
CTTGCTTGCCGGTACCAGCACAGGCGCTGTTCTTGCTACAATGTTGGTAACCCCAGACGCAAATGGGAATCCCACTTTCA
CCGCTGAGGGTTGTTGCGAATTTTACAAAAAAAACGGACGACTTATATTTCAGCATCGGTGGTACGATCCATTCCATGGA
TCGGTACGGCAGTTGTACCGACCCAAGTACTCAGGTCGACGTTTTGAGGATCTTCTCAAGAAGTACACATTCATTGATGG
GAAGTTTCTTACCCTCCTTGACACCCTGAAACCCCTTGTGGTGACCTCCTTCGACATATCTCAAGCCACACCATTCTTCT
TCGTGCGACAAGCTGCTCAGAAAGATCAAAGTCGAAACTTCAGGTTGTGGGAGGTTTGTCGGGCAACAGCAGCAGCTCCT
ACATACTTTCCACCAGCGTCAGTGCGATCAGTGGATGGGAGAGTTCAAGGCACTCTCATCGATGGCGGTGCCGTTCAGAA
CAATCCCGCGCTTGTTGCAACTACGCACGCAATTAGCAACAACGAGGAGTTTCCATACGTTAATGGTCTTGAGGACGTGT
TGGAACTTCAGTACAATAAATATTTCTGA