
Microexon ID | Pp_4:1098800-1098807:+ |
Species | Physcomitrium patens | Coordinates | 4:1098800..1098807 |
Microexon Cluster ID | Unclassified |
Size | 8 |
Pp_4:1098800-1098807:+ does not have available information here.
Transcript ID | Pp3c4_1850V3.5 |
Protein ID | Pp3c4_1850V3.5 |
Gene ID | Pp3c4_1850 |
Gene Name | NA |
Pfam domain motif | Unknown |
Motif E-value | NA |
Motif start | NA |
Motif end | NA |
Protein seq | >Pp3c4_1850V3.5 MWEEFERFYFDKRLQARNQWEMGRESPASNPSLAPLHQSQSTTVRPRNSIFKRPVSPLKLYSCTSLQEGREEEMSSSDNA QRDDEIARLPSLDFSAGEDSNFKLDVDSSVPVNPGPMGPPASLNSWNSDLLELPVGEGGPPNKRKSCILSLDGGGMRGLI AARILSHLENILQEKVGEKVKLCDYFDLLAGTSTGAVLATMLVTPDANGNPTFTAEGCCEFYKKNGRLIFQHRWYDPFHG SVRQLYRPKYSGRRFEDLLKKYTFIDGKFLTLLDTLKPLVVTSFDISQATPFFFVRQAAQKDQSRNFRLWEVCRATAAAP TYFPPASVRSVDGRVQGTLIDGGAVQNNPALVATTHAISNNEEFPYVNGLEDVLILSIGAGQMDKKHDLQKARKWGMTKW VRPIMDIMMDGTADTVDYQLAAAYAGNNCSENYLRIQLSGLPNKTSVMDCATQKNIHDLITISDDLIKRKAIMRNAYGEK VTLDQTNEERLSWFADQLIMQKTIRENPQDPQFAQHAYPAYVKEGLGPAGHLVSPLFARLFEQRQHLQLGAQGKKSRSLR F* |
CDS seq | >Pp3c4_1850V3.5 ATGTGGGAGGAGTTCGAAAGATTCTACTTCGACAAGCGCCTCCAGGCCCGGAATCAATGGGAAATGGGGAGAGAATCACC CGCCTCAAATCCTTCTCTTGCGCCACTTCATCAGTCTCAAAGCACCACAGTTCGTCCTCGAAACAGTATATTTAAAAGAC CTGTTAGCCCTCTGAAACTGTACAGTTGCACCTCACTACAAGAAGGTAGAGAGGAAGAGATGAGCAGCTCCGACAATGCT CAACGTGATGATGAAATTGCTCGACTTCCAAGCTTGGATTTCAGTGCAGGAGAAGATTCGAATTTCAAGCTTGATGTTGA TAGCAGTGTCCCTGTAAACCCTGGCCCTATGGGTCCGCCAGCGTCCTTGAATAGTTGGAATAGCGACCTGTTGGAACTTC CCGTCGGTGAAGGTGGCCCTCCAAACAAGAGAAAATCATGTATCTTGAGTTTAGATGGTGGAGGTATGCGGGGGCTTATT GCAGCAAGGATCCTCTCTCACCTTGAAAACATTTTGCAGGAAAAGGTGGGTGAGAAGGTGAAACTATGCGACTACTTCGA CTTGCTTGCCGGTACCAGCACAGGCGCTGTTCTTGCTACAATGTTGGTAACCCCAGACGCAAATGGGAATCCCACTTTCA CCGCTGAGGGTTGTTGCGAATTTTACAAAAAAAACGGACGACTTATATTTCAGCATCGGTGGTACGATCCATTCCATGGA TCGGTACGGCAGTTGTACCGACCCAAGTACTCAGGTCGACGTTTTGAGGATCTTCTCAAGAAGTACACATTCATTGATGG GAAGTTTCTTACCCTCCTTGACACCCTGAAACCCCTTGTGGTGACCTCCTTCGACATATCTCAAGCCACACCATTCTTCT TCGTGCGACAAGCTGCTCAGAAAGATCAAAGTCGAAACTTCAGGTTGTGGGAGGTTTGTCGGGCAACAGCAGCAGCTCCT ACATACTTTCCACCAGCGTCAGTGCGATCAGTGGATGGGAGAGTTCAAGGCACTCTCATCGATGGCGGTGCCGTTCAGAA CAATCCCGCGCTTGTTGCAACTACGCACGCAATTAGCAACAACGAGGAGTTTCCATACGTTAATGGTCTTGAGGACGTGT TGATATTGTCGATTGGGGCTGGTCAGATGGACAAGAAGCATGACCTACAGAAAGCGAGGAAATGGGGCATGACCAAATGG GTTCGCCCGATAATGGATATTATGATGGACGGCACTGCAGACACGGTGGACTATCAATTAGCTGCTGCCTATGCTGGAAA TAATTGCTCTGAGAACTATCTTCGTATTCAGTTGTCCGGGCTCCCAAACAAGACATCAGTGATGGATTGTGCAACTCAGA AGAATATCCATGACTTGATTACAATAAGTGACGATTTAATCAAGCGCAAGGCTATCATGCGGAACGCATATGGAGAGAAG GTAACGCTTGATCAAACAAACGAAGAGCGGCTTTCGTGGTTTGCTGACCAGTTGATCATGCAAAAGACCATACGAGAAAA TCCTCAAGATCCTCAATTTGCCCAACATGCTTATCCGGCGTACGTAAAAGAAGGTCTTGGTCCTGCTGGTCATCTCGTCT CTCCTCTATTTGCCCGTTTATTTGAGCAACGCCAACACTTGCAACTTGGTGCTCAGGGCAAAAAATCAAGATCACTCCGA TTCTAG |
Microexon DNA seq | GGAAATGG |
Microexon Amino Acid seq | WEMG |
Microexon-tag DNA Seq | GAGTTCGAAAGATTCTACTTCGACAAGCGCCTCCAGGCCCGGAATCAATGGGAAATGGGGAGAGAATCACCCGCCTCAAATCCTTCTCTTGCGCCACTTCATCAGTCT |
Microexon-tag Amino Acid seq | EFERFYFDKRLQARNQWEMGRESPASNPSLAPLHQS |
Transcript ID | Pp.19222.1 |
Gene ID | Pp.19222 |
Gene Name | NA |
Pfam domain motif | Unknown |
Motif E-value | NA |
Motif start | NA |
Motif end | NA |
Protein seq | >Pp.19222.1 MWEEFERFYFDKRLQARNQWEMGRESPASNPSLAPLHQSQSTTVRPRNSIFKRPVSPLKLYSCTSLQEGREEEMSSSDNA QRDDEIARLPSLDFSAGEDSNFKLDVDSSVPVNPGPMGPPASLNSWNSDLLELPVGEGGPPNKRKSCILSLDGGGMRGLI AARILSHLENILQEKVGEKVKLCDYFDLLAGTSTGAVLATMLVTPDANGNPTFTAEGCCEFYKKNGRLIFQHRWYDPFHG SVRQLYRPKYSGRRFEDLLKKYTFIDGKFLTLLDTLKPLVVTSFDISQATPFFFVRQAAQKDQSRNFRLWEVCRATAAAP TYFPPASVRSVDGRVQGTLIDGGAVQNNPALVATTHAISNNEEFPYVNGLEDVLELQYNKYF* |
CDS seq | >Pp.19222.1 ATGTGGGAGGAGTTCGAAAGATTCTACTTCGACAAGCGCCTCCAGGCCCGGAATCAATGGGAAATGGGGAGAGAATCACC CGCCTCAAATCCTTCTCTTGCGCCACTTCATCAGTCTCAAAGCACCACAGTTCGTCCTCGAAACAGTATATTTAAAAGAC CTGTTAGCCCTCTGAAACTGTACAGTTGCACCTCACTACAAGAAGGTAGAGAGGAAGAGATGAGCAGCTCCGACAATGCT CAACGTGATGATGAAATTGCTCGACTTCCAAGCTTGGATTTCAGTGCAGGAGAAGATTCGAATTTCAAGCTTGATGTTGA TAGCAGTGTCCCTGTAAACCCTGGCCCTATGGGTCCGCCAGCGTCCTTGAATAGTTGGAATAGCGACCTGTTGGAACTTC CCGTCGGTGAAGGTGGCCCTCCAAACAAGAGAAAATCATGTATCTTGAGTTTAGATGGTGGAGGTATGCGGGGGCTTATT GCAGCAAGGATCCTCTCTCACCTTGAAAACATTTTGCAGGAAAAGGTGGGTGAGAAGGTGAAACTATGCGACTACTTCGA CTTGCTTGCCGGTACCAGCACAGGCGCTGTTCTTGCTACAATGTTGGTAACCCCAGACGCAAATGGGAATCCCACTTTCA CCGCTGAGGGTTGTTGCGAATTTTACAAAAAAAACGGACGACTTATATTTCAGCATCGGTGGTACGATCCATTCCATGGA TCGGTACGGCAGTTGTACCGACCCAAGTACTCAGGTCGACGTTTTGAGGATCTTCTCAAGAAGTACACATTCATTGATGG GAAGTTTCTTACCCTCCTTGACACCCTGAAACCCCTTGTGGTGACCTCCTTCGACATATCTCAAGCCACACCATTCTTCT TCGTGCGACAAGCTGCTCAGAAAGATCAAAGTCGAAACTTCAGGTTGTGGGAGGTTTGTCGGGCAACAGCAGCAGCTCCT ACATACTTTCCACCAGCGTCAGTGCGATCAGTGGATGGGAGAGTTCAAGGCACTCTCATCGATGGCGGTGCCGTTCAGAA CAATCCCGCGCTTGTTGCAACTACGCACGCAATTAGCAACAACGAGGAGTTTCCATACGTTAATGGTCTTGAGGACGTGT TGGAACTTCAGTACAATAAATATTTCTGA |