
Microexon ID | Pp_9:5739735-5739737:- |
Species | Physcomitrium patens | Coordinates | 9:5739735..5739737 |
Microexon Cluster ID | Unclassified |
Size | 3 |
Pp_9:5739735-5739737:- does not have available information here.
Transcript ID | Pp3c9_9170V3.1 |
Protein ID | Pp3c9_9170V3.1 |
Gene ID | Pp3c9_9170 |
Gene Name | NA |
Pfam domain motif | DUF1296 |
Motif E-value | 2.4e-30 |
Motif start | 13 |
Motif end | 72 |
Protein seq | >Pp3c9_9170V3.1 MSTGRGGGGAVDIPASTKKVVQDLKEVVGNSEEEIYAMLKECNMDPNEAAQRLLNQGDPFHEVKRKRDKKKETGGMKARD SDMRGRPGSGGYIRGGGRGGGGTGGSLPRYGSQGSQEYGGGRGRAHGRENGIHPGLRGSSAASNSTTAPCQAKPSAPSSV PPSTGAQTAAAAAAAAPVQAPSPASSSSGGAVNGSSGFVRPARPPQGAWASGHGTMADLLKARAAPPPPPSSFQSPATSP AVPVVQTALSPSYPAAVAPVAASEPSPESSGFYSSSADPVLHTSVDLRVVGGQSAIGAVGNQRPIGDRPVSSSNAEAAID SSAASPSAQASSVPESEGSGDLETDMERELETARPPSPPAATASAPSSRDVSVDEAAVSMEAGNSQGHGSMSVLGGSQFN GRPLYGSPQQPVGTQKAAGAGGLEWKAKAPGHNLLGSSADGGSQGLGAVDPTSIAQAYQSMSIQEDEPVIIPTHLRVPEA DRSHLSFGSFGSDFVSAFGTSFGHAEVEENKKNVETIAVEEAPVELPAPSSVDTQVEMAQSYGLQQVATPVESIDASVEA PPVVQQDPAVPAQEQHTKPDPVVQSAPYFFGASNYPGFGLMPQMPGGQYGYEQTESAPQDAPRIPSMVPAYDPTTSYYTS AFRGAESDRLYPPYVPTNTASKYSGNIGLMAAPSLQASQEGGNPMLASATPSVSSSQTSQPGSNVQPAQVVPQQALPMHY SQPPSGPFGNYVSYQYMPASYPYLQPPYPHHVYNSSSTAYAQPPAGSTYTPPSAASSYPAGGATAVKFPMPQYKPGAAAG NAPHPAPAVGYGGYTTTPSGYASSPAVTVGNASGYEDMNTSRYKDSTLYIPSQQGDSSTVWIQAAMPRDIGPSAAMQTSL YYSLAGQGQHSGYAHSQQPTHGHAHPNAAYTNLYHPSQTGPAPSHQMLQQPQGMGGAGGNSQAGAYQQQPQRAQQSWNNS NY* |
CDS seq | >Pp3c9_9170V3.1 ATGAGTACCGGCAGGGGCGGCGGCGGTGCCGTCGACATCCCGGCCAGCACTAAGAAAGTGGTGCAGGACCTCAAGGAGGT GGTCGGCAACAGCGAAGAGGAGATTTATGCCATGCTCAAAGAGTGCAACATGGACCCGAACGAGGCCGCGCAACGGCTGC TGAACCAAGGAGATCCGTTTCACGAAGTGAAGAGGAAGCGGGACAAGAAGAAAGAGACTGGAGGAATGAAAGCGAGAGAT TCGGATATGAGGGGACGGCCTGGTAGCGGTGGTTATATCAGAGGAGGTGGACGTGGCGGTGGTGGAACCGGGGGTTCTCT GCCTCGTTACGGTTCTCAGGGTTCACAGGAGTATGGTGGTGGGCGAGGTCGAGCGCACGGAAGAGAGAATGGCATTCATC CTGGTTTGCGTGGTTCTAGCGCAGCTTCAAATTCAACTACTGCACCCTGTCAAGCAAAGCCGTCGGCCCCGAGCTCTGTA CCACCATCAACAGGCGCTCAAACAGCAGCAGCAGCAGCAGCAGCAGCACCAGTTCAAGCTCCAAGTCCAGCTAGCAGTAG CTCCGGAGGAGCAGTTAATGGAAGCAGCGGGTTTGTGAGACCCGCACGACCCCCACAAGGGGCATGGGCCTCAGGGCATG GCACAATGGCAGATCTATTGAAAGCGCGGGCAGCTCCTCCTCCACCTCCCTCGAGCTTCCAATCTCCGGCAACTTCCCCA GCCGTGCCGGTCGTTCAAACGGCGTTGTCTCCATCATATCCGGCTGCCGTAGCCCCAGTAGCAGCTTCTGAGCCTTCTCC GGAGTCCTCGGGATTTTATTCGTCGAGTGCGGACCCGGTGCTTCATACATCGGTAGATTTGCGAGTTGTGGGCGGTCAAA GTGCCATCGGTGCGGTGGGCAACCAGCGTCCTATTGGAGATCGGCCTGTAAGTTCATCAAATGCAGAGGCGGCAATCGAT TCAAGTGCGGCGTCCCCATCGGCTCAAGCGAGCTCTGTCCCAGAATCCGAAGGCTCTGGCGATCTGGAGACAGACATGGA GAGGGAATTAGAGACAGCAAGACCGCCTTCTCCTCCTGCTGCGACGGCGTCTGCACCGTCTAGTCGAGATGTAAGTGTGG ATGAAGCTGCCGTGAGCATGGAGGCAGGCAACTCGCAAGGACACGGGTCAATGAGCGTACTGGGTGGTAGCCAATTTAAC GGGCGACCTTTGTATGGGTCCCCACAGCAGCCCGTGGGCACTCAGAAAGCTGCAGGTGCCGGCGGCTTGGAGTGGAAGGC GAAAGCTCCTGGGCACAACTTGTTAGGATCGTCAGCGGACGGTGGTTCTCAAGGTCTAGGTGCTGTGGACCCGACGAGCA TAGCTCAAGCATATCAGTCTATGAGCATACAAGAAGACGAGCCGGTGATAATTCCGACGCACCTTCGAGTGCCGGAAGCA GACCGTTCGCATTTGAGCTTCGGTAGCTTTGGTTCAGATTTTGTGTCAGCTTTCGGGACGAGCTTTGGTCATGCAGAAGT TGAAGAGAACAAGAAGAACGTGGAGACTATTGCTGTGGAAGAGGCTCCGGTCGAGTTGCCGGCTCCTTCAAGTGTGGATA CCCAAGTGGAGATGGCCCAGTCATATGGGCTGCAGCAGGTAGCTACTCCGGTGGAGAGCATTGACGCGAGCGTGGAGGCA CCACCTGTAGTTCAGCAAGACCCGGCCGTTCCAGCCCAGGAGCAGCACACGAAACCTGATCCAGTGGTGCAGTCGGCGCC GTATTTTTTTGGGGCGTCAAACTATCCTGGGTTTGGGTTGATGCCACAAATGCCTGGCGGTCAGTATGGTTACGAGCAAA CGGAGTCGGCACCTCAGGATGCACCTCGGATTCCAAGCATGGTGCCGGCGTACGACCCGACGACGAGCTACTATACTTCA GCGTTCCGTGGGGCAGAATCCGATCGGCTTTATCCTCCCTATGTTCCAACGAACACAGCGAGCAAGTATTCTGGGAATAT TGGTTTGATGGCGGCTCCATCTCTTCAAGCATCCCAGGAGGGAGGAAATCCGATGCTAGCTTCTGCCACACCGAGCGTGA GTTCTTCGCAGACTTCTCAACCGGGAAGTAACGTGCAGCCTGCACAAGTGGTGCCGCAGCAAGCTTTGCCGATGCATTAT TCACAGCCACCTTCGGGACCCTTTGGGAACTACGTGAGTTATCAATATATGCCCGCAAGTTACCCGTATTTGCAGCCCCC ATATCCGCACCACGTTTACAACTCGAGCAGCACAGCTTATGCTCAGCCTCCGGCTGGCTCGACGTACACTCCTCCATCTG CGGCATCGTCTTATCCAGCTGGCGGAGCGACTGCTGTGAAGTTTCCGATGCCCCAGTACAAGCCCGGGGCAGCTGCCGGA AACGCTCCCCATCCTGCACCAGCCGTAGGATACGGCGGGTACACGACAACGCCTTCAGGATATGCGTCAAGTCCTGCGGT GACAGTCGGAAATGCTTCTGGATACGAGGACATGAACACATCGCGTTATAAGGATAGCACTCTCTACATTCCGAGCCAAC AGGGAGATAGCTCGACAGTATGGATTCAGGCCGCGATGCCCCGCGATATTGGACCGTCAGCGGCCATGCAGACGAGCTTG TACTACAGTTTGGCCGGGCAAGGCCAGCACAGTGGATACGCGCATTCTCAGCAACCGACGCACGGGCATGCGCATCCAAA CGCGGCATATACCAACTTGTATCATCCGTCACAGACAGGGCCGGCCCCAAGCCATCAGATGCTTCAACAGCCCCAAGGAA TGGGAGGCGCAGGGGGCAACAGCCAAGCAGGAGCCTACCAGCAGCAACCTCAACGTGCGCAACAGTCATGGAACAATTCG AACTATTAG |
Microexon DNA seq | GAG |
Microexon Amino Acid seq | GD |
Microexon-tag DNA Seq | AAAGAGTGCAACATGGACCCGAACGAGGCCGCGCAACGGCTGCTGAACCAAGGAGATCCGTTTCACGAAGTGAAGAGGAAGCGGGACAAGAAGAAAGAGACTGGAGGA |
Microexon-tag Amino Acid seq | KECNMDPNEAAQRLLNQGDPFHEVKRKRDKKKETGG |
Transcript ID | Pp.24803.2 |
Gene ID | Pp.24803 |
Gene Name | NA |
Pfam domain motif | DUF1296 |
Motif E-value | 2.4e-30 |
Motif start | 13 |
Motif end | 72 |
Protein seq | >Pp.24803.2 MSTGRGGGGAVDIPASTKKVVQDLKEVVGNSEEEIYAMLKECNMDPNEAAQRLLNQGDPFHEVKRKRDKKKETGGMKARD SDMRGRPGSGGYIRGGGRGGGGTGGSLPRYGSQGSQEYGGGRGRAHGRENGIHPGLRGSSAASNSTTAPCQAKPSAPSSV PPSTGAQTAAAAAAAAPVQAPSPASSSSGGAVNGSSGFVRPARPPQGAWASGHGTMADLLKARAAPPPPPSSFQSPATSP AVPVVQTALSPSYPAAVAPVAASEPSPESSGFYSSSADPVLHTSVDLRVVGGQSAIGAVGNQRPIGDRPVSSSNAEAAID SSAASPSAQASSVPESEGSGDLETDMERELETARPPSPPAATASAPSSRDVSVDEAAVSMEAGNSQGHGSMSVLGGSQFN GRPLYGSPQQPVGTQKAAGAGGLEWKAKAPGHNLLGSSADGGSQGLGAVDPTSIAQAYQSMSIQEDEPVIIPTHLRVPEA DRSHLSFGSFGSDFVSAFGTSFGHAEVEENKKNVETIAVEEAPVELPAPSSVDTQVEMAQSYGLQQVATPVESIDASVEA PPVVQQDPAVPAQEQHTKPDPVVQSAPYFFGASNYPGFGLMPQMPGGQYGYEQTESAPQDAPRIPSMVPAYDPTTSYYTS AFRGAESDRLYPPYVPTNTASKYSGNIGLMAAPSLQASQEGGNPMLASATPSVSSSQTSQPGSNVQPAQVVPQQALPMHY SQPPSGPFGNYVSYQYMPASYPYLQPPYPHHVYNSSSTAYAQPPAGSTYTPPSAASSYPAGGATAVKFPMPQYKPGAAAG NAPHPAPAVGYGGYTTTPSGYASSPAVTVGNASGYEDMNTSRYKDSTLYIPSQQQGDSSTVWIQAAMPRDIGPSAAMQTS LYYSLAGQGQHSGYAHSQQPTHGHAHPNAAYTNLYHPSQTGPAPSHQMLQQPQGMGGAGGNSQAGAYQQQPQRAQQSWNN SNY* |
CDS seq | >Pp.24803.2 ATGAGTACCGGCAGGGGCGGCGGCGGTGCCGTCGACATCCCGGCCAGCACTAAGAAAGTGGTGCAGGACCTCAAGGAGGT GGTCGGCAACAGCGAAGAGGAGATTTATGCCATGCTCAAAGAGTGCAACATGGACCCGAACGAGGCCGCGCAACGGCTGC TGAACCAAGGAGATCCGTTTCACGAAGTGAAGAGGAAGCGGGACAAGAAGAAAGAGACTGGAGGAATGAAAGCGAGAGAT TCGGATATGAGGGGACGGCCTGGTAGCGGTGGTTATATCAGAGGAGGTGGACGTGGCGGTGGTGGAACCGGGGGTTCTCT GCCTCGTTACGGTTCTCAGGGTTCACAGGAGTATGGTGGTGGGCGAGGTCGAGCGCACGGAAGAGAGAATGGCATTCATC CTGGTTTGCGTGGTTCTAGCGCAGCTTCAAATTCAACTACTGCACCCTGTCAAGCAAAGCCGTCGGCCCCGAGCTCTGTA CCACCATCAACAGGCGCTCAAACAGCAGCAGCAGCAGCAGCAGCAGCACCAGTTCAAGCTCCAAGTCCAGCTAGCAGTAG CTCCGGAGGAGCAGTTAATGGAAGCAGCGGGTTTGTGAGACCCGCACGACCCCCACAAGGGGCATGGGCCTCAGGGCATG GCACAATGGCAGATCTATTGAAAGCGCGGGCAGCTCCTCCTCCACCTCCCTCGAGCTTCCAATCTCCGGCAACTTCCCCA GCCGTGCCGGTCGTTCAAACGGCGTTGTCTCCATCATATCCGGCTGCCGTAGCCCCAGTAGCAGCTTCTGAGCCTTCTCC GGAGTCCTCGGGATTTTATTCGTCGAGTGCGGACCCGGTGCTTCATACATCGGTAGATTTGCGAGTTGTGGGCGGTCAAA GTGCCATCGGTGCGGTGGGCAACCAGCGTCCTATTGGAGATCGGCCTGTAAGTTCATCAAATGCAGAGGCGGCAATCGAT TCAAGTGCGGCGTCCCCATCGGCTCAAGCGAGCTCTGTCCCAGAATCCGAAGGCTCTGGCGATCTGGAGACAGACATGGA GAGGGAATTAGAGACAGCAAGACCGCCTTCTCCTCCTGCTGCGACGGCGTCTGCACCGTCTAGTCGAGATGTAAGTGTGG ATGAAGCTGCCGTGAGCATGGAGGCAGGCAACTCGCAAGGACACGGGTCAATGAGCGTACTGGGTGGTAGCCAATTTAAC GGGCGACCTTTGTATGGGTCCCCACAGCAGCCCGTGGGCACTCAGAAAGCTGCAGGTGCCGGCGGCTTGGAGTGGAAGGC GAAAGCTCCTGGGCACAACTTGTTAGGATCGTCAGCGGACGGTGGTTCTCAAGGTCTAGGTGCTGTGGACCCGACGAGCA TAGCTCAAGCATATCAGTCTATGAGCATACAAGAAGACGAGCCGGTGATAATTCCGACGCACCTTCGAGTGCCGGAAGCA GACCGTTCGCATTTGAGCTTCGGTAGCTTTGGTTCAGATTTTGTGTCAGCTTTCGGGACGAGCTTTGGTCATGCAGAAGT TGAAGAGAACAAGAAGAACGTGGAGACTATTGCTGTGGAAGAGGCTCCGGTCGAGTTGCCGGCTCCTTCAAGTGTGGATA CCCAAGTGGAGATGGCCCAGTCATATGGGCTGCAGCAGGTAGCTACTCCGGTGGAGAGCATTGACGCGAGCGTGGAGGCA CCACCTGTAGTTCAGCAAGACCCGGCCGTTCCAGCCCAGGAGCAGCACACGAAACCTGATCCAGTGGTGCAGTCGGCGCC GTATTTTTTTGGGGCGTCAAACTATCCTGGGTTTGGGTTGATGCCACAAATGCCTGGCGGTCAGTATGGTTACGAGCAAA CGGAGTCGGCACCTCAGGATGCACCTCGGATTCCAAGCATGGTGCCGGCGTACGACCCGACGACGAGCTACTATACTTCA GCGTTCCGTGGGGCAGAATCCGATCGGCTTTATCCTCCCTATGTTCCAACGAACACAGCGAGCAAGTATTCTGGGAATAT TGGTTTGATGGCGGCTCCATCTCTTCAAGCATCCCAGGAGGGAGGAAATCCGATGCTAGCTTCTGCCACACCGAGCGTGA GTTCTTCGCAGACTTCTCAACCGGGAAGTAACGTGCAGCCTGCACAAGTGGTGCCGCAGCAAGCTTTGCCGATGCATTAT TCACAGCCACCTTCGGGACCCTTTGGGAACTACGTGAGTTATCAATATATGCCCGCAAGTTACCCGTATTTGCAGCCCCC ATATCCGCACCACGTTTACAACTCGAGCAGCACAGCTTATGCTCAGCCTCCGGCTGGCTCGACGTACACTCCTCCATCTG CGGCATCGTCTTATCCAGCTGGCGGAGCGACTGCTGTGAAGTTTCCGATGCCCCAGTACAAGCCCGGGGCAGCTGCCGGA AACGCTCCCCATCCTGCACCAGCCGTAGGATACGGCGGGTACACGACAACGCCTTCAGGATATGCGTCAAGTCCTGCGGT GACAGTCGGAAATGCTTCTGGATACGAGGACATGAACACATCGCGTTATAAGGATAGCACTCTCTACATTCCGAGCCAAC AGCAGGGAGATAGCTCGACAGTATGGATTCAGGCCGCGATGCCCCGCGATATTGGACCGTCAGCGGCCATGCAGACGAGC TTGTACTACAGTTTGGCCGGGCAAGGCCAGCACAGTGGATACGCGCATTCTCAGCAACCGACGCACGGGCATGCGCATCC AAACGCGGCATATACCAACTTGTATCATCCGTCACAGACAGGGCCGGCCCCAAGCCATCAGATGCTTCAACAGCCCCAAG GAATGGGAGGCGCAGGGGGCAACAGCCAAGCAGGAGCCTACCAGCAGCAACCTCAACGTGCGCAACAGTCATGGAACAAT TCGAACTATTAG |