Microexon ID Pp_9:5739735-5739737:-
Species Physcomitrium patens
Coordinates 9:5739735..5739737
Microexon Cluster ID Unclassified
Size 3
Pp_9:5739735-5739737:- does not have available information here.
Transcript ID Pp3c9_9170V3.1
Protein ID Pp3c9_9170V3.1
Gene ID Pp3c9_9170
Gene Name NA
Pfam domain motif DUF1296
Motif E-value 2.4e-30
Motif start 13
Motif end 72
Protein seq >Pp3c9_9170V3.1
MSTGRGGGGAVDIPASTKKVVQDLKEVVGNSEEEIYAMLKECNMDPNEAAQRLLNQGDPFHEVKRKRDKKKETGGMKARD
SDMRGRPGSGGYIRGGGRGGGGTGGSLPRYGSQGSQEYGGGRGRAHGRENGIHPGLRGSSAASNSTTAPCQAKPSAPSSV
PPSTGAQTAAAAAAAAPVQAPSPASSSSGGAVNGSSGFVRPARPPQGAWASGHGTMADLLKARAAPPPPPSSFQSPATSP
AVPVVQTALSPSYPAAVAPVAASEPSPESSGFYSSSADPVLHTSVDLRVVGGQSAIGAVGNQRPIGDRPVSSSNAEAAID
SSAASPSAQASSVPESEGSGDLETDMERELETARPPSPPAATASAPSSRDVSVDEAAVSMEAGNSQGHGSMSVLGGSQFN
GRPLYGSPQQPVGTQKAAGAGGLEWKAKAPGHNLLGSSADGGSQGLGAVDPTSIAQAYQSMSIQEDEPVIIPTHLRVPEA
DRSHLSFGSFGSDFVSAFGTSFGHAEVEENKKNVETIAVEEAPVELPAPSSVDTQVEMAQSYGLQQVATPVESIDASVEA
PPVVQQDPAVPAQEQHTKPDPVVQSAPYFFGASNYPGFGLMPQMPGGQYGYEQTESAPQDAPRIPSMVPAYDPTTSYYTS
AFRGAESDRLYPPYVPTNTASKYSGNIGLMAAPSLQASQEGGNPMLASATPSVSSSQTSQPGSNVQPAQVVPQQALPMHY
SQPPSGPFGNYVSYQYMPASYPYLQPPYPHHVYNSSSTAYAQPPAGSTYTPPSAASSYPAGGATAVKFPMPQYKPGAAAG
NAPHPAPAVGYGGYTTTPSGYASSPAVTVGNASGYEDMNTSRYKDSTLYIPSQQGDSSTVWIQAAMPRDIGPSAAMQTSL
YYSLAGQGQHSGYAHSQQPTHGHAHPNAAYTNLYHPSQTGPAPSHQMLQQPQGMGGAGGNSQAGAYQQQPQRAQQSWNNS
NY*
CDS seq >Pp3c9_9170V3.1
ATGAGTACCGGCAGGGGCGGCGGCGGTGCCGTCGACATCCCGGCCAGCACTAAGAAAGTGGTGCAGGACCTCAAGGAGGT
GGTCGGCAACAGCGAAGAGGAGATTTATGCCATGCTCAAAGAGTGCAACATGGACCCGAACGAGGCCGCGCAACGGCTGC
TGAACCAAGGAGATCCGTTTCACGAAGTGAAGAGGAAGCGGGACAAGAAGAAAGAGACTGGAGGAATGAAAGCGAGAGAT
TCGGATATGAGGGGACGGCCTGGTAGCGGTGGTTATATCAGAGGAGGTGGACGTGGCGGTGGTGGAACCGGGGGTTCTCT
GCCTCGTTACGGTTCTCAGGGTTCACAGGAGTATGGTGGTGGGCGAGGTCGAGCGCACGGAAGAGAGAATGGCATTCATC
CTGGTTTGCGTGGTTCTAGCGCAGCTTCAAATTCAACTACTGCACCCTGTCAAGCAAAGCCGTCGGCCCCGAGCTCTGTA
CCACCATCAACAGGCGCTCAAACAGCAGCAGCAGCAGCAGCAGCAGCACCAGTTCAAGCTCCAAGTCCAGCTAGCAGTAG
CTCCGGAGGAGCAGTTAATGGAAGCAGCGGGTTTGTGAGACCCGCACGACCCCCACAAGGGGCATGGGCCTCAGGGCATG
GCACAATGGCAGATCTATTGAAAGCGCGGGCAGCTCCTCCTCCACCTCCCTCGAGCTTCCAATCTCCGGCAACTTCCCCA
GCCGTGCCGGTCGTTCAAACGGCGTTGTCTCCATCATATCCGGCTGCCGTAGCCCCAGTAGCAGCTTCTGAGCCTTCTCC
GGAGTCCTCGGGATTTTATTCGTCGAGTGCGGACCCGGTGCTTCATACATCGGTAGATTTGCGAGTTGTGGGCGGTCAAA
GTGCCATCGGTGCGGTGGGCAACCAGCGTCCTATTGGAGATCGGCCTGTAAGTTCATCAAATGCAGAGGCGGCAATCGAT
TCAAGTGCGGCGTCCCCATCGGCTCAAGCGAGCTCTGTCCCAGAATCCGAAGGCTCTGGCGATCTGGAGACAGACATGGA
GAGGGAATTAGAGACAGCAAGACCGCCTTCTCCTCCTGCTGCGACGGCGTCTGCACCGTCTAGTCGAGATGTAAGTGTGG
ATGAAGCTGCCGTGAGCATGGAGGCAGGCAACTCGCAAGGACACGGGTCAATGAGCGTACTGGGTGGTAGCCAATTTAAC
GGGCGACCTTTGTATGGGTCCCCACAGCAGCCCGTGGGCACTCAGAAAGCTGCAGGTGCCGGCGGCTTGGAGTGGAAGGC
GAAAGCTCCTGGGCACAACTTGTTAGGATCGTCAGCGGACGGTGGTTCTCAAGGTCTAGGTGCTGTGGACCCGACGAGCA
TAGCTCAAGCATATCAGTCTATGAGCATACAAGAAGACGAGCCGGTGATAATTCCGACGCACCTTCGAGTGCCGGAAGCA
GACCGTTCGCATTTGAGCTTCGGTAGCTTTGGTTCAGATTTTGTGTCAGCTTTCGGGACGAGCTTTGGTCATGCAGAAGT
TGAAGAGAACAAGAAGAACGTGGAGACTATTGCTGTGGAAGAGGCTCCGGTCGAGTTGCCGGCTCCTTCAAGTGTGGATA
CCCAAGTGGAGATGGCCCAGTCATATGGGCTGCAGCAGGTAGCTACTCCGGTGGAGAGCATTGACGCGAGCGTGGAGGCA
CCACCTGTAGTTCAGCAAGACCCGGCCGTTCCAGCCCAGGAGCAGCACACGAAACCTGATCCAGTGGTGCAGTCGGCGCC
GTATTTTTTTGGGGCGTCAAACTATCCTGGGTTTGGGTTGATGCCACAAATGCCTGGCGGTCAGTATGGTTACGAGCAAA
CGGAGTCGGCACCTCAGGATGCACCTCGGATTCCAAGCATGGTGCCGGCGTACGACCCGACGACGAGCTACTATACTTCA
GCGTTCCGTGGGGCAGAATCCGATCGGCTTTATCCTCCCTATGTTCCAACGAACACAGCGAGCAAGTATTCTGGGAATAT
TGGTTTGATGGCGGCTCCATCTCTTCAAGCATCCCAGGAGGGAGGAAATCCGATGCTAGCTTCTGCCACACCGAGCGTGA
GTTCTTCGCAGACTTCTCAACCGGGAAGTAACGTGCAGCCTGCACAAGTGGTGCCGCAGCAAGCTTTGCCGATGCATTAT
TCACAGCCACCTTCGGGACCCTTTGGGAACTACGTGAGTTATCAATATATGCCCGCAAGTTACCCGTATTTGCAGCCCCC
ATATCCGCACCACGTTTACAACTCGAGCAGCACAGCTTATGCTCAGCCTCCGGCTGGCTCGACGTACACTCCTCCATCTG
CGGCATCGTCTTATCCAGCTGGCGGAGCGACTGCTGTGAAGTTTCCGATGCCCCAGTACAAGCCCGGGGCAGCTGCCGGA
AACGCTCCCCATCCTGCACCAGCCGTAGGATACGGCGGGTACACGACAACGCCTTCAGGATATGCGTCAAGTCCTGCGGT
GACAGTCGGAAATGCTTCTGGATACGAGGACATGAACACATCGCGTTATAAGGATAGCACTCTCTACATTCCGAGCCAAC
AGGGAGATAGCTCGACAGTATGGATTCAGGCCGCGATGCCCCGCGATATTGGACCGTCAGCGGCCATGCAGACGAGCTTG
TACTACAGTTTGGCCGGGCAAGGCCAGCACAGTGGATACGCGCATTCTCAGCAACCGACGCACGGGCATGCGCATCCAAA
CGCGGCATATACCAACTTGTATCATCCGTCACAGACAGGGCCGGCCCCAAGCCATCAGATGCTTCAACAGCCCCAAGGAA
TGGGAGGCGCAGGGGGCAACAGCCAAGCAGGAGCCTACCAGCAGCAACCTCAACGTGCGCAACAGTCATGGAACAATTCG
AACTATTAG
Microexon DNA seq GAG
Microexon Amino Acid seq GD
Microexon-tag DNA Seq AAAGAGTGCAACATGGACCCGAACGAGGCCGCGCAACGGCTGCTGAACCAAGGAGATCCGTTTCACGAAGTGAAGAGGAAGCGGGACAAGAAGAAAGAGACTGGAGGA
Microexon-tag Amino Acid seq KECNMDPNEAAQRLLNQGDPFHEVKRKRDKKKETGG
Transcript ID Pp.24803.2
Gene ID Pp.24803
Gene Name NA
Pfam domain motif DUF1296
Motif E-value 2.4e-30
Motif start 13
Motif end 72
Protein seq >Pp.24803.2
MSTGRGGGGAVDIPASTKKVVQDLKEVVGNSEEEIYAMLKECNMDPNEAAQRLLNQGDPFHEVKRKRDKKKETGGMKARD
SDMRGRPGSGGYIRGGGRGGGGTGGSLPRYGSQGSQEYGGGRGRAHGRENGIHPGLRGSSAASNSTTAPCQAKPSAPSSV
PPSTGAQTAAAAAAAAPVQAPSPASSSSGGAVNGSSGFVRPARPPQGAWASGHGTMADLLKARAAPPPPPSSFQSPATSP
AVPVVQTALSPSYPAAVAPVAASEPSPESSGFYSSSADPVLHTSVDLRVVGGQSAIGAVGNQRPIGDRPVSSSNAEAAID
SSAASPSAQASSVPESEGSGDLETDMERELETARPPSPPAATASAPSSRDVSVDEAAVSMEAGNSQGHGSMSVLGGSQFN
GRPLYGSPQQPVGTQKAAGAGGLEWKAKAPGHNLLGSSADGGSQGLGAVDPTSIAQAYQSMSIQEDEPVIIPTHLRVPEA
DRSHLSFGSFGSDFVSAFGTSFGHAEVEENKKNVETIAVEEAPVELPAPSSVDTQVEMAQSYGLQQVATPVESIDASVEA
PPVVQQDPAVPAQEQHTKPDPVVQSAPYFFGASNYPGFGLMPQMPGGQYGYEQTESAPQDAPRIPSMVPAYDPTTSYYTS
AFRGAESDRLYPPYVPTNTASKYSGNIGLMAAPSLQASQEGGNPMLASATPSVSSSQTSQPGSNVQPAQVVPQQALPMHY
SQPPSGPFGNYVSYQYMPASYPYLQPPYPHHVYNSSSTAYAQPPAGSTYTPPSAASSYPAGGATAVKFPMPQYKPGAAAG
NAPHPAPAVGYGGYTTTPSGYASSPAVTVGNASGYEDMNTSRYKDSTLYIPSQQQGDSSTVWIQAAMPRDIGPSAAMQTS
LYYSLAGQGQHSGYAHSQQPTHGHAHPNAAYTNLYHPSQTGPAPSHQMLQQPQGMGGAGGNSQAGAYQQQPQRAQQSWNN
SNY*
CDS seq >Pp.24803.2
ATGAGTACCGGCAGGGGCGGCGGCGGTGCCGTCGACATCCCGGCCAGCACTAAGAAAGTGGTGCAGGACCTCAAGGAGGT
GGTCGGCAACAGCGAAGAGGAGATTTATGCCATGCTCAAAGAGTGCAACATGGACCCGAACGAGGCCGCGCAACGGCTGC
TGAACCAAGGAGATCCGTTTCACGAAGTGAAGAGGAAGCGGGACAAGAAGAAAGAGACTGGAGGAATGAAAGCGAGAGAT
TCGGATATGAGGGGACGGCCTGGTAGCGGTGGTTATATCAGAGGAGGTGGACGTGGCGGTGGTGGAACCGGGGGTTCTCT
GCCTCGTTACGGTTCTCAGGGTTCACAGGAGTATGGTGGTGGGCGAGGTCGAGCGCACGGAAGAGAGAATGGCATTCATC
CTGGTTTGCGTGGTTCTAGCGCAGCTTCAAATTCAACTACTGCACCCTGTCAAGCAAAGCCGTCGGCCCCGAGCTCTGTA
CCACCATCAACAGGCGCTCAAACAGCAGCAGCAGCAGCAGCAGCAGCACCAGTTCAAGCTCCAAGTCCAGCTAGCAGTAG
CTCCGGAGGAGCAGTTAATGGAAGCAGCGGGTTTGTGAGACCCGCACGACCCCCACAAGGGGCATGGGCCTCAGGGCATG
GCACAATGGCAGATCTATTGAAAGCGCGGGCAGCTCCTCCTCCACCTCCCTCGAGCTTCCAATCTCCGGCAACTTCCCCA
GCCGTGCCGGTCGTTCAAACGGCGTTGTCTCCATCATATCCGGCTGCCGTAGCCCCAGTAGCAGCTTCTGAGCCTTCTCC
GGAGTCCTCGGGATTTTATTCGTCGAGTGCGGACCCGGTGCTTCATACATCGGTAGATTTGCGAGTTGTGGGCGGTCAAA
GTGCCATCGGTGCGGTGGGCAACCAGCGTCCTATTGGAGATCGGCCTGTAAGTTCATCAAATGCAGAGGCGGCAATCGAT
TCAAGTGCGGCGTCCCCATCGGCTCAAGCGAGCTCTGTCCCAGAATCCGAAGGCTCTGGCGATCTGGAGACAGACATGGA
GAGGGAATTAGAGACAGCAAGACCGCCTTCTCCTCCTGCTGCGACGGCGTCTGCACCGTCTAGTCGAGATGTAAGTGTGG
ATGAAGCTGCCGTGAGCATGGAGGCAGGCAACTCGCAAGGACACGGGTCAATGAGCGTACTGGGTGGTAGCCAATTTAAC
GGGCGACCTTTGTATGGGTCCCCACAGCAGCCCGTGGGCACTCAGAAAGCTGCAGGTGCCGGCGGCTTGGAGTGGAAGGC
GAAAGCTCCTGGGCACAACTTGTTAGGATCGTCAGCGGACGGTGGTTCTCAAGGTCTAGGTGCTGTGGACCCGACGAGCA
TAGCTCAAGCATATCAGTCTATGAGCATACAAGAAGACGAGCCGGTGATAATTCCGACGCACCTTCGAGTGCCGGAAGCA
GACCGTTCGCATTTGAGCTTCGGTAGCTTTGGTTCAGATTTTGTGTCAGCTTTCGGGACGAGCTTTGGTCATGCAGAAGT
TGAAGAGAACAAGAAGAACGTGGAGACTATTGCTGTGGAAGAGGCTCCGGTCGAGTTGCCGGCTCCTTCAAGTGTGGATA
CCCAAGTGGAGATGGCCCAGTCATATGGGCTGCAGCAGGTAGCTACTCCGGTGGAGAGCATTGACGCGAGCGTGGAGGCA
CCACCTGTAGTTCAGCAAGACCCGGCCGTTCCAGCCCAGGAGCAGCACACGAAACCTGATCCAGTGGTGCAGTCGGCGCC
GTATTTTTTTGGGGCGTCAAACTATCCTGGGTTTGGGTTGATGCCACAAATGCCTGGCGGTCAGTATGGTTACGAGCAAA
CGGAGTCGGCACCTCAGGATGCACCTCGGATTCCAAGCATGGTGCCGGCGTACGACCCGACGACGAGCTACTATACTTCA
GCGTTCCGTGGGGCAGAATCCGATCGGCTTTATCCTCCCTATGTTCCAACGAACACAGCGAGCAAGTATTCTGGGAATAT
TGGTTTGATGGCGGCTCCATCTCTTCAAGCATCCCAGGAGGGAGGAAATCCGATGCTAGCTTCTGCCACACCGAGCGTGA
GTTCTTCGCAGACTTCTCAACCGGGAAGTAACGTGCAGCCTGCACAAGTGGTGCCGCAGCAAGCTTTGCCGATGCATTAT
TCACAGCCACCTTCGGGACCCTTTGGGAACTACGTGAGTTATCAATATATGCCCGCAAGTTACCCGTATTTGCAGCCCCC
ATATCCGCACCACGTTTACAACTCGAGCAGCACAGCTTATGCTCAGCCTCCGGCTGGCTCGACGTACACTCCTCCATCTG
CGGCATCGTCTTATCCAGCTGGCGGAGCGACTGCTGTGAAGTTTCCGATGCCCCAGTACAAGCCCGGGGCAGCTGCCGGA
AACGCTCCCCATCCTGCACCAGCCGTAGGATACGGCGGGTACACGACAACGCCTTCAGGATATGCGTCAAGTCCTGCGGT
GACAGTCGGAAATGCTTCTGGATACGAGGACATGAACACATCGCGTTATAAGGATAGCACTCTCTACATTCCGAGCCAAC
AGCAGGGAGATAGCTCGACAGTATGGATTCAGGCCGCGATGCCCCGCGATATTGGACCGTCAGCGGCCATGCAGACGAGC
TTGTACTACAGTTTGGCCGGGCAAGGCCAGCACAGTGGATACGCGCATTCTCAGCAACCGACGCACGGGCATGCGCATCC
AAACGCGGCATATACCAACTTGTATCATCCGTCACAGACAGGGCCGGCCCCAAGCCATCAGATGCTTCAACAGCCCCAAG
GAATGGGAGGCGCAGGGGGCAACAGCCAAGCAGGAGCCTACCAGCAGCAACCTCAACGTGCGCAACAGTCATGGAACAAT
TCGAACTATTAG