Microexon ID Pp_22:2822258-2822269:-
Species Physcomitrium patens
Coordinates 22:2822258..2822269
Microexon Cluster ID Unclassified
Size 12
Pp_22:2822258-2822269:- does not have available information here.
Transcript ID Pp3c22_4740V3.2
Protein ID Pp3c22_4740V3.2
Gene ID Pp3c22_4740
Gene Name NA
Pfam domain motif IMS
Motif E-value 1.3e-32
Motif start 382
Motif end 513
Protein seq >Pp3c22_4740V3.2
MAGEEKEEESLKRKKVDNVPRGMAWGAQSFASARISSIREKDPSSVSDFGRYMAEKRRKLNVQYNEDSSNKLLNNATAAV
AALHSKEGQIVVAEISTKSRIFNGISIWVDGFTVPTHQELRHLMLQYGGVFENYFRRELVTHIICSTLPDSKIKNPRSLS
RGLPIVKPEWIVDSIAAGRVLSWAPYQLQRIALEHPNQRTLQSAYLNPKLDHCPLPLDWRDNYPTYSNMMDIDLEPRSEL
LALENVPTSDNQGPSSVQDGAVNFTLLEEDLKGIEVKADECSETENQGVLESHGKKDSVAIVDRSKRNTGAGYSDAAAAA
THSTLGDDNFVQNYFKSSRLHFIGTWRNRYQSRYGSFNDNSEGQDQQVFRAKSGSDPAVIHIDMDCFFVAVVVRDRPELH
DKPVAVCHSDSVRGSGEISSANYPARAFGVRAGMFVRDAKAKCPDLEIVPYNFEAYEQVADKAVSCDEAFLDVTDQGDPC
QIASVIRSQIFEATRCTASAGVSKNKLLARLATRKAKPNGLYYIAPSEVEGFMDELAVEDLPGVGWTLKEKLKSHNLHKC
SDLLRVSKDFLQREFGVKTGDMLWSYARGIDTREVQQAQIRKSIGAEVNWGVRFLVPEDAHRFLVTLSTEVATRLQTAAV
KGRTITLKVKRRKEGSGEPSKFMGCGVCDNFSRSETVGYATDSQEILLRVAKQLFNSFAFDVRDVRGVGLQVTRLEAVGS
GQSLKGGQGQQRALKSWLAPEISKVMLDESAPADRNSGNQTIIHRSTEPLKTEAGESIPVKSPALDVPKLPLNTSAQDNN
DLHAAAESGMRNFIAPDTKDRIQVGRSSTKAAPPPTNLPPLSELDPSVLASLPPEILAEIREVYGELAAQPVVSSKPAKN
YLSPVAVMKGAHKTKTSEVSNRVDRREDGIRRVLPLEPLTERGESSRPPSVRPEIVALPPASQLDPSVMSALPLSLRREL
EQEYKRQQVKKPDKSTLRVSEPEKLCVERCSSTSLEDLWTGTPPKWVHMFKESQDAEFSNLCVLAEHMAGAVRPFSVVML
SVLPHLSNACSATVSQKVIQSCMELIKQYVQQMIACDLEEVCMIMRVLKRVAAKSQLWRVVEDLITPFVQGIVGECYGGH
LKI*
CDS seq >Pp3c22_4740V3.2
ATGGCAGGTGAAGAGAAAGAGGAAGAATCGCTCAAAAGAAAAAAAGTGGACAATGTTCCGCGTGGCATGGCATGGGGAGC
TCAATCCTTTGCTTCTGCAAGAATCAGCAGCATTCGCGAGAAAGATCCCTCGAGCGTCAGTGACTTTGGCAGATACATGG
CTGAAAAGCGCCGCAAGCTAAATGTGCAATATAATGAGGACTCTTCTAACAAGCTTCTCAACAATGCGACAGCGGCTGTT
GCAGCGCTACATTCAAAGGAGGGGCAAATTGTAGTCGCTGAGATTTCAACCAAGTCACGCATATTCAATGGAATTTCAAT
TTGGGTGGACGGTTTCACTGTACCTACTCATCAGGAGCTACGACATCTAATGCTACAATATGGAGGTGTTTTTGAGAATT
ACTTTAGAAGGGAGCTTGTCACTCACATCATTTGCAGCACTCTTCCTGATAGCAAAATTAAAAATCCAAGATCCCTCAGT
CGAGGTTTGCCAATAGTAAAACCAGAATGGATCGTGGACAGCATAGCTGCAGGCCGAGTCCTATCTTGGGCTCCATATCA
ACTCCAAAGAATAGCATTGGAACATCCAAATCAGCGTACTCTGCAGTCGGCATATTTGAATCCCAAACTCGATCACTGCC
CTCTTCCTCTTGACTGGAGGGATAATTACCCAACATATTCAAACATGATGGACATAGACTTAGAACCCCGATCGGAGCTG
CTAGCACTTGAAAATGTTCCAACGTCAGACAACCAAGGTCCATCTTCTGTACAAGATGGAGCTGTTAATTTTACCCTTTT
GGAGGAGGACCTGAAAGGCATTGAAGTGAAAGCTGATGAATGCTCTGAGACTGAAAACCAAGGAGTGCTCGAGTCCCACG
GCAAGAAAGACAGTGTAGCTATTGTAGACAGAAGTAAGCGAAATACCGGCGCTGGCTACTCAGATGCCGCTGCTGCTGCG
ACTCACTCAACGTTGGGAGACGATAATTTTGTACAAAACTACTTCAAGTCTTCTAGACTTCATTTCATAGGAACATGGCG
TAACCGTTACCAAAGTCGGTATGGGTCCTTCAACGATAATAGTGAAGGTCAAGACCAACAAGTTTTTAGAGCTAAATCAG
GAAGTGATCCTGCTGTCATCCACATCGATATGGACTGCTTCTTTGTGGCTGTGGTTGTGAGAGACCGACCTGAACTTCAC
GACAAGCCAGTTGCTGTCTGTCATTCTGATAGTGTACGCGGTAGTGGAGAGATATCATCTGCAAATTACCCAGCAAGAGC
TTTTGGAGTTCGGGCAGGCATGTTTGTGCGTGATGCGAAAGCAAAATGTCCTGACCTGGAGATAGTACCCTATAATTTTG
AGGCTTATGAGCAGGTAGCTGATAAGGCTGTTAGCTGTGATGAGGCCTTTTTGGACGTCACAGATCAAGGGGACCCATGC
CAAATTGCATCAGTGATAAGGAGCCAGATTTTTGAAGCAACCCGGTGCACTGCTAGTGCAGGTGTCTCAAAGAATAAGTT
GTTAGCACGTCTGGCCACACGAAAGGCCAAGCCCAACGGTCTATACTACATTGCACCCTCAGAGGTAGAGGGATTCATGG
ATGAGCTCGCCGTGGAGGATCTTCCTGGTGTTGGATGGACGTTAAAGGAAAAATTAAAATCTCACAATCTTCACAAGTGC
TCTGATCTGCTAAGAGTTTCAAAGGACTTTTTGCAAAGGGAATTCGGTGTGAAAACAGGAGACATGCTGTGGTCTTATGC
ACGAGGAATTGATACTCGTGAAGTTCAACAAGCTCAGATACGCAAGTCAATTGGTGCGGAAGTTAACTGGGGTGTTCGTT
TTCTTGTGCCTGAGGATGCTCATCGTTTTTTGGTAACGTTAAGCACAGAGGTCGCGACACGCTTACAAACTGCTGCTGTC
AAAGGTCGCACTATTACCTTGAAGGTGAAGAGACGGAAAGAAGGCTCTGGGGAGCCTTCCAAATTCATGGGTTGTGGAGT
TTGTGACAACTTCAGTCGTTCTGAAACGGTTGGATACGCTACTGATTCTCAGGAGATCCTACTTCGTGTGGCAAAACAAC
TTTTCAACTCTTTTGCATTCGATGTGCGTGATGTCCGTGGGGTAGGTTTGCAAGTCACAAGGCTTGAAGCAGTTGGTTCT
GGCCAGTCACTGAAAGGTGGACAAGGCCAACAGCGTGCACTGAAGTCATGGTTAGCTCCTGAGATCAGCAAAGTAATGCT
GGATGAAAGTGCTCCTGCGGATCGTAACTCAGGCAATCAGACAATCATACACAGAAGCACTGAGCCACTCAAGACAGAGG
CTGGGGAATCGATTCCTGTCAAAAGTCCAGCTTTAGATGTTCCGAAACTACCTCTCAATACATCAGCGCAAGACAATAAT
GATTTACATGCAGCTGCGGAATCTGGTATGAGGAATTTCATTGCACCAGATACGAAGGATCGCATTCAGGTTGGGAGGAG
TTCAACAAAAGCTGCACCACCTCCAACAAACTTACCCCCGCTCTCTGAATTAGACCCCTCTGTTTTAGCTTCACTTCCGC
CTGAAATTCTGGCTGAAATTCGAGAAGTTTATGGGGAACTCGCAGCCCAGCCTGTGGTCTCCAGTAAGCCTGCAAAAAAT
TATTTATCTCCTGTGGCAGTAATGAAGGGTGCCCATAAAACTAAAACATCTGAGGTAAGTAATAGAGTGGACAGAAGAGA
AGATGGAATTAGAAGGGTTTTACCTTTGGAGCCCTTGACTGAACGAGGAGAATCTTCACGGCCACCTTCAGTCCGCCCCG
AAATAGTAGCACTTCCACCAGCATCACAGTTGGATCCCTCAGTTATGTCTGCGCTCCCGCTGTCTTTGCGAAGAGAGCTT
GAGCAAGAATATAAGCGACAGCAGGTGAAGAAGCCTGATAAGTCGACATTGCGCGTGAGTGAACCGGAAAAGCTCTGCGT
TGAGAGATGTTCATCAACTTCACTTGAAGATCTTTGGACTGGTACACCACCTAAGTGGGTACACATGTTCAAAGAATCCC
AAGATGCAGAATTTAGCAACTTGTGCGTGCTAGCGGAGCACATGGCAGGTGCAGTCAGACCGTTTTCAGTTGTAATGCTG
TCGGTTCTCCCTCATCTTTCGAATGCATGTTCTGCAACAGTAAGCCAGAAAGTTATTCAAAGTTGTATGGAGTTGATTAA
ACAGTATGTTCAACAAATGATAGCGTGCGACTTGGAGGAAGTTTGTATGATAATGCGTGTGCTAAAAAGAGTTGCTGCCA
AGTCTCAACTGTGGAGGGTTGTTGAGGATCTCATTACTCCCTTCGTTCAGGGTATTGTTGGAGAGTGTTATGGAGGTCAC
CTCAAAATCTAA
Microexon DNA seq GTAGCTGATAAG
Microexon Amino Acid seq VADK
Microexon-tag DNA Seq TGTCCTGACCTGGAGATAGTACCCTATAATTTTGAGGCTTATGAGCAGGTAGCTGATAAGGCTGTTAGCTGTGATGAGGCCTTTTTGGACGTCACAGATCAAGGGGAC
Microexon-tag Amino Acid seq CPDLEIVPYNFEAYEQVADKAVSCDEAFLDVTDQGD
Transcript ID Pp3c22_4740V3.2
Gene ID Pp.14100
Gene Name NA
Pfam domain motif IMS
Motif E-value 1.3e-32
Motif start 382
Motif end 513
Protein seq >Pp3c22_4740V3.2
MAGEEKEEESLKRKKVDNVPRGMAWGAQSFASARISSIREKDPSSVSDFGRYMAEKRRKLNVQYNEDSSNKLLNNATAAV
AALHSKEGQIVVAEISTKSRIFNGISIWVDGFTVPTHQELRHLMLQYGGVFENYFRRELVTHIICSTLPDSKIKNPRSLS
RGLPIVKPEWIVDSIAAGRVLSWAPYQLQRIALEHPNQRTLQSAYLNPKLDHCPLPLDWRDNYPTYSNMMDIDLEPRSEL
LALENVPTSDNQGPSSVQDGAVNFTLLEEDLKGIEVKADECSETENQGVLESHGKKDSVAIVDRSKRNTGAGYSDAAAAA
THSTLGDDNFVQNYFKSSRLHFIGTWRNRYQSRYGSFNDNSEGQDQQVFRAKSGSDPAVIHIDMDCFFVAVVVRDRPELH
DKPVAVCHSDSVRGSGEISSANYPARAFGVRAGMFVRDAKAKCPDLEIVPYNFEAYEQVADKAVSCDEAFLDVTDQGDPC
QIASVIRSQIFEATRCTASAGVSKNKLLARLATRKAKPNGLYYIAPSEVEGFMDELAVEDLPGVGWTLKEKLKSHNLHKC
SDLLRVSKDFLQREFGVKTGDMLWSYARGIDTREVQQAQIRKSIGAEVNWGVRFLVPEDAHRFLVTLSTEVATRLQTAAV
KGRTITLKVKRRKEGSGEPSKFMGCGVCDNFSRSETVGYATDSQEILLRVAKQLFNSFAFDVRDVRGVGLQVTRLEAVGS
GQSLKGGQGQQRALKSWLAPEISKVMLDESAPADRNSGNQTIIHRSTEPLKTEAGESIPVKSPALDVPKLPLNTSAQDNN
DLHAAAESGMRNFIAPDTKDRIQVGRSSTKAAPPPTNLPPLSELDPSVLASLPPEILAEIREVYGELAAQPVVSSKPAKN
YLSPVAVMKGAHKTKTSEVSNRVDRREDGIRRVLPLEPLTERGESSRPPSVRPEIVALPPASQLDPSVMSALPLSLRREL
EQEYKRQQVKKPDKSTLRVSEPEKLCVERCSSTSLEDLWTGTPPKWVHMFKESQDAEFSNLCVLAEHMAGAVRPFSVVML
SVLPHLSNACSATVSQKVIQSCMELIKQYVQQMIACDLEEVCMIMRVLKRVAAKSQLWRVVEDLITPFVQGIVGECYGGH
LKI*
CDS seq >Pp3c22_4740V3.2
ATGGCAGGTGAAGAGAAAGAGGAAGAATCGCTCAAAAGAAAAAAAGTGGACAATGTTCCGCGTGGCATGGCATGGGGAGC
TCAATCCTTTGCTTCTGCAAGAATCAGCAGCATTCGCGAGAAAGATCCCTCGAGCGTCAGTGACTTTGGCAGATACATGG
CTGAAAAGCGCCGCAAGCTAAATGTGCAATATAATGAGGACTCTTCTAACAAGCTTCTCAACAATGCGACAGCGGCTGTT
GCAGCGCTACATTCAAAGGAGGGGCAAATTGTAGTCGCTGAGATTTCAACCAAGTCACGCATATTCAATGGAATTTCAAT
TTGGGTGGACGGTTTCACTGTACCTACTCATCAGGAGCTACGACATCTAATGCTACAATATGGAGGTGTTTTTGAGAATT
ACTTTAGAAGGGAGCTTGTCACTCACATCATTTGCAGCACTCTTCCTGATAGCAAAATTAAAAATCCAAGATCCCTCAGT
CGAGGTTTGCCAATAGTAAAACCAGAATGGATCGTGGACAGCATAGCTGCAGGCCGAGTCCTATCTTGGGCTCCATATCA
ACTCCAAAGAATAGCATTGGAACATCCAAATCAGCGTACTCTGCAGTCGGCATATTTGAATCCCAAACTCGATCACTGCC
CTCTTCCTCTTGACTGGAGGGATAATTACCCAACATATTCAAACATGATGGACATAGACTTAGAACCCCGATCGGAGCTG
CTAGCACTTGAAAATGTTCCAACGTCAGACAACCAAGGTCCATCTTCTGTACAAGATGGAGCTGTTAATTTTACCCTTTT
GGAGGAGGACCTGAAAGGCATTGAAGTGAAAGCTGATGAATGCTCTGAGACTGAAAACCAAGGAGTGCTCGAGTCCCACG
GCAAGAAAGACAGTGTAGCTATTGTAGACAGAAGTAAGCGAAATACCGGCGCTGGCTACTCAGATGCCGCTGCTGCTGCG
ACTCACTCAACGTTGGGAGACGATAATTTTGTACAAAACTACTTCAAGTCTTCTAGACTTCATTTCATAGGAACATGGCG
TAACCGTTACCAAAGTCGGTATGGGTCCTTCAACGATAATAGTGAAGGTCAAGACCAACAAGTTTTTAGAGCTAAATCAG
GAAGTGATCCTGCTGTCATCCACATCGATATGGACTGCTTCTTTGTGGCTGTGGTTGTGAGAGACCGACCTGAACTTCAC
GACAAGCCAGTTGCTGTCTGTCATTCTGATAGTGTACGCGGTAGTGGAGAGATATCATCTGCAAATTACCCAGCAAGAGC
TTTTGGAGTTCGGGCAGGCATGTTTGTGCGTGATGCGAAAGCAAAATGTCCTGACCTGGAGATAGTACCCTATAATTTTG
AGGCTTATGAGCAGGTAGCTGATAAGGCTGTTAGCTGTGATGAGGCCTTTTTGGACGTCACAGATCAAGGGGACCCATGC
CAAATTGCATCAGTGATAAGGAGCCAGATTTTTGAAGCAACCCGGTGCACTGCTAGTGCAGGTGTCTCAAAGAATAAGTT
GTTAGCACGTCTGGCCACACGAAAGGCCAAGCCCAACGGTCTATACTACATTGCACCCTCAGAGGTAGAGGGATTCATGG
ATGAGCTCGCCGTGGAGGATCTTCCTGGTGTTGGATGGACGTTAAAGGAAAAATTAAAATCTCACAATCTTCACAAGTGC
TCTGATCTGCTAAGAGTTTCAAAGGACTTTTTGCAAAGGGAATTCGGTGTGAAAACAGGAGACATGCTGTGGTCTTATGC
ACGAGGAATTGATACTCGTGAAGTTCAACAAGCTCAGATACGCAAGTCAATTGGTGCGGAAGTTAACTGGGGTGTTCGTT
TTCTTGTGCCTGAGGATGCTCATCGTTTTTTGGTAACGTTAAGCACAGAGGTCGCGACACGCTTACAAACTGCTGCTGTC
AAAGGTCGCACTATTACCTTGAAGGTGAAGAGACGGAAAGAAGGCTCTGGGGAGCCTTCCAAATTCATGGGTTGTGGAGT
TTGTGACAACTTCAGTCGTTCTGAAACGGTTGGATACGCTACTGATTCTCAGGAGATCCTACTTCGTGTGGCAAAACAAC
TTTTCAACTCTTTTGCATTCGATGTGCGTGATGTCCGTGGGGTAGGTTTGCAAGTCACAAGGCTTGAAGCAGTTGGTTCT
GGCCAGTCACTGAAAGGTGGACAAGGCCAACAGCGTGCACTGAAGTCATGGTTAGCTCCTGAGATCAGCAAAGTAATGCT
GGATGAAAGTGCTCCTGCGGATCGTAACTCAGGCAATCAGACAATCATACACAGAAGCACTGAGCCACTCAAGACAGAGG
CTGGGGAATCGATTCCTGTCAAAAGTCCAGCTTTAGATGTTCCGAAACTACCTCTCAATACATCAGCGCAAGACAATAAT
GATTTACATGCAGCTGCGGAATCTGGTATGAGGAATTTCATTGCACCAGATACGAAGGATCGCATTCAGGTTGGGAGGAG
TTCAACAAAAGCTGCACCACCTCCAACAAACTTACCCCCGCTCTCTGAATTAGACCCCTCTGTTTTAGCTTCACTTCCGC
CTGAAATTCTGGCTGAAATTCGAGAAGTTTATGGGGAACTCGCAGCCCAGCCTGTGGTCTCCAGTAAGCCTGCAAAAAAT
TATTTATCTCCTGTGGCAGTAATGAAGGGTGCCCATAAAACTAAAACATCTGAGGTAAGTAATAGAGTGGACAGAAGAGA
AGATGGAATTAGAAGGGTTTTACCTTTGGAGCCCTTGACTGAACGAGGAGAATCTTCACGGCCACCTTCAGTCCGCCCCG
AAATAGTAGCACTTCCACCAGCATCACAGTTGGATCCCTCAGTTATGTCTGCGCTCCCGCTGTCTTTGCGAAGAGAGCTT
GAGCAAGAATATAAGCGACAGCAGGTGAAGAAGCCTGATAAGTCGACATTGCGCGTGAGTGAACCGGAAAAGCTCTGCGT
TGAGAGATGTTCATCAACTTCACTTGAAGATCTTTGGACTGGTACACCACCTAAGTGGGTACACATGTTCAAAGAATCCC
AAGATGCAGAATTTAGCAACTTGTGCGTGCTAGCGGAGCACATGGCAGGTGCAGTCAGACCGTTTTCAGTTGTAATGCTG
TCGGTTCTCCCTCATCTTTCGAATGCATGTTCTGCAACAGTAAGCCAGAAAGTTATTCAAAGTTGTATGGAGTTGATTAA
ACAGTATGTTCAACAAATGATAGCGTGCGACTTGGAGGAAGTTTGTATGATAATGCGTGTGCTAAAAAGAGTTGCTGCCA
AGTCTCAACTGTGGAGGGTTGTTGAGGATCTCATTACTCCCTTCGTTCAGGGTATTGTTGGAGAGTGTTATGGAGGTCAC
CTCAAAATCTAA