
Microexon ID | Pp_14:4075744-4075750:+ |
Species | Physcomitrium patens | Coordinates | 14:4075744..4075750 |
Microexon Cluster ID | MEP13 |
Size | 7 |
Phase | 0 |
Pfam Domain Motif | DNA_ligase_A_M |
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 13,38,7,50 |
Microexon location in the Microexon-tag | 3 |
Microexon-tag DNA Seq | AAYCAGGAAATWGCMAAGGCWGCAARRGAKGGRYTKGMSASTGATMGACAGTTRTGYTATGTTGCWTTTGAYRTTCTKTATGYTGGAGAYACYAGYGTYATYCAYCAR |
Logo of Microexon-tag DNA Seq | ![]() |
Alignment of exons | ![]() |
Pp_14:4075744-4075750:+ does not have available information here.
Transcript ID | Pp3c14_5920V3.1 |
Protein ID | Pp3c14_5920V3.1 |
Gene ID | Pp3c14_5920 |
Gene Name | NA |
Pfam domain motif | DNA_ligase_A_M |
Motif E-value | 7.3e-42 |
Motif start | 229 |
Motif end | 445 |
Protein seq | >Pp3c14_5920V3.1 MKRMEETVEFGVLCSMFEAIVRCKKGALKRKHVRTFLEHVYNGQEHFSAMRLILPDLDKERANYGLREAVLAKFLADALG LSKESEDAKKLINWRKGGQRAGSNAGNFSMVASEVLCRRQKTAPGGLMIKEVNDLLDRLAAAQDKEEKTAVLAELINKTN AQEMRWIIMIILKDLKLGISEKTVFSEFHPDAEDYFNVTCDLKLVCEKLRDRSIRYKRQDIEVGKAVRPQLAARAANVED AWKKMRGKDVVVECKFDGDRIQVHKNGNNLNFWSRTFNDHPEFKEAIGDVLCQRIIPEKCILDGEMLVWDRVSQRFAEFG SNRGVAKAAKDGLDSDLQICYVAFDILYDGDSSVIHRPLRERQRLLQNAIRPLSGRLEVLIPAAGGLNAKHAVGDPRWSI LATSPEEVEQFFQETIDNREEGVVLKDLDSKWEPSDRSGKWLKLKPDYVHSESDLDALIIGGYFGTGRRGGEVAQFLLGL AEPSEVGGYPTKFRSFCRVGTGLTDDEAEQLVHKLKPHFRRNHKSTKPPSCYVLTNSSKERPDVWIERPEKSVILQITSD IRTIRSEVFATPYSLRFPRIQRIRYDKPWYDCLDVQTLVDTVHAKSQNDGQAEGGNQFKYRARRAKQEKPERASLVPSHM LVTDVSHVKQATRIFKGLVFYVANTSSEYPVEKIHKLVVENGGTFSMNLSSTVTHAVAFEKKGLKYQASFMNGDVIHLSW LLDCMAQKVLLVVSPKYYLNMSDATKERLKGEVDEFGDFYCNEVDETDLTQIFENIDVNKQYHDIDRVNYYIKKYSPSPT WCLFSGCCVFFLRPLHSANENTLQVANLTLRRLALEVEMHEGTVSDKLTRSITHVVMYVPTESPVSFQIILRSASSEERR LLLSKRVKVVSHRWIEDSVNRSIYRHPADEEYDLRTGEVLLSDEEEEELVSKTSVPIKADSMVPITPDKRVVPRSLRKPA TRRRALNLGRPSRRSKGTEEELPRVEAQKSGKEEHVEDDRAPTEITNKGTRNDPKAVEPFQLDDSASRTPEKRVLRVTRS RKKDKPSSGNERLPLKRRLIHDEGEASLTQPSDKALTGAEPFIPRMSTRLAEKRKAFESKREADTYKLECTAADLREEDQ EEPVEASEFEGLQNTSSLKEPTGRDDSRLHFNDFDHQSPAIDDMINTLLPNYASSQQANLSQQTNRESALPSRSNSFFIK PTLAPQPTQGVLGSYSRPYLPSHCASEALQPVPDAPDNSDLFSSVHRLASSQDHLTGTVTVESSSQGLATYLPPTNNPEK KKRSFKDLVNQMLNGN* |
CDS seq | >Pp3c14_5920V3.1 ATGAAGAGGATGGAGGAAACTGTGGAGTTCGGAGTATTGTGTAGCATGTTTGAGGCCATCGTTAGGTGTAAGAAGGGAGC TCTGAAGCGGAAACATGTGCGGACGTTTTTGGAGCATGTTTACAATGGCCAAGAGCATTTTAGCGCTATGCGTTTGATTC TTCCCGACTTGGATAAAGAGCGTGCCAACTATGGACTCCGCGAGGCTGTGTTGGCCAAATTCTTAGCCGATGCTCTGGGG CTGTCTAAAGAATCTGAGGACGCTAAGAAACTTATCAATTGGCGTAAAGGAGGCCAAAGAGCCGGCAGCAATGCCGGCAA TTTCTCTATGGTTGCGTCTGAGGTGCTCTGCAGGAGGCAGAAAACAGCACCTGGTGGTCTAATGATTAAAGAGGTCAATG ATCTTCTCGATCGCTTAGCGGCAGCACAAGACAAGGAAGAGAAAACAGCTGTGCTAGCTGAGCTCATCAACAAGACAAAT GCTCAGGAAATGAGGTGGATTATTATGATTATTCTGAAAGATTTGAAACTCGGGATTAGTGAGAAGACAGTTTTCAGCGA GTTTCATCCCGACGCGGAAGATTATTTCAATGTCACATGCGATCTGAAGTTGGTTTGTGAAAAGCTTCGTGATCGCAGCA TACGTTACAAGCGCCAGGATATCGAAGTGGGGAAGGCAGTGCGACCACAATTAGCCGCAAGGGCTGCCAATGTCGAAGAT GCCTGGAAGAAGATGCGAGGCAAAGACGTTGTGGTAGAATGCAAGTTTGATGGAGACCGTATTCAAGTGCACAAAAATGG AAATAATCTGAACTTCTGGTCAAGGACGTTTAATGACCATCCAGAGTTCAAGGAGGCTATTGGTGACGTATTGTGTCAAC GTATCATTCCCGAGAAGTGTATACTTGACGGAGAGATGTTGGTTTGGGATCGAGTGTCGCAAAGATTTGCTGAGTTTGGC TCTAATCGTGGAGTTGCAAAGGCTGCTAAGGATGGCTTGGATTCTGATCTTCAGATTTGTTATGTTGCATTCGATATTCT GTATGATGGTGACAGCAGTGTTATCCATCGTCCGTTGCGTGAACGGCAACGGCTCCTTCAGAACGCTATTCGTCCTCTCA GTGGCCGTCTGGAGGTTCTTATACCTGCAGCCGGAGGGCTTAATGCAAAGCATGCTGTTGGAGATCCGAGGTGGTCAATC TTGGCCACAAGTCCCGAGGAGGTGGAACAGTTTTTTCAAGAGACGATCGACAATAGGGAGGAAGGGGTTGTGTTAAAAGA CCTAGATTCCAAATGGGAGCCCAGTGATCGTAGTGGAAAATGGCTTAAACTCAAGCCTGATTACGTTCACTCTGAATCTG ACCTTGATGCGCTGATAATAGGTGGTTATTTTGGAACTGGACGGCGTGGTGGTGAGGTTGCTCAATTTTTATTGGGATTG GCTGAGCCGTCAGAAGTGGGTGGCTACCCTACAAAATTTCGTTCCTTCTGCAGAGTGGGTACAGGCCTCACTGATGATGA GGCTGAGCAGTTAGTGCACAAGCTCAAACCCCACTTCAGGAGGAATCATAAAAGCACCAAACCACCGAGCTGTTATGTGC TGACTAACTCATCCAAAGAGCGGCCTGATGTCTGGATTGAACGACCTGAAAAGTCCGTAATACTTCAAATCACAAGTGAT ATCCGAACCATACGCTCTGAGGTATTTGCAACGCCGTATAGTCTACGGTTTCCTCGAATCCAAAGAATTCGATATGATAA ACCTTGGTATGACTGCCTCGATGTTCAGACATTAGTAGATACGGTACATGCAAAGAGCCAGAACGATGGTCAAGCGGAAG GTGGGAATCAATTCAAATATCGAGCAAGACGGGCCAAACAGGAAAAGCCTGAACGAGCATCATTGGTCCCCTCCCACATG TTGGTAACAGACGTATCGCATGTAAAGCAAGCCACTCGCATCTTCAAGGGCCTTGTCTTCTACGTTGCCAACACTTCCAG TGAGTACCCAGTTGAGAAAATTCACAAATTGGTTGTAGAGAATGGAGGCACCTTCTCTATGAACTTGAGCAGTACTGTTA CTCACGCGGTCGCATTCGAAAAGAAAGGACTCAAGTATCAGGCATCTTTTATGAATGGGGATGTCATTCATCTATCATGG CTTTTAGACTGCATGGCTCAGAAAGTACTCCTTGTAGTAAGCCCCAAGTATTACTTGAATATGTCAGATGCCACAAAGGA AAGACTGAAAGGTGAAGTTGACGAATTTGGAGATTTTTACTGCAACGAAGTTGATGAAACTGACCTTACACAGATCTTCG AAAACATAGATGTGAACAAGCAGTACCACGACATTGACAGAGTGAATTACTACATCAAGAAATATTCCCCATCTCCAACT TGGTGCCTTTTCTCCGGCTGCTGCGTCTTCTTTCTCCGTCCTCTTCACTCCGCGAATGAAAATACTCTACAGGTGGCAAA CCTAACATTGAGAAGACTTGCACTCGAGGTGGAGATGCACGAAGGCACCGTTTCTGACAAATTGACGCGTAGCATCACAC ACGTTGTGATGTATGTACCTACGGAAAGTCCCGTATCGTTCCAGATTATATTACGAAGTGCGTCATCGGAGGAAAGAAGG CTGCTATTATCGAAGCGGGTCAAAGTTGTAAGCCATCGCTGGATAGAGGATTCTGTTAACAGAAGTATCTACCGTCACCC CGCAGACGAAGAGTATGATCTCAGGACTGGAGAAGTGCTTTTGTCCGATGAGGAAGAAGAAGAGCTGGTGTCAAAAACTT CAGTCCCCATAAAGGCTGACTCGATGGTACCCATCACCCCAGACAAGAGGGTTGTACCTAGATCTCTTCGAAAACCAGCT ACGCGGAGACGAGCCCTGAACCTTGGAAGGCCTAGTAGAAGATCCAAAGGAACAGAGGAGGAACTTCCAAGAGTAGAGGC CCAAAAGTCGGGCAAAGAAGAGCACGTAGAGGATGACAGAGCACCAACAGAAATAACCAATAAGGGAACCAGGAATGATC CGAAGGCAGTGGAACCTTTTCAATTAGATGATTCTGCTTCTCGTACACCAGAAAAGAGAGTTCTTAGAGTTACTCGAAGC AGGAAAAAAGACAAACCATCATCTGGGAATGAGCGCCTACCTTTAAAACGACGTCTCATTCATGATGAAGGTGAGGCATC TCTGACCCAACCTTCAGACAAGGCACTCACAGGAGCGGAACCTTTCATACCAAGAATGTCGACCAGATTGGCTGAGAAAA GAAAAGCATTCGAAAGCAAGAGAGAAGCCGACACATATAAGCTAGAATGTACTGCAGCGGATCTTCGTGAGGAAGACCAA GAAGAGCCCGTGGAAGCTTCAGAATTCGAGGGGTTGCAGAACACCAGTTCTTTGAAAGAGCCAACTGGAAGGGACGACAG CAGATTACATTTCAACGATTTCGACCATCAAAGCCCAGCCATTGATGATATGATCAATACTCTACTTCCAAATTACGCCT CATCACAACAGGCTAACTTATCTCAACAAACGAACAGAGAGAGTGCCTTGCCCTCAAGGTCGAACTCATTTTTTATCAAA CCGACGTTGGCCCCACAACCCACTCAGGGAGTTCTAGGTAGCTACTCTCGACCATATCTCCCTTCACATTGTGCATCTGA AGCCCTCCAGCCGGTACCAGATGCTCCAGATAACTCTGATTTATTTTCATCAGTGCATCGTTTGGCAAGTTCACAAGACC ACTTGACTGGGACTGTGACGGTTGAATCATCCTCACAGGGGTTGGCAACTTATCTACCTCCGACCAACAACCCTGAGAAG AAGAAAAGAAGCTTTAAGGATCTGGTCAACCAGATGCTCAACGGTAACTGA |
Microexon DNA seq | ATTTGTT |
Microexon Amino Acid seq | ICY |
Microexon-tag DNA Seq | AATCGTGGAGTTGCAAAGGCTGCTAAGGATGGCTTGGATTCTGATCTTCAGATTTGTTATGTTGCATTCGATATTCTGTATGATGGTGACAGCAGTGTTATCCATCGT |
Microexon-tag Amino Acid seq | NRGVAKAAKDGLDSDLQICYVAFDILYDGDSSVIHR |
Transcript ID | Pp3c14_5920V3.1 |
Gene ID | Pp.5522 |
Gene Name | NA |
Pfam domain motif | DNA_ligase_A_M |
Motif E-value | 7.3e-42 |
Motif start | 229 |
Motif end | 445 |
Protein seq | >Pp3c14_5920V3.1 MKRMEETVEFGVLCSMFEAIVRCKKGALKRKHVRTFLEHVYNGQEHFSAMRLILPDLDKERANYGLREAVLAKFLADALG LSKESEDAKKLINWRKGGQRAGSNAGNFSMVASEVLCRRQKTAPGGLMIKEVNDLLDRLAAAQDKEEKTAVLAELINKTN AQEMRWIIMIILKDLKLGISEKTVFSEFHPDAEDYFNVTCDLKLVCEKLRDRSIRYKRQDIEVGKAVRPQLAARAANVED AWKKMRGKDVVVECKFDGDRIQVHKNGNNLNFWSRTFNDHPEFKEAIGDVLCQRIIPEKCILDGEMLVWDRVSQRFAEFG SNRGVAKAAKDGLDSDLQICYVAFDILYDGDSSVIHRPLRERQRLLQNAIRPLSGRLEVLIPAAGGLNAKHAVGDPRWSI LATSPEEVEQFFQETIDNREEGVVLKDLDSKWEPSDRSGKWLKLKPDYVHSESDLDALIIGGYFGTGRRGGEVAQFLLGL AEPSEVGGYPTKFRSFCRVGTGLTDDEAEQLVHKLKPHFRRNHKSTKPPSCYVLTNSSKERPDVWIERPEKSVILQITSD IRTIRSEVFATPYSLRFPRIQRIRYDKPWYDCLDVQTLVDTVHAKSQNDGQAEGGNQFKYRARRAKQEKPERASLVPSHM LVTDVSHVKQATRIFKGLVFYVANTSSEYPVEKIHKLVVENGGTFSMNLSSTVTHAVAFEKKGLKYQASFMNGDVIHLSW LLDCMAQKVLLVVSPKYYLNMSDATKERLKGEVDEFGDFYCNEVDETDLTQIFENIDVNKQYHDIDRVNYYIKKYSPSPT WCLFSGCCVFFLRPLHSANENTLQVANLTLRRLALEVEMHEGTVSDKLTRSITHVVMYVPTESPVSFQIILRSASSEERR LLLSKRVKVVSHRWIEDSVNRSIYRHPADEEYDLRTGEVLLSDEEEEELVSKTSVPIKADSMVPITPDKRVVPRSLRKPA TRRRALNLGRPSRRSKGTEEELPRVEAQKSGKEEHVEDDRAPTEITNKGTRNDPKAVEPFQLDDSASRTPEKRVLRVTRS RKKDKPSSGNERLPLKRRLIHDEGEASLTQPSDKALTGAEPFIPRMSTRLAEKRKAFESKREADTYKLECTAADLREEDQ EEPVEASEFEGLQNTSSLKEPTGRDDSRLHFNDFDHQSPAIDDMINTLLPNYASSQQANLSQQTNRESALPSRSNSFFIK PTLAPQPTQGVLGSYSRPYLPSHCASEALQPVPDAPDNSDLFSSVHRLASSQDHLTGTVTVESSSQGLATYLPPTNNPEK KKRSFKDLVNQMLNGN* |
CDS seq | >Pp3c14_5920V3.1 ATGAAGAGGATGGAGGAAACTGTGGAGTTCGGAGTATTGTGTAGCATGTTTGAGGCCATCGTTAGGTGTAAGAAGGGAGC TCTGAAGCGGAAACATGTGCGGACGTTTTTGGAGCATGTTTACAATGGCCAAGAGCATTTTAGCGCTATGCGTTTGATTC TTCCCGACTTGGATAAAGAGCGTGCCAACTATGGACTCCGCGAGGCTGTGTTGGCCAAATTCTTAGCCGATGCTCTGGGG CTGTCTAAAGAATCTGAGGACGCTAAGAAACTTATCAATTGGCGTAAAGGAGGCCAAAGAGCCGGCAGCAATGCCGGCAA TTTCTCTATGGTTGCGTCTGAGGTGCTCTGCAGGAGGCAGAAAACAGCACCTGGTGGTCTAATGATTAAAGAGGTCAATG ATCTTCTCGATCGCTTAGCGGCAGCACAAGACAAGGAAGAGAAAACAGCTGTGCTAGCTGAGCTCATCAACAAGACAAAT GCTCAGGAAATGAGGTGGATTATTATGATTATTCTGAAAGATTTGAAACTCGGGATTAGTGAGAAGACAGTTTTCAGCGA GTTTCATCCCGACGCGGAAGATTATTTCAATGTCACATGCGATCTGAAGTTGGTTTGTGAAAAGCTTCGTGATCGCAGCA TACGTTACAAGCGCCAGGATATCGAAGTGGGGAAGGCAGTGCGACCACAATTAGCCGCAAGGGCTGCCAATGTCGAAGAT GCCTGGAAGAAGATGCGAGGCAAAGACGTTGTGGTAGAATGCAAGTTTGATGGAGACCGTATTCAAGTGCACAAAAATGG AAATAATCTGAACTTCTGGTCAAGGACGTTTAATGACCATCCAGAGTTCAAGGAGGCTATTGGTGACGTATTGTGTCAAC GTATCATTCCCGAGAAGTGTATACTTGACGGAGAGATGTTGGTTTGGGATCGAGTGTCGCAAAGATTTGCTGAGTTTGGC TCTAATCGTGGAGTTGCAAAGGCTGCTAAGGATGGCTTGGATTCTGATCTTCAGATTTGTTATGTTGCATTCGATATTCT GTATGATGGTGACAGCAGTGTTATCCATCGTCCGTTGCGTGAACGGCAACGGCTCCTTCAGAACGCTATTCGTCCTCTCA GTGGCCGTCTGGAGGTTCTTATACCTGCAGCCGGAGGGCTTAATGCAAAGCATGCTGTTGGAGATCCGAGGTGGTCAATC TTGGCCACAAGTCCCGAGGAGGTGGAACAGTTTTTTCAAGAGACGATCGACAATAGGGAGGAAGGGGTTGTGTTAAAAGA CCTAGATTCCAAATGGGAGCCCAGTGATCGTAGTGGAAAATGGCTTAAACTCAAGCCTGATTACGTTCACTCTGAATCTG ACCTTGATGCGCTGATAATAGGTGGTTATTTTGGAACTGGACGGCGTGGTGGTGAGGTTGCTCAATTTTTATTGGGATTG GCTGAGCCGTCAGAAGTGGGTGGCTACCCTACAAAATTTCGTTCCTTCTGCAGAGTGGGTACAGGCCTCACTGATGATGA GGCTGAGCAGTTAGTGCACAAGCTCAAACCCCACTTCAGGAGGAATCATAAAAGCACCAAACCACCGAGCTGTTATGTGC TGACTAACTCATCCAAAGAGCGGCCTGATGTCTGGATTGAACGACCTGAAAAGTCCGTAATACTTCAAATCACAAGTGAT ATCCGAACCATACGCTCTGAGGTATTTGCAACGCCGTATAGTCTACGGTTTCCTCGAATCCAAAGAATTCGATATGATAA ACCTTGGTATGACTGCCTCGATGTTCAGACATTAGTAGATACGGTACATGCAAAGAGCCAGAACGATGGTCAAGCGGAAG GTGGGAATCAATTCAAATATCGAGCAAGACGGGCCAAACAGGAAAAGCCTGAACGAGCATCATTGGTCCCCTCCCACATG TTGGTAACAGACGTATCGCATGTAAAGCAAGCCACTCGCATCTTCAAGGGCCTTGTCTTCTACGTTGCCAACACTTCCAG TGAGTACCCAGTTGAGAAAATTCACAAATTGGTTGTAGAGAATGGAGGCACCTTCTCTATGAACTTGAGCAGTACTGTTA CTCACGCGGTCGCATTCGAAAAGAAAGGACTCAAGTATCAGGCATCTTTTATGAATGGGGATGTCATTCATCTATCATGG CTTTTAGACTGCATGGCTCAGAAAGTACTCCTTGTAGTAAGCCCCAAGTATTACTTGAATATGTCAGATGCCACAAAGGA AAGACTGAAAGGTGAAGTTGACGAATTTGGAGATTTTTACTGCAACGAAGTTGATGAAACTGACCTTACACAGATCTTCG AAAACATAGATGTGAACAAGCAGTACCACGACATTGACAGAGTGAATTACTACATCAAGAAATATTCCCCATCTCCAACT TGGTGCCTTTTCTCCGGCTGCTGCGTCTTCTTTCTCCGTCCTCTTCACTCCGCGAATGAAAATACTCTACAGGTGGCAAA CCTAACATTGAGAAGACTTGCACTCGAGGTGGAGATGCACGAAGGCACCGTTTCTGACAAATTGACGCGTAGCATCACAC ACGTTGTGATGTATGTACCTACGGAAAGTCCCGTATCGTTCCAGATTATATTACGAAGTGCGTCATCGGAGGAAAGAAGG CTGCTATTATCGAAGCGGGTCAAAGTTGTAAGCCATCGCTGGATAGAGGATTCTGTTAACAGAAGTATCTACCGTCACCC CGCAGACGAAGAGTATGATCTCAGGACTGGAGAAGTGCTTTTGTCCGATGAGGAAGAAGAAGAGCTGGTGTCAAAAACTT CAGTCCCCATAAAGGCTGACTCGATGGTACCCATCACCCCAGACAAGAGGGTTGTACCTAGATCTCTTCGAAAACCAGCT ACGCGGAGACGAGCCCTGAACCTTGGAAGGCCTAGTAGAAGATCCAAAGGAACAGAGGAGGAACTTCCAAGAGTAGAGGC CCAAAAGTCGGGCAAAGAAGAGCACGTAGAGGATGACAGAGCACCAACAGAAATAACCAATAAGGGAACCAGGAATGATC CGAAGGCAGTGGAACCTTTTCAATTAGATGATTCTGCTTCTCGTACACCAGAAAAGAGAGTTCTTAGAGTTACTCGAAGC AGGAAAAAAGACAAACCATCATCTGGGAATGAGCGCCTACCTTTAAAACGACGTCTCATTCATGATGAAGGTGAGGCATC TCTGACCCAACCTTCAGACAAGGCACTCACAGGAGCGGAACCTTTCATACCAAGAATGTCGACCAGATTGGCTGAGAAAA GAAAAGCATTCGAAAGCAAGAGAGAAGCCGACACATATAAGCTAGAATGTACTGCAGCGGATCTTCGTGAGGAAGACCAA GAAGAGCCCGTGGAAGCTTCAGAATTCGAGGGGTTGCAGAACACCAGTTCTTTGAAAGAGCCAACTGGAAGGGACGACAG CAGATTACATTTCAACGATTTCGACCATCAAAGCCCAGCCATTGATGATATGATCAATACTCTACTTCCAAATTACGCCT CATCACAACAGGCTAACTTATCTCAACAAACGAACAGAGAGAGTGCCTTGCCCTCAAGGTCGAACTCATTTTTTATCAAA CCGACGTTGGCCCCACAACCCACTCAGGGAGTTCTAGGTAGCTACTCTCGACCATATCTCCCTTCACATTGTGCATCTGA AGCCCTCCAGCCGGTACCAGATGCTCCAGATAACTCTGATTTATTTTCATCAGTGCATCGTTTGGCAAGTTCACAAGACC ACTTGACTGGGACTGTGACGGTTGAATCATCCTCACAGGGGTTGGCAACTTATCTACCTCCGACCAACAACCCTGAGAAG AAGAAAAGAAGCTTTAAGGATCTGGTCAACCAGATGCTCAACGGTAACTGA |