Microexon ID Pp_26:516011-516022:+
Species Physcomitrium patens
Coordinates 26:516011..516022
Microexon Cluster ID MEP28
Size 12
Phase 0
Pfam Domain Motif Peptidase_M1
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 24,19,5,12,48
Microexon location in the Microexon-tag 4
Microexon-tag DNA Seq GTTMGRCCWCAYTCTTAYATYAAGATGGACAACTTCTAYACAGTRACGGTKTATGARAAGGGWGCTGAAGTTGTCMGRATGTACAARACMTTRYTKGGRAGTYMAGGR
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Pp_26:516011-516022:+ does not have available information here.
Transcript ID Pp3c26_1330V3.1
Protein ID Pp3c26_1330V3.1
Gene ID Pp3c26_1330
Gene Name NA
Pfam domain motif Peptidase_M1
Motif E-value 4.7e-49
Motif start 370
Motif end 574
Protein seq >Pp3c26_1330V3.1
MAAPALSVLLERSPASALYASAADSSSRSGAWIAGSAAGGLVLMPVPRLRRSMSMACLRSSARSLLRAAATKNPTFKSAR
VQRGIVSRQTLRASPLSRSVRALGSQSYFSNNLLHLRDSVSSRVQCTLAAQPQDTMAKDEVVKEGPKEIFLRDYKAPDYA
FEKVDLKFELGEEKTLVTSNIRVLPKSSGAPLILDGDSSLKLVTLKINGSSVPEEDFKITPRKLHLTSLPTGPFELEIVT
EIEPQNNTSLEGLYKSSGNFCTQCEAEGFRKITYYQDRPDVMSKFTTRIEADKALYPVLLGNGNLIEEGDLTDGKHYAVW
EDPWTKPCYLFALVAGQLVSRDDTFTTMSGRKVVLRIWTPPQDIPKTDHAMKSLINSMKWDEEVFGLEYDLDLFNVVAVP
DFNMGAMENKSLNIFNSRLVLASPETATDSDYAAIEGVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDMGS
RGVKRISDVSRLRISQFPQDAGPMAHPVRPQSYIKMDNFYTVTVYEKGAEVVRMYQTLLGKEGFRKGMDLYFQRHDGQAV
SCEEFMAAMFDANDAKFPTFPLWYAQAGTPILTVTTSYDADAQSFTIKCKQEVPPTAGQPQKEPMLLPLAVGLLDSQGND
MRLSSVKDGASLHNLSSSDGDFTTTAVLHVDKAEQEFTFLGITEKPVPSLLRNFSAPVRLVSDVTDDELFFLLAHDSDQF
NRWEAGQTLARKLMLDLISKQQKGEELNVNSAFIEGIRSTVTNSSLDKEFVAKCMSLPGESEIADLMEVADPDAIHTVRR
FVIKQIAASLREELLSTVEENRSSAAYDPSHEHRARRSLKNISLSYMASLNEPEITELLVNEYKNATNMTDQVSALAALC
QNPGDARSQALSDFYDQWKEEALVMNKWLALQAMSDVPGNVEHVRSLLEHPAFDIRNPNKVYSLIGGFCASAVNFHAKDG
SGYKFLADIVLELDKLNPQVASRMISAFTRWRRFDEERQALTKAQLERIKSQDGLSDNVFEIASKSLAS*
CDS seq >Pp3c26_1330V3.1
ATGGCGGCGCCCGCTTTGTCAGTGCTACTAGAGCGGAGCCCCGCAAGTGCTCTGTATGCTTCCGCTGCAGACTCGAGCTC
GCGTTCTGGCGCTTGGATTGCTGGGAGTGCAGCGGGAGGGCTGGTGCTTATGCCGGTGCCGCGGTTGCGACGGAGTATGT
CTATGGCATGCTTGCGATCTTCCGCCCGCTCTTTGCTGCGCGCGGCTGCGACTAAGAATCCGACCTTCAAATCAGCTCGG
GTCCAGCGTGGTATCGTTTCTAGGCAGACTTTGAGGGCCAGTCCTCTTTCCAGGAGTGTCAGGGCACTTGGCTCACAGTC
ATATTTCAGCAACAATCTTCTTCACTTGCGGGATTCGGTGTCTAGTCGTGTTCAGTGTACATTAGCAGCCCAGCCTCAGG
ACACCATGGCGAAGGATGAGGTTGTCAAGGAGGGTCCCAAGGAAATCTTTCTTAGGGATTACAAGGCACCAGATTATGCT
TTTGAAAAGGTAGATTTGAAATTCGAGTTGGGGGAGGAGAAGACATTAGTTACATCCAACATTCGCGTCCTTCCTAAATC
TTCTGGTGCCCCATTGATACTTGACGGTGACTCAAGCTTGAAATTAGTTACTTTGAAAATCAATGGCTCATCCGTACCGG
AGGAAGATTTTAAGATCACTCCGCGGAAATTGCACTTGACTTCACTGCCTACTGGTCCCTTTGAGTTGGAAATTGTGACC
GAGATCGAGCCACAGAACAACACTTCTTTAGAAGGGCTGTACAAGTCATCTGGGAATTTCTGCACCCAATGTGAAGCAGA
GGGTTTCCGCAAGATCACCTACTACCAGGACCGTCCTGATGTTATGTCTAAGTTCACAACACGCATAGAAGCCGATAAAG
CCCTTTACCCAGTGTTGTTGGGTAATGGCAACCTGATCGAGGAAGGTGACTTAACAGATGGCAAGCATTATGCAGTGTGG
GAGGATCCATGGACCAAACCTTGCTACTTATTTGCACTGGTAGCTGGTCAACTTGTAAGTCGTGATGATACTTTCACCAC
TATGTCAGGCCGCAAAGTCGTTCTACGAATTTGGACTCCTCCACAAGATATTCCTAAGACCGACCACGCAATGAAGTCTT
TAATCAATTCTATGAAATGGGACGAAGAGGTATTCGGCTTGGAGTATGATCTAGACCTCTTCAACGTTGTTGCTGTGCCA
GATTTCAACATGGGTGCTATGGAGAATAAGAGTTTGAACATTTTCAACTCGAGGTTGGTTTTGGCGTCTCCTGAGACAGC
CACAGACTCAGACTACGCTGCTATTGAGGGTGTTATTGGCCACGAGTATTTTCACAACTGGACAGGAAATCGGGTGACAT
GCAGGGATTGGTTTCAGCTTAGTTTGAAGGAAGGTCTTACTGTATTTAGAGATCAGGAATTCTCATCTGACATGGGTAGC
CGTGGAGTAAAGCGTATTTCTGATGTGTCCCGGCTACGAATTTCTCAATTTCCTCAGGATGCTGGTCCGATGGCACATCC
TGTGCGACCTCAGTCCTATATAAAGATGGATAACTTCTACACCGTTACTGTTTATGAGAAGGGTGCTGAGGTCGTCCGTA
TGTATCAGACTTTGCTTGGAAAGGAAGGCTTTAGAAAGGGTATGGATTTGTACTTCCAAAGGCACGATGGTCAGGCTGTT
TCTTGTGAGGAATTTATGGCGGCCATGTTTGATGCCAATGATGCGAAGTTCCCTACGTTCCCTCTTTGGTATGCTCAAGC
AGGTACTCCGATTCTGACTGTCACCACCTCTTATGACGCTGATGCGCAATCTTTCACGATCAAGTGCAAGCAAGAAGTTC
CCCCCACAGCAGGCCAGCCACAGAAGGAGCCAATGCTTCTACCTCTGGCTGTGGGACTTCTTGACTCACAAGGAAATGAC
ATGCGTCTGAGCTCTGTTAAGGATGGCGCTTCACTTCACAATTTGAGCAGTTCGGATGGTGATTTTACCACTACTGCTGT
ATTGCATGTTGACAAGGCCGAGCAAGAGTTTACATTTTTGGGCATTACTGAGAAGCCAGTCCCATCATTGTTGCGGAACT
TCAGTGCGCCGGTGCGCCTAGTCTCTGATGTTACAGATGACGAGCTATTTTTTCTCCTGGCTCATGATTCTGATCAATTT
AACAGGTGGGAAGCTGGTCAGACTCTAGCTCGCAAGCTTATGCTGGACTTGATTTCTAAGCAACAGAAGGGAGAAGAGCT
TAACGTTAACAGTGCTTTTATTGAGGGCATCCGCAGTACCGTAACGAACTCCTCCCTCGACAAGGAGTTTGTGGCGAAAT
GTATGTCGTTACCTGGAGAAAGTGAGATTGCCGACCTTATGGAGGTGGCTGATCCAGATGCCATACACACTGTTAGACGG
TTTGTTATTAAACAAATTGCTGCGAGCCTTCGCGAGGAACTTCTGAGCACGGTGGAAGAAAATCGTAGTTCTGCTGCTTA
TGACCCCAGCCACGAGCACAGAGCTCGGCGGTCTTTGAAAAATATTTCATTGAGTTACATGGCATCTTTGAATGAACCTG
AGATAACGGAATTGTTAGTGAATGAATATAAGAATGCTACAAACATGACCGATCAAGTTTCTGCTCTAGCTGCACTCTGT
CAGAACCCTGGAGATGCTCGCTCTCAGGCTCTATCAGACTTCTACGACCAGTGGAAGGAAGAGGCTTTGGTCATGAACAA
ATGGCTAGCTTTGCAAGCTATGTCTGATGTTCCAGGCAATGTTGAGCATGTGCGGAGTCTGCTGGAGCACCCTGCTTTTG
ACATCCGTAACCCCAACAAGGTTTATTCTTTGATTGGTGGTTTCTGTGCGTCCGCTGTCAATTTCCACGCCAAGGACGGT
TCAGGGTACAAGTTTTTGGCCGACATTGTTCTTGAGCTGGATAAGCTGAATCCTCAGGTTGCATCACGCATGATTTCGGC
TTTCACTAGATGGAGGAGGTTTGATGAAGAAAGGCAAGCATTGACCAAAGCACAGTTGGAGCGAATAAAAAGCCAGGACG
GTCTATCTGATAATGTTTTTGAGATTGCCTCAAAGAGTCTGGCTTCATAG
Microexon DNA seq GTTTATGAGAAG
Microexon Amino Acid seq VYEK
Microexon-tag DNA Seq GTGCGACCTCAGTCCTATATAAAGATGGATAACTTCTACACCGTTACTGTTTATGAGAAGGGTGCTGAGGTCGTCCGTATGTATCAGACTTTGCTTGGAAAGGAAGGC
Microexon-tag Amino Acid seq VRPQSYIKMDNFYTVTVYEKGAEVVRMYQTLLGKEG
Transcript ID Pp3c26_1330V3.2
Gene ID Pp.16894
Gene Name NA
Pfam domain motif Peptidase_M1
Motif E-value 4.7e-49
Motif start 370
Motif end 574
Protein seq >Pp3c26_1330V3.2
MAAPALSVLLERSPASALYASAADSSSRSGAWIAGSAAGGLVLMPVPRLRRSMSMACLRSSARSLLRAAATKNPTFKSAR
VQRGIVSRQTLRASPLSRSVRALGSQSYFSNNLLHLRDSVSSRVQCTLAAQPQDTMAKDEVVKEGPKEIFLRDYKAPDYA
FEKVDLKFELGEEKTLVTSNIRVLPKSSGAPLILDGDSSLKLVTLKINGSSVPEEDFKITPRKLHLTSLPTGPFELEIVT
EIEPQNNTSLEGLYKSSGNFCTQCEAEGFRKITYYQDRPDVMSKFTTRIEADKALYPVLLGNGNLIEEGDLTDGKHYAVW
EDPWTKPCYLFALVAGQLVSRDDTFTTMSGRKVVLRIWTPPQDIPKTDHAMKSLINSMKWDEEVFGLEYDLDLFNVVAVP
DFNMGAMENKSLNIFNSRLVLASPETATDSDYAAIEGVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDMGS
RGVKRISDVSRLRISQFPQDAGPMAHPVRPQSYIKMDNFYTVTVYEKGAEVVRMYQTLLGKEGFRKGMDLYFQRHDGQAV
SCEEFMAAMFDANDAKFPTFPLWYAQAGTPILTVTTSYDADAQSFTIKCKQEVPPTAGQPQKEPMLLPLAVGLLDSQGND
MRLSSVKDGASLHNLSSSDGDFTTTAVLHVDKAEQEFTFLGITEKPVPSLLRNFSAPVRLVSDVTDDELFFLLAHDSDQF
NRWEAGQTLARKLMLDLISKQQKGEELNVNSAFIEGIRSTVTNSSLDKEFVAKCMSLPGESEIADLMEVADPDAIHTVRR
FVIKQIAASLREELLSTVEENRSSAAYDPSHEHRARRSLKNISLSYMASLNEPEITELLVNEYKNATNMTDQVSALAALC
QNPGDARSQALSDFYDQWKEEALVMNKWLALQAMSDVPGNVEHVRSLLEHPAFDIRNPNKVYSLIGGFCASAVNFHAKDG
SGYKFLADIVLELDKLNPQVASRMISAFTRWRRFDEERQALTKAQLERIKSQDGLSDNVFEIASKSLAS*
CDS seq >Pp3c26_1330V3.2
ATGGCGGCGCCCGCTTTGTCAGTGCTACTAGAGCGGAGCCCCGCAAGTGCTCTGTATGCTTCCGCTGCAGACTCGAGCTC
GCGTTCTGGCGCTTGGATTGCTGGGAGTGCAGCGGGAGGGCTGGTGCTTATGCCGGTGCCGCGGTTGCGACGGAGTATGT
CTATGGCATGCTTGCGATCTTCCGCCCGCTCTTTGCTGCGCGCGGCTGCGACTAAGAATCCGACCTTCAAATCAGCTCGG
GTCCAGCGTGGTATCGTTTCTAGGCAGACTTTGAGGGCCAGTCCTCTTTCCAGGAGTGTCAGGGCACTTGGCTCACAGTC
ATATTTCAGCAACAATCTTCTTCACTTGCGGGATTCGGTGTCTAGTCGTGTTCAGTGTACATTAGCAGCCCAGCCTCAGG
ACACCATGGCGAAGGATGAGGTTGTCAAGGAGGGTCCCAAGGAAATCTTTCTTAGGGATTACAAGGCACCAGATTATGCT
TTTGAAAAGGTAGATTTGAAATTCGAGTTGGGGGAGGAGAAGACATTAGTTACATCCAACATTCGCGTCCTTCCTAAATC
TTCTGGTGCCCCATTGATACTTGACGGTGACTCAAGCTTGAAATTAGTTACTTTGAAAATCAATGGCTCATCCGTACCGG
AGGAAGATTTTAAGATCACTCCGCGGAAATTGCACTTGACTTCACTGCCTACTGGTCCCTTTGAGTTGGAAATTGTGACC
GAGATCGAGCCACAGAACAACACTTCTTTAGAAGGGCTGTACAAGTCATCTGGGAATTTCTGCACCCAATGTGAAGCAGA
GGGTTTCCGCAAGATCACCTACTACCAGGACCGTCCTGATGTTATGTCTAAGTTCACAACACGCATAGAAGCCGATAAAG
CCCTTTACCCAGTGTTGTTGGGTAATGGCAACCTGATCGAGGAAGGTGACTTAACAGATGGCAAGCATTATGCAGTGTGG
GAGGATCCATGGACCAAACCTTGCTACTTATTTGCACTGGTAGCTGGTCAACTTGTAAGTCGTGATGATACTTTCACCAC
TATGTCAGGCCGCAAAGTCGTTCTACGAATTTGGACTCCTCCACAAGATATTCCTAAGACCGACCACGCAATGAAGTCTT
TAATCAATTCTATGAAATGGGACGAAGAGGTATTCGGCTTGGAGTATGATCTAGACCTCTTCAACGTTGTTGCTGTGCCA
GATTTCAACATGGGTGCTATGGAGAATAAGAGTTTGAACATTTTCAACTCGAGGTTGGTTTTGGCGTCTCCTGAGACAGC
CACAGACTCAGACTACGCTGCTATTGAGGGTGTTATTGGCCACGAGTATTTTCACAACTGGACAGGAAATCGGGTGACAT
GCAGGGATTGGTTTCAGCTTAGTTTGAAGGAAGGTCTTACTGTATTTAGAGATCAGGAATTCTCATCTGACATGGGTAGC
CGTGGAGTAAAGCGTATTTCTGATGTGTCCCGGCTACGAATTTCTCAATTTCCTCAGGATGCTGGTCCGATGGCACATCC
TGTGCGACCTCAGTCCTATATAAAGATGGATAACTTCTACACCGTTACTGTTTATGAGAAGGGTGCTGAGGTCGTCCGTA
TGTATCAGACTTTGCTTGGAAAGGAAGGCTTTAGAAAGGGTATGGATTTGTACTTCCAAAGGCACGATGGTCAGGCTGTT
TCTTGTGAGGAATTTATGGCGGCCATGTTTGATGCCAATGATGCGAAGTTCCCTACGTTCCCTCTTTGGTATGCTCAAGC
AGGTACTCCGATTCTGACTGTCACCACCTCTTATGACGCTGATGCGCAATCTTTCACGATCAAGTGCAAGCAAGAAGTTC
CCCCCACAGCAGGCCAGCCACAGAAGGAGCCAATGCTTCTACCTCTGGCTGTGGGACTTCTTGACTCACAAGGAAATGAC
ATGCGTCTGAGCTCTGTTAAGGATGGCGCTTCACTTCACAATTTGAGCAGTTCGGATGGTGATTTTACCACTACTGCTGT
ATTGCATGTTGACAAGGCCGAGCAAGAGTTTACATTTTTGGGCATTACTGAGAAGCCAGTCCCATCATTGTTGCGGAACT
TCAGTGCGCCGGTGCGCCTAGTCTCTGATGTTACAGATGACGAGCTATTTTTTCTCCTGGCTCATGATTCTGATCAATTT
AACAGGTGGGAAGCTGGTCAGACTCTAGCTCGCAAGCTTATGCTGGACTTGATTTCTAAGCAACAGAAGGGAGAAGAGCT
TAACGTTAACAGTGCTTTTATTGAGGGCATCCGCAGTACCGTAACGAACTCCTCCCTCGACAAGGAGTTTGTGGCGAAAT
GTATGTCGTTACCTGGAGAAAGTGAGATTGCCGACCTTATGGAGGTGGCTGATCCAGATGCCATACACACTGTTAGACGG
TTTGTTATTAAACAAATTGCTGCGAGCCTTCGCGAGGAACTTCTGAGCACGGTGGAAGAAAATCGTAGTTCTGCTGCTTA
TGACCCCAGCCACGAGCACAGAGCTCGGCGGTCTTTGAAAAATATTTCATTGAGTTACATGGCATCTTTGAATGAACCTG
AGATAACGGAATTGTTAGTGAATGAATATAAGAATGCTACAAACATGACCGATCAAGTTTCTGCTCTAGCTGCACTCTGT
CAGAACCCTGGAGATGCTCGCTCTCAGGCTCTATCAGACTTCTACGACCAGTGGAAGGAAGAGGCTTTGGTCATGAACAA
ATGGCTAGCTTTGCAAGCTATGTCTGATGTTCCAGGCAATGTTGAGCATGTGCGGAGTCTGCTGGAGCACCCTGCTTTTG
ACATCCGTAACCCCAACAAGGTTTATTCTTTGATTGGTGGTTTCTGTGCGTCCGCTGTCAATTTCCACGCCAAGGACGGT
TCAGGGTACAAGTTTTTGGCCGACATTGTTCTTGAGCTGGATAAGCTGAATCCTCAGGTTGCATCACGCATGATTTCGGC
TTTCACTAGATGGAGGAGGTTTGATGAAGAAAGGCAAGCATTGACCAAAGCACAGTTGGAGCGAATAAAAAGCCAGGACG
GTCTATCTGATAATGTTTTTGAGATTGCCTCAAAGAGTCTGGCTTCATAG