Microexon ID Pp_18:5033402-5033412:-
Species Physcomitrium patens
Coordinates 18:5033402..5033412
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAAATTTGAAG
Microexon Amino Acid seq GNLK
Microexon-tag DNA Seq AGAGATGTTCCGAAGGATCCACGACTGTATGCTTGCAGTCTGTCACAAGGAAATTTGAAGGTGATCGAAGTACATAATTTCACACAGGATGATCTTCTGACGGAAGAT
Microexon-tag Amino Acid Seq RDVPKDPRLYACSLSQGNLKVIEVHNFTQDDLLTED
Microexon-tag spanning region5033125-5033705
Microexon-tag prediction score0.899
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c18_7160V3.1x
Reference Transcript ID Pp3c18_7160V3.1
Gene ID Pp3c18_7160
Gene Name NA
Transcript ID Pp3c18_7160V3.1
Protein ID Pp3c18_7160V3.1
Gene ID Pp3c18_7160
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp3c18_7160V3.1
MAVSMKNVDIAFQGVGQKPGMEIWRIEDFKPTPLPTESYGKFYSGDSYIVLRTTALKTGGFHYDIHFWLGKNTTQDEAGT
AAIKTVELDAALGGRAVQYRETQEHETDLFLSYFKPCIIPLEGGVASGFNKVEVEKVEPRLFIVKGRRAVRVSQVPFARS
SLNHNDVFVLDTESTIFQFNGATSSIQERGKALEVVQYIKDTYHDGKCEVIIIDDGTLGSEADTGQFWVLFGGFAPLARK
AAVADDAPKLTKPKLFCIIEASFKEVEISKDILDSSKCYLLDCGNELYIWAGRNTSLDARKAAVSTVENFITNEKRPKHS
QIIRIIEGFETLEFRSHFDNWPLHEQYPISEEGRGKVAALLKQQGLNTKGILKGSPVREESPSLPSLSGKLEVWRIVCGM
KKQIAAEEIGRFYENSCYIVLYTYQGEERKEEYLLCNWSGRHSPLEDKDASLKVMKDMSVALKGRAVQAYVAQGREPIQF
LALFKCMCILKEPPSLGQKDNNAVMLVRVRAAGPKIVQAVQVEPSSASLNSSDCFLLQTNSKLYAWSGNLSTFESQKASL
LVAEILKPGVIARAMKEGLEPPLFWSSLGGKRKYASQREARDVPKDPRLYACSLSQGNLKVIEVHNFTQDDLLTEDIMIL
DCHNIIYEWIGHNTSTDNKEHSLSIAKRFLERAEKLDGAQPDTPIFILAEGYEPIFFTSFFSWDSSKVNVNGDAYSRKLA
GLQGRRIPSEKPQRHLTSSSSVGAKDESTQRAAAMAALSSQLTKEGKFSKVVQNIINQNNSASAPVSPRFHRPSTANSQR
AAAMAALSLMFGTKKAGLASSVSVDSDWIAGSSPFTKMEASGDTESVTSSKSEDGGDEGEEITEFFSYDRLKASPTDPDL
KINVKRKEAYLSPEDFEKLFGMPRSQFYELPKWKQDQRKRNLQLF*
CDS seq >Pp3c18_7160V3.1
ATGGCTGTGTCCATGAAGAATGTCGACATTGCTTTTCAAGGAGTTGGGCAGAAACCAGGAATGGAAATCTGGCGCATTGA
AGATTTCAAACCAACGCCCTTACCTACGGAATCCTATGGAAAATTTTACTCCGGCGATTCTTACATTGTCCTCAGGACCA
CAGCACTAAAGACTGGAGGTTTCCACTACGACATTCACTTTTGGCTAGGAAAGAACACAACCCAGGATGAAGCTGGCACT
GCGGCAATTAAAACAGTTGAGCTGGACGCTGCTTTAGGTGGTCGCGCTGTTCAATACAGAGAAACTCAAGAACATGAAAC
GGATCTTTTTCTGTCTTACTTCAAACCTTGTATCATTCCTTTAGAAGGGGGTGTTGCTTCTGGATTCAACAAAGTGGAGG
TTGAGAAGGTGGAGCCTCGTTTGTTCATTGTCAAAGGGAGGCGCGCAGTTCGAGTTTCACAGGTCCCCTTCGCCCGTTCC
TCTCTAAACCATAATGATGTCTTTGTTCTGGATACGGAATCAACAATCTTTCAGTTCAACGGAGCAACGTCTAGCATCCA
AGAGCGGGGAAAAGCTTTAGAAGTTGTCCAATATATCAAGGATACATATCACGATGGAAAATGTGAAGTTATCATTATAG
ATGATGGTACGCTCGGGTCCGAGGCAGACACAGGGCAGTTTTGGGTTTTATTCGGGGGTTTCGCTCCACTTGCAAGGAAA
GCTGCCGTTGCAGATGATGCTCCTAAGTTGACGAAACCTAAGCTTTTCTGTATCATTGAAGCCAGCTTCAAGGAAGTAGA
AATCTCTAAAGACATATTGGACAGCAGCAAGTGTTATCTGCTTGATTGTGGTAATGAGCTCTACATATGGGCGGGTCGTA
ATACATCTCTTGATGCACGAAAGGCTGCTGTTTCAACAGTAGAGAATTTCATCACTAATGAGAAAAGACCAAAGCACAGT
CAGATTATCCGAATCATTGAGGGATTTGAAACATTAGAGTTTCGGTCACATTTTGACAACTGGCCATTACATGAACAGTA
TCCTATCTCTGAAGAAGGAAGGGGCAAAGTTGCAGCTTTGTTGAAGCAGCAAGGCCTCAACACGAAGGGCATTCTTAAGG
GTTCGCCTGTCCGAGAAGAATCACCATCACTTCCAAGTTTGAGTGGCAAGCTTGAGGTTTGGAGGATAGTATGTGGTATG
AAGAAACAAATTGCTGCTGAAGAAATTGGAAGATTTTACGAAAACAGCTGCTACATTGTACTGTATACTTATCAAGGAGA
AGAACGTAAAGAGGAATACCTTCTTTGCAATTGGAGTGGCCGGCATTCCCCTCTGGAAGATAAGGATGCGTCCCTGAAAG
TTATGAAAGACATGAGTGTAGCACTTAAAGGGCGTGCAGTGCAGGCTTACGTTGCACAAGGGAGGGAACCCATTCAGTTT
TTGGCGCTGTTCAAATGCATGTGCATTCTGAAGGAGCCTCCCAGTTTAGGCCAAAAAGACAATAACGCAGTAATGTTGGT
GCGGGTGCGAGCTGCTGGTCCGAAAATTGTACAAGCCGTACAAGTAGAGCCTTCATCCGCTTCGCTGAACTCCTCCGATT
GCTTCCTGCTTCAAACCAACTCAAAATTGTATGCCTGGTCAGGCAATCTTAGTACTTTTGAGAGTCAAAAGGCGAGTCTG
CTAGTGGCGGAAATTTTGAAGCCTGGTGTTATAGCAAGGGCTATGAAAGAGGGTTTAGAGCCTCCGCTATTTTGGAGTTC
TCTTGGAGGGAAACGGAAATATGCAAGTCAGAGAGAAGCAAGAGATGTTCCGAAGGATCCACGACTGTATGCTTGCAGTC
TGTCACAAGGAAATTTGAAGGTGATCGAAGTACATAATTTCACACAGGATGATCTTCTGACGGAAGATATCATGATCCTG
GACTGTCACAATATTATATACGAGTGGATTGGCCATAATACGAGCACAGACAACAAAGAGCATTCCTTAAGCATTGCCAA
GAGATTTCTTGAACGAGCAGAAAAATTGGATGGAGCACAACCTGATACTCCCATCTTTATACTTGCAGAAGGCTACGAAC
CAATATTTTTCACCTCCTTCTTCTCATGGGATTCCAGCAAGGTCAATGTCAATGGAGACGCATATTCAAGAAAGCTTGCA
GGGCTTCAAGGACGACGTATTCCTTCAGAGAAACCCCAAAGACATCTTACATCAAGTTCTTCAGTTGGAGCCAAAGACGA
ATCCACACAACGGGCTGCAGCTATGGCAGCTCTCTCATCGCAGTTGACGAAAGAGGGGAAGTTTTCCAAGGTTGTCCAAA
ATATAATCAATCAGAATAATTCAGCCTCTGCTCCAGTGAGTCCTAGATTTCATCGACCATCAACTGCTAACTCTCAAAGA
GCTGCTGCAATGGCAGCTCTATCGTTAATGTTTGGTACGAAAAAAGCAGGATTAGCCTCTTCAGTTTCAGTTGATTCTGA
TTGGATCGCTGGAAGTTCGCCGTTCACGAAAATGGAGGCATCAGGGGATACAGAATCTGTAACAAGCTCTAAGTCTGAAG
ATGGAGGGGATGAAGGGGAGGAAATCACCGAATTTTTCAGTTATGATCGCCTGAAAGCATCGCCCACGGATCCTGATTTA
AAAATAAATGTAAAAAGAAAAGAGGCTTACTTATCTCCCGAAGATTTTGAGAAGCTGTTCGGAATGCCCAGAAGCCAGTT
TTATGAGTTGCCAAAGTGGAAACAGGATCAACGCAAACGCAATTTACAACTTTTTTAG
Microexon DNA seq GAAATTTGAAG
Microexon Amino Acid seq GNLK
Microexon-tag DNA Seq AGAGATGTTCCGAAGGATCCACGACTGTATGCTTGCAGTCTGTCACAAGGAAATTTGAAGGTGATCGAAGTACATAATTTCACACAGGATGATCTTCTGACGGAAGAT
Microexon-tag Amino Acid seq RDVPKDPRLYACSLSQGNLKVIEVHNFTQDDLLTED
Transcript ID Pp3c18_7160V3.1
Gene ID Pp.9411
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp3c18_7160V3.1
MAVSMKNVDIAFQGVGQKPGMEIWRIEDFKPTPLPTESYGKFYSGDSYIVLRTTALKTGGFHYDIHFWLGKNTTQDEAGT
AAIKTVELDAALGGRAVQYRETQEHETDLFLSYFKPCIIPLEGGVASGFNKVEVEKVEPRLFIVKGRRAVRVSQVPFARS
SLNHNDVFVLDTESTIFQFNGATSSIQERGKALEVVQYIKDTYHDGKCEVIIIDDGTLGSEADTGQFWVLFGGFAPLARK
AAVADDAPKLTKPKLFCIIEASFKEVEISKDILDSSKCYLLDCGNELYIWAGRNTSLDARKAAVSTVENFITNEKRPKHS
QIIRIIEGFETLEFRSHFDNWPLHEQYPISEEGRGKVAALLKQQGLNTKGILKGSPVREESPSLPSLSGKLEVWRIVCGM
KKQIAAEEIGRFYENSCYIVLYTYQGEERKEEYLLCNWSGRHSPLEDKDASLKVMKDMSVALKGRAVQAYVAQGREPIQF
LALFKCMCILKEPPSLGQKDNNAVMLVRVRAAGPKIVQAVQVEPSSASLNSSDCFLLQTNSKLYAWSGNLSTFESQKASL
LVAEILKPGVIARAMKEGLEPPLFWSSLGGKRKYASQREARDVPKDPRLYACSLSQGNLKVIEVHNFTQDDLLTEDIMIL
DCHNIIYEWIGHNTSTDNKEHSLSIAKRFLERAEKLDGAQPDTPIFILAEGYEPIFFTSFFSWDSSKVNVNGDAYSRKLA
GLQGRRIPSEKPQRHLTSSSSVGAKDESTQRAAAMAALSSQLTKEGKFSKVVQNIINQNNSASAPVSPRFHRPSTANSQR
AAAMAALSLMFGTKKAGLASSVSVDSDWIAGSSPFTKMEASGDTESVTSSKSEDGGDEGEEITEFFSYDRLKASPTDPDL
KINVKRKEAYLSPEDFEKLFGMPRSQFYELPKWKQDQRKRNLQLF*
CDS seq >Pp3c18_7160V3.1
ATGGCTGTGTCCATGAAGAATGTCGACATTGCTTTTCAAGGAGTTGGGCAGAAACCAGGAATGGAAATCTGGCGCATTGA
AGATTTCAAACCAACGCCCTTACCTACGGAATCCTATGGAAAATTTTACTCCGGCGATTCTTACATTGTCCTCAGGACCA
CAGCACTAAAGACTGGAGGTTTCCACTACGACATTCACTTTTGGCTAGGAAAGAACACAACCCAGGATGAAGCTGGCACT
GCGGCAATTAAAACAGTTGAGCTGGACGCTGCTTTAGGTGGTCGCGCTGTTCAATACAGAGAAACTCAAGAACATGAAAC
GGATCTTTTTCTGTCTTACTTCAAACCTTGTATCATTCCTTTAGAAGGGGGTGTTGCTTCTGGATTCAACAAAGTGGAGG
TTGAGAAGGTGGAGCCTCGTTTGTTCATTGTCAAAGGGAGGCGCGCAGTTCGAGTTTCACAGGTCCCCTTCGCCCGTTCC
TCTCTAAACCATAATGATGTCTTTGTTCTGGATACGGAATCAACAATCTTTCAGTTCAACGGAGCAACGTCTAGCATCCA
AGAGCGGGGAAAAGCTTTAGAAGTTGTCCAATATATCAAGGATACATATCACGATGGAAAATGTGAAGTTATCATTATAG
ATGATGGTACGCTCGGGTCCGAGGCAGACACAGGGCAGTTTTGGGTTTTATTCGGGGGTTTCGCTCCACTTGCAAGGAAA
GCTGCCGTTGCAGATGATGCTCCTAAGTTGACGAAACCTAAGCTTTTCTGTATCATTGAAGCCAGCTTCAAGGAAGTAGA
AATCTCTAAAGACATATTGGACAGCAGCAAGTGTTATCTGCTTGATTGTGGTAATGAGCTCTACATATGGGCGGGTCGTA
ATACATCTCTTGATGCACGAAAGGCTGCTGTTTCAACAGTAGAGAATTTCATCACTAATGAGAAAAGACCAAAGCACAGT
CAGATTATCCGAATCATTGAGGGATTTGAAACATTAGAGTTTCGGTCACATTTTGACAACTGGCCATTACATGAACAGTA
TCCTATCTCTGAAGAAGGAAGGGGCAAAGTTGCAGCTTTGTTGAAGCAGCAAGGCCTCAACACGAAGGGCATTCTTAAGG
GTTCGCCTGTCCGAGAAGAATCACCATCACTTCCAAGTTTGAGTGGCAAGCTTGAGGTTTGGAGGATAGTATGTGGTATG
AAGAAACAAATTGCTGCTGAAGAAATTGGAAGATTTTACGAAAACAGCTGCTACATTGTACTGTATACTTATCAAGGAGA
AGAACGTAAAGAGGAATACCTTCTTTGCAATTGGAGTGGCCGGCATTCCCCTCTGGAAGATAAGGATGCGTCCCTGAAAG
TTATGAAAGACATGAGTGTAGCACTTAAAGGGCGTGCAGTGCAGGCTTACGTTGCACAAGGGAGGGAACCCATTCAGTTT
TTGGCGCTGTTCAAATGCATGTGCATTCTGAAGGAGCCTCCCAGTTTAGGCCAAAAAGACAATAACGCAGTAATGTTGGT
GCGGGTGCGAGCTGCTGGTCCGAAAATTGTACAAGCCGTACAAGTAGAGCCTTCATCCGCTTCGCTGAACTCCTCCGATT
GCTTCCTGCTTCAAACCAACTCAAAATTGTATGCCTGGTCAGGCAATCTTAGTACTTTTGAGAGTCAAAAGGCGAGTCTG
CTAGTGGCGGAAATTTTGAAGCCTGGTGTTATAGCAAGGGCTATGAAAGAGGGTTTAGAGCCTCCGCTATTTTGGAGTTC
TCTTGGAGGGAAACGGAAATATGCAAGTCAGAGAGAAGCAAGAGATGTTCCGAAGGATCCACGACTGTATGCTTGCAGTC
TGTCACAAGGAAATTTGAAGGTGATCGAAGTACATAATTTCACACAGGATGATCTTCTGACGGAAGATATCATGATCCTG
GACTGTCACAATATTATATACGAGTGGATTGGCCATAATACGAGCACAGACAACAAAGAGCATTCCTTAAGCATTGCCAA
GAGATTTCTTGAACGAGCAGAAAAATTGGATGGAGCACAACCTGATACTCCCATCTTTATACTTGCAGAAGGCTACGAAC
CAATATTTTTCACCTCCTTCTTCTCATGGGATTCCAGCAAGGTCAATGTCAATGGAGACGCATATTCAAGAAAGCTTGCA
GGGCTTCAAGGACGACGTATTCCTTCAGAGAAACCCCAAAGACATCTTACATCAAGTTCTTCAGTTGGAGCCAAAGACGA
ATCCACACAACGGGCTGCAGCTATGGCAGCTCTCTCATCGCAGTTGACGAAAGAGGGGAAGTTTTCCAAGGTTGTCCAAA
ATATAATCAATCAGAATAATTCAGCCTCTGCTCCAGTGAGTCCTAGATTTCATCGACCATCAACTGCTAACTCTCAAAGA
GCTGCTGCAATGGCAGCTCTATCGTTAATGTTTGGTACGAAAAAAGCAGGATTAGCCTCTTCAGTTTCAGTTGATTCTGA
TTGGATCGCTGGAAGTTCGCCGTTCACGAAAATGGAGGCATCAGGGGATACAGAATCTGTAACAAGCTCTAAGTCTGAAG
ATGGAGGGGATGAAGGGGAGGAAATCACCGAATTTTTCAGTTATGATCGCCTGAAAGCATCGCCCACGGATCCTGATTTA
AAAATAAATGTAAAAAGAAAAGAGGCTTACTTATCTCCCGAAGATTTTGAGAAGCTGTTCGGAATGCCCAGAAGCCAGTT
TTATGAGTTGCCAAAGTGGAAACAGGATCAACGCAAACGCAATTTACAACTTTTTTAG