Microexon ID Pp_22:11378446-11378452:+
Species Physcomitrium patens
Coordinates 22:11378446..11378452
Microexon Cluster ID MEP15
Size 7
Phase 2
Pfam Domain Motif GBP
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 50,7,51
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq ATATCTMGRYTSTCATTTGCTGTTGARMTTGCTGAAGARTTYTATGGAAGAGTGAAGGGRCAAGATGTTGCWTTTGARCCWGCWAARCTYYTGTGGCTTATCCARMGT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GGTTAAG
Microexon Amino Acid seq RVK
Microexon-tag DNA Seq ATCGCCAAGCTGTCGTTTGCAGTAGACTTGGCAGAAGAGTTTTATGGTCGGGTTAAGGGAAGAGAAAGTATTTTGGAACCTGCTAAGCTGCTCTGGTTGATCCAACGC
Microexon-tag Amino Acid Seq IAKLSFAVDLAEEFYGRVKGRESILEPAKLLWLIQR
Microexon-tag spanning region11378188-11378696
Microexon-tag prediction score0.8615
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c22_17680V3.1x
Reference Transcript ID Pp3c22_17680V3.1
Gene ID Pp3c22_17680
Gene Name NA
Transcript ID Pp3c22_17680V3.1
Protein ID Pp3c22_17680V3.1
Gene ID Pp3c22_17680
Gene Name NA
Pfam domain motif GBP
Motif E-value 9.8e-37
Motif start 59
Motif end 300
Protein seq >Pp3c22_17680V3.1
MDGRQSKTCSRVAFLSFVLCYFIGVVGYGDMSGSHLRGSTSSQFHKAFSIVEPDPGHTKLAVSRSGLNAIARITNPIAVV
AVIGPYRSGKSFLLNQLLSLSCNEGFGVGHMRDTKTRGIWAWGEPLEINLDGVKTSVLFLDTEGFESIGKSNVYDDRIFA
LAAIMSSVLIYNLPETVREADIAKLSFAVDLAEEFYGRVKGRESILEPAKLLWLIQRDFLEGKSVQAMVEEALQTVPNPD
NNKDIFQVNRIRKSLSLMAENSTAFSLPQPHLERTKLCEMGDSELHHSYVSQRERLKKVVTSMIRPKIVQGGTITGKDFV
SLLEQTLDALNKGEIPSAGYVVEAFNRAVVDRCVALYNTRMSKFQLPVREEELLSGHESTVHEVTSLFGKERFGRQKDND
ESSRMLLMQVEKAYVTLVEINTFRSSRQCDNIYTSCEDQLDSLQGLRLPSMAKFSAGITSCNITFTKNCLGPSKPVYQKR
LEKMWLKARSHFINNYNQRLFNWLVVLSLVMVVVGRFVIKFLLLEIAAWGLFIFLETYTRIFWSAESLYYNPTWRVVVSA
WEATVYGPLLDLDRWFIPLSWIIFFGFITYRLKFGRRKKGPAPLLPSVDIPRTSRRSRFLKM*
CDS seq >Pp3c22_17680V3.1
ATGGATGGGAGGCAAAGTAAAACGTGCTCAAGGGTGGCCTTTCTTTCCTTCGTACTCTGCTATTTCATCGGAGTGGTTGG
ATATGGAGATATGTCCGGAAGTCATTTACGCGGTTCTACTTCCTCACAATTCCACAAAGCGTTTTCAATCGTGGAGCCGG
ATCCAGGGCACACAAAATTAGCGGTATCTAGAAGCGGTTTGAATGCTATAGCGCGAATCACGAATCCAATTGCGGTTGTT
GCGGTCATCGGACCTTATCGTTCTGGAAAATCATTTCTCCTCAATCAGCTTCTGTCACTTTCCTGCAATGAAGGGTTTGG
CGTTGGCCACATGCGGGACACGAAGACGAGAGGTATTTGGGCGTGGGGGGAGCCTCTGGAAATCAACCTCGACGGTGTGA
AGACCTCGGTCTTATTCTTGGACACGGAAGGCTTTGAGAGTATTGGAAAGTCCAATGTTTATGATGATAGGATATTTGCA
CTAGCAGCTATAATGAGTTCCGTTCTTATTTATAATTTGCCGGAGACGGTACGTGAGGCTGATATCGCCAAGCTGTCGTT
TGCAGTAGACTTGGCAGAAGAGTTTTATGGTCGGGTTAAGGGAAGAGAAAGTATTTTGGAACCTGCTAAGCTGCTCTGGT
TGATCCAACGCGACTTCCTTGAGGGAAAGTCAGTTCAGGCGATGGTGGAAGAAGCGCTGCAGACCGTTCCCAATCCTGAC
AACAACAAGGACATATTCCAGGTGAATCGCATACGGAAATCCTTGTCGTTAATGGCAGAGAATAGCACAGCTTTTAGCTT
ACCTCAGCCACACTTGGAGCGTACAAAGCTTTGTGAAATGGGAGACTCAGAACTTCATCATTCATACGTGAGCCAGAGGG
AGCGCTTAAAAAAGGTTGTGACTTCTATGATTCGGCCAAAAATAGTCCAAGGGGGGACAATCACTGGAAAAGATTTCGTC
TCTCTTCTAGAACAGACGCTGGATGCTCTAAACAAAGGGGAGATTCCGTCAGCTGGCTATGTGGTGGAAGCTTTCAACAG
AGCTGTAGTTGATCGATGTGTAGCGTTGTACAATACTCGTATGTCCAAATTCCAGTTGCCTGTTCGTGAAGAAGAGCTCC
TCAGTGGTCATGAGAGTACCGTTCACGAGGTCACTAGCTTGTTTGGAAAGGAGCGATTTGGGAGGCAAAAAGATAACGAC
GAATCGTCAAGAATGCTTCTCATGCAAGTGGAGAAGGCTTATGTAACTCTCGTGGAAATTAATACTTTTAGATCGTCAAG
ACAGTGTGACAATATCTACACATCTTGTGAAGATCAACTAGATAGTCTACAGGGGCTGCGGTTACCTTCGATGGCGAAAT
TCAGTGCTGGAATAACATCATGCAACATCACATTCACCAAAAACTGCCTGGGACCTTCAAAGCCTGTGTATCAAAAGCGT
TTGGAGAAGATGTGGTTGAAGGCAAGGAGCCATTTCATAAACAACTACAACCAGCGTCTTTTCAACTGGCTTGTAGTTCT
GTCTTTAGTCATGGTGGTTGTAGGTCGCTTTGTTATCAAATTCTTGCTCCTGGAGATTGCAGCTTGGGGGCTCTTCATTT
TCTTAGAAACCTACACACGCATATTTTGGTCTGCCGAGTCCCTTTACTACAACCCCACTTGGCGAGTAGTGGTCTCGGCG
TGGGAAGCAACTGTTTACGGCCCATTGCTTGATTTGGACAGGTGGTTTATCCCTCTGTCTTGGATTATATTTTTTGGATT
CATAACATACCGATTGAAATTCGGGAGGCGTAAGAAAGGGCCAGCACCCCTGCTGCCAAGTGTCGACATTCCCAGAACCT
CGAGAAGAAGTCGATTTCTGAAAATGTAG
Microexon DNA seq GGTTAAG
Microexon Amino Acid seq RVK
Microexon-tag DNA Seq ATCGCCAAGCTGTCGTTTGCAGTAGACTTGGCAGAAGAGTTTTATGGTCGGGTTAAGGGAAGAGAAAGTATTTTGGAACCTGCTAAGCTGCTCTGGTTGATCCAACGC
Microexon-tag Amino Acid seq IAKLSFAVDLAEEFYGRVKGRESILEPAKLLWLIQR
Transcript ID Pp3c22_17680V3.1
Gene ID Pp.14569
Gene Name NA
Pfam domain motif GBP
Motif E-value 9.8e-37
Motif start 59
Motif end 300
Protein seq >Pp3c22_17680V3.1
MDGRQSKTCSRVAFLSFVLCYFIGVVGYGDMSGSHLRGSTSSQFHKAFSIVEPDPGHTKLAVSRSGLNAIARITNPIAVV
AVIGPYRSGKSFLLNQLLSLSCNEGFGVGHMRDTKTRGIWAWGEPLEINLDGVKTSVLFLDTEGFESIGKSNVYDDRIFA
LAAIMSSVLIYNLPETVREADIAKLSFAVDLAEEFYGRVKGRESILEPAKLLWLIQRDFLEGKSVQAMVEEALQTVPNPD
NNKDIFQVNRIRKSLSLMAENSTAFSLPQPHLERTKLCEMGDSELHHSYVSQRERLKKVVTSMIRPKIVQGGTITGKDFV
SLLEQTLDALNKGEIPSAGYVVEAFNRAVVDRCVALYNTRMSKFQLPVREEELLSGHESTVHEVTSLFGKERFGRQKDND
ESSRMLLMQVEKAYVTLVEINTFRSSRQCDNIYTSCEDQLDSLQGLRLPSMAKFSAGITSCNITFTKNCLGPSKPVYQKR
LEKMWLKARSHFINNYNQRLFNWLVVLSLVMVVVGRFVIKFLLLEIAAWGLFIFLETYTRIFWSAESLYYNPTWRVVVSA
WEATVYGPLLDLDRWFIPLSWIIFFGFITYRLKFGRRKKGPAPLLPSVDIPRTSRRSRFLKM*
CDS seq >Pp3c22_17680V3.1
ATGGATGGGAGGCAAAGTAAAACGTGCTCAAGGGTGGCCTTTCTTTCCTTCGTACTCTGCTATTTCATCGGAGTGGTTGG
ATATGGAGATATGTCCGGAAGTCATTTACGCGGTTCTACTTCCTCACAATTCCACAAAGCGTTTTCAATCGTGGAGCCGG
ATCCAGGGCACACAAAATTAGCGGTATCTAGAAGCGGTTTGAATGCTATAGCGCGAATCACGAATCCAATTGCGGTTGTT
GCGGTCATCGGACCTTATCGTTCTGGAAAATCATTTCTCCTCAATCAGCTTCTGTCACTTTCCTGCAATGAAGGGTTTGG
CGTTGGCCACATGCGGGACACGAAGACGAGAGGTATTTGGGCGTGGGGGGAGCCTCTGGAAATCAACCTCGACGGTGTGA
AGACCTCGGTCTTATTCTTGGACACGGAAGGCTTTGAGAGTATTGGAAAGTCCAATGTTTATGATGATAGGATATTTGCA
CTAGCAGCTATAATGAGTTCCGTTCTTATTTATAATTTGCCGGAGACGGTACGTGAGGCTGATATCGCCAAGCTGTCGTT
TGCAGTAGACTTGGCAGAAGAGTTTTATGGTCGGGTTAAGGGAAGAGAAAGTATTTTGGAACCTGCTAAGCTGCTCTGGT
TGATCCAACGCGACTTCCTTGAGGGAAAGTCAGTTCAGGCGATGGTGGAAGAAGCGCTGCAGACCGTTCCCAATCCTGAC
AACAACAAGGACATATTCCAGGTGAATCGCATACGGAAATCCTTGTCGTTAATGGCAGAGAATAGCACAGCTTTTAGCTT
ACCTCAGCCACACTTGGAGCGTACAAAGCTTTGTGAAATGGGAGACTCAGAACTTCATCATTCATACGTGAGCCAGAGGG
AGCGCTTAAAAAAGGTTGTGACTTCTATGATTCGGCCAAAAATAGTCCAAGGGGGGACAATCACTGGAAAAGATTTCGTC
TCTCTTCTAGAACAGACGCTGGATGCTCTAAACAAAGGGGAGATTCCGTCAGCTGGCTATGTGGTGGAAGCTTTCAACAG
AGCTGTAGTTGATCGATGTGTAGCGTTGTACAATACTCGTATGTCCAAATTCCAGTTGCCTGTTCGTGAAGAAGAGCTCC
TCAGTGGTCATGAGAGTACCGTTCACGAGGTCACTAGCTTGTTTGGAAAGGAGCGATTTGGGAGGCAAAAAGATAACGAC
GAATCGTCAAGAATGCTTCTCATGCAAGTGGAGAAGGCTTATGTAACTCTCGTGGAAATTAATACTTTTAGATCGTCAAG
ACAGTGTGACAATATCTACACATCTTGTGAAGATCAACTAGATAGTCTACAGGGGCTGCGGTTACCTTCGATGGCGAAAT
TCAGTGCTGGAATAACATCATGCAACATCACATTCACCAAAAACTGCCTGGGACCTTCAAAGCCTGTGTATCAAAAGCGT
TTGGAGAAGATGTGGTTGAAGGCAAGGAGCCATTTCATAAACAACTACAACCAGCGTCTTTTCAACTGGCTTGTAGTTCT
GTCTTTAGTCATGGTGGTTGTAGGTCGCTTTGTTATCAAATTCTTGCTCCTGGAGATTGCAGCTTGGGGGCTCTTCATTT
TCTTAGAAACCTACACACGCATATTTTGGTCTGCCGAGTCCCTTTACTACAACCCCACTTGGCGAGTAGTGGTCTCGGCG
TGGGAAGCAACTGTTTACGGCCCATTGCTTGATTTGGACAGGTGGTTTATCCCTCTGTCTTGGATTATATTTTTTGGATT
CATAACATACCGATTGAAATTCGGGAGGCGTAAGAAAGGGCCAGCACCCCTGCTGCCAAGTGTCGACATTCCCAGAACCT
CGAGAAGAAGTCGATTTCTGAAAATGTAG