Microexon ID Pp_1:20496374-20496388:+
Species Physcomitrium patens
Coordinates 1:20496374..20496388
Microexon Cluster ID MEP41
Size 15
Phase 0
Pfam Domain Motif DUF974
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq CARTTYTTCAAGTTYATTGTTKCWAAYCCACTTTCWGTTAGRACAAAGGTYCGYRYTRTCAAGGAAACTACMTWTYTRGARGCTTGYATWGARAAYCATACAAAATCA
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GTGCGCATAGTGAAG
Microexon Amino Acid seq VRIVK
Microexon-tag DNA Seq CAATATTTTAAGTTTATGGCATCAAATCCTCTCTCAGTCCGGACGAAGGTGCGCATAGTGAAGGATACTACATATCTGGAAGCTTGCATAGAGAATAGCACAAAATCA
Microexon-tag Amino Acid Seq QYFKFMASNPLSVRTKVRIVKDTTYLEACIENSTKS
Microexon-tag spanning region20496109-20496627
Microexon-tag prediction score0.9329
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c1_28670V3.1x
Reference Transcript ID Pp3c1_28670V3.1
Gene ID Pp3c1_28670
Gene Name NA
Transcript ID Pp3c1_28670V3.1
Protein ID Pp3c1_28670V3.1
Gene ID Pp3c1_28670
Gene Name NA
Pfam domain motif DUF974
Motif E-value 8.3e-59
Motif start 81
Motif end 311
Protein seq >Pp3c1_28670V3.1
MSSGPGGTGHSLAFRVMRLCRPALQVDLGLRFDPMDLVQGEDLHDSEELQASIESRDKEGPYWRRSELEKPIDALGLPGL
LVLPQTFGSIYLGESFCSYISVGNHSNHDVRDVGIKAELQTERQRVTLYDNTKAPMDFICAGGRHDFIIEHDIKELGPHT
LVCMAVYTDADAERKYLPQYFKFMASNPLSVRTKVRIVKDTTYLEACIENSTKSLLFLDHVRFDPQPPMTVSVLEVESNE
NDESEGPLSGLLKQIKVIKANGGTRHFLYQFHKPAGVPVSTKADGSNTLGKLEIMWRTTLGEPGRLQTQQILGNPSPRKE
VSLRIVEIPSRILLERPFLVRMSVSNHTDRTVGPLQISMSQDDAQGVPRAIVVNGLWSMTVPQLDPLASTDVNLSLVATA
VGVQKITGVGLTDRRDGKPYDALTATEVFVESE*
CDS seq >Pp3c1_28670V3.1
ATGAGTTCGGGGCCAGGAGGGACGGGCCACTCCCTGGCCTTTCGCGTCATGCGGCTATGTCGCCCGGCATTGCAAGTAGA
TCTGGGACTTAGGTTCGACCCTATGGATCTGGTCCAAGGAGAAGATTTACATGACAGCGAGGAGTTGCAAGCGAGTATTG
AATCCCGGGATAAGGAAGGGCCTTATTGGAGAAGAAGTGAATTGGAAAAACCGATCGATGCTTTAGGGCTTCCTGGACTC
TTAGTGCTCCCACAGACCTTCGGCTCCATCTACTTGGGCGAGTCTTTTTGCAGCTACATTAGCGTGGGCAACCACTCAAA
CCACGACGTTCGAGATGTTGGAATCAAGGCGGAGTTGCAAACTGAGAGGCAGCGTGTAACCTTGTATGATAATACGAAGG
CGCCCATGGATTTTATTTGCGCGGGGGGCCGTCATGACTTCATTATTGAGCATGATATCAAAGAGCTCGGACCTCATACG
TTGGTTTGCATGGCGGTTTACACAGACGCTGATGCAGAACGAAAGTACCTGCCACAATATTTTAAGTTTATGGCATCAAA
TCCTCTCTCAGTCCGGACGAAGGTGCGCATAGTGAAGGATACTACATATCTGGAAGCTTGCATAGAGAATAGCACAAAAT
CACTTCTTTTCTTGGACCACGTGCGCTTCGATCCACAGCCTCCTATGACTGTGTCTGTTTTGGAAGTGGAGAGCAATGAA
AATGATGAATCTGAAGGTCCATTAAGTGGCCTCTTGAAACAAATTAAAGTCATAAAAGCAAATGGTGGTACCCGCCATTT
TCTTTACCAGTTCCATAAACCAGCAGGAGTACCTGTGTCAACTAAGGCAGATGGCAGTAACACTCTTGGGAAGTTGGAAA
TCATGTGGCGGACTACTCTTGGAGAACCTGGTCGATTGCAAACACAACAAATTTTGGGCAATCCATCACCTCGAAAGGAA
GTGAGTCTTCGTATTGTGGAGATTCCCTCTCGCATCCTTCTGGAGAGACCCTTTCTGGTCCGAATGAGTGTAAGCAACCA
CACTGACAGGACTGTTGGCCCTCTTCAGATTTCAATGTCTCAAGATGATGCTCAGGGTGTGCCCAGAGCAATTGTTGTGA
ATGGCCTTTGGAGCATGACTGTGCCGCAATTAGATCCATTGGCTTCCACGGATGTTAATCTGAGTTTGGTTGCTACCGCG
GTTGGTGTCCAGAAGATCACCGGCGTAGGGTTGACTGATCGGCGAGATGGCAAGCCCTATGATGCTTTGACAGCTACTGA
GGTGTTTGTGGAGTCCGAGTGA
Microexon DNA seq GTGCGCATAGTGAAG
Microexon Amino Acid seq VRIVK
Microexon-tag DNA Seq CAATATTTTAAGTTTATGGCATCAAATCCTCTCTCAGTCCGGACGAAGGTGCGCATAGTGAAGGATACTACATATCTGGAAGCTTGCATAGAGAATAGCACAAAATCA
Microexon-tag Amino Acid seq QYFKFMASNPLSVRTKVRIVKDTTYLEACIENSTKS
Transcript ID Pp.1093.1
Gene ID Pp.1093
Gene Name NA
Pfam domain motif DUF974
Motif E-value 8.3e-59
Motif start 81
Motif end 311
Protein seq >Pp.1093.1
MSSGPGGTGHSLAFRVMRLCRPALQVDLGLRFDPMDLVQGEDLHDSEELQASIESRDKEGPYWRRSELEKPIDALGLPGL
LVLPQTFGSIYLGESFCSYISVGNHSNHDVRDVGIKAELQTERQRVTLYDNTKAPMDFICAGGRHDFIIEHDIKELGPHT
LVCMAVYTDADAERKYLPQYFKFMASNPLSVRTKVRIVKDTTYLEACIENSTKSLLFLDHVRFDPQPPMTVSVLEVESNE
NDESEGPLSGLLKQIKVIKANGGTRHFLYQFHKPAGVPVSTKADGSNTLGKLEIMWRTTLGEPGRLQTQQILGNPSPRKE
VSLRIVEIPSRILLERPFLVRMSVSNHTDRTVGPLQISMSQDDAQGVPRAIVVNGLWSMTVPQLDPLASTDVNLSLVATA
VGVQKITGVGLTDRRDGKPYDALTATEVFVESE*
CDS seq >Pp.1093.1
ATGAGTTCGGGGCCAGGAGGGACGGGCCACTCCCTGGCCTTTCGCGTCATGCGGCTATGTCGCCCGGCATTGCAAGTAGA
TCTGGGACTTAGGTTCGACCCTATGGATCTGGTCCAAGGAGAAGATTTACATGACAGCGAGGAGTTGCAAGCGAGTATTG
AATCCCGGGATAAGGAAGGGCCTTATTGGAGAAGAAGTGAATTGGAAAAACCGATCGATGCTTTAGGGCTTCCTGGACTC
TTAGTGCTCCCACAGACCTTCGGCTCCATCTACTTGGGCGAGTCTTTTTGCAGCTACATTAGCGTGGGCAACCACTCAAA
CCACGACGTTCGAGATGTTGGAATCAAGGCGGAGTTGCAAACTGAGAGGCAGCGTGTAACCTTGTATGATAATACGAAGG
CGCCCATGGATTTTATTTGCGCGGGGGGCCGTCATGACTTCATTATTGAGCATGATATCAAAGAGCTCGGACCTCATACG
TTGGTTTGCATGGCGGTTTACACAGACGCTGATGCAGAACGAAAGTACCTGCCACAATATTTTAAGTTTATGGCATCAAA
TCCTCTCTCAGTCCGGACGAAGGTGCGCATAGTGAAGGATACTACATATCTGGAAGCTTGCATAGAGAATAGCACAAAAT
CACTTCTTTTCTTGGACCACGTGCGCTTCGATCCACAGCCTCCTATGACTGTGTCTGTTTTGGAAGTGGAGAGCAATGAA
AATGATGAATCTGAAGGTCCATTAAGTGGCCTCTTGAAACAAATTAAAGTCATAAAAGCAAATGGTGGTACCCGCCATTT
TCTTTACCAGTTCCATAAACCAGCAGGAGTACCTGTGTCAACTAAGGCAGATGGCAGTAACACTCTTGGGAAGTTGGAAA
TCATGTGGCGGACTACTCTTGGAGAACCTGGTCGATTGCAAACACAACAAATTTTGGGCAATCCATCACCTCGAAAGGAA
GTGAGTCTTCGTATTGTGGAGATTCCCTCTCGCATCCTTCTGGAGAGACCCTTTCTGGTCCGAATGAGTGTAAGCAACCA
CACTGACAGGACTGTTGGCCCTCTTCAGATTTCAATGTCTCAAGATGATGCTCAGGGTGTGCCCAGAGCAATTGTTGTGA
ATGGCCTTTGGAGCATGACTGTGCCGCAATTAGATCCATTGGCTTCCACGGATGTTAATCTGAGTTTGGTTGCTACCGCG
GTTGGTGTCCAGAAGATCACCGGCGTAGGGTTGACTGATCGGCGAGATGGCAAGCCCTATGATGCTTTGACAGCTACTGA
GGTGTTTGTGGAGTCCGAGTGA