Microexon ID Pp_22:11102770-11102780:+
Species Physcomitrium patens
Coordinates 22:11102770..11102780
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAAATTTGAAG
Microexon Amino Acid seq GNLK
Microexon-tag DNA Seq AAAGAGAATCCGAAAGATCCTAGGTTGTACGTTTGCAGTTTATCACAAGGAAATTTGAAGGTGACTGAAGTGCACAATTTCACTCAAGATGATCTTCTGAGTGATGAT
Microexon-tag Amino Acid Seq KENPKDPRLYVCSLSQGNLKVTEVHNFTQDDLLSDD
Microexon-tag spanning region11102429-11103094
Microexon-tag prediction score0.8846
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c22_17160V3.1x
Reference Transcript ID Pp3c22_17160V3.1
Gene ID Pp3c22_17160
Gene Name NA
Transcript ID Pp3c22_17160V3.1
Protein ID Pp3c22_17160V3.1
Gene ID Pp3c22_17160
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 4.9e-09
Motif start 620
Motif end 694
Protein seq >Pp3c22_17160V3.1
MAVSMKNVDPAFHGVGQKAGMDIWRMENFKPVLLPKESHGKFYSGDSYIVLRSTALKSGGLHYDIHFWLGKDTSQDEAGA
AAIKAVELDAALGGRAVQYRETQEHETELFLSYFKPCIIPMEGGVASGFKKLEVEKVEPRLFVVKGRRSVRVAQVPFSRS
SLNHDDVFVLDTESTIFQFNGETSSIQERGKALEVVQYIKDTYHDGKCDIIIVDDGTLGSEADTGQFWVLFGGFAPLTKK
ATVADDAPELPKPKLLCIVEGSFKGVEISKDALDSSKCYVLDCGTELYLWAGRNTSLDARKAAISTAEGLITEHNTAKYT
KITRVIEGFETLEFRSYFEKWPLNGHHTVSEEGRGKVAGILKQQGVNTKGILKGSPVKEELPSLPTLNGNLEVWRLVGGV
KKEVAAGDVGKFYDHSCYVVIYTVQGEEQKEEYHLYSWTGRYTSPEDKVAATRVMNEKNAELKGRGVQAYIIQGKEPTQF
LALFKCVCILKEHTHPGHKEHSIMLVRVRAAGPHIVVAVQLEPVSASLNSSDCFLLQTSSKLYAWSGNLSTAESQRAVLR
VAEILKPGVIARPVKESLEPPLFWSSLGGKRKYASHCKPKENPKDPRLYVCSLSQGNLKVTEVHNFTQDDLLSDDIMIMD
CHNVLYEWVGQHASSEEKEHSLDVGKKYIERAARLDGMLPDTPIFIITEGNEPTFFTSFFSWDTSKVNVNGDAYAQKIAD
IQARQAPQETLQRRLTPSASAGTKSESTQRAAAMAALSSQLTSEGKLSKAAQTIITQNSASAPVSPKVHRPSAANSQRAA
AMAALSFMFGSKPAPASTVSVDADWVAGSSPFTKAEATGDTDSVTSSKTSEDGGDAGEEIAEFYSYDRLKSTSTNPPPKI
NIKRKEAYLSPEDFEKLFGMSRTQFYEMPKWKQDQRKRQLLLF*
CDS seq >Pp3c22_17160V3.1
ATGGCTGTATCTATGAAGAATGTCGACCCTGCATTTCATGGAGTCGGACAGAAAGCAGGAATGGATATATGGCGCATGGA
AAATTTCAAGCCAGTACTATTGCCTAAGGAATCCCATGGAAAATTTTACTCGGGAGATTCTTACATTGTGCTCAGGTCAA
CAGCATTGAAGTCTGGAGGGCTTCACTATGACATTCATTTTTGGCTTGGGAAGGATACTAGCCAGGATGAGGCTGGTGCA
GCGGCAATTAAGGCCGTAGAGCTGGACGCTGCTTTAGGTGGTCGCGCGGTTCAATACAGAGAAACCCAGGAGCATGAAAC
AGAGCTTTTCTTATCTTACTTCAAGCCATGCATTATTCCTATGGAAGGCGGTGTTGCTTCTGGATTCAAGAAGTTGGAGG
TTGAGAAGGTCGAGCCTCGTTTGTTCGTCGTGAAAGGAAGACGCTCTGTCCGAGTTGCACAGGTGCCATTTTCTCGTTCT
TCACTTAACCATGACGATGTTTTTGTTCTGGATACTGAATCTACAATCTTCCAATTCAATGGAGAAACTTCCAGCATCCA
AGAGAGAGGAAAAGCTCTAGAAGTCGTCCAGTATATCAAGGATACATATCACGATGGCAAATGTGACATTATTATCGTAG
ATGATGGTACTCTTGGATCCGAGGCCGACACGGGTCAGTTCTGGGTGCTGTTCGGGGGCTTTGCTCCGCTTACAAAAAAG
GCCACTGTAGCAGATGATGCTCCTGAGTTACCCAAGCCCAAGCTGCTCTGTATTGTGGAAGGGAGCTTCAAGGGTGTGGA
AATCTCTAAAGATGCGCTGGACAGTAGTAAGTGTTACGTTCTTGATTGCGGTACTGAGCTCTACTTATGGGCAGGTCGTA
ACACTTCACTTGATGCAAGAAAGGCCGCAATTTCAACTGCAGAGGGTTTAATCACTGAACATAATACGGCGAAGTACACT
AAAATCACTCGGGTCATTGAGGGATTCGAAACGCTAGAATTTCGGTCATATTTTGAGAAGTGGCCGTTGAATGGACACCA
CACTGTTTCTGAAGAAGGAAGAGGCAAAGTTGCAGGTATTTTGAAGCAGCAAGGTGTTAACACAAAGGGCATTCTTAAGG
GTTCGCCTGTCAAAGAAGAACTCCCATCACTTCCAACTTTGAATGGCAATCTTGAGGTATGGAGGTTGGTCGGTGGCGTA
AAGAAAGAGGTTGCCGCTGGAGATGTAGGAAAGTTCTATGACCACAGCTGCTATGTCGTGATTTATACAGTTCAGGGAGA
AGAACAGAAAGAAGAATATCATCTTTATAGCTGGACTGGCCGGTACACCTCTCCTGAGGACAAGGTTGCAGCAACGAGAG
TTATGAATGAGAAGAATGCCGAACTTAAAGGGCGGGGAGTTCAGGCATACATTATTCAAGGGAAGGAACCCACTCAGTTC
CTGGCGCTGTTCAAATGCGTTTGCATATTGAAGGAACATACCCACCCAGGTCACAAAGAACATTCAATAATGTTGGTGCG
GGTGCGAGCTGCTGGTCCACACATTGTTGTAGCTGTACAACTAGAGCCGGTTTCAGCTTCACTCAACTCCTCTGATTGCT
TTCTGCTTCAAACCAGCTCAAAGTTGTATGCCTGGTCAGGTAACCTGAGCACTGCCGAAAGCCAGAGGGCTGTCCTGCGA
GTGGCAGAAATCTTGAAGCCTGGCGTAATAGCAAGGCCCGTGAAAGAAAGTTTAGAACCCCCACTCTTTTGGAGTTCTCT
AGGGGGTAAACGGAAATATGCAAGCCACTGTAAACCAAAAGAGAATCCGAAAGATCCTAGGTTGTACGTTTGCAGTTTAT
CACAAGGAAATTTGAAGGTGACTGAAGTGCACAATTTCACTCAAGATGATCTTCTGAGTGATGATATCATGATCATGGAT
TGTCACAATGTCTTGTACGAGTGGGTTGGCCAGCATGCAAGTTCGGAGGAGAAAGAGCATAGTTTAGATGTTGGCAAGAA
ATATATTGAGCGAGCAGCAAGGCTGGATGGGATGCTACCAGATACTCCGATTTTTATTATTACGGAAGGCAACGAGCCGA
CGTTTTTCACCAGTTTCTTCTCGTGGGATACCAGCAAGGTCAATGTCAATGGAGACGCATATGCCCAAAAGATTGCTGAT
ATTCAGGCGCGACAAGCACCTCAAGAGACACTCCAAAGGCGTCTTACACCAAGTGCTTCAGCCGGTACTAAAAGTGAATC
CACTCAGAGGGCAGCTGCTATGGCTGCTCTTTCATCACAATTGACCTCAGAAGGAAAACTTTCTAAGGCTGCTCAAACTA
TAATCACTCAGAATTCTGCCTCCGCTCCAGTAAGTCCGAAGGTTCATCGACCATCAGCTGCCAATTCACAAAGAGCTGCG
GCTATGGCAGCTCTATCCTTCATGTTTGGTTCAAAACCAGCTCCAGCCTCAACGGTTTCAGTGGATGCTGATTGGGTTGC
TGGGAGCTCACCCTTCACAAAAGCGGAAGCAACAGGTGATACTGATTCAGTAACTAGCTCAAAGACTTCCGAGGATGGCG
GTGATGCAGGAGAGGAGATTGCTGAATTTTACAGCTATGATCGCTTGAAGTCGACATCCACAAATCCCCCTCCGAAAATC
AATATAAAAAGAAAAGAGGCCTATTTATCTCCTGAAGATTTCGAGAAGCTATTTGGAATGTCGAGAACTCAGTTTTATGA
GATGCCAAAGTGGAAACAGGATCAACGCAAACGTCAACTTCTGCTCTTTTAG
Microexon DNA seq GAAATTTGAAG
Microexon Amino Acid seq GNLK
Microexon-tag DNA Seq AAAGAGAATCCGAAAGATCCTAGGTTGTACGTTTGCAGTTTATCACAAGGAAATTTGAAGGTGACTGAAGTGCACAATTTCACTCAAGATGATCTTCTGAGTGATGAT
Microexon-tag Amino Acid seq KENPKDPRLYVCSLSQGNLKVTEVHNFTQDDLLSDD
Transcript ID Pp.14547.1
Gene ID Pp.14547
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 4.9e-09
Motif start 620
Motif end 694
Protein seq >Pp.14547.1
MAVSMKNVDPAFHGVGQKAGMDIWRMENFKPVLLPKESHGKFYSGDSYIVLRSTALKSGGLHYDIHFWLGKDTSQDEAGA
AAIKAVELDAALGGRAVQYRETQEHETELFLSYFKPCIIPMEGGVASGFKKLEVEKVEPRLFVVKGRRSVRVAQVPFSRS
SLNHDDVFVLDTESTIFQFNGETSSIQERGKALEVVQYIKDTYHDGKCDIIIVDDGTLGSEADTGQFWVLFGGFAPLTKK
ATVADDAPELPKPKLLCIVEGSFKGVEISKDALDSSKCYVLDCGTELYLWAGRNTSLDARKAAISTAEGLITEHNTAKYT
KITRVIEGFETLEFRSYFEKWPLNGHHTVSEEGRGKVAGILKQQGVNTKGILKGSPVKEELPSLPTLNGNLEVWRLVGGV
KKEVAAGDVGKFYDHSCYVVIYTVQGEEQKEEYHLYSWTGRYTSPEDKVAATRVMNEKNAELKGRGVQAYIIQGKEPTQF
LALFKCVCILKEHTHPGHKEHSIMLVRVRAAGPHIVVAVQLEPVSASLNSSDCFLLQTSSKLYAWSGNLSTAESQRAVLR
VAEILKPGVIARPVKESLEPPLFWSSLGGKRKYASHCKPKENPKDPRLYVCSLSQGNLKVTEVHNFTQDDLLSDDIMIMD
CHNVLYEWVGQHASSEEKEHSLDVGKKYIERAARLDGMLPDTPIFIITEGNEPTFFTSFFSWDTSKVNVNGDAYAQKIAD
IQARQAPQETLQRRLTPSASAGTKSESTQRAAAMAALSSQLTSEGKLSKAAQTIITQNSASAPVSPKVHRPSAANSQRAA
AMAALSFMFGSKPAPASTVSVDADWVAGSSPFTKAEATGDTDSVTSSKTSEDGGDAGEEIAEFYSYDRLKSTSTNPPPKI
NIKRKEAYLSPEDFEKLFGMSRTQFYEMPKWKQDQRKRQLLLF*
CDS seq >Pp.14547.1
ATGGCTGTATCTATGAAGAATGTCGACCCTGCATTTCATGGAGTCGGACAGAAAGCAGGAATGGATATATGGCGCATGGA
AAATTTCAAGCCAGTACTATTGCCTAAGGAATCCCATGGAAAATTTTACTCGGGAGATTCTTACATTGTGCTCAGGTCAA
CAGCATTGAAGTCTGGAGGGCTTCACTATGACATTCATTTTTGGCTTGGGAAGGATACTAGCCAGGATGAGGCTGGTGCA
GCGGCAATTAAGGCCGTAGAGCTGGACGCTGCTTTAGGTGGTCGCGCGGTTCAATACAGAGAAACCCAGGAGCATGAAAC
AGAGCTTTTCTTATCTTACTTCAAGCCATGCATTATTCCTATGGAAGGCGGTGTTGCTTCTGGATTCAAGAAGTTGGAGG
TTGAGAAGGTCGAGCCTCGTTTGTTCGTCGTGAAAGGAAGACGCTCTGTCCGAGTTGCACAGGTGCCATTTTCTCGTTCT
TCACTTAACCATGACGATGTTTTTGTTCTGGATACTGAATCTACAATCTTCCAATTCAATGGAGAAACTTCCAGCATCCA
AGAGAGAGGAAAAGCTCTAGAAGTCGTCCAGTATATCAAGGATACATATCACGATGGCAAATGTGACATTATTATCGTAG
ATGATGGTACTCTTGGATCCGAGGCCGACACGGGTCAGTTCTGGGTGCTGTTCGGGGGCTTTGCTCCGCTTACAAAAAAG
GCCACTGTAGCAGATGATGCTCCTGAGTTACCCAAGCCCAAGCTGCTCTGTATTGTGGAAGGGAGCTTCAAGGGTGTGGA
AATCTCTAAAGATGCGCTGGACAGTAGTAAGTGTTACGTTCTTGATTGCGGTACTGAGCTCTACTTATGGGCAGGTCGTA
ACACTTCACTTGATGCAAGAAAGGCCGCAATTTCAACTGCAGAGGGTTTAATCACTGAACATAATACGGCGAAGTACACT
AAAATCACTCGGGTCATTGAGGGATTCGAAACGCTAGAATTTCGGTCATATTTTGAGAAGTGGCCGTTGAATGGACACCA
CACTGTTTCTGAAGAAGGAAGAGGCAAAGTTGCAGGTATTTTGAAGCAGCAAGGTGTTAACACAAAGGGCATTCTTAAGG
GTTCGCCTGTCAAAGAAGAACTCCCATCACTTCCAACTTTGAATGGCAATCTTGAGGTATGGAGGTTGGTCGGTGGCGTA
AAGAAAGAGGTTGCCGCTGGAGATGTAGGAAAGTTCTATGACCACAGCTGCTATGTCGTGATTTATACAGTTCAGGGAGA
AGAACAGAAAGAAGAATATCATCTTTATAGCTGGACTGGCCGGTACACCTCTCCTGAGGACAAGGTTGCAGCAACGAGAG
TTATGAATGAGAAGAATGCCGAACTTAAAGGGCGGGGAGTTCAGGCATACATTATTCAAGGGAAGGAACCCACTCAGTTC
CTGGCGCTGTTCAAATGCGTTTGCATATTGAAGGAACATACCCACCCAGGTCACAAAGAACATTCAATAATGTTGGTGCG
GGTGCGAGCTGCTGGTCCACACATTGTTGTAGCTGTACAACTAGAGCCGGTTTCAGCTTCACTCAACTCCTCTGATTGCT
TTCTGCTTCAAACCAGCTCAAAGTTGTATGCCTGGTCAGGTAACCTGAGCACTGCCGAAAGCCAGAGGGCTGTCCTGCGA
GTGGCAGAAATCTTGAAGCCTGGCGTAATAGCAAGGCCCGTGAAAGAAAGTTTAGAACCCCCACTCTTTTGGAGTTCTCT
AGGGGGTAAACGGAAATATGCAAGCCACTGTAAACCAAAAGAGAATCCGAAAGATCCTAGGTTGTACGTTTGCAGTTTAT
CACAAGGAAATTTGAAGGTGACTGAAGTGCACAATTTCACTCAAGATGATCTTCTGAGTGATGATATCATGATCATGGAT
TGTCACAATGTCTTGTACGAGTGGGTTGGCCAGCATGCAAGTTCGGAGGAGAAAGAGCATAGTTTAGATGTTGGCAAGAA
ATATATTGAGCGAGCAGCAAGGCTGGATGGGATGCTACCAGATACTCCGATTTTTATTATTACGGAAGGCAACGAGCCGA
CGTTTTTCACCAGTTTCTTCTCGTGGGATACCAGCAAGGTCAATGTCAATGGAGACGCATATGCCCAAAAGATTGCTGAT
ATTCAGGCGCGACAAGCACCTCAAGAGACACTCCAAAGGCGTCTTACACCAAGTGCTTCAGCCGGTACTAAAAGTGAATC
CACTCAGAGGGCAGCTGCTATGGCTGCTCTTTCATCACAATTGACCTCAGAAGGAAAACTTTCTAAGGCTGCTCAAACTA
TAATCACTCAGAATTCTGCCTCCGCTCCAGTAAGTCCGAAGGTTCATCGACCATCAGCTGCCAATTCACAAAGAGCTGCG
GCTATGGCAGCTCTATCCTTCATGTTTGGTTCAAAACCAGCTCCAGCCTCAACGGTTTCAGTGGATGCTGATTGGGTTGC
TGGGAGCTCACCCTTCACAAAAGCGGAAGCAACAGGTGATACTGATTCAGTAACTAGCTCAAAGACTTCCGAGGATGGCG
GTGATGCAGGAGAGGAGATTGCTGAATTTTACAGCTATGATCGCTTGAAGTCGACATCCACAAATCCCCCTCCGAAAATC
AATATAAAAAGAAAAGAGGCCTATTTATCTCCTGAAGATTTCGAGAAGCTATTTGGAATGTCGAGAACTCAGTTTTATGA
GATGCCAAAGTGGAAACAGGATCAACGCAAACGTCAACTTCTGCTCTTTTAG