Microexon ID Pp_19:10494754-10494764:-
Species Physcomitrium patens
Coordinates 19:10494754..10494764
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq AAAATTTGAAG
Microexon Amino Acid seq ENLK
Microexon-tag DNA Seq AAAGAGGGTCCGAAGGATCCAAGGCTGTTCGCTTGCAGTCTTTCACGAGAAAATTTGAAGGTGACTGAAGTGCACAATTTCACACAAGATGATCTTCTGAGTGACGAT
Microexon-tag Amino Acid Seq KEGPKDPRLFACSLSRENLKVTEVHNFTQDDLLSDD
Microexon-tag spanning region10494491-10495110
Microexon-tag prediction score0.8812
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c19_16000V3.1x
Reference Transcript ID Pp3c19_16000V3.1
Gene ID Pp3c19_16000
Gene Name NA
Transcript ID Pp3c19_16000V3.1
Protein ID Pp3c19_16000V3.1
Gene ID Pp3c19_16000
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 5.3e-10
Motif start 620
Motif end 694
Protein seq >Pp3c19_16000V3.1
MAVSMKNVDVAFQGVGQKPGIDIWRIENFKPVPLLKEFHGKFYSGDSYIVLKTTALKTGGFHYDIHFWLGKDTSQDEAGT
AAIKTVELDAALGGRAVQYRETQEHETELFLSYFKPCIVPMEGGIASGFKKVEVGKVEPRLFIVKGRRTVRVTQVPFARS
SLNHDDVFVLDTESTIFQFNGENSSIQERGKALEVVQYIKDTDHDGKCEIVIIDDGTLGTEADTGQFWVLFGGFAPLSKK
PVVADDASGLPKPKLLCIIERSLKEVEMSKDVLDSSKCYVLDCGNEIYTWAGRNTSLDARKAAISIVEDLITNLNRPKHI
QITRIIEGFETLEFRSYFVKWPLNGQHTVSEEGRGKVAALLKQQGVNTKGILKGSPVKEELPPLPSLNGKLEVWRLVGGV
KKEIDAGDVGRFYDHSCYIVLYTYQGEERKEEYLLCNWIGRHTSVEDKASGLRVMNEMSAALKGRAVQAYIAQGKEPIQF
LALFKCMCILKEHVCPGHKDHSILLVRARCVGPQIVLAVQLEPVSASLNSSDCFLLQTNSKLYAWTGNLSTVENQKAVLR
AAEVLKPGVVARPVKEGLEPPLFWSSLGSKRKYASHPKPKEGPKDPRLFACSLSRENLKVTEVHNFTQDDLLSDDIMILD
CHNVIYEWVGQHASTEEKELNLDIAKKYIERAARLDGILQDVPIFMITEGNEPMFFTTFFSWDSSKVHGDSYTKRVAGIQ
GRPVPQEKVQRRLTPSASAGTKSESTQRAAAMAALSSQLTSEGKLSKVAQTLVNQNPSSAPASPRFHRPSTANSQRAAAM
AALSFMLGTKKAPGSAVSVDADWVAGSSPFAKVEATGDTESVTSSKTSEDGGDGGEEIAEFYSYDRLKSSSTNPPKINIK
RKEAYLSPEDFEKLFGMSRTQFYEMPKWKQDQRKRNLLLF*
CDS seq >Pp3c19_16000V3.1
ATGGCTGTGTCTATGAAGAATGTGGACGTCGCATTCCAAGGAGTTGGCCAGAAACCAGGAATTGACATATGGCGCATTGA
GAATTTCAAACCAGTGCCCTTGCTCAAGGAATTTCATGGAAAATTTTATTCAGGAGATTCTTACATTGTGCTCAAGACGA
CCGCACTTAAAACTGGAGGGTTCCACTACGATATTCACTTTTGGCTGGGAAAGGACACGAGCCAGGATGAGGCTGGTACA
GCAGCAATTAAGACCGTTGAGCTGGATGCTGCGTTAGGTGGTCGCGCCGTTCAGTATAGAGAAACTCAGGAGCACGAAAC
AGAACTCTTTCTATCTTATTTCAAACCATGTATTGTTCCTATGGAAGGCGGTATTGCTTCTGGATTCAAGAAAGTGGAAG
TTGGGAAGGTTGAGCCTCGTTTATTCATTGTAAAAGGAAGACGCACTGTCAGAGTTACACAGGTGCCATTTGCTCGTTCC
TCACTGAACCATGACGATGTTTTTGTTCTGGACACGGAATCAACAATATTCCAATTCAATGGAGAAAATTCCAGTATTCA
AGAGAGGGGGAAAGCTCTTGAAGTGGTCCAGTATATCAAGGATACAGATCATGATGGAAAATGCGAAATTGTAATTATAG
ACGATGGTACGCTCGGCACTGAGGCAGACACTGGGCAATTCTGGGTTCTGTTTGGAGGCTTTGCTCCTCTTTCAAAGAAA
CCTGTTGTTGCAGATGATGCCTCTGGGTTACCCAAGCCTAAGTTGCTCTGTATCATAGAAAGGAGCTTGAAGGAAGTGGA
AATGTCTAAGGATGTACTTGACAGCAGCAAGTGTTACGTGCTCGATTGCGGTAATGAGATCTACACTTGGGCAGGTCGCA
ACACATCACTTGATGCTAGAAAGGCTGCAATTTCAATTGTAGAGGATTTAATCACTAACCTGAATAGGCCGAAGCACATC
CAGATCACCCGGATCATTGAAGGATTCGAAACGCTCGAGTTTCGTTCGTACTTTGTTAAGTGGCCATTAAATGGACAACA
CACCGTCTCTGAAGAAGGAAGAGGCAAAGTTGCAGCATTGTTGAAGCAGCAAGGTGTTAACACAAAAGGTATTCTCAAGG
GTTCACCTGTGAAAGAAGAGCTCCCACCACTTCCAAGTTTGAATGGCAAGCTTGAGGTATGGAGGTTGGTCGGTGGTGTA
AAAAAAGAAATTGATGCTGGAGATGTTGGAAGGTTCTATGACCACAGCTGCTATATTGTGCTTTACACTTATCAAGGAGA
AGAGCGTAAAGAGGAATACCTTCTATGCAACTGGATTGGTCGGCACACCTCTGTGGAGGACAAGGCTTCGGGACTGAGGG
TTATGAATGAAATGAGTGCAGCACTGAAAGGACGTGCAGTTCAGGCATACATTGCTCAAGGCAAGGAACCCATTCAGTTT
TTGGCGCTGTTTAAATGCATGTGCATATTGAAGGAACATGTTTGTCCAGGTCACAAGGATCATTCAATATTGTTGGTGCG
GGCGCGGTGTGTTGGTCCACAAATTGTCCTAGCTGTCCAGCTGGAGCCTGTGTCAGCTTCACTAAACTCCTCCGATTGCT
TTCTACTTCAAACCAACTCGAAGTTGTATGCCTGGACAGGCAACCTGAGTACTGTTGAGAATCAGAAGGCTGTTTTGCGA
GCAGCTGAAGTTCTGAAGCCTGGTGTTGTAGCAAGGCCTGTGAAAGAAGGATTAGAGCCTCCACTCTTTTGGAGTTCTCT
GGGGAGTAAACGAAAATATGCAAGCCACCCCAAACCAAAAGAGGGTCCGAAGGATCCAAGGCTGTTCGCTTGCAGTCTTT
CACGAGAAAATTTGAAGGTGACTGAAGTGCACAATTTCACACAAGATGATCTTCTGAGTGACGATATCATGATCCTGGAC
TGTCACAATGTCATCTACGAGTGGGTTGGCCAGCATGCAAGCACAGAGGAGAAAGAGCTAAATTTAGATATTGCCAAGAA
ATACATCGAACGTGCAGCAAGGTTGGATGGGATACTACAGGATGTTCCCATCTTCATGATCACGGAAGGCAATGAGCCAA
TGTTTTTCACCACCTTCTTCTCATGGGATTCCAGCAAGGTCCATGGAGATTCCTACACAAAAAGAGTTGCAGGGATTCAA
GGACGACCAGTTCCTCAAGAGAAAGTCCAAAGACGTCTTACTCCAAGTGCTTCAGCTGGTACCAAAAGTGAATCCACACA
GAGGGCAGCAGCCATGGCAGCTCTCTCTTCACAGTTGACTTCAGAAGGGAAACTGTCGAAGGTTGCCCAAACACTAGTCA
ATCAGAACCCATCCTCTGCTCCAGCGAGTCCAAGGTTTCATCGTCCATCAACTGCGAATTCTCAAAGAGCTGCTGCAATG
GCGGCCCTATCCTTCATGCTTGGCACAAAAAAAGCTCCAGGCTCTGCAGTGTCAGTCGATGCTGATTGGGTTGCTGGGAG
CTCACCATTCGCGAAAGTGGAAGCAACGGGAGATACAGAATCTGTAACAAGCTCAAAGACTTCTGAGGATGGAGGAGATG
GAGGAGAGGAGATCGCTGAATTTTACAGCTATGATCGTTTGAAATCATCATCCACAAATCCTCCAAAAATAAATATAAAA
AGAAAAGAGGCTTATTTATCCCCTGAAGATTTTGAGAAGCTCTTTGGAATGTCGAGAACCCAGTTTTACGAGATGCCCAA
GTGGAAACAGGATCAACGCAAGCGCAATCTCCTACTCTTTTAG
Microexon DNA seq AAAATTTGAAG
Microexon Amino Acid seq ENLK
Microexon-tag DNA Seq AAAGAGGGTCCGAAGGATCCAAGGCTGTTCGCTTGCAGTCTTTCACGAGAAAATTTGAAGGTGACTGAAGTGCACAATTTCACACAAGATGATCTTCTGAGTGACGAT
Microexon-tag Amino Acid seq KEGPKDPRLFACSLSRENLKVTEVHNFTQDDLLSDD
Transcript ID Pp.10539.1
Gene ID Pp.10539
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 5.3e-10
Motif start 620
Motif end 694
Protein seq >Pp.10539.1
MAVSMKNVDVAFQGVGQKPGIDIWRIENFKPVPLLKEFHGKFYSGDSYIVLKTTALKTGGFHYDIHFWLGKDTSQDEAGT
AAIKTVELDAALGGRAVQYRETQEHETELFLSYFKPCIVPMEGGIASGFKKVEVGKVEPRLFIVKGRRTVRVTQVPFARS
SLNHDDVFVLDTESTIFQFNGENSSIQERGKALEVVQYIKDTDHDGKCEIVIIDDGTLGTEADTGQFWVLFGGFAPLSKK
PVVADDASGLPKPKLLCIIERSLKEVEMSKDVLDSSKCYVLDCGNEIYTWAGRNTSLDARKAAISIVEDLITNLNRPKHI
QITRIIEGFETLEFRSYFVKWPLNGQHTVSEEGRGKVAALLKQQGVNTKGILKGSPVKEELPPLPSLNGKLEVWRLVGGV
KKEIDAGDVGRFYDHSCYIVLYTYQGEERKEEYLLCNWIGRHTSVEDKASGLRVMNEMSAALKGRAVQAYIAQGKEPIQF
LALFKCMCILKEHVCPGHKDHSILLVRARCVGPQIVLAVQLEPVSASLNSSDCFLLQTNSKLYAWTGNLSTVENQKAVLR
AAEVLKPGVVARPVKEGLEPPLFWSSLGSKRKYASHPKPKEGPKDPRLFACSLSRENLKVTEVHNFTQDDLLSDDIMILD
CHNVIYEWVGQHASTEEKELNLDIAKKYIERAARLDGILQDVPIFMITEGNEPMFFTTFFSWDSSKVNVHGDSYTKRVAG
IQGRPVPQEKVQRRLTPSASAGTKSESTQRAAAMAALSSQLTSEGKLSKVAQTLVNQNPSSAPASPRFHRPSTANSQRAA
AMAALSFMLGTKKAPGSAVSVDADWVAGSSPFAKVEATGDTESVTSSKTSEDGGDGGEEIAEFYSYDRLKSSSTNPPKIN
IKRKEAYLSPEDFEKLFGMSRTQFYEMPKWKQDQRKRNLLLF*
CDS seq >Pp.10539.1
ATGGCTGTGTCTATGAAGAATGTGGACGTCGCATTCCAAGGAGTTGGCCAGAAACCAGGAATTGACATATGGCGCATTGA
GAATTTCAAACCAGTGCCCTTGCTCAAGGAATTTCATGGAAAATTTTATTCAGGAGATTCTTACATTGTGCTCAAGACGA
CCGCACTTAAAACTGGAGGGTTCCACTACGATATTCACTTTTGGCTGGGAAAGGACACGAGCCAGGATGAGGCTGGTACA
GCAGCAATTAAGACCGTTGAGCTGGATGCTGCGTTAGGTGGTCGCGCCGTTCAGTATAGAGAAACTCAGGAGCACGAAAC
AGAACTCTTTCTATCTTATTTCAAACCATGTATTGTTCCTATGGAAGGCGGTATTGCTTCTGGATTCAAGAAAGTGGAAG
TTGGGAAGGTTGAGCCTCGTTTATTCATTGTAAAAGGAAGACGCACTGTCAGAGTTACACAGGTGCCATTTGCTCGTTCC
TCACTGAACCATGACGATGTTTTTGTTCTGGACACGGAATCAACAATATTCCAATTCAATGGAGAAAATTCCAGTATTCA
AGAGAGGGGGAAAGCTCTTGAAGTGGTCCAGTATATCAAGGATACAGATCATGATGGAAAATGCGAAATTGTAATTATAG
ACGATGGTACGCTCGGCACTGAGGCAGACACTGGGCAATTCTGGGTTCTGTTTGGAGGCTTTGCTCCTCTTTCAAAGAAA
CCTGTTGTTGCAGATGATGCCTCTGGGTTACCCAAGCCTAAGTTGCTCTGTATCATAGAAAGGAGCTTGAAGGAAGTGGA
AATGTCTAAGGATGTACTTGACAGCAGCAAGTGTTACGTGCTCGATTGCGGTAATGAGATCTACACTTGGGCAGGTCGCA
ACACATCACTTGATGCTAGAAAGGCTGCAATTTCAATTGTAGAGGATTTAATCACTAACCTGAATAGGCCGAAGCACATC
CAGATCACCCGGATCATTGAAGGATTCGAAACGCTCGAGTTTCGTTCGTACTTTGTTAAGTGGCCATTAAATGGACAACA
CACCGTCTCTGAAGAAGGAAGAGGCAAAGTTGCAGCATTGTTGAAGCAGCAAGGTGTTAACACAAAAGGTATTCTCAAGG
GTTCACCTGTGAAAGAAGAGCTCCCACCACTTCCAAGTTTGAATGGCAAGCTTGAGGTATGGAGGTTGGTCGGTGGTGTA
AAAAAAGAAATTGATGCTGGAGATGTTGGAAGGTTCTATGACCACAGCTGCTATATTGTGCTTTACACTTATCAAGGAGA
AGAGCGTAAAGAGGAATACCTTCTATGCAACTGGATTGGTCGGCACACCTCTGTGGAGGACAAGGCTTCGGGACTGAGGG
TTATGAATGAAATGAGTGCAGCACTGAAAGGACGTGCAGTTCAGGCATACATTGCTCAAGGCAAGGAACCCATTCAGTTT
TTGGCGCTGTTTAAATGCATGTGCATATTGAAGGAACATGTTTGTCCAGGTCACAAGGATCATTCAATATTGTTGGTGCG
GGCGCGGTGTGTTGGTCCACAAATTGTCCTAGCTGTCCAGCTGGAGCCTGTGTCAGCTTCACTAAACTCCTCCGATTGCT
TTCTACTTCAAACCAACTCGAAGTTGTATGCCTGGACAGGCAACCTGAGTACTGTTGAGAATCAGAAGGCTGTTTTGCGA
GCAGCTGAAGTTCTGAAGCCTGGTGTTGTAGCAAGGCCTGTGAAAGAAGGATTAGAGCCTCCACTCTTTTGGAGTTCTCT
GGGGAGTAAACGAAAATATGCAAGCCACCCCAAACCAAAAGAGGGTCCGAAGGATCCAAGGCTGTTCGCTTGCAGTCTTT
CACGAGAAAATTTGAAGGTGACTGAAGTGCACAATTTCACACAAGATGATCTTCTGAGTGACGATATCATGATCCTGGAC
TGTCACAATGTCATCTACGAGTGGGTTGGCCAGCATGCAAGCACAGAGGAGAAAGAGCTAAATTTAGATATTGCCAAGAA
ATACATCGAACGTGCAGCAAGGTTGGATGGGATACTACAGGATGTTCCCATCTTCATGATCACGGAAGGCAATGAGCCAA
TGTTTTTCACCACCTTCTTCTCATGGGATTCCAGCAAGGTCAATGTCCATGGAGATTCCTACACAAAAAGAGTTGCAGGG
ATTCAAGGACGACCAGTTCCTCAAGAGAAAGTCCAAAGACGTCTTACTCCAAGTGCTTCAGCTGGTACCAAAAGTGAATC
CACACAGAGGGCAGCAGCCATGGCAGCTCTCTCTTCACAGTTGACTTCAGAAGGGAAACTGTCGAAGGTTGCCCAAACAC
TAGTCAATCAGAACCCATCCTCTGCTCCAGCGAGTCCAAGGTTTCATCGTCCATCAACTGCGAATTCTCAAAGAGCTGCT
GCAATGGCGGCCCTATCCTTCATGCTTGGCACAAAAAAAGCTCCAGGCTCTGCAGTGTCAGTCGATGCTGATTGGGTTGC
TGGGAGCTCACCATTCGCGAAAGTGGAAGCAACGGGAGATACAGAATCTGTAACAAGCTCAAAGACTTCTGAGGATGGAG
GAGATGGAGGAGAGGAGATCGCTGAATTTTACAGCTATGATCGTTTGAAATCATCATCCACAAATCCTCCAAAAATAAAT
ATAAAAAGAAAAGAGGCTTATTTATCCCCTGAAGATTTTGAGAAGCTCTTTGGAATGTCGAGAACCCAGTTTTACGAGAT
GCCCAAGTGGAAACAGGATCAACGCAAGCGCAATCTCCTACTCTTTTAG