Microexon ID Sm_GL377581:430241-430255:+
Species Selaginella moellendorffii
Coordinates GL377581:430241..430255
Microexon Cluster ID MEP41
Size 15
Phase 0
Pfam Domain Motif DUF974
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq CARTTYTTCAAGTTYATTGTTKCWAAYCCACTTTCWGTTAGRACAAAGGTYCGYRYTRTCAAGGAAACTACMTWTYTRGARGCTTGYATWGARAAYCATACAAAATCA
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GTTAGAACAGTCAAG
Microexon Amino Acid seq VRTVK
Microexon-tag DNA Seq CAATACTTCAAGTTCACGACCTCTAATCCTGTTTCCGTCAGAACAAAGGTTAGAACAGTCAAGGATACAACATTTCTCGAGGCGTGCATAGAAAACCAAACCAAGTCT
Microexon-tag Amino Acid Seq QYFKFTTSNPVSVRTKVRTVKDTTFLEACIENQTKS
Microexon-tag spanning region430145-430350
Microexon-tag prediction score0.905
Overlapped with the annotated transcript (%) 100
New Transcript ID EFJ27594x
Reference Transcript ID EFJ27594
Gene ID SELMODRAFT_95233
Gene Name NA
Transcript ID EFJ27594
Protein ID EFJ27594
Gene ID SELMODRAFT_95233
Gene Name NA
Pfam domain motif DUF974
Motif E-value 1.4e-61
Motif start 80
Motif end 309
Protein seq >EFJ27594
MTSGAGAAGHSLAFRVMRLCRPSCQVDHPLLVDPSDVCNGEDSVNFKELLPGLVNGNDPGFWKRFELQEPMDAMGLSGQL
VLPQTFGSIYLGETFCSYISVGNHTNHDVRDVIIKAELQTERQRIILSDNSKSPIESIRATGRFDFIIEHDIKELGGHTL
VCMAVYTDPDGDRKYLPQYFKFTTSNPVSVRTKVRTVKDTTFLEACIENQTKSHLFMDQVRFEPAPPWSVTTLENEEEAS
ESDGPISGYIKSLKLINGNGGARHYLFQLKRPPLESSDVKLEGANALGKLEILWRTTLGETGRLQTQQINGSPTPKKPLD
VKMTNLPQRILIERPFLVRMEVTNRSEQFTGPLRVVMSETDDNGTPRTVLMNGLLSLVSSRIHEDLTGTLSQNLVAVAAG
VQRIAGICLVDARDGRQVEFVPPTEVNFEALLFVANRSR*
CDS seq >EFJ27594
ATGACGTCCGGGGCTGGAGCAGCGGGCCACTCGCTCGCATTCCGGGTGATGCGGCTGTGCCGCCCCAGCTGCCAGGTAGA
TCACCCCCTCCTCGTCGATCCCAGCGACGTGTGCAATGGTGAGGACTCGGTGAATTTCAAGGAGCTGCTGCCGGGGCTTG
TCAATGGCAACGATCCGGGCTTCTGGAAGCGATTTGAGCTCCAGGAGCCCATGGATGCGATGGGGCTCTCCGGGCAGCTT
GTTCTTCCACAAACTTTCGGATCGATCTATCTCGGAGAGACGTTTTGTAGCTACATCAGTGTCGGGAATCACACGAATCA
CGATGTGAGAGATGTGATCATCAAAGCAGAGCTCCAGACTGAGCGTCAAAGGATTATTCTAAGCGACAACTCCAAGTCCC
CGATCGAGTCCATCAGAGCGACTGGCCGATTTGATTTTATCATCGAGCACGACATCAAAGAACTTGGAGGACATACGCTT
GTTTGCATGGCAGTGTACACTGATCCAGACGGTGATCGTAAATATCTTCCGCAATACTTCAAGTTCACGACCTCTAATCC
TGTTTCCGTCAGAACAAAGGTTAGAACAGTCAAGGATACAACATTTCTCGAGGCGTGCATAGAAAACCAAACCAAGTCTC
ACCTCTTTATGGACCAAGTTAGATTTGAGCCTGCACCGCCTTGGAGTGTCACGACTTTGGAGAACGAGGAAGAAGCTAGT
GAATCAGATGGTCCCATCAGCGGCTATATCAAGAGCTTGAAACTGATCAATGGTAACGGCGGTGCTCGCCACTATCTCTT
TCAGCTAAAGAGGCCGCCATTGGAATCCTCAGACGTTAAGTTAGAAGGCGCAAATGCCCTTGGGAAGCTTGAAATACTGT
GGAGGACAACTCTTGGCGAAACCGGTCGCCTTCAAACGCAACAAATCAATGGCAGTCCAACGCCGAAAAAGCCTTTGGAC
GTCAAGATGACAAACTTGCCGCAGAGAATTCTAATAGAACGACCATTTCTCGTTCGGATGGAAGTTACAAACCGGTCAGA
ACAATTCACTGGACCGCTAAGAGTTGTCATGTCTGAGACCGATGACAATGGTACACCCAGGACAGTTCTCATGAACGGAC
TCTTGAGCTTGGTAAGTTCTCGTATTCATGAAGATTTAACGGGCACTCTCTCGCAGAACTTGGTGGCGGTAGCCGCAGGT
GTGCAGCGAATAGCGGGGATTTGTCTAGTGGACGCACGAGACGGCAGACAAGTTGAGTTTGTGCCTCCAACCGAGGTAAA
CTTCGAAGCTTTGCTGTTCGTAGCTAATCGGTCACGCTGA
Microexon DNA seq GTTAGAACAGTCAAG
Microexon Amino Acid seq VRTVK
Microexon-tag DNA Seq CAATACTTCAAGTTCACGACCTCTAATCCTGTTTCCGTCAGAACAAAGGTTAGAACAGTCAAGGATACAACATTTCTCGAGGCGTGCATAGAAAACCAAACCAAGTCT
Microexon-tag Amino Acid seq QYFKFTTSNPVSVRTKVRTVKDTTFLEACIENQTKS
Transcript ID Sm.10203.1
Gene ID Sm.10203
Gene Name NA
Pfam domain motif DUF974
Motif E-value 1.4e-61
Motif start 80
Motif end 309
Protein seq >Sm.10203.1
MTSGAGAAGHSLAFRVMRLCRPSCQVDHPLLVDPSDVCNGEDSVNFKELLPGLVNGNDPGFWKRFELQEPMDAMGLSGQL
VLPQTFGSIYLGETFCSYISVGNHTNHDVRDVIIKAELQTERQRIILSDNSKSPIESIRATGRFDFIIEHDIKELGGHTL
VCMAVYTDPDGDRKYLPQYFKFTTSNPVSVRTKVRTVKDTTFLEACIENQTKSHLFMDQVRFEPAPPWSVTTLENEEEAS
ESDGPISGYIKSLKLINGNGGARHYLFQLKRPPLESSDVKLEGANALGKLEILWRTTLGETGRLQTQQINGSPTPKKPLD
VKMTNLPQRILIERPFLVRMEVTNRSEQFTGPLRVVMSETDDNGTPRTVLMNGLLSLMVPPLAPLASTELEVNLVAVAAG
VQRIAGICLVDARDGRQVEFVPPTEVNFEALLFVANRSR*
CDS seq >Sm.10203.1
ATGACGTCCGGGGCTGGAGCAGCGGGCCACTCGCTCGCATTCCGGGTGATGCGGCTGTGCCGCCCCAGCTGCCAGGTAGA
TCACCCCCTCCTCGTCGATCCCAGCGACGTGTGCAATGGTGAGGACTCGGTGAATTTCAAGGAGCTGCTGCCGGGGCTTG
TCAATGGCAACGATCCGGGCTTCTGGAAGCGATTTGAGCTCCAGGAGCCCATGGATGCGATGGGGCTCTCCGGGCAGCTT
GTTCTTCCACAAACTTTCGGATCGATCTATCTCGGAGAGACGTTTTGTAGCTACATCAGTGTCGGGAATCACACGAATCA
CGATGTGAGAGATGTGATCATCAAAGCAGAGCTCCAGACTGAGCGTCAAAGGATTATTCTAAGCGACAACTCCAAGTCCC
CGATCGAGTCCATCAGAGCGACTGGCCGATTTGATTTTATCATCGAGCACGACATCAAAGAACTTGGAGGACATACGCTT
GTTTGCATGGCAGTGTACACTGATCCAGACGGTGATCGTAAATATCTTCCGCAATACTTCAAGTTCACGACCTCTAATCC
TGTTTCCGTCAGAACAAAGGTTAGAACAGTCAAGGATACAACATTTCTCGAGGCGTGCATAGAAAACCAAACCAAGTCTC
ACCTCTTTATGGACCAAGTTAGATTTGAGCCTGCACCGCCTTGGAGTGTCACGACTTTGGAGAACGAGGAAGAAGCTAGT
GAATCAGATGGTCCCATCAGCGGCTATATCAAGAGCTTGAAACTGATCAATGGTAACGGCGGTGCTCGCCACTATCTCTT
TCAGCTAAAGAGGCCGCCATTGGAATCCTCAGACGTTAAGTTAGAAGGCGCAAATGCCCTTGGGAAGCTTGAAATACTGT
GGAGGACAACTCTTGGCGAAACCGGTCGCCTTCAAACGCAACAAATCAATGGCAGTCCAACGCCGAAAAAGCCTTTGGAC
GTCAAGATGACAAACTTGCCGCAGAGAATTCTAATAGAACGACCATTTCTCGTTCGGATGGAAGTTACAAACCGGTCAGA
ACAATTCACTGGACCGCTAAGAGTTGTCATGTCTGAGACCGATGACAATGGTACACCCAGGACAGTTCTCATGAACGGAC
TCTTGAGCTTGATGGTTCCTCCACTGGCCCCTCTCGCATCGACCGAGCTGGAAGTTAACTTGGTGGCGGTAGCCGCAGGT
GTGCAGCGAATAGCGGGGATTTGTCTAGTGGACGCACGAGACGGCAGACAAGTTGAGTTTGTGCCTCCAACCGAGGTAAA
CTTCGAAGCTTTGCTGTTCGTAGCTAATCGGTCACGCTGA