Microexon ID Sm_GL377610:708189-708202:+
Species Selaginella moellendorffii
Coordinates GL377610:708189..708202
Microexon Cluster ID MEP36
Size 14
Phase 1
Pfam Domain Motif EFP_N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 15,31,14,48
Microexon location in the Microexon-tag 3
Microexon-tag DNA Seq KTYYGCGGCTCYGATGTGARRCYWGGRAAYGTCATTGAAARAAAAGGAMRBATTTAYSAGGTKRTAAARGCASAACAYWCWMMTCAAGGAAGAGGAGGAGCYAYWATA
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GGCGCCTTGTGCAG
Microexon Amino Acid seq GRLVQ
Microexon-tag DNA Seq TTCCCCGACACAGAGGTGAGAGTCGGGCAGATTTTGGAGCGCAAAGGGCGCCTTGTGCAGGTTATAAAGATTGCTCATGCTCATGGAGGCCGTGGTGGAGCTACAATT
Microexon-tag Amino Acid Seq FPDTEVRVGQILERKGRLVQVIKIAHAHGGRGGATI
Microexon-tag spanning region706761-708313
Microexon-tag prediction score0.8469
Overlapped with the annotated transcript (%) 12.72
New Transcript ID EFJ18223x
Reference Transcript ID EFJ18223
Gene ID SELMODRAFT_444589
Gene Name NA
Transcript ID EFJ18223
Protein ID EFJ18223
Gene ID SELMODRAFT_444589
Gene Name NA
Pfam domain motif EFP_N
Motif E-value 3e-09
Motif start 384
Motif end 438
Protein seq >EFJ18223
MSSDRRRSSYIWWYDKSLNFFRDRSAKQELTTRSSKTLTDLDCCSVCQEQFTLPCQANCTHWFCGECILRVWEYGSALLP
CKCPICRQPITHLWPSEFSTTEEATNFMPLLTDIATYNLNFDSRVITILRLKFPDTEVRCLHVLHFLFLLVLALYVVCTL
NLILEGKLAIRTVKKEFMELNAAHRWVAGFNVAERKSGLRVLCSTRSLPRLPIMRGRKLGREAILTVQSLKRAKGVREEM
EQVLKSQVSRLLKTDMLEVLRELHRQNDCVLAYEELQGEMLPDGLVYTELMTAYLRCERPNDAIDTFQEMKQLGLTVELK
TFTTLAKWLEKKGLGEEASARMQRSRRVAQQGFKALSGFRVFKEDSSSLWHWQQRRGLKIDGLEVRVGQILERKGRLVQV
IKIAHAHGGRGGATIPIEYKDLISGIKSTERLRTSESVERVAVDSKTFTYLYDEGQMAILMDPKTFDQVTVDKELFGTGA
SYLSDGMSVTVQMHEGKPVTASVPVSLTCKVVEAEPFMKGQTQAAAYKHITLENGKIIMAPCFIVSGDQIVVNTVEDTYV
TSGLHVLLLQSFQRITNMTLFQRRVISGSLKSIKQKSKMGDSEQHRHWEWSRLPAELRHRILHLLPLSSALAARAVCKDW
KDSLSTWSGRKSWFLVKDEQVEQEGSLKWSMVDAETGSWIRLGGLSRFATSVAAGDGLVCLCVGNSARPEFYVGNPLFEK
SWQKLPELPPLKTLTLARLVLSASALTCVLGIVSEEPPGVTVSASVTSNSGSGSSGFKVLVGRCGYIVVYESRAGKWREV
QGDTPALALQLLQESDERSIVPVRRGSQLLWSPLRSSGGAHRSSGSHHYRHLKVWRLDEDDELGQECKPVNRMRDYMVQE
ISPKGFFAAHWCLREAKLMLLTCTLEKFSPQVRDWGFKRLVLWSFDPHAAKWSSNPLRRPLRDVDLSSSVVYSFQPRLDF
SQMAAWCRKCIYLKPQLEKLALEFQHGAKFFCVDVNAVPHALIKRVGVSRMPTIQLWKNKEKRAEIIGGQKAELVVEEVR
QMLQTLEK*
CDS seq >EFJ18223
ATGAGCAGCGATCGTCGACGCTCTTCTTACATCTGGTGGTACGATAAGAGCTTGAATTTCTTCCGTGACCGCAGCGCCAA
GCAAGAACTGACGACTCGATCATCCAAAACTCTCACGGACTTGGATTGCTGCTCGGTTTGCCAGGAGCAATTCACTCTTC
CTTGTCAAGCGAATTGCACACACTGGTTCTGTGGAGAATGTATTCTCAGAGTGTGGGAGTATGGATCAGCGTTACTCCCT
TGTAAATGTCCCATTTGTCGACAACCAATCACTCACTTGTGGCCTAGCGAATTTTCTACGACTGAAGAGGCTACGAATTT
CATGCCACTGCTCACTGACATCGCGACTTACAATCTTAACTTCGATAGCCGAGTGATCACCATTCTGCGACTGAAATTCC
CCGACACAGAGGTTCGGTGTCTCCACGTTCTTCACTTCCTTTTTCTCCTGGTTCTCGCTCTTTACGTCGTTTGCACGCTC
AATCTCATACTCGAAGGTAAGCTGGCTATTCGGACAGTGAAGAAAGAGTTCATGGAACTTAATGCGGCACACAGATGGGT
CGCGGGTTTCAACGTTGCCGAGAGAAAATCCGGACTGCGTGTGCTGTGTTCTACCAGGAGTCTTCCCAGACTACCGATAA
TGAGGGGCCGGAAGCTGGGAAGAGAGGCAATTCTCACTGTACAGAGCTTGAAGAGAGCCAAAGGTGTCCGGGAGGAGATG
GAACAAGTGCTCAAGTCTCAGGTGTCTCGGCTTCTCAAGACTGACATGCTCGAAGTTCTGCGGGAGCTACACAGGCAAAA
TGACTGTGTTCTAGCTTACGAGGAGCTCCAGGGAGAAATGCTGCCGGATGGGCTGGTTTACACTGAGCTCATGACGGCTT
ACTTGAGATGCGAGAGACCAAACGATGCGATCGACACATTCCAGGAGATGAAGCAACTGGGGCTCACGGTGGAGCTCAAA
ACTTTCACCACTCTGGCCAAATGGTTGGAGAAAAAAGGCCTTGGCGAGGAAGCTAGCGCTAGAATGCAGCGCTCGAGGCG
CGTTGCGCAGCAGGGATTTAAGGCGCTGTCTGGATTTCGAGTCTTTAAGGAGGATTCTTCGTCGTTGTGGCATTGGCAGC
AGCGCCGCGGATTGAAGATCGATGGATTGGAGGTGAGAGTCGGGCAGATTTTGGAGCGCAAAGGGCGCCTTGTGCAGGTT
ATAAAGATTGCTCATGCTCATGGAGGCCGTGGTGGAGCTACAATTCCGATCGAGTACAAGGATCTGATCAGCGGGATCAA
GTCAACTGAGCGTTTGCGAACCTCAGAATCGGTGGAAAGAGTGGCTGTCGATTCGAAAACCTTTACATATCTGTACGACG
AGGGACAAATGGCGATTTTGATGGATCCAAAAACGTTCGACCAGGTTACAGTGGACAAAGAGCTCTTTGGAACTGGAGCT
TCATATCTTTCAGATGGAATGTCTGTGACGGTGCAAATGCACGAGGGTAAACCAGTGACTGCTTCGGTACCAGTTTCTCT
GACATGCAAAGTGGTCGAAGCTGAGCCTTTTATGAAAGGCCAGACCCAAGCTGCAGCGTACAAGCACATCACGCTAGAAA
ATGGCAAAATAATCATGGCCCCTTGCTTCATCGTTAGTGGAGACCAGATAGTTGTCAACACAGTGGAAGACACTTACGTG
ACAAGTGGTCTTCATGTTCTGCTTCTACAGAGCTTCCAAAGAATAACAAACATGACTCTCTTCCAAAGAAGAGTAATCTC
GGGGAGTTTAAAGAGCATCAAACAAAAGAGCAAGATGGGGGATAGTGAGCAGCACAGGCATTGGGAATGGTCCAGGCTTC
CAGCTGAGCTCCGACACAGGATTCTTCATCTTCTTCCGCTCTCTTCGGCTCTCGCCGCCCGAGCGGTATGCAAGGATTGG
AAAGACTCGCTCTCGACATGGTCTGGACGGAAGTCGTGGTTTCTGGTGAAGGACGAGCAAGTCGAGCAAGAAGGCAGCTT
GAAGTGGTCAATGGTTGATGCGGAGACAGGCAGCTGGATTCGTCTCGGTGGCCTCTCGCGCTTTGCGACTTCCGTGGCTG
CCGGGGACGGCCTGGTGTGCCTGTGCGTGGGGAATTCCGCGAGACCCGAGTTTTACGTGGGGAATCCATTGTTTGAGAAG
TCATGGCAGAAGCTACCGGAGCTCCCTCCGCTCAAGACTTTGACACTGGCGAGGCTGGTGCTGAGTGCCTCGGCGCTTAC
ATGCGTCCTGGGCATTGTTTCCGAGGAGCCACCGGGTGTTACTGTCAGCGCCAGTGTCACTAGCAACAGCGGCAGTGGAT
CTTCAGGCTTCAAGGTCCTGGTTGGACGCTGCGGGTACATCGTTGTGTACGAGTCGCGAGCTGGAAAGTGGCGAGAGGTA
CAGGGAGACACGCCGGCGCTGGCTCTTCAGCTTCTCCAGGAATCGGACGAGCGCAGCATTGTGCCGGTGAGACGAGGATC
CCAGCTGCTGTGGTCGCCGCTCCGATCCAGCGGAGGAGCTCACCGGAGCAGCGGGTCTCACCACTACAGGCACCTCAAAG
TGTGGAGGCTGGACGAGGACGACGAGCTCGGCCAGGAGTGTAAGCCCGTGAACAGGATGAGAGACTACATGGTCCAGGAG
ATCTCGCCAAAGGGCTTCTTCGCGGCTCACTGGTGCTTGCGCGAAGCCAAGCTCATGCTGCTCACTTGCACGCTCGAGAA
GTTCTCGCCCCAGGTCCGCGACTGGGGATTCAAGCGATTGGTGCTGTGGTCGTTTGATCCTCACGCAGCAAAGTGGTCGT
CGAATCCTCTGAGAAGACCTCTTCGAGATGTGGATCTGAGCAGCAGCGTGGTCTACTCGTTCCAGCCGAGGCTGGATTTC
AGCCAGATGGCAGCGTGGTGTAGGAAATGCATCTACTTGAAGCCTCAATTGGAGAAGCTGGCTTTGGAATTCCAGCACGG
AGCAAAATTCTTTTGTGTGGACGTCAATGCCGTTCCACACGCATTGATTAAACGCGTTGGTGTTTCGAGAATGCCAACTA
TTCAGCTGTGGAAGAACAAGGAAAAGAGAGCAGAGATCATTGGAGGTCAAAAGGCCGAGCTGGTTGTGGAAGAAGTCCGG
CAAATGTTGCAGACCCTCGAAAAGTGA
Microexon DNA seq GGCGCCTTGTGCAG
Microexon Amino Acid seq GRLVQ
Microexon-tag DNA Seq ATCGATGGATTGGAGGTGAGAGTCGGGCAGATTTTGGAGCGCAAAGGGCGCCTTGTGCAGGTTATAAAGATTGCTCATGCTCATGGAGGCCGTGGTGGAGCTACAATT
Microexon-tag Amino Acid seq IDGLEVRVGQILERKGRLVQVIKIAHAHGGRGGATI
Transcript ID EFJ18223
Gene ID Sm.19388
Gene Name NA
Pfam domain motif EFP_N
Motif E-value 3e-09
Motif start 384
Motif end 438
Protein seq >EFJ18223
MSSDRRRSSYIWWYDKSLNFFRDRSAKQELTTRSSKTLTDLDCCSVCQEQFTLPCQANCTHWFCGECILRVWEYGSALLP
CKCPICRQPITHLWPSEFSTTEEATNFMPLLTDIATYNLNFDSRVITILRLKFPDTEVRCLHVLHFLFLLVLALYVVCTL
NLILEGKLAIRTVKKEFMELNAAHRWVAGFNVAERKSGLRVLCSTRSLPRLPIMRGRKLGREAILTVQSLKRAKGVREEM
EQVLKSQVSRLLKTDMLEVLRELHRQNDCVLAYEELQGEMLPDGLVYTELMTAYLRCERPNDAIDTFQEMKQLGLTVELK
TFTTLAKWLEKKGLGEEASARMQRSRRVAQQGFKALSGFRVFKEDSSSLWHWQQRRGLKIDGLEVRVGQILERKGRLVQV
IKIAHAHGGRGGATIPIEYKDLISGIKSTERLRTSESVERVAVDSKTFTYLYDEGQMAILMDPKTFDQVTVDKELFGTGA
SYLSDGMSVTVQMHEGKPVTASVPVSLTCKVVEAEPFMKGQTQAAAYKHITLENGKIIMAPCFIVSGDQIVVNTVEDTYV
TSGLHVLLLQSFQRITNMTLFQRRVISGSLKSIKQKSKMGDSEQHRHWEWSRLPAELRHRILHLLPLSSALAARAVCKDW
KDSLSTWSGRKSWFLVKDEQVEQEGSLKWSMVDAETGSWIRLGGLSRFATSVAAGDGLVCLCVGNSARPEFYVGNPLFEK
SWQKLPELPPLKTLTLARLVLSASALTCVLGIVSEEPPGVTVSASVTSNSGSGSSGFKVLVGRCGYIVVYESRAGKWREV
QGDTPALALQLLQESDERSIVPVRRGSQLLWSPLRSSGGAHRSSGSHHYRHLKVWRLDEDDELGQECKPVNRMRDYMVQE
ISPKGFFAAHWCLREAKLMLLTCTLEKFSPQVRDWGFKRLVLWSFDPHAAKWSSNPLRRPLRDVDLSSSVVYSFQPRLDF
SQMAAWCRKCIYLKPQLEKLALEFQHGAKFFCVDVNAVPHALIKRVGVSRMPTIQLWKNKEKRAEIIGGQKAELVVEEVR
QMLQTLEK*
CDS seq >EFJ18223
ATGAGCAGCGATCGTCGACGCTCTTCTTACATCTGGTGGTACGATAAGAGCTTGAATTTCTTCCGTGACCGCAGCGCCAA
GCAAGAACTGACGACTCGATCATCCAAAACTCTCACGGACTTGGATTGCTGCTCGGTTTGCCAGGAGCAATTCACTCTTC
CTTGTCAAGCGAATTGCACACACTGGTTCTGTGGAGAATGTATTCTCAGAGTGTGGGAGTATGGATCAGCGTTACTCCCT
TGTAAATGTCCCATTTGTCGACAACCAATCACTCACTTGTGGCCTAGCGAATTTTCTACGACTGAAGAGGCTACGAATTT
CATGCCACTGCTCACTGACATCGCGACTTACAATCTTAACTTCGATAGCCGAGTGATCACCATTCTGCGACTGAAATTCC
CCGACACAGAGGTTCGGTGTCTCCACGTTCTTCACTTCCTTTTTCTCCTGGTTCTCGCTCTTTACGTCGTTTGCACGCTC
AATCTCATACTCGAAGGTAAGCTGGCTATTCGGACAGTGAAGAAAGAGTTCATGGAACTTAATGCGGCACACAGATGGGT
CGCGGGTTTCAACGTTGCCGAGAGAAAATCCGGACTGCGTGTGCTGTGTTCTACCAGGAGTCTTCCCAGACTACCGATAA
TGAGGGGCCGGAAGCTGGGAAGAGAGGCAATTCTCACTGTACAGAGCTTGAAGAGAGCCAAAGGTGTCCGGGAGGAGATG
GAACAAGTGCTCAAGTCTCAGGTGTCTCGGCTTCTCAAGACTGACATGCTCGAAGTTCTGCGGGAGCTACACAGGCAAAA
TGACTGTGTTCTAGCTTACGAGGAGCTCCAGGGAGAAATGCTGCCGGATGGGCTGGTTTACACTGAGCTCATGACGGCTT
ACTTGAGATGCGAGAGACCAAACGATGCGATCGACACATTCCAGGAGATGAAGCAACTGGGGCTCACGGTGGAGCTCAAA
ACTTTCACCACTCTGGCCAAATGGTTGGAGAAAAAAGGCCTTGGCGAGGAAGCTAGCGCTAGAATGCAGCGCTCGAGGCG
CGTTGCGCAGCAGGGATTTAAGGCGCTGTCTGGATTTCGAGTCTTTAAGGAGGATTCTTCGTCGTTGTGGCATTGGCAGC
AGCGCCGCGGATTGAAGATCGATGGATTGGAGGTGAGAGTCGGGCAGATTTTGGAGCGCAAAGGGCGCCTTGTGCAGGTT
ATAAAGATTGCTCATGCTCATGGAGGCCGTGGTGGAGCTACAATTCCGATCGAGTACAAGGATCTGATCAGCGGGATCAA
GTCAACTGAGCGTTTGCGAACCTCAGAATCGGTGGAAAGAGTGGCTGTCGATTCGAAAACCTTTACATATCTGTACGACG
AGGGACAAATGGCGATTTTGATGGATCCAAAAACGTTCGACCAGGTTACAGTGGACAAAGAGCTCTTTGGAACTGGAGCT
TCATATCTTTCAGATGGAATGTCTGTGACGGTGCAAATGCACGAGGGTAAACCAGTGACTGCTTCGGTACCAGTTTCTCT
GACATGCAAAGTGGTCGAAGCTGAGCCTTTTATGAAAGGCCAGACCCAAGCTGCAGCGTACAAGCACATCACGCTAGAAA
ATGGCAAAATAATCATGGCCCCTTGCTTCATCGTTAGTGGAGACCAGATAGTTGTCAACACAGTGGAAGACACTTACGTG
ACAAGTGGTCTTCATGTTCTGCTTCTACAGAGCTTCCAAAGAATAACAAACATGACTCTCTTCCAAAGAAGAGTAATCTC
GGGGAGTTTAAAGAGCATCAAACAAAAGAGCAAGATGGGGGATAGTGAGCAGCACAGGCATTGGGAATGGTCCAGGCTTC
CAGCTGAGCTCCGACACAGGATTCTTCATCTTCTTCCGCTCTCTTCGGCTCTCGCCGCCCGAGCGGTATGCAAGGATTGG
AAAGACTCGCTCTCGACATGGTCTGGACGGAAGTCGTGGTTTCTGGTGAAGGACGAGCAAGTCGAGCAAGAAGGCAGCTT
GAAGTGGTCAATGGTTGATGCGGAGACAGGCAGCTGGATTCGTCTCGGTGGCCTCTCGCGCTTTGCGACTTCCGTGGCTG
CCGGGGACGGCCTGGTGTGCCTGTGCGTGGGGAATTCCGCGAGACCCGAGTTTTACGTGGGGAATCCATTGTTTGAGAAG
TCATGGCAGAAGCTACCGGAGCTCCCTCCGCTCAAGACTTTGACACTGGCGAGGCTGGTGCTGAGTGCCTCGGCGCTTAC
ATGCGTCCTGGGCATTGTTTCCGAGGAGCCACCGGGTGTTACTGTCAGCGCCAGTGTCACTAGCAACAGCGGCAGTGGAT
CTTCAGGCTTCAAGGTCCTGGTTGGACGCTGCGGGTACATCGTTGTGTACGAGTCGCGAGCTGGAAAGTGGCGAGAGGTA
CAGGGAGACACGCCGGCGCTGGCTCTTCAGCTTCTCCAGGAATCGGACGAGCGCAGCATTGTGCCGGTGAGACGAGGATC
CCAGCTGCTGTGGTCGCCGCTCCGATCCAGCGGAGGAGCTCACCGGAGCAGCGGGTCTCACCACTACAGGCACCTCAAAG
TGTGGAGGCTGGACGAGGACGACGAGCTCGGCCAGGAGTGTAAGCCCGTGAACAGGATGAGAGACTACATGGTCCAGGAG
ATCTCGCCAAAGGGCTTCTTCGCGGCTCACTGGTGCTTGCGCGAAGCCAAGCTCATGCTGCTCACTTGCACGCTCGAGAA
GTTCTCGCCCCAGGTCCGCGACTGGGGATTCAAGCGATTGGTGCTGTGGTCGTTTGATCCTCACGCAGCAAAGTGGTCGT
CGAATCCTCTGAGAAGACCTCTTCGAGATGTGGATCTGAGCAGCAGCGTGGTCTACTCGTTCCAGCCGAGGCTGGATTTC
AGCCAGATGGCAGCGTGGTGTAGGAAATGCATCTACTTGAAGCCTCAATTGGAGAAGCTGGCTTTGGAATTCCAGCACGG
AGCAAAATTCTTTTGTGTGGACGTCAATGCCGTTCCACACGCATTGATTAAACGCGTTGGTGTTTCGAGAATGCCAACTA
TTCAGCTGTGGAAGAACAAGGAAAAGAGAGCAGAGATCATTGGAGGTCAAAAGGCCGAGCTGGTTGTGGAAGAAGTCCGG
CAAATGTTGCAGACCCTCGAAAAGTGA