Microexon ID Sm_GL377565:3470997-3471011:-
Species Selaginella moellendorffii
Coordinates GL377565:3470997..3471011
Microexon Cluster ID MEP42
Size 15
Phase 0
Pfam Domain Motif bHLH-MYC_N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ACTGCGTGCATTCAG
Microexon Amino Acid seq TACIQ
Microexon-tag DNA Seq GCTAATGAAGCGACTACCAAGGTTTTCTCGAGAGCGTTACTGGCAAAGACTGCGTGCATTCAGACAGTTGTGTGCATTCCCATGGCCGAGGGTGTTTTGGAGCTTGGC
Microexon-tag Amino Acid Seq ANEATTKVFSRALLAKTACIQTVVCIPMAEGVLELG
Microexon-tag spanning region3470890-3471138
Microexon-tag prediction score0.919
Overlapped with the annotated transcript (%) 86.11
New Transcript ID EFJ38534x
Reference Transcript ID EFJ38534
Gene ID SELMODRAFT_73360
Gene Name NA
Sm_GL377565:3470997-3471011:- does not have available information here.
Microexon DNA seq ACTGCGTGCATTCAG
Microexon Amino Acid seq TACIQ
Microexon-tag DNA Seq GCTAATGAAGCGACTACCAAGGTTTTCTCGAGAGCGTTACTGGCAAAGACTGCGTGCATTCAGACAGTTGTGTGCATTCCCATGGCCGAGGGTGTTTTGGAGCTTGGC
Microexon-tag Amino Acid seq ANEATTKVFSRALLAKTACIQTVVCIPMAEGVLELG
Transcript ID Sm.557.1
Gene ID Sm.557
Gene Name NA
Pfam domain motif bHLH-MYC_N
Motif E-value 1.3e-40
Motif start 41
Motif end 209
Protein seq >Sm.557.1
MDLCCFLEAMSTSTRVKKLFVLMARLSLSFLIPFVDHEPHWESLHDHRELVWSDGYYNGSVKTRKTIIVSRERSPEEHGL
QRSDQLRELFENLSASGDGSQSSTATRRPTAALSPEDLTDTEWFYLVCMSCTFDPGTGIPGQAFSKGRPVWLCKANEATT
KVFSRALLAKTACIQTVVCIPMAEGVLELGSTELVREDTSIVQHVVSFFVELAKPTVSEPSISSSQSGENGHQDLRPSSQ
DGPSSEQPRQRQKMERCSAFQSWKEFQRQASISSSQVKSQCQAMLKNVLFRVPHMYTAAEKSEAAKMNDSGGDSQSKSVS
RKEDDVNTAHAMLERRRREKLNDRFLMLRNMVPFVTKMDKVSILGDAIEYLRQLQRQVADLEQRNKVLEARLRGQDVPVE
AEKAQSSPPQDHPQEKPAPQDMAVDMEPEDSFPMSTTYKLGPDSSSYKAEIQMQDDFTALEIECSFRQGILLDILAALDK
LNLDVSTVEARTPDQRTFCASLKAEAKDLPRASEQDISEALQRVTRPLSS*
CDS seq >Sm.557.1
ATGGACTTATGCTGTTTTCTGGAAGCCATGTCCACCTCCACAAGGGTAAAGAAACTCTTTGTCCTTATGGCTAGACTATC
ACTCTCTTTTCTTATACCTTTTGTGGATCACGAACCTCATTGGGAAAGTTTGCATGATCACAGGGAGCTTGTTTGGTCGG
ATGGATACTACAACGGGAGTGTGAAAACGAGGAAGACGATCATAGTTTCCAGAGAACGGAGCCCGGAAGAACACGGCCTA
CAGAGAAGTGACCAGCTGCGAGAACTGTTTGAGAACTTGTCAGCGTCCGGAGATGGTAGCCAGTCCTCGACAGCCACCCG
GCGCCCGACTGCTGCGCTCTCGCCCGAAGACTTGACGGACACCGAGTGGTTCTACTTGGTTTGCATGTCGTGTACGTTTG
ATCCAGGCACCGGAATACCTGGACAAGCTTTCTCGAAAGGACGTCCTGTCTGGCTTTGCAAAGCTAATGAAGCGACTACC
AAGGTTTTCTCGAGAGCGTTACTGGCAAAGACTGCGTGCATTCAGACAGTTGTGTGCATTCCCATGGCCGAGGGTGTTTT
GGAGCTTGGCTCGACTGAGCTTGTCCGAGAGGATACGTCCATTGTACAACACGTTGTCAGCTTCTTTGTTGAGCTTGCAA
AGCCAACTGTTTCGGAGCCGTCCATCTCGAGCTCTCAAAGCGGTGAGAATGGACACCAAGACCTTCGCCCCTCCTCACAG
GATGGTCCTTCCAGCGAGCAGCCTAGGCAACGACAGAAAATGGAGAGATGCAGTGCTTTCCAGAGCTGGAAAGAGTTTCA
AAGACAGGCTTCAATATCATCGAGCCAAGTGAAAAGCCAATGCCAGGCAATGCTTAAAAACGTACTCTTCCGGGTTCCTC
ACATGTACACAGCAGCTGAAAAGTCGGAGGCTGCCAAAATGAACGACAGTGGAGGAGATAGTCAGTCGAAATCAGTGTCA
AGAAAGGAGGACGATGTCAACACCGCTCACGCGATGCTGGAGCGCAGGAGAAGGGAGAAGCTCAACGACAGATTTTTGAT
GCTGCGAAACATGGTTCCTTTTGTCACCAAGATGGACAAGGTGTCGATCTTGGGAGACGCGATTGAGTACCTGAGGCAGC
TGCAAAGGCAAGTCGCTGATCTCGAGCAACGGAACAAAGTTCTCGAGGCAAGGCTCAGAGGACAGGACGTACCTGTTGAA
GCCGAGAAAGCTCAAAGCAGTCCTCCACAGGATCATCCGCAAGAGAAGCCAGCGCCGCAGGACATGGCAGTGGACATGGA
GCCAGAGGATAGCTTTCCAATGTCCACAACTTACAAGCTGGGACCGGACAGCTCCAGCTACAAAGCCGAGATCCAGATGC
AAGACGACTTCACAGCTCTCGAAATCGAGTGCTCCTTCCGGCAAGGAATCCTTCTGGACATCCTCGCGGCCCTGGACAAG
CTCAACCTCGACGTCTCCACGGTCGAAGCACGGACACCCGACCAACGAACGTTTTGTGCCAGCCTCAAGGCAGAGGCGAA
AGATTTGCCGAGAGCAAGCGAGCAAGATATAAGCGAAGCTCTCCAGAGAGTAACGCGGCCTCTGTCATCCTAG