Microexon ID Sm_GL377573:2040600-2040614:+
Species Selaginella moellendorffii
Coordinates GL377573:2040600..2040614
Microexon Cluster ID MEP42
Size 15
Phase 0
Pfam Domain Motif bHLH-MYC_N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ACAGCGTGCATTCAG
Microexon Amino Acid seq TACIQ
Microexon-tag DNA Seq GCTAATGAAGCGACTACCAAGGTTTTCTCGAGAGCCTTACTGGCAAAGACAGCGTGCATTCAGACAGTTGTGTGCATTCCCCTGGCCGAGGGTGTTTTGGAGCTTGGC
Microexon-tag Amino Acid Seq ANEATTKVFSRALLAKTACIQTVVCIPLAEGVLELG
Microexon-tag spanning region2040473-2040721
Microexon-tag prediction score0.9131
Overlapped with the annotated transcript (%) 86.11
New Transcript ID EFJ31612x
Reference Transcript ID EFJ31612
Gene ID SELMODRAFT_87033
Gene Name NA
Sm_GL377573:2040600-2040614:+ does not have available information here.
Microexon DNA seq ACAGCGTGCATTCAG
Microexon Amino Acid seq TACIQ
Microexon-tag DNA Seq GCTAATGAAGCGACTACCAAGGTTTTCTCGAGAGCCTTACTGGCAAAGACAGCGTGCATTCAGACAGTTGTGTGCATTCCCCTGGCCGAGGGTGTTTTGGAGCTTGGC
Microexon-tag Amino Acid seq ANEATTKVFSRALLAKTACIQTVVCIPLAEGVLELG
Transcript ID Sm.6724.2
Gene ID Sm.6724
Gene Name NA
Pfam domain motif bHLH-MYC_N
Motif E-value 5.4e-52
Motif start 16
Motif end 204
Protein seq >Sm.6724.2
MAVKRKEMGEKSRITLRQRLQAAVQSIQWTYAVFWKPCPPPQGELVWSDGYYNGSVKTRKTIIVSRERSPEEHGLQRSDQ
LRELFENLSASGDGSQSSTATRRPTAALSPEDLTDTEWFYLVCMSCTFDPGTGIPGQAFAKGRPVWLCKANEATTKVFSR
ALLAKTACIQTVVCIPLAEGVLELGSTELVREDTSIVQHVVSFFVELAKPTVSEPSISSSQSGENGHQDLRPSSQDGPSS
EQPRQRQKMERCSAFQSWKEFQRQASISSSQVKSQCQAMLKNVLFRVPHMYTAAEKSEAAKMNDSGGDSQSKSVSRKEDD
VNTAHAMLERRRREKLNDRFLMLRNMVPFVTKMDKVSILGDAIEYLRQLQKQVADLEQRNKVLEARLRGQDVPVEAEKAQ
SSPPQDHPQEKPAPQDMAVDTEPEDSFPMSTTYKLGPDSSSYKAEIQMQDDFTALEIECSFRQGILLDILAALDKLNLDV
STVEARTPDQRTFCASLKAEAKDLPRASEQDISEALQRVTRPLSS*
CDS seq >Sm.6724.2
ATGGCTGTTAAAAGGAAGGAAATGGGAGAGAAGTCGAGGATAACACTGAGACAAAGGCTACAAGCGGCAGTACAGAGTAT
ACAATGGACTTATGCTGTTTTCTGGAAGCCATGTCCACCTCCACAAGGGGAGCTTGTTTGGTCGGATGGATACTACAACG
GGAGTGTGAAAACGAGGAAGACGATCATAGTTTCCAGAGAACGGAGCCCGGAAGAACACGGCCTACAGAGAAGTGACCAG
CTGCGAGAACTGTTTGAGAACTTGTCAGCGTCCGGAGATGGTAGCCAGTCCTCGACAGCCACCCGGCGCCCGACTGCTGC
CCTCTCGCCCGAAGACTTGACGGACACCGAGTGGTTCTACTTGGTTTGCATGTCGTGTACGTTTGATCCAGGCACCGGAA
TACCTGGACAAGCTTTCGCGAAAGGACGTCCTGTCTGGCTTTGCAAAGCTAATGAAGCGACTACCAAGGTTTTCTCGAGA
GCCTTACTGGCAAAGACAGCGTGCATTCAGACAGTTGTGTGCATTCCCCTGGCCGAGGGTGTTTTGGAGCTTGGCTCGAC
TGAGCTTGTCCGAGAGGATACGTCCATTGTACAACACGTTGTCAGCTTCTTCGTGGAGCTTGCAAAGCCAACTGTTTCGG
AGCCGTCCATCTCGAGCTCTCAAAGCGGTGAGAATGGACACCAAGACCTTCGCCCCTCCTCACAGGATGGTCCTTCCAGC
GAGCAGCCTAGGCAACGACAGAAAATGGAGAGATGCAGTGCTTTCCAGAGCTGGAAAGAGTTTCAAAGACAGGCTTCAAT
ATCGTCGAGCCAAGTGAAAAGCCAATGCCAGGCAATGCTTAAAAACGTACTCTTCCGGGTTCCTCACATGTACACAGCAG
CTGAAAAGTCGGAGGCTGCCAAAATGAACGACAGTGGAGGAGATAGTCAGTCCAAATCAGTGTCAAGAAAGGAGGACGAT
GTCAACACCGCTCACGCGATGCTGGAGCGCAGGAGAAGGGAGAAGCTCAACGACAGATTTTTGATGCTGCGAAACATGGT
TCCTTTTGTCACCAAGATGGACAAGGTGTCGATCTTGGGAGACGCGATTGAGTACCTGAGGCAGCTGCAAAAGCAAGTCG
CCGATCTCGAGCAACGGAACAAAGTTCTCGAGGCAAGGCTCAGAGGACAGGACGTACCTGTTGAAGCCGAGAAAGCTCAA
AGCAGTCCTCCACAGGATCATCCGCAAGAGAAGCCAGCGCCGCAGGACATGGCAGTGGACACGGAGCCAGAGGATAGCTT
TCCAATGTCGACAACTTACAAGCTGGGACCGGACAGCTCCAGCTACAAAGCCGAGATCCAGATGCAAGACGACTTCACAG
CTCTCGAAATCGAGTGCTCCTTCCGGCAAGGAATCCTTCTGGACATCCTCGCGGCCCTGGACAAGCTCAACCTCGACGTC
TCCACGGTCGAAGCACGGACACCCGACCAACGAACGTTTTGTGCCAGCCTCAAGGCAGAGGCGAAAGATTTGCCGAGAGC
AAGCGAGCAAGATATAAGTGAAGCTCTCCAGAGAGTAACGCGGCCTCTGTCATCCTAG