Microexon ID Sm_GL377575:52995-52999:-
Species Selaginella moellendorffii
Coordinates GL377575:52995..52999
Microexon Cluster ID MEP07
Size 5
Phase 1
Pfam Domain Motif Peptidase_M1
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 33,19,5,12,39
Microexon location in the Microexon-tag 3
Microexon-tag DNA Seq GCWCAYCCTGTTCGRCCWCAYTCTTAYATYAAGATGGACAACTTCTAYACAGTRACGGTKTATGARAAGGGWGCTGAAGTTGTCMGRATGTACAARACMTTRYTKGGR
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TGACG
Microexon Amino Acid seq VT
Microexon-tag DNA Seq GCGCACCCTGTCCGTCCGCATTCTTACATTAAGATGGACAACTTCTATACGGTGACGGTGTATAAAAAGGGTGCGGAAGTGGTTCGCATGTACCAAACATTGCTTGGG
Microexon-tag Amino Acid Seq AHPVRPHSYIKMDNFYTVTVYKKGAEVVRMYQTLLG
Microexon-tag spanning region52887-53167
Microexon-tag prediction score0.9601
Overlapped with the annotated transcript (%) 100
New Transcript ID EFJ30603x
Reference Transcript ID EFJ30603
Gene ID SELMODRAFT_440241
Gene Name NA
Transcript ID EFJ30603
Protein ID EFJ30603
Gene ID SELMODRAFT_440241
Gene Name NA
Pfam domain motif Peptidase_M1
Motif E-value 1.1e-48
Motif start 331
Motif end 543
Protein seq >EFJ30603
MKVFKNCIGLPFTDVDFELSVESIENHQARCFTRYCKSSSAFALSSKFKVDVSSIVYGSHHVGTWLRARKLIEIAGSSQF
STASVSPVAEEMAVKEAPKEIFLKDYKPTGYHFDTVELKFVLSEKKTTVVSKIRVLPRNSSESPPLVLDGRDVKLLFVKI
NGEERKLEEVELTSRHLTLKSLPVQPFDLEIGSEIHPETNLSLEGLYKSSSGMFCTQCEAEGFRKITFYQDRPDVMAKFT
TRIEADKAQCPVLLSNGNLIDSGDLESLTCCGQNGFHYAVWEDPFKKPCYLFALVAGQLTSRDDTFKTRSGRDVQLRIWT
PAQDVPKTEHAMHALKLSMKWDEDVYGLEYDLDLFNIVAVPDFDTGAMENKSLNIFNSNLVLASAETATDVDYATILRVI
GHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDMGSRPVKRIGDVALLRTAQFSEDAGPMAHPVRPHSYIKMDNF
YTVTVYKKGAEVVRMYQTLLGKDGFRKGMDLYFERHDGQAVTCEDFFAAMRDANSANLSVFLRWYSQAGTPVLTVSTAYD
AAARSYTVKCKQEIPSTPGQPVKEPTLIPLAVGLLNSKGEDMVLTELLDGGSLKSLKDTAGGPAKTAVVVVDKEEQEFTF
LNLPEKPVPSFLRNFSAPVRLVTPTVTDDDLLFLLAHDSDEFNRWEAGQTMGRKLMLDLIPKAQRNEQLAEFVAKALTLP
GESELMDLMEVADPDAVHTVRKFVIRELASKLKQDFLKTVTENRSSDPHYTFDHKNKARRSLKNLALDYLCSLDEADTTE
LALNEYKSATNMTDQFAALSSLCQNPGDVRDEALANFYEQWKDETLVVNKWLVLQSMSDIPGNVKNMRRLLDHPSFDMKN
PNKVYSLIGPFCRTAVNFHAKDGSGYEFLAEVTLQLDKMNPQMASRMVSSLSRWKRFDEGRQSLAKAQLERIAKTDGLSE
NVFEIASKSLAAYTSYLLGKNHTFSGRASICHKIVYSFTTQLLKSHLNSSTFYKPLVPRQRRNHHRFV*
CDS seq >EFJ30603
ATGAAAGTCTTCAAAAACTGCATTGGACTTCCATTCACTGACGTTGACTTTGAGCTCTCAGTGGAAAGCATAGAAAATCA
TCAGGCAAGGTGCTTCACTCGGTACTGTAAATCCAGTTCCGCGTTTGCCTTGTCTTCGAAATTCAAGGTAGATGTTTCAA
GCATTGTTTATGGTTCTCATCATGTAGGAACTTGGTTGCGTGCGCGGAAGCTGATTGAAATTGCCGGGTCTAGCCAATTC
TCCACAGCGAGTGTTTCTCCTGTAGCTGAAGAGATGGCTGTCAAGGAGGCTCCCAAGGAGATTTTTTTGAAGGATTACAA
GCCCACTGGCTATCACTTTGACACGGTCGAGCTGAAATTTGTTCTGTCTGAGAAAAAGACAACCGTGGTGTCTAAGATCC
GGGTTTTGCCGAGAAACTCAAGTGAAAGCCCGCCATTGGTTTTGGATGGTCGCGACGTGAAACTTCTCTTTGTTAAAATC
AACGGCGAAGAACGGAAGCTGGAAGAAGTTGAGCTGACCAGCCGTCATCTGACGTTGAAGTCACTTCCTGTTCAGCCTTT
TGATTTAGAGATTGGCTCTGAGATACACCCGGAGACTAACTTGTCTCTCGAAGGGCTCTACAAGTCTTCCAGCGGCATGT
TTTGTACACAATGCGAAGCTGAAGGTTTTAGAAAAATAACCTTTTATCAGGATCGGCCAGACGTTATGGCAAAGTTCACT
ACCAGGATAGAAGCGGACAAAGCACAGTGTCCCGTTCTTTTGTCGAATGGCAACCTTATAGACTCCGGCGACCTCGAGAG
CCTGACGTGCTGTGGTCAGAATGGATTTCATTATGCTGTGTGGGAGGATCCTTTCAAAAAGCCTTGCTACCTCTTTGCCT
TGGTAGCGGGGCAACTCACGTCCAGAGACGATACTTTTAAAACGCGTTCAGGAAGGGATGTTCAACTGAGGATTTGGACT
CCTGCACAAGACGTTCCAAAGACAGAGCATGCTATGCATGCTCTCAAGCTTTCAATGAAGTGGGATGAGGATGTGTATGG
GTTGGAGTATGATCTTGACCTGTTTAACATTGTCGCTGTTCCCGATTTCGACACGGGTGCTATGGAGAACAAGAGCTTGA
ATATATTCAACTCGAACCTTGTTTTGGCGTCTGCTGAAACTGCTACAGATGTTGACTACGCGACAATTTTGCGTGTGATC
GGTCATGAGTACTTCCACAATTGGACTGGGAATAGGGTGACATGCCGTGATTGGTTTCAATTGAGTCTGAAGGAGGGTCT
TACAGTTTTCAGGGATCAGGAGTTTTCTTCTGACATGGGCTCTCGACCTGTCAAACGTATCGGTGATGTCGCTCTTCTCA
GAACAGCACAATTCTCAGAGGACGCTGGTCCCATGGCGCACCCTGTCCGTCCGCATTCTTACATTAAGATGGACAACTTC
TATACGGTGACGGTGTATAAAAAGGGTGCGGAAGTGGTTCGCATGTACCAAACATTGCTTGGGAAAGATGGGTTTCGGAA
GGGCATGGACTTGTACTTTGAACGTCACGACGGACAAGCTGTAACTTGCGAGGATTTCTTTGCTGCTATGCGAGACGCTA
ATTCCGCCAATCTTTCAGTTTTCTTGAGATGGTATTCTCAAGCTGGAACACCCGTTCTTACTGTTTCGACAGCGTATGAT
GCTGCAGCTCGTAGCTATACTGTAAAGTGCAAGCAAGAAATTCCTTCTACACCTGGGCAACCTGTGAAGGAGCCGACGTT
AATACCATTGGCTGTGGGTCTGCTAAATTCAAAAGGCGAGGATATGGTGTTGACCGAGTTGCTTGATGGAGGATCACTCA
AGTCATTGAAAGACACAGCGGGGGGACCTGCAAAGACAGCAGTAGTTGTTGTTGATAAGGAAGAGCAAGAGTTCACTTTT
CTCAATTTGCCTGAGAAACCTGTGCCATCTTTCCTAAGAAACTTTAGTGCTCCCGTGCGCCTTGTGACGCCGACTGTGAC
GGATGACGATTTGCTCTTTTTGCTCGCGCATGACTCCGACGAGTTTAACAGATGGGAGGCTGGACAGACGATGGGAAGAA
AACTTATGCTGGATCTTATTCCGAAAGCACAAAGAAACGAACAGCTCGCGGAATTTGTGGCAAAGGCACTAACACTTCCT
GGAGAAAGCGAACTGATGGACTTGATGGAGGTTGCAGATCCAGATGCAGTACATACTGTCCGAAAATTCGTGATAAGAGA
GCTTGCGTCGAAACTGAAGCAAGATTTCCTGAAAACTGTGACTGAAAACCGAAGTTCTGATCCGCACTACACATTCGATC
ACAAAAACAAAGCCAGGAGATCCTTAAAAAACCTCGCTCTCGACTACCTCTGCTCTCTGGACGAGGCGGACACTACTGAG
CTTGCTTTGAACGAATACAAAAGCGCAACAAATATGACAGATCAATTTGCAGCCCTCTCTTCACTTTGCCAGAATCCTGG
GGACGTGCGTGATGAAGCTTTGGCAAACTTCTACGAGCAGTGGAAGGACGAAACTTTGGTTGTGAACAAATGGCTGGTGT
TACAATCTATGTCTGATATCCCTGGAAATGTTAAGAACATGAGGCGATTGCTAGATCATCCGTCCTTCGATATGAAGAAC
CCAAACAAGGTGTATTCTCTGATCGGTCCTTTCTGTCGGACCGCTGTCAACTTTCACGCTAAAGATGGCTCTGGCTACGA
GTTTTTGGCAGAGGTGACTTTGCAGTTGGACAAAATGAATCCTCAGATGGCATCGCGAATGGTATCGTCATTGTCGAGGT
GGAAACGCTTTGATGAAGGCAGACAATCGCTGGCTAAGGCTCAACTGGAACGCATTGCTAAAACCGATGGCCTGTCAGAG
AATGTGTTTGAGATCGCCTCCAAGAGCCTGGCAGCATATACAAGCTATTTGCTTGGGAAAAATCACACCTTTTCAGGTCG
AGCAAGCATTTGCCATAAGATTGTATACAGCTTCACCACCCAGCTCCTCAAAAGCCACCTCAATTCTTCCACTTTCTACA
AGCCGCTGGTGCCACGCCAACGGAGAAATCACCACCGCTTCGTCTAG
Microexon DNA seq TGACG
Microexon Amino Acid seq VT
Microexon-tag DNA Seq GCGCACCCTGTCCGTCCGCATTCTTACATTAAGATGGACAACTTCTATACGGTGACGGTGTATAAAAAGGGTGCGGAAGTGGTTCGCATGTACCAAACATTGCTTGGG
Microexon-tag Amino Acid seq AHPVRPHSYIKMDNFYTVTVYKKGAEVVRMYQTLLG
Transcript ID Sm.7561.19
Gene ID Sm.7561
Gene Name NA
Pfam domain motif Peptidase_M1
Motif E-value 2.9e-49
Motif start 112
Motif end 324
Protein seq >Sm.7561.19
MFCTQCEAEGFRKITFYQDRPDVMAKFTTRIEADKAQYPVLLSNGNLIDSGDLENGFHYAVWEDPFKKPCYLFALVAGQL
TSRDDTFKTRSGRDVQLRIWTPAQDVPKTEHAMHALKLSMKWDEDVYGLEYDLDLFNIVAVPDFDTGAMENKSLNIFNSN
LVLASAETATDVDYATILRVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDMGSRPVKRIGDVALLRTAQFS
EDAGPMAHPVRPHSYIKMDNFYTVTVYKKGAEVVRMYQTLLGKDGFRKGMDLYFERHDGQAVTCEDFFAAMRDANSANLS
VFLRWYSQAGTPVLTVSTAYDAAARSYTVKCKQEIPSTPGQPVKEPTLIPLAVGLLNSKGEDMVLTELLDGGSLKSLKDT
AGGPAKTAVVVVDKEEQEFTFLNLPEKPVPSFLRNFSAPVRLVTPTVTDDDLLFLLAHDSDEFNRWEAGQTMGRKLMLDL
IPKAQRNEQLAVPSAFVEGMRSILNDSSLDKACFSHASRL*
CDS seq >Sm.7561.19
ATGTTTTGTACACAATGCGAAGCTGAAGGTTTTAGAAAAATAACCTTTTATCAGGATCGGCCAGACGTTATGGCAAAGTT
CACTACCAGGATAGAAGCGGACAAAGCACAGTATCCCGTTCTTTTGTCGAATGGCAACCTTATAGACTCCGGAGACCTCG
AGAATGGATTTCATTATGCTGTGTGGGAGGATCCTTTCAAAAAGCCTTGCTACCTCTTTGCCTTGGTAGCGGGGCAACTC
ACGTCCAGAGACGATACTTTTAAAACGCGTTCAGGAAGGGATGTTCAACTGAGGATTTGGACTCCTGCACAAGACGTTCC
AAAGACAGAGCATGCTATGCATGCTCTCAAGCTTTCAATGAAGTGGGATGAGGATGTGTATGGGTTGGAGTATGATCTTG
ACCTGTTTAACATTGTCGCTGTTCCCGATTTCGACACGGGTGCTATGGAGAACAAGAGCTTGAATATATTCAACTCGAAC
CTTGTTTTGGCGTCTGCTGAAACTGCTACAGATGTTGACTACGCGACAATTTTGCGTGTGATCGGTCATGAGTACTTCCA
CAATTGGACTGGGAATAGGGTGACATGCCGTGATTGGTTTCAATTGAGTCTGAAGGAGGGTCTTACAGTTTTCAGGGATC
AGGAGTTTTCTTCTGACATGGGCTCTCGACCTGTCAAACGTATCGGTGATGTCGCTCTTCTCAGAACAGCACAATTCTCA
GAGGACGCTGGTCCCATGGCGCACCCTGTCCGTCCGCATTCTTACATTAAGATGGACAACTTCTATACGGTGACGGTGTA
TAAAAAGGGTGCGGAAGTGGTTCGCATGTACCAAACATTGCTTGGGAAAGATGGGTTTCGGAAGGGCATGGACTTGTACT
TTGAACGTCACGACGGACAAGCTGTAACTTGCGAGGATTTCTTTGCTGCTATGCGAGACGCTAATTCCGCCAATCTTTCA
GTTTTCTTGAGATGGTATTCTCAAGCTGGAACACCCGTTCTTACTGTTTCGACAGCGTATGATGCTGCAGCTCGTAGCTA
TACTGTAAAGTGCAAGCAAGAAATTCCTTCTACACCTGGGCAACCTGTGAAGGAGCCGACGTTAATACCATTGGCTGTGG
GTCTGCTAAATTCAAAAGGCGAGGATATGGTGTTGACCGAGTTGCTTGATGGAGGATCACTCAAGTCATTGAAAGACACA
GCGGGGGGACCTGCAAAGACAGCAGTAGTTGTTGTTGATAAGGAAGAGCAAGAGTTCACTTTTCTCAATTTGCCTGAGAA
ACCTGTGCCATCTTTCCTAAGAAACTTTAGTGCTCCCGTGCGCCTTGTGACGCCGACTGTGACGGATGACGATTTGCTCT
TTTTGCTCGCGCATGACTCCGACGAGTTTAACAGATGGGAGGCTGGACAGACGATGGGAAGAAAACTTATGCTGGATCTT
ATTCCGAAAGCACAAAGAAACGAACAGCTCGCGGTTCCATCCGCATTTGTGGAAGGCATGCGCAGTATTCTCAACGATTC
CTCGTTAGATAAGGCATGTTTTTCTCATGCTTCTAGGCTCTAG