Microexon ID Gm_17:4815209-4815217:-
Species Glycine max
Coordinates 17:4815209..4815217
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTACTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAATAGTTGCAGAAGGGAAGGTCAAAGCAGGAAAGGAAGGCAAGTTTACTTGGGTGGTTATGACAAGGAGGATAAGGCAGCCAGAGCTTATGATCTCGCAGCT
Microexon-tag Amino Acid Seq WDNSCRREGQSRKGRQVYLGGYDKEDKAARAYDLAA
Microexon-tag spanning region4814945-4815820
Microexon-tag prediction score0.9812
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH02857x
Reference Transcript ID KRH02857
Gene ID GLYMA_17G062600
Gene Name NA
Transcript ID KRH02857
Protein ID KRH02857
Gene ID GLYMA_17G062600
Gene Name NA
Pfam domain motif AP2
Motif E-value 2.6e-12
Motif start 151
Motif end 209
Protein seq >KRH02857
MDSSSSSPPNSTNNNSLAFSLSNHFPNPSSSPLSLFHSFTYPSLSLTGSNTVDAPPEPTAGAGPTNLSIFTGGPKFEDFL
GGSAATATTVACAPPQLPQFSTDNNNHLYDSELKSTIAACFPRALAAEQSTEPQKPSPKKTVDTFGQRTSIYRGVTRHRW
TGRYEAHLWDNSCRREGQSRKGRQVYLGGYDKEDKAARAYDLAALKYWGPTTTTNFPISNYEKELEEMKNMTRQEFVASL
RRKSSGFSRGASIYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGLNAVTNFDMSRYDVKSI
ANSTLPIGGLSGKNKNSTDSASESKSHEPSQSDGDPSSASSVTFASQQQPSSSNLSFAIPIKQDPSDYWSILGYHNTPLD
NSGIRNTTSTVTTTTFPSSNNGTASSLTPFNMEFSSAPSSTGSDNNAAFFSGGGIFVQQQTSHGHGNASSGSSSSSLSCS
IPFATPIFSLNSNTSYESSAGYGNWIGPTLHTFQSHAKPSLFQTPIFGME*
CDS seq >KRH02857
ATGGACTCTTCTTCTTCATCACCGCCAAACAGCACCAACAACAACTCCCTCGCTTTCTCTCTTTCCAATCACTTTCCCAA
CCCTTCTTCCTCTCCCCTTTCTCTCTTCCACTCCTTCACCTATCCATCTCTCTCTCTCACAGGCAGCAACACGGTGGACG
CACCGCCTGAGCCCACCGCTGGAGCAGGACCGACCAACCTCTCCATATTCACCGGCGGCCCCAAGTTCGAGGACTTTCTG
GGCGGTTCCGCCGCAACAGCCACCACCGTCGCGTGTGCACCGCCACAGCTTCCGCAGTTCTCCACCGACAACAACAACCA
CCTATACGATTCGGAGCTGAAGTCAACAATAGCCGCGTGCTTCCCTCGCGCCTTGGCCGCCGAACAAAGCACCGAACCGC
AAAAACCATCCCCCAAGAAAACCGTCGACACCTTCGGGCAACGCACCTCCATCTACCGCGGCGTGACCCGACATAGATGG
ACTGGGAGATACGAAGCTCATCTATGGGACAATAGTTGCAGAAGGGAAGGTCAAAGCAGGAAAGGAAGGCAAGTTTACTT
GGGTGGTTATGACAAGGAGGATAAGGCAGCCAGAGCTTATGATCTCGCAGCTCTCAAGTACTGGGGTCCAACTACCACCA
CTAACTTTCCTATTTCCAACTATGAGAAGGAACTGGAGGAGATGAAGAACATGACTAGGCAAGAGTTTGTTGCTTCTCTT
CGTAGGAAGAGCAGTGGTTTCTCTAGAGGGGCCTCTATATACAGAGGAGTAACGAGACACCACCAGCATGGCCGATGGCA
GGCGAGAATAGGCAGAGTTGCCGGAAACAAAGACCTCTACCTTGGCACTTTCAGCACCCAAGAAGAAGCTGCTGAGGCCT
ATGACATTGCTGCTATCAAATTCAGGGGATTAAATGCAGTAACAAACTTTGACATGAGTCGCTACGACGTGAAGAGCATT
GCAAATAGTACTCTTCCTATTGGTGGTTTATCTGGCAAGAACAAGAACTCCACAGATTCTGCATCTGAGAGCAAAAGCCA
TGAGCCAAGCCAATCCGATGGAGATCCATCATCGGCTTCATCGGTGACCTTTGCATCACAGCAACAACCTTCAAGCTCCA
ACTTAAGCTTTGCCATACCCATTAAGCAAGACCCTTCAGATTACTGGTCCATCTTGGGGTACCATAATACTCCCCTTGAC
AACAGTGGCATCAGGAACACTACTAGTACTGTTACTACAACTACTTTTCCATCCTCCAACAATGGCACTGCTAGTAGTTT
GACACCCTTCAACATGGAGTTCTCAAGTGCCCCCTCAAGTACCGGCAGCGATAACAATGCCGCGTTTTTCAGTGGAGGAG
GCATCTTTGTTCAGCAACAAACTAGTCATGGTCATGGAAATGCAAGCAGTGGTTCCTCCTCTTCTTCTTTAAGCTGTTCA
ATCCCATTCGCCACGCCCATATTTTCTCTAAATAGCAATACTAGTTATGAGAGCAGTGCTGGTTATGGAAACTGGATTGG
ACCTACCCTGCACACATTCCAATCCCATGCAAAACCAAGTCTCTTTCAAACGCCAATATTTGGAATGGAATGA
Microexon DNA seq TTTACTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAATAGTTGCAGAAGGGAAGGTCAAAGCAGGAAAGGAAGGCAAGTTTACTTGGGTGGTTATGACAAGGAGGATAAGGCAGCCAGAGCTTATGATCTCGCAGCT
Microexon-tag Amino Acid seq WDNSCRREGQSRKGRQVYLGGYDKEDKAARAYDLAA
Transcript ID KRH02857
Gene ID Gm.22006
Gene Name NA
Pfam domain motif AP2
Motif E-value 2.6e-12
Motif start 151
Motif end 209
Protein seq >KRH02857
MDSSSSSPPNSTNNNSLAFSLSNHFPNPSSSPLSLFHSFTYPSLSLTGSNTVDAPPEPTAGAGPTNLSIFTGGPKFEDFL
GGSAATATTVACAPPQLPQFSTDNNNHLYDSELKSTIAACFPRALAAEQSTEPQKPSPKKTVDTFGQRTSIYRGVTRHRW
TGRYEAHLWDNSCRREGQSRKGRQVYLGGYDKEDKAARAYDLAALKYWGPTTTTNFPISNYEKELEEMKNMTRQEFVASL
RRKSSGFSRGASIYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGLNAVTNFDMSRYDVKSI
ANSTLPIGGLSGKNKNSTDSASESKSHEPSQSDGDPSSASSVTFASQQQPSSSNLSFAIPIKQDPSDYWSILGYHNTPLD
NSGIRNTTSTVTTTTFPSSNNGTASSLTPFNMEFSSAPSSTGSDNNAAFFSGGGIFVQQQTSHGHGNASSGSSSSSLSCS
IPFATPIFSLNSNTSYESSAGYGNWIGPTLHTFQSHAKPSLFQTPIFGME*
CDS seq >KRH02857
ATGGACTCTTCTTCTTCATCACCGCCAAACAGCACCAACAACAACTCCCTCGCTTTCTCTCTTTCCAATCACTTTCCCAA
CCCTTCTTCCTCTCCCCTTTCTCTCTTCCACTCCTTCACCTATCCATCTCTCTCTCTCACAGGCAGCAACACGGTGGACG
CACCGCCTGAGCCCACCGCTGGAGCAGGACCGACCAACCTCTCCATATTCACCGGCGGCCCCAAGTTCGAGGACTTTCTG
GGCGGTTCCGCCGCAACAGCCACCACCGTCGCGTGTGCACCGCCACAGCTTCCGCAGTTCTCCACCGACAACAACAACCA
CCTATACGATTCGGAGCTGAAGTCAACAATAGCCGCGTGCTTCCCTCGCGCCTTGGCCGCCGAACAAAGCACCGAACCGC
AAAAACCATCCCCCAAGAAAACCGTCGACACCTTCGGGCAACGCACCTCCATCTACCGCGGCGTGACCCGACATAGATGG
ACTGGGAGATACGAAGCTCATCTATGGGACAATAGTTGCAGAAGGGAAGGTCAAAGCAGGAAAGGAAGGCAAGTTTACTT
GGGTGGTTATGACAAGGAGGATAAGGCAGCCAGAGCTTATGATCTCGCAGCTCTCAAGTACTGGGGTCCAACTACCACCA
CTAACTTTCCTATTTCCAACTATGAGAAGGAACTGGAGGAGATGAAGAACATGACTAGGCAAGAGTTTGTTGCTTCTCTT
CGTAGGAAGAGCAGTGGTTTCTCTAGAGGGGCCTCTATATACAGAGGAGTAACGAGACACCACCAGCATGGCCGATGGCA
GGCGAGAATAGGCAGAGTTGCCGGAAACAAAGACCTCTACCTTGGCACTTTCAGCACCCAAGAAGAAGCTGCTGAGGCCT
ATGACATTGCTGCTATCAAATTCAGGGGATTAAATGCAGTAACAAACTTTGACATGAGTCGCTACGACGTGAAGAGCATT
GCAAATAGTACTCTTCCTATTGGTGGTTTATCTGGCAAGAACAAGAACTCCACAGATTCTGCATCTGAGAGCAAAAGCCA
TGAGCCAAGCCAATCCGATGGAGATCCATCATCGGCTTCATCGGTGACCTTTGCATCACAGCAACAACCTTCAAGCTCCA
ACTTAAGCTTTGCCATACCCATTAAGCAAGACCCTTCAGATTACTGGTCCATCTTGGGGTACCATAATACTCCCCTTGAC
AACAGTGGCATCAGGAACACTACTAGTACTGTTACTACAACTACTTTTCCATCCTCCAACAATGGCACTGCTAGTAGTTT
GACACCCTTCAACATGGAGTTCTCAAGTGCCCCCTCAAGTACCGGCAGCGATAACAATGCCGCGTTTTTCAGTGGAGGAG
GCATCTTTGTTCAGCAACAAACTAGTCATGGTCATGGAAATGCAAGCAGTGGTTCCTCCTCTTCTTCTTTAAGCTGTTCA
ATCCCATTCGCCACGCCCATATTTTCTCTAAATAGCAATACTAGTTATGAGAGCAGTGCTGGTTATGGAAACTGGATTGG
ACCTACCCTGCACACATTCCAATCCCATGCAAAACCAAGTCTCTTTCAAACGCCAATATTTGGAATGGAATGA