Microexon ID Gm_8:41327348-41327356:+
Species Glycine max
Coordinates 8:41327348..41327356
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TATATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAAAGGGACCTGGAATCCCACTCAGAAGAAGAAAGGAAAGCAAGTATATTTGGGAGCTTATAATGATGAAGAAGCTGCAGCTAGAGCTTATGATTTGGCTGCA
Microexon-tag Amino Acid Seq WDKGTWNPTQKKKGKQVYLGAYNDEEAAARAYDLAA
Microexon-tag spanning region41327174-41327499
Microexon-tag prediction score0.9428
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH45852x
Reference Transcript ID KRH45852
Gene ID GLYMA_08G297000
Gene Name NA
Transcript ID KRH45852
Protein ID KRH45852
Gene ID GLYMA_08G297000
Gene Name NA
Pfam domain motif AP2
Motif E-value 5.2e-12
Motif start 72
Motif end 131
Protein seq >KRH45852
MELAPVKSELSPRSHRLLMIDGSEVIGTKCVKRRRRDSSTAVLGGNGQQGEQLEEQKQLGGQSTATTVKRSSRFRGVSRH
RWTGRFEAHLWDKGTWNPTQKKKGKQVYLGAYNDEEAAARAYDLAALKYWGISTFTNFPVSDYEKEIEIMKTVTKEEYLA
SLRRRSSGFSRGVSKYRGVARHHHNGRWEARIGRVFGNKYLYLGTYSTQEEAARAYDIAAIEYRGINAVTNFDLSTYIRW
LRPGTHPTASHDQKPSTDAQPFATSNSMQARGNIEVSNSNKNSFPSGKLDSTKKRDFSKYMNPLSPCNKPSSPTALGLLL
KSSVFRELMQRNLNSSSEEAEEVELKYPHEGNDGVGGIYDNENTNNSYFCSSNISRLPNLESSEESPLPMYHGTVQSLWN
SAFNMSN*
CDS seq >KRH45852
ATGGAGCTTGCACCTGTGAAGTCTGAACTAAGTCCAAGGAGCCATAGGTTGCTCATGATAGATGGTAGTGAGGTTATTGG
CACTAAGTGTGTCAAAAGGCGGCGAAGAGATTCATCTACGGCCGTGTTAGGTGGCAATGGACAACAAGGTGAACAGTTAG
AAGAACAGAAGCAACTTGGTGGCCAATCAACTGCCACCACTGTGAAGAGAAGCTCAAGGTTCAGGGGTGTTAGCAGACAC
AGGTGGACTGGAAGGTTTGAAGCACATCTATGGGATAAAGGGACCTGGAATCCCACTCAGAAGAAGAAAGGAAAGCAAGT
ATATTTGGGAGCTTATAATGATGAAGAAGCTGCAGCTAGAGCTTATGATTTGGCTGCACTCAAGTACTGGGGAATATCAA
CTTTCACAAATTTTCCTGTATCTGATTATGAGAAAGAAATTGAGATAATGAAAACTGTAACCAAAGAAGAATATCTTGCT
TCATTGAGAAGGAGGAGCAGTGGTTTTTCCAGAGGTGTATCGAAGTATAGAGGAGTAGCAAGGCATCATCACAATGGAAG
ATGGGAAGCAAGGATTGGGAGAGTGTTCGGTAACAAGTACCTCTACCTTGGGACTTACAGTACACAAGAAGAAGCAGCTC
GCGCATATGACATTGCAGCAATTGAATACAGAGGCATAAATGCAGTGACAAACTTTGACTTGAGCACCTACATAAGATGG
CTAAGACCGGGAACACATCCTACTGCTTCTCATGATCAAAAGCCTAGCACTGATGCTCAACCTTTTGCAACCTCTAACTC
CATGCAAGCAAGAGGGAACATTGAGGTATCCAACTCCAACAAGAATTCATTCCCCTCAGGTAAATTGGACAGTACCAAGA
AGCGAGACTTTTCCAAGTACATGAACCCTTTGAGTCCATGCAACAAGCCATCTTCCCCAACAGCATTAGGACTTCTCCTA
AAATCCTCAGTGTTTAGAGAACTGATGCAGAGAAATCTGAACTCTTCTAGTGAAGAAGCTGAAGAAGTTGAATTGAAATA
TCCACATGAGGGCAATGATGGAGTTGGAGGGATTTATGATAATGAAAACACCAATAACTCTTACTTTTGCTCTTCTAATA
TCAGCAGATTACCTAACTTGGAGTCATCAGAAGAGAGTCCATTGCCTATGTATCATGGAACTGTGCAATCACTATGGAAT
AGTGCTTTCAACATGTCTAACTGA
Microexon DNA seq TATATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAAAGGGACCTGGAATCCCACTCAGAAGAAGAAAGGAAAGCAAGTATATTTGGGAGCTTATAATGATGAAGAAGCTGCAGCTAGAGCTTATGATTTGGCTGCA
Microexon-tag Amino Acid seq WDKGTWNPTQKKKGKQVYLGAYNDEEAAARAYDLAA
Transcript ID KRH45852
Gene ID Gm.51062
Gene Name NA
Pfam domain motif AP2
Motif E-value 5.2e-12
Motif start 72
Motif end 131
Protein seq >KRH45852
MELAPVKSELSPRSHRLLMIDGSEVIGTKCVKRRRRDSSTAVLGGNGQQGEQLEEQKQLGGQSTATTVKRSSRFRGVSRH
RWTGRFEAHLWDKGTWNPTQKKKGKQVYLGAYNDEEAAARAYDLAALKYWGISTFTNFPVSDYEKEIEIMKTVTKEEYLA
SLRRRSSGFSRGVSKYRGVARHHHNGRWEARIGRVFGNKYLYLGTYSTQEEAARAYDIAAIEYRGINAVTNFDLSTYIRW
LRPGTHPTASHDQKPSTDAQPFATSNSMQARGNIEVSNSNKNSFPSGKLDSTKKRDFSKYMNPLSPCNKPSSPTALGLLL
KSSVFRELMQRNLNSSSEEAEEVELKYPHEGNDGVGGIYDNENTNNSYFCSSNISRLPNLESSEESPLPMYHGTVQSLWN
SAFNMSN*
CDS seq >KRH45852
ATGGAGCTTGCACCTGTGAAGTCTGAACTAAGTCCAAGGAGCCATAGGTTGCTCATGATAGATGGTAGTGAGGTTATTGG
CACTAAGTGTGTCAAAAGGCGGCGAAGAGATTCATCTACGGCCGTGTTAGGTGGCAATGGACAACAAGGTGAACAGTTAG
AAGAACAGAAGCAACTTGGTGGCCAATCAACTGCCACCACTGTGAAGAGAAGCTCAAGGTTCAGGGGTGTTAGCAGACAC
AGGTGGACTGGAAGGTTTGAAGCACATCTATGGGATAAAGGGACCTGGAATCCCACTCAGAAGAAGAAAGGAAAGCAAGT
ATATTTGGGAGCTTATAATGATGAAGAAGCTGCAGCTAGAGCTTATGATTTGGCTGCACTCAAGTACTGGGGAATATCAA
CTTTCACAAATTTTCCTGTATCTGATTATGAGAAAGAAATTGAGATAATGAAAACTGTAACCAAAGAAGAATATCTTGCT
TCATTGAGAAGGAGGAGCAGTGGTTTTTCCAGAGGTGTATCGAAGTATAGAGGAGTAGCAAGGCATCATCACAATGGAAG
ATGGGAAGCAAGGATTGGGAGAGTGTTCGGTAACAAGTACCTCTACCTTGGGACTTACAGTACACAAGAAGAAGCAGCTC
GCGCATATGACATTGCAGCAATTGAATACAGAGGCATAAATGCAGTGACAAACTTTGACTTGAGCACCTACATAAGATGG
CTAAGACCGGGAACACATCCTACTGCTTCTCATGATCAAAAGCCTAGCACTGATGCTCAACCTTTTGCAACCTCTAACTC
CATGCAAGCAAGAGGGAACATTGAGGTATCCAACTCCAACAAGAATTCATTCCCCTCAGGTAAATTGGACAGTACCAAGA
AGCGAGACTTTTCCAAGTACATGAACCCTTTGAGTCCATGCAACAAGCCATCTTCCCCAACAGCATTAGGACTTCTCCTA
AAATCCTCAGTGTTTAGAGAACTGATGCAGAGAAATCTGAACTCTTCTAGTGAAGAAGCTGAAGAAGTTGAATTGAAATA
TCCACATGAGGGCAATGATGGAGTTGGAGGGATTTATGATAATGAAAACACCAATAACTCTTACTTTTGCTCTTCTAATA
TCAGCAGATTACCTAACTTGGAGTCATCAGAAGAGAGTCCATTGCCTATGTATCATGGAACTGTGCAATCACTATGGAAT
AGTGCTTTCAACATGTCTAACTGA