Microexon ID Gm_9:7211644-7211651:-
Species Glycine max
Coordinates 9:7211644..7211651
Microexon Cluster ID MEP18
Size 8
Phase 1
Pfam Domain Motif DEAD
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,8,51
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq CCTGCWCTRGCAAARMCAGGMATTGTKCTTGTTGTTTSTCCYTTRATAGCHYTRATGGARAAYCARGTTAYGRSMTTGAARGARAAAGGRRTTSCWGCTGAATWTCTC
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq CCTTAATG
Microexon Amino Acid seq ALM
Microexon-tag DNA Seq CCTGCATTGGCAAAAGCAGGCATTGTGCTTGTCGTTTGCCCTTTAATAGCCTTAATGGAAAACCAAGTAATGGCACTAAAGGAGAAAGGCATAGCGGCAGAATTTCTC
Microexon-tag Amino Acid Seq PALAKAGIVLVVCPLIALMENQVMALKEKGIAAEFL
Microexon-tag spanning region7211486-7211820
Microexon-tag prediction score0.9639
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH37506x
Reference Transcript ID KRH37506
Gene ID GLYMA_09G070600
Gene Name NA
Transcript ID KRH37506
Protein ID KRH37506
Gene ID GLYMA_09G070600
Gene Name NA
Pfam domain motif DEAD
Motif E-value 4.4e-19
Motif start 41
Motif end 208
Protein seq >KRH37506
MQKSALPLSDANANKKREELRRKETLVKLLRWHFGYPDFRDMQLDAIQAVLSGKDCFCLMPTGGGKSMCYQIPALAKAGI
VLVVCPLIALMENQVMALKEKGIAAEFLSSTKTTDAKVKIHEDLDSGKPSTRLLYVTPELITTPGFMTKLTKIYTRGLLN
LIAIDEAHCISSWGHDFRPSYRKLSSLRSHLPDVPILALTATAVPKVQKDVVESLQMQNPLMLKSSFNRPNIYYEVRYKD
LLDDAYADLSNTLKSLGDVCAIVYCLERSMCDDLSTNLSQNGISCAAYHAGLNNKMRTSVLDDWISSKIKVVVATVAFGL
WSSFRMGIDRKDVRIVCHFNIPKSMEAFYQESGRAGRDQLPSRSLLYYGVDDRKRMEFILRKSVSKKSQSSSSQEESSKM
SLIAFNLMVEYCEGSGCRRKRVLESFGEQVTASLCGKTCDGCRHPNLVARYLEDLTTACALRQKNGSSRVFMTSSTDAIN
GEQLSEFWNQDEEASGSEEDISDSDGNGNEVVNNLTRSKLQSKLGVSEKLAMLQRAEENFYRNNNAYKQSNKVDKNAISD
PMRGSSRQRLQNALKQVQQRLDNFKIEMETSASFLEEECYKKYGKVGKSFYYSQVASTVRWLTTASSSELINRLSAINAS
TSMNVLSEAENLLIPANQPLTPAEQPLTSPPALDPYARDTSNEHSGTARSETSACVLPMEGSFSTNLPQIPSFSEFVNSR
KAKGDQLHDTKRHSSRVEKKMRIQ*
CDS seq >KRH37506
ATGCAGAAGTCGGCATTGCCACTGAGCGACGCGAATGCGAACAAGAAGAGGGAGGAATTGCGCCGCAAGGAAACGTTGGT
GAAGCTTCTGAGATGGCATTTTGGGTACCCCGATTTCAGGGACATGCAATTGGACGCTATTCAAGCTGTGCTCTCAGGGA
AAGATTGTTTTTGTCTTATGCCAACTGGAGGAGGCAAGTCGATGTGTTATCAGATCCCTGCATTGGCAAAAGCAGGCATT
GTGCTTGTCGTTTGCCCTTTAATAGCCTTAATGGAAAACCAAGTAATGGCACTAAAGGAGAAAGGCATAGCGGCAGAATT
TCTCTCCTCAACGAAAACAACAGATGCAAAAGTAAAGATTCATGAGGACCTTGATTCTGGAAAACCTTCTACGAGGCTGC
TATATGTGACTCCAGAGTTGATAACAACACCAGGGTTTATGACTAAGCTGACAAAGATTTATACCAGGGGGTTGCTAAAT
CTGATTGCGATAGATGAGGCGCATTGCATCTCATCTTGGGGTCATGATTTCAGACCTAGCTACCGTAAGCTATCCTCTTT
GAGAAGCCACCTACCAGATGTACCAATATTAGCTTTGACTGCTACTGCTGTGCCTAAGGTTCAGAAGGATGTAGTTGAAT
CCTTGCAGATGCAAAATCCATTAATGCTCAAGTCTTCATTTAATCGTCCTAATATATATTATGAAGTTAGATACAAAGAT
CTGTTGGATGATGCTTATGCTGATTTATCTAATACACTCAAATCTCTGGGAGATGTCTGTGCAATAGTGTACTGCCTTGA
ACGTTCAATGTGTGATGACTTGTCAACTAATCTATCCCAAAATGGCATTTCATGTGCTGCTTATCATGCAGGATTGAATA
ATAAAATGCGAACTTCGGTGTTGGATGATTGGATATCTTCCAAGATAAAAGTTGTTGTTGCTACTGTTGCTTTTGGCCTT
TGGTCTTCTTTCAGAATGGGCATTGATAGAAAGGATGTCAGAATTGTATGCCACTTCAACATTCCCAAGTCAATGGAAGC
ATTCTATCAAGAGTCAGGCAGAGCTGGTCGTGATCAATTGCCATCTAGAAGTCTGCTGTACTATGGGGTAGATGATCGCA
AAAGAATGGAATTTATATTACGTAAATCAGTGAGCAAGAAGTCACAGTCATCTAGTTCACAAGAAGAATCATCCAAAATG
TCCCTGATTGCTTTCAATCTGATGGTTGAATATTGTGAAGGGTCTGGATGTCGCAGGAAAAGGGTTCTCGAGAGTTTTGG
GGAACAGGTAACTGCATCACTATGTGGAAAAACATGTGATGGCTGCAGACATCCAAACTTAGTTGCCCGATATTTGGAGG
ATCTCACAACTGCTTGTGCTCTACGCCAGAAAAATGGTTCTTCTCGAGTTTTTATGACCAGTTCCACTGATGCAATTAAC
GGAGAACAGTTATCTGAATTCTGGAATCAGGATGAGGAAGCTAGTGGATCAGAGGAAGATATATCTGATTCAGATGGTAA
TGGTAATGAGGTTGTCAACAACCTAACCCGGTCAAAGCTTCAATCCAAATTGGGAGTGAGTGAAAAGCTTGCTATGTTAC
AACGGGCAGAAGAAAACTTCTATCGAAATAATAATGCTTACAAACAGAGCAACAAAGTTGACAAAAATGCTATTTCTGAT
CCAATGCGAGGATCAAGCAGGCAAAGGTTACAAAATGCTCTAAAACAGGTTCAGCAACGGCTTGACAACTTCAAGATTGA
AATGGAAACATCAGCATCTTTCCTTGAAGAAGAATGCTACAAGAAATATGGCAAGGTTGGTAAATCATTCTATTATTCGC
AAGTGGCAAGTACTGTAAGATGGCTGACAACTGCAAGTTCTAGCGAACTGATCAACCGACTTAGTGCAATTAATGCTTCT
ACCTCAATGAATGTCTTGTCTGAAGCTGAAAACCTCCTCATCCCAGCTAATCAGCCCCTCACACCAGCTGAACAGCCCCT
CACATCGCCACCTGCTTTAGATCCTTATGCAAGAGATACCAGCAATGAACACTCAGGCACTGCTAGATCAGAAACTTCTG
CATGTGTCTTGCCAATGGAAGGTTCTTTTAGTACCAATTTGCCACAGATACCATCCTTCTCTGAATTCGTAAATAGTAGG
AAAGCAAAAGGGGACCAATTACATGACACCAAAAGGCATTCATCAAGAGTTGAAAAGAAGATGAGGATACAGTAG
Microexon DNA seq CCTTAATG
Microexon Amino Acid seq ALM
Microexon-tag DNA Seq CCTGCATTGGCAAAAGCAGGCATTGTGCTTGTCGTTTGCCCTTTAATAGCCTTAATGGAAAACCAAGTAATGGCACTAAAGGAGAAAGGCATAGCGGCAGAATTTCTC
Microexon-tag Amino Acid seq PALAKAGIVLVVCPLIALMENQVMALKEKGIAAEFL
Transcript ID Gm.52479.1
Gene ID Gm.52479
Gene Name NA
Pfam domain motif DEAD
Motif E-value 4.4e-19
Motif start 41
Motif end 208
Protein seq >Gm.52479.1
MQKSALPLSDANANKKREELRRKETLVKLLRWHFGYPDFRDMQLDAIQAVLSGKDCFCLMPTGGGKSMCYQIPALAKAGI
VLVVCPLIALMENQVMALKEKGIAAEFLSSTKTTDAKVKIHEDLDSGKPSTRLLYVTPELITTPGFMTKLTKIYTRGLLN
LIAIDEAHCISSWGHDFRPSYRKLSSLRSHLPDVPILALTATAVPKVQKDVVESLQMQNPLMLKSSFNRPNIYYEVRYKD
LLDDAYADLSNTLKSLGDVCAIVYCLERSMCDDLSTNLSQNGISCAAYHAGLNNKMRTSVLDDWISSKIKVVVATVAFGL
WSSFRMGIDRKDVRIVCHFNIPKSMEAFYQESGRAGRDQLPSRSLLYYGVDDRKRMEFILRKSVSKKSQSSSSQEESSKM
SLIAFNLMVEYCEGSGCRRKRVLESFGEQVTASLCGKTCDGCRHPNLVARYLEDLTTACALRQKNGSSRVFMTSSTDAIN
GEQLSEFWNQDEEASGSEEDISDSDGNGNEVVNNLTRSKLQSKLGVSEKLAMLQRAEENFYRNNNAYKQSNKVDKNAISD
PMRGSSRQRLQNALKQVQQRLDNFKIEMETSASFLEEECYKKYGKVGKSFYYSQVASTVRWLTTASSSELINRLSAINAS
TSMNVLSEAENLLIPANQPLTPAEQPLTSPPALDPYARDTSNEHSGTARSETSACVLPMEGSFSTNLPQIPSFSEFVNSR
KAKGDQLHDTKRHSSRVEKKMRIQ*
CDS seq >Gm.52479.1
ATGCAGAAGTCGGCATTGCCACTGAGCGACGCGAATGCGAACAAGAAGAGGGAGGAATTGCGCCGCAAGGAAACGTTGGT
GAAGCTTCTGAGATGGCATTTTGGGTACCCCGATTTCAGGGACATGCAATTGGACGCTATTCAAGCTGTGCTCTCAGGGA
AAGATTGTTTTTGTCTTATGCCAACTGGAGGAGGCAAGTCGATGTGTTATCAGATCCCTGCATTGGCAAAAGCAGGCATT
GTGCTTGTCGTTTGCCCTTTAATAGCCTTAATGGAAAACCAAGTAATGGCACTAAAGGAGAAAGGCATAGCGGCAGAATT
TCTCTCCTCAACGAAAACAACAGATGCAAAAGTAAAGATTCATGAGGACCTTGATTCTGGAAAACCTTCTACGAGGCTGC
TATATGTGACTCCAGAGTTGATAACAACACCAGGGTTTATGACTAAGCTGACAAAGATTTATACCAGGGGGTTGCTAAAT
CTGATTGCGATAGATGAGGCGCATTGCATCTCATCTTGGGGTCATGATTTCAGACCTAGCTACCGTAAGCTATCCTCTTT
GAGAAGCCACCTACCAGATGTACCAATATTAGCTTTGACTGCTACTGCTGTGCCTAAGGTTCAGAAGGATGTAGTTGAAT
CCTTGCAGATGCAAAATCCATTAATGCTCAAGTCTTCATTTAATCGTCCTAATATATATTATGAAGTTAGATACAAAGAT
CTGTTGGATGATGCTTATGCTGATTTATCTAATACACTCAAATCTCTGGGAGATGTCTGTGCAATAGTGTACTGCCTTGA
ACGTTCAATGTGTGATGACTTGTCAACTAATCTATCCCAAAATGGCATTTCATGTGCTGCTTATCATGCAGGATTGAATA
ATAAAATGCGAACTTCGGTGTTGGATGATTGGATATCTTCCAAGATAAAAGTTGTTGTTGCTACTGTTGCTTTTGGCCTT
TGGTCTTCTTTCAGAATGGGCATTGATAGAAAGGATGTCAGAATTGTATGCCACTTCAACATTCCCAAGTCAATGGAAGC
ATTCTATCAAGAGTCAGGCAGAGCTGGTCGTGATCAATTGCCATCTAGAAGTCTGCTGTACTATGGGGTAGATGATCGCA
AAAGAATGGAATTTATATTACGTAAATCAGTGAGCAAGAAGTCACAGTCATCTAGTTCACAAGAAGAATCATCCAAAATG
TCCCTGATTGCTTTCAATCTGATGGTTGAATATTGTGAAGGGTCTGGATGTCGCAGGAAAAGGGTTCTCGAGAGTTTTGG
GGAACAGGTAACTGCATCACTATGTGGAAAAACATGTGATGGCTGCAGACATCCAAACTTAGTTGCCCGATATTTGGAGG
ATCTCACAACTGCTTGTGCTCTACGCCAGAAAAATGGTTCTTCTCGAGTTTTTATGACCAGTTCCACTGATGCAATTAAC
GGAGAACAGTTATCTGAATTCTGGAATCAGGATGAGGAAGCTAGTGGATCAGAGGAAGATATATCTGATTCAGATGGTAA
TGGTAATGAGGTTGTCAACAACCTAACCCGGTCAAAGCTTCAATCCAAATTGGGAGTGAGTGAAAAGCTTGCTATGTTAC
AACGGGCAGAAGAAAACTTCTATCGAAATAATAATGCTTACAAACAGAGCAACAAAGTTGACAAAAATGCTATTTCTGAT
CCAATGCGAGGATCAAGCAGGCAAAGGTTACAAAATGCTCTAAAACAGGTTCAGCAACGGCTTGACAACTTCAAGATTGA
AATGGAAACATCAGCATCTTTCCTTGAAGAAGAATGCTACAAGAAATATGGCAAGGTTGGTAAATCATTCTATTATTCGC
AAGTGGCAAGTACTGTAAGATGGCTGACAACTGCAAGTTCTAGCGAACTGATCAACCGACTTAGTGCAATTAATGCTTCT
ACCTCAATGAATGTCTTGTCTGAAGCTGAAAACCTCCTCATCCCAGCTAATCAGCCCCTCACACCAGCTGAACAGCCCCT
CACATCGCCACCTGCTTTAGATCCTTATGCAAGAGATACCAGCAATGAACACTCAGGCACTGCTAGATCAGAAACTTCTG
CATGTGTCTTGCCAATGGAAGGTTCTTTTAGTACCAATTTGCCACAGATACCATCCTTCTCTGAATTCGTAAATAGTAGG
AAAGCAAAAGGGGACCAATTACATGACACCAAAAGGCATTCATCAAGAGTTGAAAAGAAGATGAGGATACAGTAG