Microexon ID Gm_9:7202001-7202004:-
Species Glycine max
Coordinates 9:7202001..7202004
Microexon Cluster ID MEP05
Size 4
Phase 2
Pfam Domain Motif Helicase_C
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 53,4,51
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GATGAYTGGMTTTCWKCSARRAYWCAAGTTGTWGTKGCHACWGTRGCWTTTGGRATGGGWATWGATARRMARGATGTYMGDATTGTKTGYCAYTTYAAYWTKCCWAAR
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq AATG
Microexon Amino Acid seq GM
Microexon-tag DNA Seq GATGATTGGATATCTTCCAAGATAAAAGTTGTTGTTGCTACTGTTGCTTTTGGAATGGGCATTGATAGAAAGGATGTCAGAATTGTATGCCACTTCAACATTCCCAAG
Microexon-tag Amino Acid Seq DDWISSKIKVVVATVAFGMGIDRKDVRIVCHFNIPK
Microexon-tag spanning region7201832-7207379
Microexon-tag prediction score0.9635
Overlapped with the annotated transcript (%) 85.71
New Transcript ID KRH37506x
Reference Transcript ID KRH37506
Gene ID GLYMA_09G070600
Gene Name NA
Gm_9:7202001-7202004:- does not have available information here.
Microexon DNA seq AATG
Microexon Amino Acid seq GM
Microexon-tag DNA Seq GATGATTGGATATCTTCCAAGATAAAAGTTGTTGTTGCTACTGTTGCTTTTGGAATGGGCATTGATAGAAAGGATGTCAGAATTGTATGCCACTTCAACATTCCCAAG
Microexon-tag Amino Acid seq DDWISSKIKVVVATVAFGMGIDRKDVRIVCHFNIPK
Transcript ID Gm.52479.2
Gene ID Gm.52479
Gene Name NA
Pfam domain motif Helicase_C
Motif E-value 9.8e-16
Motif start 253
Motif end 351
Protein seq >Gm.52479.2
MQKSALPLSDANANKKREELRRKETLVKLLRWHFGYPDFRDMQLDAIQAVLSGKDCFCLMPTGGGKSMCYQIPALAKAGI
VLVVCPLIALMENQVMALKEKGIAAEFLSSTKTTDAKVKIHEDLDSGKPSTRLLYVTPELITTPGFMTKLTKIYTRGLLN
LIAIDEAHCISSWGHDFRPSYRKLSSLRSHLPDVPILALTATAVPKVQKDVVESLQMQNPLMLKSSFNRPNIYYEVRYKD
LLDDAYADLSNTLKSLGDVCAIVYCLERSMCDDLSTNLSQNGISCAAYHAGLNNKMRTSVLDDWISSKIKVVVATVAFGM
GIDRKDVRIVCHFNIPKSMEAFYQESGRAGRDQLPSRSLLYYGVDDRKRMEFILRKSVSKKSQSSSSQEESSKMSLIAFN
LMVEYCEGSGCRRKRVLESFGEQVTASLCGKTCDGCRHPNLVARYLEDLTTACALRQKNGSSRVFMTSSTDAINGEQLSE
FWNQDEEASGSEEDISDSDDGNEVVNNLTRSKLQSKLGVSEKLAMLQRAEENFYRNNNAYKQSNKVDKNAISDPMRGSSR
QRLQNALKQVQQRLDNFKIEMETSASFLEEECYKKYGKVGKSFYYSQVASTVRWLTTASSSELINRLSAINASTSMNVLS
EAENLLIPANQPLTPAEQPLTSPPALDPYARDTSNEHSGTARSETSACVLPMEGSFSTNLPQIPSFSEFVNSRKAKGDQL
HDTKRHSSRVEKKMRIQ*
CDS seq >Gm.52479.2
ATGCAGAAGTCGGCATTGCCACTGAGCGACGCGAATGCGAACAAGAAGAGGGAGGAATTGCGCCGCAAGGAAACGTTGGT
GAAGCTTCTGAGATGGCATTTTGGGTACCCCGATTTCAGGGACATGCAATTGGACGCTATTCAAGCTGTGCTCTCAGGGA
AAGATTGTTTTTGTCTTATGCCAACTGGAGGAGGCAAGTCGATGTGTTATCAGATCCCTGCATTGGCAAAAGCAGGCATT
GTGCTTGTCGTTTGCCCTTTAATAGCCTTAATGGAAAACCAAGTAATGGCACTAAAGGAGAAAGGCATAGCGGCAGAATT
TCTCTCCTCAACGAAAACAACAGATGCAAAAGTAAAGATTCATGAGGACCTTGATTCTGGAAAACCTTCTACGAGGCTGC
TATATGTGACTCCAGAGTTGATAACAACACCAGGGTTTATGACTAAGCTGACAAAGATTTATACCAGGGGGTTGCTAAAT
CTGATTGCGATAGATGAGGCGCATTGCATCTCATCTTGGGGTCATGATTTCAGACCTAGCTACCGTAAGCTATCCTCTTT
GAGAAGCCACCTACCAGATGTACCAATATTAGCTTTGACTGCTACTGCTGTGCCTAAGGTTCAGAAGGATGTAGTTGAAT
CCTTGCAGATGCAAAATCCATTAATGCTCAAGTCTTCATTTAATCGTCCTAATATATATTATGAAGTTAGATACAAAGAT
CTGTTGGATGATGCTTATGCTGATTTATCTAATACACTCAAATCTCTGGGAGATGTCTGTGCAATAGTGTACTGCCTTGA
ACGTTCAATGTGTGATGACTTGTCAACTAATCTATCCCAAAATGGCATTTCATGTGCTGCTTATCATGCAGGATTGAATA
ATAAAATGCGAACTTCGGTGTTGGATGATTGGATATCTTCCAAGATAAAAGTTGTTGTTGCTACTGTTGCTTTTGGAATG
GGCATTGATAGAAAGGATGTCAGAATTGTATGCCACTTCAACATTCCCAAGTCAATGGAAGCATTCTATCAAGAGTCAGG
CAGAGCTGGTCGTGATCAATTGCCATCTAGAAGTCTGCTGTACTATGGGGTAGATGATCGCAAAAGAATGGAATTTATAT
TACGTAAATCAGTGAGCAAGAAGTCACAGTCATCTAGTTCACAAGAAGAATCATCCAAAATGTCCCTGATTGCTTTCAAT
CTGATGGTTGAATATTGTGAAGGGTCTGGATGTCGCAGGAAAAGGGTTCTCGAGAGTTTTGGGGAACAGGTAACTGCATC
ACTATGTGGAAAAACATGTGATGGCTGCAGACATCCAAACTTAGTTGCCCGATATTTGGAGGATCTCACAACTGCTTGTG
CTCTACGCCAGAAAAATGGTTCTTCTCGAGTTTTTATGACCAGTTCCACTGATGCAATTAACGGAGAACAGTTATCTGAA
TTCTGGAATCAGGATGAGGAAGCTAGTGGATCAGAGGAAGATATATCTGATTCAGATGATGGTAATGAGGTTGTCAACAA
CCTAACCCGGTCAAAGCTTCAATCCAAATTGGGAGTGAGTGAAAAGCTTGCTATGTTACAACGGGCAGAAGAAAACTTCT
ATCGAAATAATAATGCTTACAAACAGAGCAACAAAGTTGACAAAAATGCTATTTCTGATCCAATGCGAGGATCAAGCAGG
CAAAGGTTACAAAATGCTCTAAAACAGGTTCAGCAACGGCTTGACAACTTCAAGATTGAAATGGAAACATCAGCATCTTT
CCTTGAAGAAGAATGCTACAAGAAATATGGCAAGGTTGGTAAATCATTCTATTATTCGCAAGTGGCAAGTACTGTAAGAT
GGCTGACAACTGCAAGTTCTAGCGAACTGATCAACCGACTTAGTGCAATTAATGCTTCTACCTCAATGAATGTCTTGTCT
GAAGCTGAAAACCTCCTCATCCCAGCTAATCAGCCCCTCACACCAGCTGAACAGCCCCTCACATCGCCACCTGCTTTAGA
TCCTTATGCAAGAGATACCAGCAATGAACACTCAGGCACTGCTAGATCAGAAACTTCTGCATGTGTCTTGCCAATGGAAG
GTTCTTTTAGTACCAATTTGCCACAGATACCATCCTTCTCTGAATTCGTAAATAGTAGGAAAGCAAAAGGGGACCAATTA
CATGACACCAAAAGGCATTCATCAAGAGTTGAAAAGAAGATGAGGATACAGTAG