Microexon ID Gm_1:54263461-54263469:+
Species Glycine max
Coordinates 1:54263461..54263469
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAGAGAACAGCTTACCATTTCCAACCTGAGAAGAATTGGATGAACGATCCTAATGGTCCAATGTACTACAAAGGGTGGTATCACTTCTTCTACCAGTACAATCCA
Microexon-tag Amino Acid Seq WQRTAYHFQPEKNWMNDPNGPMYYKGWYHFFYQYNP
Microexon-tag spanning region54262439-54264338
Microexon-tag prediction score0.9763
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH77397x
Reference Transcript ID KRH77397
Gene ID GLYMA_01G211000
Gene Name NA
Transcript ID KRH77397
Protein ID KRH77397
Gene ID GLYMA_01G211000
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.7e-108
Motif start 146
Motif end 464
Protein seq >KRH77397
MGSPNTRTSSSSDSELPCSNYIPVPDGPHSPAEPARREVIVIFCGLLVLVSLVAFNGYSWAHSNGHDGDHHHASSSLPTT
LLTVTPNELELSPDTVPWSTWQTTLSRGVSAGVSEKSSWLFNSNNGNGESYPWNNSMLSWQRTAYHFQPEKNWMNDPNGP
MYYKGWYHFFYQYNPNGAVWGDIVWGHAVSRDMIHWFHLPLAMVADQWYDKNGVWTGSATILPDGQVIMLYTGSTNESMQ
VQNLAYPADPSDPLLVDWIKYPANPVLFPPPGIDAKDFRDPTTAWITSEGKWRISIGSKLNKTGIALVYDTNDFKTFERV
EGVLHVVPGTGMWECVDFFPVSSKGENGLDTSINGENVKHVVKVSLDDDRHDYYALGTYDEKNVKFTPDDFNNDVGIGLR
YDYGIFYASKTFYDQSKGRRVLWGWIGESDSEYADVAKGWASVQGIPRTVALDKKTGSNLIQWPVAEVESLRLRSDEFQN
LKVKPGSVVPLEIGTAAQLDIVAEFEIDKKALEKTGQSNKEYKCSTSGGSTERGTIGPFGLLVLADDDLSEYTPTYFYVV
KGSHGQLKTSFCSDQSRSSLATDVSKKIFGSFVPVLKDEKLSVRILVDHSIVESFAQGGRTCVTSRVYPTKAIYGAARLF
LFNNATEATVTASVKVWQMNSAFIRPFHPDQSNSF*
CDS seq >KRH77397
ATGGGAAGCCCCAACACAAGAACCAGCTCCTCGTCTGATTCCGAACTCCCTTGCTCTAATTACATTCCGGTACCCGATGG
ACCCCACTCTCCCGCAGAACCAGCTCGAAGAGAGGTGATTGTAATCTTTTGTGGGTTGCTGGTGCTTGTGTCCTTGGTGG
CTTTCAATGGTTACAGCTGGGCGCATTCAAATGGACATGATGGAGATCATCATCATGCGTCATCATCATTGCCAACAACG
TTGTTAACAGTAACACCCAATGAATTGGAACTGAGCCCCGACACAGTGCCATGGTCAACATGGCAGACGACGCTTTCACG
TGGCGTGTCTGCAGGGGTCTCGGAGAAGTCCAGCTGGCTATTCAACAGCAACAATGGGAACGGGGAATCGTATCCTTGGA
ACAATAGCATGCTGTCATGGCAGAGAACAGCTTACCATTTCCAACCTGAGAAGAATTGGATGAACGATCCTAATGGTCCA
ATGTACTACAAAGGGTGGTATCACTTCTTCTACCAGTACAATCCAAACGGTGCAGTTTGGGGTGACATAGTTTGGGGACA
CGCTGTGTCAAGGGACATGATTCACTGGTTTCACCTTCCACTTGCAATGGTGGCTGATCAATGGTATGACAAAAACGGTG
TGTGGACAGGCTCTGCCACCATCTTACCCGATGGTCAAGTCATCATGCTATACACTGGTTCCACCAACGAGTCAATGCAA
GTGCAAAACCTTGCATACCCTGCAGACCCCTCTGATCCCCTCCTTGTTGATTGGATCAAATACCCTGCAAATCCTGTTCT
ATTCCCACCACCTGGCATTGATGCCAAAGATTTTCGTGACCCCACCACTGCATGGATCACCTCTGAAGGAAAGTGGCGCA
TCAGTATTGGTTCCAAGCTTAACAAAACTGGCATTGCCTTGGTTTATGATACCAATGACTTCAAGACCTTTGAGCGTGTG
GAGGGAGTGCTTCATGTTGTCCCTGGCACTGGCATGTGGGAGTGTGTTGACTTTTTCCCTGTATCTAGCAAGGGTGAAAA
TGGCCTTGATACTTCAATCAATGGGGAGAATGTGAAGCATGTGGTAAAGGTTAGCTTGGATGATGATAGACATGATTACT
ATGCACTCGGGACTTACGATGAGAAGAATGTTAAGTTCACACCTGATGACTTCAACAACGATGTTGGTATTGGACTCAGA
TATGACTATGGTATATTTTATGCATCCAAGACATTTTATGATCAGAGTAAGGGGAGGAGAGTGTTGTGGGGTTGGATTGG
AGAGTCTGATAGCGAATACGCTGACGTGGCCAAAGGTTGGGCATCAGTTCAGGGTATTCCAAGAACAGTGGCACTTGATA
AGAAAACTGGTAGCAACTTAATTCAATGGCCTGTGGCAGAGGTAGAGAGTTTGAGATTGAGAAGCGACGAGTTTCAAAAT
TTGAAGGTGAAGCCAGGGTCAGTGGTGCCACTAGAAATTGGAACAGCTGCACAGTTGGACATTGTAGCTGAGTTTGAGAT
AGATAAGAAGGCCTTGGAGAAGACAGGCCAGTCCAATAAAGAGTATAAGTGTAGCACTAGTGGTGGATCAACTGAGCGTG
GTACCATAGGACCTTTTGGTCTTCTAGTTTTGGCAGATGATGATCTTTCTGAATACACTCCCACTTACTTTTATGTCGTC
AAAGGAAGCCATGGACAACTTAAAACTTCCTTCTGCTCTGATCAATCAAGGTCTTCTCTAGCAACTGATGTTAGTAAGAA
AATCTTTGGAAGCTTTGTTCCAGTACTAAAAGACGAAAAGTTGTCCGTAAGGATATTGGTGGACCATTCTATAGTGGAAA
GCTTTGCTCAAGGTGGAAGGACGTGTGTAACATCTCGAGTTTATCCAACGAAGGCAATCTATGGAGCTGCTAGATTGTTT
TTATTCAATAATGCTACTGAGGCCACTGTGACTGCTTCAGTTAAGGTTTGGCAAATGAATTCTGCATTCATACGCCCATT
CCACCCTGACCAAAGCAATTCATTTTAA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAGAGAACAGCTTACCATTTCCAACCTGAGAAGAATTGGATGAACGATCCTAATGGTCCAATGTACTACAAAGGGTGGTATCACTTCTTCTACCAGTACAATCCA
Microexon-tag Amino Acid seq WQRTAYHFQPEKNWMNDPNGPMYYKGWYHFFYQYNP
Transcript ID Gm.2040.1
Gene ID Gm.2040
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.3e-108
Motif start 146
Motif end 464
Protein seq >Gm.2040.1
MGSPNTRTSSSSDSELPCSNYIPVPDGPHSPAEPARREVIVIFCGLLVLVSLVAFNGYSWAHSNGHDGDHHHASSSLPTT
LLTVTPNELELSPDTVPWSTWQTTLSRGVSAGVSEKSSWLFNSNNGNGESYPWNNSMLSWQRTAYHFQPEKNWMNDPNGP
MYYKGWYHFFYQYNPNGAVWGDIVWGHAVSRDMIHWFHLPLAMVADQWYDKNGVWTGSATILPDGQVIMLYTGSTNESMQ
VQNLAYPADPSDPLLVDWIKYPANPVLFPPPGIDAKDFRDPTTAWITSEGKWRISIGSKLNKTGIALVYDTNDFKTFERV
EGVLHVVPGTGMWECVDFFPVSSKGENGLDTSINGENVKHVVKVSLDDDRHDYYALGTYDEKNVKFTPDDFNNDVGIGLR
YDYGIFYASKTFYDQSKGRRVLWGWIGESDSEYADVAKGWASVQGIPRTVALDKKTGSNLIQWPVAEVESLRLRSDEFQN
LKVKPGSVVPLEIGTAAQVLVHVIYFNWIVVRRTSSFM*
CDS seq >Gm.2040.1
ATGGGAAGCCCCAACACAAGAACCAGCTCCTCGTCTGATTCCGAACTCCCTTGCTCTAATTACATTCCGGTACCCGATGG
ACCCCACTCTCCCGCAGAACCAGCTCGAAGAGAGGTGATTGTAATCTTTTGTGGGTTGCTGGTGCTTGTGTCCTTGGTGG
CTTTCAATGGTTACAGCTGGGCGCATTCAAATGGACATGATGGAGATCATCATCATGCGTCATCATCATTGCCAACAACG
TTGTTAACAGTAACACCCAATGAATTGGAACTGAGCCCCGACACAGTGCCATGGTCAACATGGCAGACGACGCTTTCACG
TGGCGTGTCTGCAGGGGTCTCGGAGAAGTCCAGCTGGCTATTCAACAGCAACAATGGGAACGGGGAATCGTATCCTTGGA
ACAATAGCATGCTGTCATGGCAGAGAACAGCTTACCATTTCCAACCTGAGAAGAATTGGATGAACGATCCTAATGGTCCA
ATGTACTACAAAGGGTGGTATCACTTCTTCTACCAGTACAATCCAAACGGTGCAGTTTGGGGTGACATAGTTTGGGGACA
CGCTGTGTCAAGGGACATGATTCACTGGTTTCACCTTCCACTTGCAATGGTGGCTGATCAATGGTATGACAAAAACGGTG
TGTGGACAGGCTCTGCCACCATCTTACCCGATGGTCAAGTCATCATGCTATACACTGGTTCCACCAACGAGTCAATGCAA
GTGCAAAACCTTGCATACCCTGCAGACCCCTCTGATCCCCTCCTTGTTGATTGGATCAAATACCCTGCAAATCCTGTTCT
ATTCCCACCACCTGGCATTGATGCCAAAGATTTTCGTGACCCCACCACTGCATGGATCACCTCTGAAGGAAAGTGGCGCA
TCAGTATTGGTTCCAAGCTTAACAAAACTGGCATTGCCTTGGTTTATGATACCAATGACTTCAAGACCTTTGAGCGTGTG
GAGGGAGTGCTTCATGTTGTCCCTGGCACTGGCATGTGGGAGTGTGTTGACTTTTTCCCTGTATCTAGCAAGGGTGAAAA
TGGCCTTGATACTTCAATCAATGGGGAGAATGTGAAGCATGTGGTAAAGGTTAGCTTGGATGATGATAGACATGATTACT
ATGCACTCGGGACTTACGATGAGAAGAATGTTAAGTTCACACCTGATGACTTCAACAACGATGTTGGTATTGGACTCAGA
TATGACTATGGTATATTTTATGCATCCAAGACATTTTATGATCAGAGTAAGGGGAGGAGAGTGTTGTGGGGTTGGATTGG
AGAGTCTGATAGCGAATACGCTGACGTGGCCAAAGGTTGGGCATCAGTTCAGGGTATTCCAAGAACAGTGGCACTTGATA
AGAAAACTGGTAGCAACTTAATTCAATGGCCTGTGGCAGAGGTAGAGAGTTTGAGATTGAGAAGCGACGAGTTTCAAAAT
TTGAAGGTGAAGCCAGGGTCAGTGGTGCCACTAGAAATTGGAACAGCTGCACAGGTTTTGGTTCATGTTATTTATTTTAA
TTGGATAGTAGTTAGGAGAACATCAAGTTTTATGTAA