Microexon ID Gm_17:11226020-11226028:-
Species Glycine max
Coordinates 17:11226020..11226028
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAGAGAACTGCTTTTCATTTCCAACCAGAGAAAAACTGGATGAACGATCCTAACGGTCCTATGTACTACAAGGGATGGTATCACTTCTTCTACCAATACAACCCG
Microexon-tag Amino Acid Seq WQRTAFHFQPEKNWMNDPNGPMYYKGWYHFFYQYNP
Microexon-tag spanning region11225077-11226747
Microexon-tag prediction score0.9758
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH04082x
Reference Transcript ID KRH04082
Gene ID GLYMA_17G138500
Gene Name NA
Transcript ID KRH04082
Protein ID KRH04082
Gene ID GLYMA_17G138500
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.6e-110
Motif start 118
Motif end 436
Protein seq >KRH04082
MDHRKPLLPTSSDDAPNPRTRKDLVLMICGLFLLSSLVAFGGYRASNAPHADVSSPASNDEQPSPTSVPSPKWYPVSRGV
SSGVSEKSSSMLFAVKDGASEAFPWDNSMLSWQRTAFHFQPEKNWMNDPNGPMYYKGWYHFFYQYNPNGAVWGDIVWGHA
VSRDMIHWLHLPLAMVADQWYDMQGVWTGSATILPNGEIIMLYTGSTNESVQVQNLAYPADPSDPLLVDWIKYPGNPVLV
PPPGIGTKDFRDPTTAWLTSEGKWRITIGSKLNKTGIALVYDTEDFKSYELKEGLLRAVDGTGMWECVDFFPVSKKNENG
LDTSVNGDEVKHVMKVSLDDDRHDYYAIGTYDEKSVLFTPDDAKNDVGVGLRYDYGIFYASKTFYDQNKERRLLWGWIGE
SDSEYADVAKGWASVQSIPRTVELDRKTGSNLLQWPVAEVESLRLRSDEFKNLKAKPGSVVSIDIETATQLDIVAEFEID
KETLEKTPESNEEYTCGNSGGSKQRGALGPFGLLVLADEGLFEYTPQYFYVIKGSNGNLKTSFCADQSRSSQANDVRKQI
VGSAVPVLKDEKFSLRILVDHSIVESFAQGGRTVVTSRVYPTKAIYGAARLFLFNNATEATVTASLNVWQMNSAFIRPFH
PDQKN*
CDS seq >KRH04082
ATGGATCATCGCAAACCATTGTTACCCACTTCCTCTGACGATGCTCCAAATCCACGCACTCGGAAGGACCTTGTTCTAAT
GATTTGCGGTTTGTTTTTGCTCTCTTCCCTTGTTGCCTTTGGAGGGTACAGGGCCTCCAACGCGCCACATGCTGACGTGT
CATCGCCAGCATCGAACGATGAACAACCTAGTCCCACATCGGTGCCTTCTCCGAAATGGTACCCGGTTTCACGGGGTGTT
TCTTCTGGGGTGTCGGAGAAGTCATCAAGCATGTTGTTTGCGGTTAAAGATGGAGCTTCGGAAGCATTTCCTTGGGACAA
TAGCATGCTGTCGTGGCAGAGAACTGCTTTTCATTTCCAACCAGAGAAAAACTGGATGAACGATCCTAACGGTCCTATGT
ACTACAAGGGATGGTATCACTTCTTCTACCAATACAACCCGAACGGTGCAGTTTGGGGTGACATAGTTTGGGGACACGCA
GTGTCAAGGGACATGATCCACTGGCTTCACCTTCCACTAGCAATGGTGGCTGATCAATGGTATGACATGCAAGGTGTGTG
GACAGGCTCAGCCACGATCTTACCAAACGGTGAAATCATCATGTTATACACGGGTTCCACCAACGAGTCAGTGCAGGTTC
AAAACCTTGCATACCCTGCAGACCCCTCTGACCCTCTCCTTGTGGATTGGATCAAATACCCTGGAAACCCCGTTTTGGTG
CCACCACCAGGCATTGGTACTAAGGACTTTCGCGACCCGACAACCGCGTGGCTCACCTCCGAAGGGAAGTGGCGAATCAC
CATAGGATCCAAGCTCAACAAAACAGGCATTGCATTGGTTTATGACACTGAGGACTTCAAGAGCTATGAGCTCAAGGAGG
GTTTGCTTCGCGCTGTCGATGGCACCGGCATGTGGGAGTGTGTGGACTTCTTCCCTGTGTCCAAGAAGAATGAGAATGGA
TTGGATACATCTGTTAATGGGGATGAGGTGAAGCATGTGATGAAGGTGAGCCTGGATGATGATAGACATGATTACTATGC
AATTGGGACTTATGATGAGAAGAGTGTCTTGTTCACACCAGATGATGCTAAGAATGATGTTGGTGTTGGATTAAGGTATG
ACTACGGGATATTCTATGCATCCAAGACGTTTTATGATCAGAATAAGGAGAGGAGACTTTTGTGGGGTTGGATTGGAGAG
TCTGACAGTGAATATGCTGATGTAGCCAAAGGTTGGGCTTCAGTTCAGAGTATTCCTAGAACTGTGGAGCTTGATAGGAA
GACTGGCAGCAACTTGCTTCAGTGGCCTGTTGCTGAGGTGGAGAGTTTGAGATTGAGAAGTGATGAATTCAAAAATTTGA
AGGCTAAACCAGGGTCAGTCGTGTCAATAGATATTGAAACAGCCACACAGTTGGACATTGTTGCCGAGTTTGAGATAGAC
AAGGAAACCCTTGAGAAAACACCTGAATCCAACGAGGAGTACACGTGTGGCAACAGTGGTGGATCTAAACAACGTGGTGC
CTTAGGACCTTTTGGTCTTTTGGTTTTGGCAGATGAGGGCCTTTTTGAGTATACTCCTCAGTATTTTTATGTCATTAAAG
GGAGCAATGGAAATCTTAAGACTTCCTTTTGCGCTGATCAATCAAGGTCTTCTCAGGCAAATGATGTTCGCAAGCAAATC
GTTGGCAGCGCTGTTCCAGTACTTAAAGATGAAAAGTTTTCCTTGCGGATACTGGTGGACCATTCTATTGTTGAAAGCTT
TGCTCAAGGTGGAAGGACGGTTGTGACATCTCGGGTTTATCCAACAAAGGCAATCTATGGAGCTGCTAGGTTGTTCTTGT
TCAACAATGCTACTGAGGCCACTGTGACAGCCTCACTCAATGTTTGGCAAATGAATTCTGCATTTATACGCCCATTCCAC
CCTGATCAGAAGAATTAA
Microexon DNA seq ATCCTAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAGAGAACTGCTTTTCATTTCCAACCAGAGAAAAACTGGATGAACGATCCTAACGGTCCTATGTACTACAAGGGATGGTATCACTTCTTCTACCAATACAACCCG
Microexon-tag Amino Acid seq WQRTAFHFQPEKNWMNDPNGPMYYKGWYHFFYQYNP
Transcript ID KRH04082
Gene ID Gm.22782
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.6e-110
Motif start 118
Motif end 436
Protein seq >KRH04082
MDHRKPLLPTSSDDAPNPRTRKDLVLMICGLFLLSSLVAFGGYRASNAPHADVSSPASNDEQPSPTSVPSPKWYPVSRGV
SSGVSEKSSSMLFAVKDGASEAFPWDNSMLSWQRTAFHFQPEKNWMNDPNGPMYYKGWYHFFYQYNPNGAVWGDIVWGHA
VSRDMIHWLHLPLAMVADQWYDMQGVWTGSATILPNGEIIMLYTGSTNESVQVQNLAYPADPSDPLLVDWIKYPGNPVLV
PPPGIGTKDFRDPTTAWLTSEGKWRITIGSKLNKTGIALVYDTEDFKSYELKEGLLRAVDGTGMWECVDFFPVSKKNENG
LDTSVNGDEVKHVMKVSLDDDRHDYYAIGTYDEKSVLFTPDDAKNDVGVGLRYDYGIFYASKTFYDQNKERRLLWGWIGE
SDSEYADVAKGWASVQSIPRTVELDRKTGSNLLQWPVAEVESLRLRSDEFKNLKAKPGSVVSIDIETATQLDIVAEFEID
KETLEKTPESNEEYTCGNSGGSKQRGALGPFGLLVLADEGLFEYTPQYFYVIKGSNGNLKTSFCADQSRSSQANDVRKQI
VGSAVPVLKDEKFSLRILVDHSIVESFAQGGRTVVTSRVYPTKAIYGAARLFLFNNATEATVTASLNVWQMNSAFIRPFH
PDQKN*
CDS seq >KRH04082
ATGGATCATCGCAAACCATTGTTACCCACTTCCTCTGACGATGCTCCAAATCCACGCACTCGGAAGGACCTTGTTCTAAT
GATTTGCGGTTTGTTTTTGCTCTCTTCCCTTGTTGCCTTTGGAGGGTACAGGGCCTCCAACGCGCCACATGCTGACGTGT
CATCGCCAGCATCGAACGATGAACAACCTAGTCCCACATCGGTGCCTTCTCCGAAATGGTACCCGGTTTCACGGGGTGTT
TCTTCTGGGGTGTCGGAGAAGTCATCAAGCATGTTGTTTGCGGTTAAAGATGGAGCTTCGGAAGCATTTCCTTGGGACAA
TAGCATGCTGTCGTGGCAGAGAACTGCTTTTCATTTCCAACCAGAGAAAAACTGGATGAACGATCCTAACGGTCCTATGT
ACTACAAGGGATGGTATCACTTCTTCTACCAATACAACCCGAACGGTGCAGTTTGGGGTGACATAGTTTGGGGACACGCA
GTGTCAAGGGACATGATCCACTGGCTTCACCTTCCACTAGCAATGGTGGCTGATCAATGGTATGACATGCAAGGTGTGTG
GACAGGCTCAGCCACGATCTTACCAAACGGTGAAATCATCATGTTATACACGGGTTCCACCAACGAGTCAGTGCAGGTTC
AAAACCTTGCATACCCTGCAGACCCCTCTGACCCTCTCCTTGTGGATTGGATCAAATACCCTGGAAACCCCGTTTTGGTG
CCACCACCAGGCATTGGTACTAAGGACTTTCGCGACCCGACAACCGCGTGGCTCACCTCCGAAGGGAAGTGGCGAATCAC
CATAGGATCCAAGCTCAACAAAACAGGCATTGCATTGGTTTATGACACTGAGGACTTCAAGAGCTATGAGCTCAAGGAGG
GTTTGCTTCGCGCTGTCGATGGCACCGGCATGTGGGAGTGTGTGGACTTCTTCCCTGTGTCCAAGAAGAATGAGAATGGA
TTGGATACATCTGTTAATGGGGATGAGGTGAAGCATGTGATGAAGGTGAGCCTGGATGATGATAGACATGATTACTATGC
AATTGGGACTTATGATGAGAAGAGTGTCTTGTTCACACCAGATGATGCTAAGAATGATGTTGGTGTTGGATTAAGGTATG
ACTACGGGATATTCTATGCATCCAAGACGTTTTATGATCAGAATAAGGAGAGGAGACTTTTGTGGGGTTGGATTGGAGAG
TCTGACAGTGAATATGCTGATGTAGCCAAAGGTTGGGCTTCAGTTCAGAGTATTCCTAGAACTGTGGAGCTTGATAGGAA
GACTGGCAGCAACTTGCTTCAGTGGCCTGTTGCTGAGGTGGAGAGTTTGAGATTGAGAAGTGATGAATTCAAAAATTTGA
AGGCTAAACCAGGGTCAGTCGTGTCAATAGATATTGAAACAGCCACACAGTTGGACATTGTTGCCGAGTTTGAGATAGAC
AAGGAAACCCTTGAGAAAACACCTGAATCCAACGAGGAGTACACGTGTGGCAACAGTGGTGGATCTAAACAACGTGGTGC
CTTAGGACCTTTTGGTCTTTTGGTTTTGGCAGATGAGGGCCTTTTTGAGTATACTCCTCAGTATTTTTATGTCATTAAAG
GGAGCAATGGAAATCTTAAGACTTCCTTTTGCGCTGATCAATCAAGGTCTTCTCAGGCAAATGATGTTCGCAAGCAAATC
GTTGGCAGCGCTGTTCCAGTACTTAAAGATGAAAAGTTTTCCTTGCGGATACTGGTGGACCATTCTATTGTTGAAAGCTT
TGCTCAAGGTGGAAGGACGGTTGTGACATCTCGGGTTTATCCAACAAAGGCAATCTATGGAGCTGCTAGGTTGTTCTTGT
TCAACAATGCTACTGAGGCCACTGTGACAGCCTCACTCAATGTTTGGCAAATGAATTCTGCATTTATACGCCCATTCCAC
CCTGATCAGAAGAATTAA