Microexon ID Gm_5:5152585-5152593:-
Species Glycine max
Coordinates 5:5152585..5152593
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCCAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAGAGAACTGCTTTTCATTTCCAACCAGAGAAAAACTGGATGAACGATCCCAATGGTCCTATGTACTACAAGGGATGGTATCACTTTTTCTACCAATACAACCCG
Microexon-tag Amino Acid Seq WQRTAFHFQPEKNWMNDPNGPMYYKGWYHFFYQYNP
Microexon-tag spanning region5151470-5153291
Microexon-tag prediction score0.9746
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH57351x
Reference Transcript ID KRH57351
Gene ID GLYMA_05G056300
Gene Name NA
Transcript ID KRH57351
Protein ID KRH57351
Gene ID GLYMA_05G056300
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.1e-109
Motif start 118
Motif end 436
Protein seq >KRH57351
MDHRKPLLPTSSDAAPNPHTRKDLLLVICGLLLLSSLVAFGGYRASNAPHADVASATSNDEQRSPTSVPSPKWYPVSRGV
SSGVSEKSSSMLFAVKDGASKAFPWDNSMLSWQRTAFHFQPEKNWMNDPNGPMYYKGWYHFFYQYNPNGAVWGDIVWGHA
VSRDMIHWLHLPLAMVADQWYDMQGVWTGSATILPNGEIIMLYTGSTNESVQVQNLAYPANPSDPLLVDWIKYPGNPVLV
PPPGIGAKDFRDPTTAWLTSEGKWRITIGSKLNKTGIALVYDTEDFKNYELKEGLLRAVAGTGMWECVDFFPVSKENENG
LDTSINGAEVKHVMKVSLDDDRHDYYSIGTYDEKNVLFTPDDAKNDVGVGLRYDYGIFYASKTFYDQNKERRVLWGWIGE
SDSEYADVAKGWASVQSIPRTVELDRKTGSNLLQWPVAEIESLRLRSDEFKNLKAKPGSVVSVDIETATQLDIVAEFEID
KETLDKIPQSNEEYTCSTSGGSKQRGALGPFGLLVLADEGLSEYTPQYFYVIKGSNGNLKTSFCADQSRSSQANDVRKQI
VGSAVPVLKGEKFSLRILVDHSIVESFAQGGRTVVTSRVYPTKAIYGAARLFLFNNATEATVTASLKVWQMNSAFIRPFH
PDQKS*
CDS seq >KRH57351
ATGGATCATCGCAAACCATTGCTACCCACTTCCTCTGATGCTGCTCCAAATCCACACACTCGCAAGGACCTTCTTCTAGT
TATTTGCGGCTTGCTATTACTCTCTTCCCTTGTTGCCTTTGGAGGGTACAGGGCCTCCAACGCGCCGCATGCCGACGTGG
CATCGGCAACATCGAACGATGAGCAACGTAGTCCGACATCGGTGCCTTCTCCGAAATGGTACCCGGTTTCGCGAGGTGTT
TCTTCCGGGGTGTCAGAGAAGTCATCAAGCATGTTGTTTGCGGTTAAAGATGGAGCTTCGAAGGCATTTCCATGGGACAA
CAGCATGTTGTCTTGGCAGAGAACTGCTTTTCATTTCCAACCAGAGAAAAACTGGATGAACGATCCCAATGGTCCTATGT
ACTACAAGGGATGGTATCACTTTTTCTACCAATACAACCCGAACGGTGCAGTTTGGGGTGACATAGTTTGGGGACATGCA
GTTTCAAGGGACATGATCCATTGGCTTCACCTTCCACTGGCAATGGTGGCTGATCAATGGTATGACATGCAAGGTGTGTG
GACAGGCTCAGCCACAATCTTACCAAATGGTGAAATCATCATGTTATACACAGGTTCCACCAACGAGTCAGTGCAGGTTC
AAAACCTTGCATACCCTGCAAACCCCTCTGACCCTCTCCTTGTGGATTGGATCAAATACCCTGGGAACCCCGTTTTGGTG
CCACCACCAGGCATTGGGGCCAAGGACTTTCGTGACCCAACAACAGCGTGGCTCACCTCAGAAGGGAAGTGGCGAATCAC
CATAGGTTCCAAGCTCAACAAAACCGGCATTGCGTTGGTTTATGACACTGAGGACTTCAAGAACTATGAGCTCAAGGAGG
GGCTGCTTCGTGCTGTCGCTGGCACCGGCATGTGGGAGTGTGTGGACTTCTTCCCTGTGTCCAAGGAGAATGAGAATGGA
TTGGATACTTCTATTAATGGGGCTGAGGTGAAGCATGTGATGAAGGTGAGCCTGGATGATGATAGACATGATTACTATTC
AATTGGGACTTATGATGAGAAGAATGTTCTGTTCACACCAGATGATGCTAAGAATGATGTTGGCGTTGGATTGAGGTATG
ACTATGGGATATTCTATGCATCCAAGACGTTTTATGATCAGAATAAGGAGAGGAGAGTTTTGTGGGGTTGGATTGGGGAG
TCTGACAGTGAATATGCTGATGTGGCCAAAGGTTGGGCTTCAGTTCAGAGTATTCCTAGAACTGTGGAGCTTGATAGGAA
GACTGGCAGCAACTTACTTCAGTGGCCTGTTGCTGAGATCGAGAGTTTGAGATTGAGAAGTGATGAATTTAAAAATTTGA
AGGCTAAACCAGGGTCAGTGGTGTCAGTAGATATTGAAACAGCCACGCAGTTGGACATTGTTGCCGAGTTTGAGATAGAC
AAGGAAACCCTTGACAAAATTCCTCAATCCAACGAGGAATACACGTGCAGCACCAGTGGTGGATCTAAACAACGTGGTGC
CTTAGGACCTTTTGGTCTTTTGGTTTTGGCAGATGAGGGGCTTTCTGAGTATACTCCTCAGTATTTTTATGTCATTAAAG
GGAGCAATGGAAATCTTAAGACTTCCTTTTGTGCTGATCAATCAAGGTCTTCTCAGGCAAATGATGTCCGCAAGCAAATC
GTTGGCAGCGCTGTTCCAGTTCTTAAAGGCGAAAAGTTTTCCTTGCGGATACTGGTGGACCATTCTATTGTTGAAAGCTT
TGCTCAAGGTGGAAGGACGGTTGTGACATCTCGGGTTTATCCAACAAAGGCAATCTATGGAGCTGCTAGGTTGTTCTTGT
TCAACAATGCTACCGAGGCCACTGTGACAGCCTCACTAAAAGTTTGGCAAATGAATTCTGCATTTATACGCCCATTCCAC
CCTGATCAGAAGAGTTAA
Microexon DNA seq ATCCCAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAGAGAACTGCTTTTCATTTCCAACCAGAGAAAAACTGGATGAACGATCCCAATGGTCCTATGTACTACAAGGGATGGTATCACTTTTTCTACCAATACAACCCG
Microexon-tag Amino Acid seq WQRTAFHFQPEKNWMNDPNGPMYYKGWYHFFYQYNP
Transcript ID KRH57351
Gene ID Gm.40420
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.1e-109
Motif start 118
Motif end 436
Protein seq >KRH57351
MDHRKPLLPTSSDAAPNPHTRKDLLLVICGLLLLSSLVAFGGYRASNAPHADVASATSNDEQRSPTSVPSPKWYPVSRGV
SSGVSEKSSSMLFAVKDGASKAFPWDNSMLSWQRTAFHFQPEKNWMNDPNGPMYYKGWYHFFYQYNPNGAVWGDIVWGHA
VSRDMIHWLHLPLAMVADQWYDMQGVWTGSATILPNGEIIMLYTGSTNESVQVQNLAYPANPSDPLLVDWIKYPGNPVLV
PPPGIGAKDFRDPTTAWLTSEGKWRITIGSKLNKTGIALVYDTEDFKNYELKEGLLRAVAGTGMWECVDFFPVSKENENG
LDTSINGAEVKHVMKVSLDDDRHDYYSIGTYDEKNVLFTPDDAKNDVGVGLRYDYGIFYASKTFYDQNKERRVLWGWIGE
SDSEYADVAKGWASVQSIPRTVELDRKTGSNLLQWPVAEIESLRLRSDEFKNLKAKPGSVVSVDIETATQLDIVAEFEID
KETLDKIPQSNEEYTCSTSGGSKQRGALGPFGLLVLADEGLSEYTPQYFYVIKGSNGNLKTSFCADQSRSSQANDVRKQI
VGSAVPVLKGEKFSLRILVDHSIVESFAQGGRTVVTSRVYPTKAIYGAARLFLFNNATEATVTASLKVWQMNSAFIRPFH
PDQKS*
CDS seq >KRH57351
ATGGATCATCGCAAACCATTGCTACCCACTTCCTCTGATGCTGCTCCAAATCCACACACTCGCAAGGACCTTCTTCTAGT
TATTTGCGGCTTGCTATTACTCTCTTCCCTTGTTGCCTTTGGAGGGTACAGGGCCTCCAACGCGCCGCATGCCGACGTGG
CATCGGCAACATCGAACGATGAGCAACGTAGTCCGACATCGGTGCCTTCTCCGAAATGGTACCCGGTTTCGCGAGGTGTT
TCTTCCGGGGTGTCAGAGAAGTCATCAAGCATGTTGTTTGCGGTTAAAGATGGAGCTTCGAAGGCATTTCCATGGGACAA
CAGCATGTTGTCTTGGCAGAGAACTGCTTTTCATTTCCAACCAGAGAAAAACTGGATGAACGATCCCAATGGTCCTATGT
ACTACAAGGGATGGTATCACTTTTTCTACCAATACAACCCGAACGGTGCAGTTTGGGGTGACATAGTTTGGGGACATGCA
GTTTCAAGGGACATGATCCATTGGCTTCACCTTCCACTGGCAATGGTGGCTGATCAATGGTATGACATGCAAGGTGTGTG
GACAGGCTCAGCCACAATCTTACCAAATGGTGAAATCATCATGTTATACACAGGTTCCACCAACGAGTCAGTGCAGGTTC
AAAACCTTGCATACCCTGCAAACCCCTCTGACCCTCTCCTTGTGGATTGGATCAAATACCCTGGGAACCCCGTTTTGGTG
CCACCACCAGGCATTGGGGCCAAGGACTTTCGTGACCCAACAACAGCGTGGCTCACCTCAGAAGGGAAGTGGCGAATCAC
CATAGGTTCCAAGCTCAACAAAACCGGCATTGCGTTGGTTTATGACACTGAGGACTTCAAGAACTATGAGCTCAAGGAGG
GGCTGCTTCGTGCTGTCGCTGGCACCGGCATGTGGGAGTGTGTGGACTTCTTCCCTGTGTCCAAGGAGAATGAGAATGGA
TTGGATACTTCTATTAATGGGGCTGAGGTGAAGCATGTGATGAAGGTGAGCCTGGATGATGATAGACATGATTACTATTC
AATTGGGACTTATGATGAGAAGAATGTTCTGTTCACACCAGATGATGCTAAGAATGATGTTGGCGTTGGATTGAGGTATG
ACTATGGGATATTCTATGCATCCAAGACGTTTTATGATCAGAATAAGGAGAGGAGAGTTTTGTGGGGTTGGATTGGGGAG
TCTGACAGTGAATATGCTGATGTGGCCAAAGGTTGGGCTTCAGTTCAGAGTATTCCTAGAACTGTGGAGCTTGATAGGAA
GACTGGCAGCAACTTACTTCAGTGGCCTGTTGCTGAGATCGAGAGTTTGAGATTGAGAAGTGATGAATTTAAAAATTTGA
AGGCTAAACCAGGGTCAGTGGTGTCAGTAGATATTGAAACAGCCACGCAGTTGGACATTGTTGCCGAGTTTGAGATAGAC
AAGGAAACCCTTGACAAAATTCCTCAATCCAACGAGGAATACACGTGCAGCACCAGTGGTGGATCTAAACAACGTGGTGC
CTTAGGACCTTTTGGTCTTTTGGTTTTGGCAGATGAGGGGCTTTCTGAGTATACTCCTCAGTATTTTTATGTCATTAAAG
GGAGCAATGGAAATCTTAAGACTTCCTTTTGTGCTGATCAATCAAGGTCTTCTCAGGCAAATGATGTCCGCAAGCAAATC
GTTGGCAGCGCTGTTCCAGTTCTTAAAGGCGAAAAGTTTTCCTTGCGGATACTGGTGGACCATTCTATTGTTGAAAGCTT
TGCTCAAGGTGGAAGGACGGTTGTGACATCTCGGGTTTATCCAACAAAGGCAATCTATGGAGCTGCTAGGTTGTTCTTGT
TCAACAATGCTACCGAGGCCACTGTGACAGCCTCACTAAAAGTTTGGCAAATGAATTCTGCATTTATACGCCCATTCCAC
CCTGATCAGAAGAGTTAA