
Microexon ID | Gm_5:5152585-5152593:- |
Species | Glycine max | Coordinates | 5:5152585..5152593 |
Microexon Cluster ID | MEP22 |
Size | 9 |
Phase | 1 |
Pfam Domain Motif | Glyco_hydro_32N |
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 49,9,50 |
Microexon location in the Microexon-tag | 2 |
Microexon-tag DNA Seq | YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV |
Logo of Microexon-tag DNA Seq | ![]() |
Alignment of exons | ![]() |
Microexon DNA seq | ATCCCAATG |
Microexon Amino Acid seq | DPNG |
Microexon-tag DNA Seq | TGGCAGAGAACTGCTTTTCATTTCCAACCAGAGAAAAACTGGATGAACGATCCCAATGGTCCTATGTACTACAAGGGATGGTATCACTTTTTCTACCAATACAACCCG |
Microexon-tag Amino Acid Seq | WQRTAFHFQPEKNWMNDPNGPMYYKGWYHFFYQYNP |
Microexon-tag spanning region | 5151470-5153291 |
Microexon-tag prediction score | 0.9746 |
Overlapped with the annotated transcript (%) | 100 |
New Transcript ID | KRH57351x |
Reference Transcript ID | KRH57351 |
Gene ID | GLYMA_05G056300 |
Gene Name | NA |
Transcript ID | KRH57351 |
Protein ID | KRH57351 |
Gene ID | GLYMA_05G056300 |
Gene Name | NA |
Pfam domain motif | Glyco_hydro_32N |
Motif E-value | 1.1e-109 |
Motif start | 118 |
Motif end | 436 |
Protein seq | >KRH57351 MDHRKPLLPTSSDAAPNPHTRKDLLLVICGLLLLSSLVAFGGYRASNAPHADVASATSNDEQRSPTSVPSPKWYPVSRGV SSGVSEKSSSMLFAVKDGASKAFPWDNSMLSWQRTAFHFQPEKNWMNDPNGPMYYKGWYHFFYQYNPNGAVWGDIVWGHA VSRDMIHWLHLPLAMVADQWYDMQGVWTGSATILPNGEIIMLYTGSTNESVQVQNLAYPANPSDPLLVDWIKYPGNPVLV PPPGIGAKDFRDPTTAWLTSEGKWRITIGSKLNKTGIALVYDTEDFKNYELKEGLLRAVAGTGMWECVDFFPVSKENENG LDTSINGAEVKHVMKVSLDDDRHDYYSIGTYDEKNVLFTPDDAKNDVGVGLRYDYGIFYASKTFYDQNKERRVLWGWIGE SDSEYADVAKGWASVQSIPRTVELDRKTGSNLLQWPVAEIESLRLRSDEFKNLKAKPGSVVSVDIETATQLDIVAEFEID KETLDKIPQSNEEYTCSTSGGSKQRGALGPFGLLVLADEGLSEYTPQYFYVIKGSNGNLKTSFCADQSRSSQANDVRKQI VGSAVPVLKGEKFSLRILVDHSIVESFAQGGRTVVTSRVYPTKAIYGAARLFLFNNATEATVTASLKVWQMNSAFIRPFH PDQKS* |
CDS seq | >KRH57351 ATGGATCATCGCAAACCATTGCTACCCACTTCCTCTGATGCTGCTCCAAATCCACACACTCGCAAGGACCTTCTTCTAGT TATTTGCGGCTTGCTATTACTCTCTTCCCTTGTTGCCTTTGGAGGGTACAGGGCCTCCAACGCGCCGCATGCCGACGTGG CATCGGCAACATCGAACGATGAGCAACGTAGTCCGACATCGGTGCCTTCTCCGAAATGGTACCCGGTTTCGCGAGGTGTT TCTTCCGGGGTGTCAGAGAAGTCATCAAGCATGTTGTTTGCGGTTAAAGATGGAGCTTCGAAGGCATTTCCATGGGACAA CAGCATGTTGTCTTGGCAGAGAACTGCTTTTCATTTCCAACCAGAGAAAAACTGGATGAACGATCCCAATGGTCCTATGT ACTACAAGGGATGGTATCACTTTTTCTACCAATACAACCCGAACGGTGCAGTTTGGGGTGACATAGTTTGGGGACATGCA GTTTCAAGGGACATGATCCATTGGCTTCACCTTCCACTGGCAATGGTGGCTGATCAATGGTATGACATGCAAGGTGTGTG GACAGGCTCAGCCACAATCTTACCAAATGGTGAAATCATCATGTTATACACAGGTTCCACCAACGAGTCAGTGCAGGTTC AAAACCTTGCATACCCTGCAAACCCCTCTGACCCTCTCCTTGTGGATTGGATCAAATACCCTGGGAACCCCGTTTTGGTG CCACCACCAGGCATTGGGGCCAAGGACTTTCGTGACCCAACAACAGCGTGGCTCACCTCAGAAGGGAAGTGGCGAATCAC CATAGGTTCCAAGCTCAACAAAACCGGCATTGCGTTGGTTTATGACACTGAGGACTTCAAGAACTATGAGCTCAAGGAGG GGCTGCTTCGTGCTGTCGCTGGCACCGGCATGTGGGAGTGTGTGGACTTCTTCCCTGTGTCCAAGGAGAATGAGAATGGA TTGGATACTTCTATTAATGGGGCTGAGGTGAAGCATGTGATGAAGGTGAGCCTGGATGATGATAGACATGATTACTATTC AATTGGGACTTATGATGAGAAGAATGTTCTGTTCACACCAGATGATGCTAAGAATGATGTTGGCGTTGGATTGAGGTATG ACTATGGGATATTCTATGCATCCAAGACGTTTTATGATCAGAATAAGGAGAGGAGAGTTTTGTGGGGTTGGATTGGGGAG TCTGACAGTGAATATGCTGATGTGGCCAAAGGTTGGGCTTCAGTTCAGAGTATTCCTAGAACTGTGGAGCTTGATAGGAA GACTGGCAGCAACTTACTTCAGTGGCCTGTTGCTGAGATCGAGAGTTTGAGATTGAGAAGTGATGAATTTAAAAATTTGA AGGCTAAACCAGGGTCAGTGGTGTCAGTAGATATTGAAACAGCCACGCAGTTGGACATTGTTGCCGAGTTTGAGATAGAC AAGGAAACCCTTGACAAAATTCCTCAATCCAACGAGGAATACACGTGCAGCACCAGTGGTGGATCTAAACAACGTGGTGC CTTAGGACCTTTTGGTCTTTTGGTTTTGGCAGATGAGGGGCTTTCTGAGTATACTCCTCAGTATTTTTATGTCATTAAAG GGAGCAATGGAAATCTTAAGACTTCCTTTTGTGCTGATCAATCAAGGTCTTCTCAGGCAAATGATGTCCGCAAGCAAATC GTTGGCAGCGCTGTTCCAGTTCTTAAAGGCGAAAAGTTTTCCTTGCGGATACTGGTGGACCATTCTATTGTTGAAAGCTT TGCTCAAGGTGGAAGGACGGTTGTGACATCTCGGGTTTATCCAACAAAGGCAATCTATGGAGCTGCTAGGTTGTTCTTGT TCAACAATGCTACCGAGGCCACTGTGACAGCCTCACTAAAAGTTTGGCAAATGAATTCTGCATTTATACGCCCATTCCAC CCTGATCAGAAGAGTTAA |
Microexon DNA seq | ATCCCAATG |
Microexon Amino Acid seq | DPNG |
Microexon-tag DNA Seq | TGGCAGAGAACTGCTTTTCATTTCCAACCAGAGAAAAACTGGATGAACGATCCCAATGGTCCTATGTACTACAAGGGATGGTATCACTTTTTCTACCAATACAACCCG |
Microexon-tag Amino Acid seq | WQRTAFHFQPEKNWMNDPNGPMYYKGWYHFFYQYNP |
Transcript ID | KRH57351 |
Gene ID | Gm.40420 |
Gene Name | NA |
Pfam domain motif | Glyco_hydro_32N |
Motif E-value | 1.1e-109 |
Motif start | 118 |
Motif end | 436 |
Protein seq | >KRH57351 MDHRKPLLPTSSDAAPNPHTRKDLLLVICGLLLLSSLVAFGGYRASNAPHADVASATSNDEQRSPTSVPSPKWYPVSRGV SSGVSEKSSSMLFAVKDGASKAFPWDNSMLSWQRTAFHFQPEKNWMNDPNGPMYYKGWYHFFYQYNPNGAVWGDIVWGHA VSRDMIHWLHLPLAMVADQWYDMQGVWTGSATILPNGEIIMLYTGSTNESVQVQNLAYPANPSDPLLVDWIKYPGNPVLV PPPGIGAKDFRDPTTAWLTSEGKWRITIGSKLNKTGIALVYDTEDFKNYELKEGLLRAVAGTGMWECVDFFPVSKENENG LDTSINGAEVKHVMKVSLDDDRHDYYSIGTYDEKNVLFTPDDAKNDVGVGLRYDYGIFYASKTFYDQNKERRVLWGWIGE SDSEYADVAKGWASVQSIPRTVELDRKTGSNLLQWPVAEIESLRLRSDEFKNLKAKPGSVVSVDIETATQLDIVAEFEID KETLDKIPQSNEEYTCSTSGGSKQRGALGPFGLLVLADEGLSEYTPQYFYVIKGSNGNLKTSFCADQSRSSQANDVRKQI VGSAVPVLKGEKFSLRILVDHSIVESFAQGGRTVVTSRVYPTKAIYGAARLFLFNNATEATVTASLKVWQMNSAFIRPFH PDQKS* |
CDS seq | >KRH57351 ATGGATCATCGCAAACCATTGCTACCCACTTCCTCTGATGCTGCTCCAAATCCACACACTCGCAAGGACCTTCTTCTAGT TATTTGCGGCTTGCTATTACTCTCTTCCCTTGTTGCCTTTGGAGGGTACAGGGCCTCCAACGCGCCGCATGCCGACGTGG CATCGGCAACATCGAACGATGAGCAACGTAGTCCGACATCGGTGCCTTCTCCGAAATGGTACCCGGTTTCGCGAGGTGTT TCTTCCGGGGTGTCAGAGAAGTCATCAAGCATGTTGTTTGCGGTTAAAGATGGAGCTTCGAAGGCATTTCCATGGGACAA CAGCATGTTGTCTTGGCAGAGAACTGCTTTTCATTTCCAACCAGAGAAAAACTGGATGAACGATCCCAATGGTCCTATGT ACTACAAGGGATGGTATCACTTTTTCTACCAATACAACCCGAACGGTGCAGTTTGGGGTGACATAGTTTGGGGACATGCA GTTTCAAGGGACATGATCCATTGGCTTCACCTTCCACTGGCAATGGTGGCTGATCAATGGTATGACATGCAAGGTGTGTG GACAGGCTCAGCCACAATCTTACCAAATGGTGAAATCATCATGTTATACACAGGTTCCACCAACGAGTCAGTGCAGGTTC AAAACCTTGCATACCCTGCAAACCCCTCTGACCCTCTCCTTGTGGATTGGATCAAATACCCTGGGAACCCCGTTTTGGTG CCACCACCAGGCATTGGGGCCAAGGACTTTCGTGACCCAACAACAGCGTGGCTCACCTCAGAAGGGAAGTGGCGAATCAC CATAGGTTCCAAGCTCAACAAAACCGGCATTGCGTTGGTTTATGACACTGAGGACTTCAAGAACTATGAGCTCAAGGAGG GGCTGCTTCGTGCTGTCGCTGGCACCGGCATGTGGGAGTGTGTGGACTTCTTCCCTGTGTCCAAGGAGAATGAGAATGGA TTGGATACTTCTATTAATGGGGCTGAGGTGAAGCATGTGATGAAGGTGAGCCTGGATGATGATAGACATGATTACTATTC AATTGGGACTTATGATGAGAAGAATGTTCTGTTCACACCAGATGATGCTAAGAATGATGTTGGCGTTGGATTGAGGTATG ACTATGGGATATTCTATGCATCCAAGACGTTTTATGATCAGAATAAGGAGAGGAGAGTTTTGTGGGGTTGGATTGGGGAG TCTGACAGTGAATATGCTGATGTGGCCAAAGGTTGGGCTTCAGTTCAGAGTATTCCTAGAACTGTGGAGCTTGATAGGAA GACTGGCAGCAACTTACTTCAGTGGCCTGTTGCTGAGATCGAGAGTTTGAGATTGAGAAGTGATGAATTTAAAAATTTGA AGGCTAAACCAGGGTCAGTGGTGTCAGTAGATATTGAAACAGCCACGCAGTTGGACATTGTTGCCGAGTTTGAGATAGAC AAGGAAACCCTTGACAAAATTCCTCAATCCAACGAGGAATACACGTGCAGCACCAGTGGTGGATCTAAACAACGTGGTGC CTTAGGACCTTTTGGTCTTTTGGTTTTGGCAGATGAGGGGCTTTCTGAGTATACTCCTCAGTATTTTTATGTCATTAAAG GGAGCAATGGAAATCTTAAGACTTCCTTTTGTGCTGATCAATCAAGGTCTTCTCAGGCAAATGATGTCCGCAAGCAAATC GTTGGCAGCGCTGTTCCAGTTCTTAAAGGCGAAAAGTTTTCCTTGCGGATACTGGTGGACCATTCTATTGTTGAAAGCTT TGCTCAAGGTGGAAGGACGGTTGTGACATCTCGGGTTTATCCAACAAAGGCAATCTATGGAGCTGCTAGGTTGTTCTTGT TCAACAATGCTACCGAGGCCACTGTGACAGCCTCACTAAAAGTTTGGCAAATGAATTCTGCATTTATACGCCCATTCCAC CCTGATCAGAAGAGTTAA |