Microexon ID Gm_9:45480013-45480021:+
Species Glycine max
Coordinates 9:45480013..45480021
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAAAGAACAGCTTTCCATTTTCAACCTCAAAGGAACTGGATGAACGATCCTAACGGTCCATTGTTTTACATGGGGTGGTACCATATATTTTACCAATACAATCCT
Microexon-tag Amino Acid Seq WQRTAFHFQPQRNWMNDPNGPLFYMGWYHIFYQYNP
Microexon-tag spanning region45479304-45481080
Microexon-tag prediction score0.9584
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH39981x
Reference Transcript ID KRH39981
Gene ID GLYMA_09G231500
Gene Name NA
Transcript ID KRH39981
Protein ID KRH39981
Gene ID GLYMA_09G231500
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.1e-105
Motif start 106
Motif end 424
Protein seq >KRH39981
MAAMNPDLEHAPQIPLLNPPTGESGRRTQKGTLVFIVSIVFLLSFIIINLQSHEPSFENNITTVPLLPIARGVAEGVSAK
SNPYLSQKASYNWTNAMLSWQRTAFHFQPQRNWMNDPNGPLFYMGWYHIFYQYNPDSAVWGNITWGHAVSRDLIHWLYLP
IALVPDKWFDISGVWSGSATLLPDGKILMLYTGNTDRNVQVQNLAYPANLSDPLLLDWVKYANNPVLVPPPGIGPKDFRD
PTTAWIGPDEKWRITIGSKLNKTGLSLLYKTQDFIHYEQSDRYLHQVPGTGMWECVDFYPVSVNGPNGLDTSENGPDVKH
VLKASLDDTKVDHYAIGTYFIENDTWVPDNPNEDVGIGLKLDYGRYYASKTFYDQQKQRRILWGWINESDSETADLKKGW
ASLQTIPRTVVFDKKTRTNLLHWPVEEVESLRLSNSEFEGVVVKPGSVVPLDIGPATQLDIFAEFEIEDLASKGIGKDNV
DCGNGAVDRSAFGPFGILAIADDQLSELTPIYFHLSSTTKDGSLTTSFCVDETRSSKAPDVSKLIFGSKAPVLSDEKLSM
RVLVDHSIIESFAQGGRTVITSRVYPTEAIYGAARLFLFNNATDINIKASLKIWQLNSAFIRPFPFDQKL*
CDS seq >KRH39981
ATGGCAGCCATGAACCCTGATCTTGAGCATGCTCCACAAATTCCCTTGCTGAATCCTCCAACAGGGGAAAGTGGAAGGAG
AACGCAAAAGGGTACCCTTGTTTTCATTGTCTCGATTGTTTTCCTATTGTCATTTATCATCATAAACCTTCAGAGCCATG
AGCCCAGCTTTGAAAACAATATCACAACGGTGCCATTACTACCTATAGCTAGAGGGGTCGCTGAGGGTGTATCAGCCAAG
TCAAATCCATATCTATCACAGAAAGCTTCATATAATTGGACAAATGCCATGTTATCTTGGCAAAGAACAGCTTTCCATTT
TCAACCTCAAAGGAACTGGATGAACGATCCTAACGGTCCATTGTTTTACATGGGGTGGTACCATATATTTTACCAATACA
ATCCTGATTCAGCAGTGTGGGGCAACATAACATGGGGCCACGCTGTATCCAGAGACTTGATTCACTGGCTCTACCTTCCC
ATTGCGTTGGTTCCAGATAAGTGGTTTGACATCAGTGGTGTATGGTCAGGGTCAGCAACCCTTTTGCCAGATGGCAAAAT
CCTAATGCTCTACACGGGTAACACCGATCGAAACGTGCAAGTCCAAAATCTTGCGTACCCCGCCAACCTATCTGATCCCC
TCCTCCTTGATTGGGTCAAATACGCTAATAACCCCGTCTTGGTGCCCCCACCAGGCATTGGGCCAAAGGATTTTCGTGAC
CCAACCACTGCGTGGATTGGGCCAGATGAGAAGTGGAGGATCACTATTGGGTCAAAGCTCAACAAGACAGGTCTTTCGTT
GCTTTATAAAACCCAAGATTTCATCCACTATGAGCAAAGTGATCGCTATTTGCATCAAGTCCCGGGTACCGGCATGTGGG
AGTGCGTTGACTTTTACCCGGTTTCAGTAAACGGGCCTAACGGTTTGGATACATCTGAAAATGGGCCAGATGTGAAGCAT
GTGTTGAAGGCTAGTTTGGATGACACTAAGGTGGATCATTATGCAATTGGAACCTACTTCATTGAAAATGATACATGGGT
GCCCGATAACCCGAATGAGGATGTGGGTATCGGGTTGAAATTGGACTATGGGAGATACTATGCGTCAAAGACATTCTATG
ACCAACAAAAACAAAGAAGGATCCTGTGGGGTTGGATTAATGAATCAGATAGTGAAACCGCTGACTTGAAAAAGGGTTGG
GCATCTCTCCAGACTATTCCAAGAACAGTTGTGTTTGACAAGAAGACTAGAACTAATTTGCTTCACTGGCCAGTGGAAGA
AGTAGAAAGCTTAAGACTTAGCAACTCCGAATTTGAAGGAGTTGTGGTTAAACCTGGCTCAGTTGTGCCACTGGACATAG
GCCCGGCCACACAGTTGGACATATTTGCTGAATTTGAAATCGAAGATTTAGCATCCAAAGGAATTGGCAAGGACAATGTA
GACTGTGGAAATGGAGCTGTTGACAGAAGTGCTTTTGGACCATTCGGTATTTTGGCTATTGCAGATGACCAACTTTCTGA
ACTAACACCAATTTATTTCCATCTTTCTAGTACTACTAAAGATGGCAGTTTAACCACTTCCTTCTGTGTTGATGAAACTA
GGTCATCAAAGGCTCCTGATGTTTCAAAGCTCATTTTTGGAAGCAAAGCTCCAGTTCTCAGTGATGAAAAATTATCAATG
AGGGTATTGGTCGACCATTCTATCATTGAGAGCTTTGCTCAGGGAGGGAGAACAGTGATCACATCTAGAGTTTATCCAAC
AGAAGCAATATATGGAGCTGCAAGATTGTTTCTGTTCAACAACGCGACGGACATAAACATCAAGGCCTCGCTCAAGATTT
GGCAATTGAACTCAGCTTTCATACGCCCCTTCCCCTTTGACCAGAAGTTGTGA
Microexon DNA seq ATCCTAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAAAGAACAGCTTTCCATTTTCAACCTCAAAGGAACTGGATGAACGATCCTAACGGTCCATTGTTTTACATGGGGTGGTACCATATATTTTACCAATACAATCCT
Microexon-tag Amino Acid seq WQRTAFHFQPQRNWMNDPNGPLFYMGWYHIFYQYNP
Transcript ID KRH39981
Gene ID Gm.54031
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.1e-105
Motif start 106
Motif end 424
Protein seq >KRH39981
MAAMNPDLEHAPQIPLLNPPTGESGRRTQKGTLVFIVSIVFLLSFIIINLQSHEPSFENNITTVPLLPIARGVAEGVSAK
SNPYLSQKASYNWTNAMLSWQRTAFHFQPQRNWMNDPNGPLFYMGWYHIFYQYNPDSAVWGNITWGHAVSRDLIHWLYLP
IALVPDKWFDISGVWSGSATLLPDGKILMLYTGNTDRNVQVQNLAYPANLSDPLLLDWVKYANNPVLVPPPGIGPKDFRD
PTTAWIGPDEKWRITIGSKLNKTGLSLLYKTQDFIHYEQSDRYLHQVPGTGMWECVDFYPVSVNGPNGLDTSENGPDVKH
VLKASLDDTKVDHYAIGTYFIENDTWVPDNPNEDVGIGLKLDYGRYYASKTFYDQQKQRRILWGWINESDSETADLKKGW
ASLQTIPRTVVFDKKTRTNLLHWPVEEVESLRLSNSEFEGVVVKPGSVVPLDIGPATQLDIFAEFEIEDLASKGIGKDNV
DCGNGAVDRSAFGPFGILAIADDQLSELTPIYFHLSSTTKDGSLTTSFCVDETRSSKAPDVSKLIFGSKAPVLSDEKLSM
RVLVDHSIIESFAQGGRTVITSRVYPTEAIYGAARLFLFNNATDINIKASLKIWQLNSAFIRPFPFDQKL*
CDS seq >KRH39981
ATGGCAGCCATGAACCCTGATCTTGAGCATGCTCCACAAATTCCCTTGCTGAATCCTCCAACAGGGGAAAGTGGAAGGAG
AACGCAAAAGGGTACCCTTGTTTTCATTGTCTCGATTGTTTTCCTATTGTCATTTATCATCATAAACCTTCAGAGCCATG
AGCCCAGCTTTGAAAACAATATCACAACGGTGCCATTACTACCTATAGCTAGAGGGGTCGCTGAGGGTGTATCAGCCAAG
TCAAATCCATATCTATCACAGAAAGCTTCATATAATTGGACAAATGCCATGTTATCTTGGCAAAGAACAGCTTTCCATTT
TCAACCTCAAAGGAACTGGATGAACGATCCTAACGGTCCATTGTTTTACATGGGGTGGTACCATATATTTTACCAATACA
ATCCTGATTCAGCAGTGTGGGGCAACATAACATGGGGCCACGCTGTATCCAGAGACTTGATTCACTGGCTCTACCTTCCC
ATTGCGTTGGTTCCAGATAAGTGGTTTGACATCAGTGGTGTATGGTCAGGGTCAGCAACCCTTTTGCCAGATGGCAAAAT
CCTAATGCTCTACACGGGTAACACCGATCGAAACGTGCAAGTCCAAAATCTTGCGTACCCCGCCAACCTATCTGATCCCC
TCCTCCTTGATTGGGTCAAATACGCTAATAACCCCGTCTTGGTGCCCCCACCAGGCATTGGGCCAAAGGATTTTCGTGAC
CCAACCACTGCGTGGATTGGGCCAGATGAGAAGTGGAGGATCACTATTGGGTCAAAGCTCAACAAGACAGGTCTTTCGTT
GCTTTATAAAACCCAAGATTTCATCCACTATGAGCAAAGTGATCGCTATTTGCATCAAGTCCCGGGTACCGGCATGTGGG
AGTGCGTTGACTTTTACCCGGTTTCAGTAAACGGGCCTAACGGTTTGGATACATCTGAAAATGGGCCAGATGTGAAGCAT
GTGTTGAAGGCTAGTTTGGATGACACTAAGGTGGATCATTATGCAATTGGAACCTACTTCATTGAAAATGATACATGGGT
GCCCGATAACCCGAATGAGGATGTGGGTATCGGGTTGAAATTGGACTATGGGAGATACTATGCGTCAAAGACATTCTATG
ACCAACAAAAACAAAGAAGGATCCTGTGGGGTTGGATTAATGAATCAGATAGTGAAACCGCTGACTTGAAAAAGGGTTGG
GCATCTCTCCAGACTATTCCAAGAACAGTTGTGTTTGACAAGAAGACTAGAACTAATTTGCTTCACTGGCCAGTGGAAGA
AGTAGAAAGCTTAAGACTTAGCAACTCCGAATTTGAAGGAGTTGTGGTTAAACCTGGCTCAGTTGTGCCACTGGACATAG
GCCCGGCCACACAGTTGGACATATTTGCTGAATTTGAAATCGAAGATTTAGCATCCAAAGGAATTGGCAAGGACAATGTA
GACTGTGGAAATGGAGCTGTTGACAGAAGTGCTTTTGGACCATTCGGTATTTTGGCTATTGCAGATGACCAACTTTCTGA
ACTAACACCAATTTATTTCCATCTTTCTAGTACTACTAAAGATGGCAGTTTAACCACTTCCTTCTGTGTTGATGAAACTA
GGTCATCAAAGGCTCCTGATGTTTCAAAGCTCATTTTTGGAAGCAAAGCTCCAGTTCTCAGTGATGAAAAATTATCAATG
AGGGTATTGGTCGACCATTCTATCATTGAGAGCTTTGCTCAGGGAGGGAGAACAGTGATCACATCTAGAGTTTATCCAAC
AGAAGCAATATATGGAGCTGCAAGATTGTTTCTGTTCAACAACGCGACGGACATAAACATCAAGGCCTCGCTCAAGATTT
GGCAATTGAACTCAGCTTTCATACGCCCCTTCCCCTTTGACCAGAAGTTGTGA