Microexon ID Gm_12:399494-399502:-
Species Glycine max
Coordinates 12:399494..399502
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAAAGAACAGCTTTCCATTTTCAACCTCAAAGGAACTGGATGAACGATCCTAACGGTCCATTGTTTTACATGGGGTGGTACCATGTATTTTACCAATACAATCCG
Microexon-tag Amino Acid Seq WQRTAFHFQPQRNWMNDPNGPLFYMGWYHVFYQYNP
Microexon-tag spanning region398476-400192
Microexon-tag prediction score0.9605
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH23803x
Reference Transcript ID KRH23803
Gene ID GLYMA_12G005100
Gene Name NA
Transcript ID KRH23803
Protein ID KRH23803
Gene ID GLYMA_12G005100
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.7e-104
Motif start 99
Motif end 408
Protein seq >KRH23803
MNPDLEHAPQIPLLNPPTGESGRRTQKGTLVFIVLMSLFALIIVNLQSHEPSLENNITLIPKARGVAEGVSAKSNQYLSH
KASYNWTNAMLSWQRTAFHFQPQRNWMNDPNGPLFYMGWYHVFYQYNPDSAVWGNITWGHAVSRDLIHWLYLPIALFPDK
WFDVNGVWSGSATLLPDGKILMLYTGSTDQNVQVQNLAYPANLSDPLLLDWVKYADNPVLAPPPGIGPKDFRDPTTAWFG
PDEKWRITIGSKLNGTGLSLVYKTQDFIHYEQNDHYLHQVPGTGMWECVDFYPVSVNGPNDVKHVLKASLDDTKVDHYAI
GTYFIENDTWVPDNPHEDVGIGFKLDYGRYYASKTFYDQHKNRRILWGWINESDSETADLKKGWASLQTIPRTVVFDKKT
RTNLVHWPVEEVESLRLGSSEFEGVVVKPGSVVPLDIGPATQLDVFAEFEIEFLASKGSGKDNIGCGNGAVDRSALGPFG
ILAIADDHLSELTPIYFHLSSTTKDGSSTTSFCVDETRSSKAPDVSKLVFGSKVPVLSDEKLSMRVLVDHSIIESFAQGG
RTVISSRVYPTEAIYGAARLFLFNNATDINIKVSLKIWQLNSAFIRPFPFDQKL*
CDS seq >KRH23803
ATGAACCCTGATCTTGAGCATGCTCCGCAAATTCCCTTGCTGAATCCTCCAACAGGGGAAAGTGGAAGGAGAACGCAAAA
GGGTACCCTTGTTTTCATTGTCCTGATGTCATTGTTTGCACTTATTATCGTAAACCTTCAGAGCCATGAGCCCAGCTTAG
AAAACAATATCACATTAATACCTAAAGCTAGAGGGGTTGCTGAGGGTGTATCAGCCAAGTCAAACCAATATCTGTCACAC
AAAGCTTCATATAACTGGACAAATGCAATGTTATCTTGGCAAAGAACAGCTTTCCATTTTCAACCTCAAAGGAACTGGAT
GAACGATCCTAACGGTCCATTGTTTTACATGGGGTGGTACCATGTATTTTACCAATACAATCCGGATTCAGCCGTGTGGG
GCAACATAACATGGGGCCACGCTGTATCCAGAGACTTGATTCACTGGCTCTACCTTCCCATTGCCTTGTTTCCAGATAAG
TGGTTTGACGTCAACGGTGTATGGTCAGGTTCAGCAACCCTTTTGCCAGATGGCAAAATCCTAATGCTCTACACGGGTAG
CACCGATCAAAACGTGCAAGTCCAAAATCTTGCCTACCCCGCCAACCTATCTGATCCCCTCCTCCTTGATTGGGTCAAAT
ATGCTGATAACCCTGTCTTGGCCCCTCCACCAGGCATTGGGCCAAAGGATTTTCGTGACCCAACCACTGCATGGTTTGGG
CCAGATGAAAAGTGGAGGATCACCATTGGCTCAAAGCTCAACGGGACAGGTCTTTCATTGGTTTATAAAACCCAAGATTT
CATCCACTACGAGCAAAATGATCACTATTTGCACCAAGTCCCGGGAACGGGTATGTGGGAGTGCGTGGACTTTTACCCGG
TTTCAGTAAACGGGCCTAACGATGTGAAACATGTGTTGAAGGCTAGTTTGGATGACACCAAGGTGGATCATTATGCAATT
GGGACATACTTCATTGAAAATGATACATGGGTGCCCGATAACCCGCATGAGGATGTGGGTATCGGGTTCAAATTGGACTA
TGGCAGATACTATGCGTCAAAGACATTCTATGACCAACACAAAAACAGAAGGATCCTGTGGGGTTGGATTAATGAATCAG
ATAGTGAAACCGCTGACTTGAAAAAGGGTTGGGCATCTCTCCAGACTATTCCAAGAACAGTAGTGTTTGACAAGAAGACT
AGAACTAATTTGGTTCACTGGCCAGTGGAAGAAGTAGAAAGCTTAAGACTTGGCAGTTCCGAATTTGAAGGAGTTGTGGT
TAAACCTGGCTCAGTTGTGCCACTGGACATAGGCCCAGCCACACAGTTGGACGTATTTGCTGAATTTGAAATCGAATTTT
TAGCATCCAAAGGGAGTGGCAAGGACAATATAGGATGTGGAAATGGAGCTGTTGATAGAAGTGCTTTGGGACCATTCGGT
ATTCTGGCTATTGCAGATGACCATCTTTCTGAACTAACACCAATTTATTTCCATCTTTCTAGTACTACTAAAGATGGCAG
TTCAACCACTTCCTTCTGTGTTGATGAAACTAGGTCATCAAAGGCTCCTGATGTTTCAAAGCTCGTTTTTGGAAGCAAAG
TTCCTGTTCTCAGTGATGAAAAATTATCAATGAGGGTATTGGTGGACCATTCAATCATTGAGAGCTTTGCTCAGGGAGGG
AGAACAGTGATCTCATCTAGAGTTTATCCAACAGAAGCAATATATGGAGCTGCAAGATTGTTTCTGTTCAACAACGCAAC
GGACATAAACATCAAGGTCTCGCTCAAGATTTGGCAATTGAACTCAGCTTTCATACGCCCCTTTCCCTTTGACCAGAAGT
TGTGA
Microexon DNA seq ATCCTAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAAAGAACAGCTTTCCATTTTCAACCTCAAAGGAACTGGATGAACGATCCTAACGGTCCATTGTTTTACATGGGGTGGTACCATGTATTTTACCAATACAATCCG
Microexon-tag Amino Acid seq WQRTAFHFQPQRNWMNDPNGPLFYMGWYHVFYQYNP
Transcript ID Gm.7899.3
Gene ID Gm.7899
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.7e-104
Motif start 99
Motif end 408
Protein seq >Gm.7899.3
MNPDLEHAPQIPLLNPPTGESGRRTQKGTLVFIVLMSLFALIIVNLQSHEPSLENNITLIPKARGVAEGVSAKSNQYLSH
KASYNWTNAMLSWQRTAFHFQPQRNWMNDPNGPLFYMGWYHVFYQYNPDSAVWGNITWGHAVSRDLIHWLYLPIALFPDK
WFDVNGVWSGSATLLPDGKILMLYTGSTDQNVQVQNLAYPANLSDPLLLDWVKYADNPVLAPPPGIGPKDFRDPTTAWFG
PDEKWRITIGSKLNGTGLSLVYKTQDFIHYEQNDHYLHQVPGTGMWECVDFYPVSVNGPNDVKHVLKASLDDTKVDHYAI
GTYFIENDTWVPDNPHEDVGIGFKLDYGRYYASKTFYDQHKNRRILWGWINESDSETADLKKGWASLQTIPRTVVFDKKT
RTNLVHWPVEEVESLRLGSSEFEGVVVKPGSVVPLDIGPATQLDVFAEFEIEFLASKGSGKDNIGCGNGAVDRSALGPFG
ILAIADDHLSELTPIYFHLSSTTKDGSSTTSFCVDETRSSKAPDVSKLVFGSKVPVLSDEKLSMRVLVDHSIIESFAQGG
RTVISSRVYPTEAIYGAARLFLFNNATDINIKVSLKIWQLNSAFIRPFPFDQKL*
CDS seq >Gm.7899.3
ATGAACCCTGATCTTGAGCATGCTCCGCAAATTCCCTTGCTGAATCCTCCAACAGGGGAAAGTGGAAGGAGAACGCAAAA
GGGTACCCTTGTTTTCATTGTCCTGATGTCATTGTTTGCACTTATTATCGTAAACCTTCAGAGCCATGAGCCCAGCTTAG
AAAACAATATCACATTAATACCTAAAGCTAGAGGGGTTGCTGAGGGTGTATCAGCCAAGTCAAACCAATATCTGTCACAC
AAAGCTTCATATAACTGGACAAATGCAATGTTATCTTGGCAAAGAACAGCTTTCCATTTTCAACCTCAAAGGAACTGGAT
GAACGATCCTAACGGTCCATTGTTTTACATGGGGTGGTACCATGTATTTTACCAATACAATCCGGATTCAGCCGTGTGGG
GCAACATAACATGGGGCCACGCTGTATCCAGAGACTTGATTCACTGGCTCTACCTTCCCATTGCCTTGTTTCCAGATAAG
TGGTTTGACGTCAACGGTGTATGGTCAGGTTCAGCAACCCTTTTGCCAGATGGCAAAATCCTAATGCTCTACACGGGTAG
CACCGATCAAAACGTGCAAGTCCAAAATCTTGCCTACCCCGCCAACCTATCTGATCCCCTCCTCCTTGATTGGGTCAAAT
ATGCTGATAACCCTGTCTTGGCCCCTCCACCAGGCATTGGGCCAAAGGATTTTCGTGACCCAACCACTGCATGGTTTGGG
CCAGATGAAAAGTGGAGGATCACCATTGGCTCAAAGCTCAACGGGACAGGTCTTTCATTGGTTTATAAAACCCAAGATTT
CATCCACTACGAGCAAAATGATCACTATTTGCACCAAGTCCCGGGAACGGGTATGTGGGAGTGCGTGGACTTTTACCCGG
TTTCAGTAAACGGGCCTAACGATGTGAAACATGTGTTGAAGGCTAGTTTGGATGACACCAAGGTGGATCATTATGCAATT
GGGACATACTTCATTGAAAATGATACATGGGTGCCCGATAACCCGCATGAGGATGTGGGTATCGGGTTCAAATTGGACTA
TGGCAGATACTATGCGTCAAAGACATTCTATGACCAACACAAAAACAGAAGGATCCTGTGGGGTTGGATTAATGAATCAG
ATAGTGAAACCGCTGACTTGAAAAAGGGTTGGGCATCTCTCCAGACTATTCCAAGAACAGTAGTGTTTGACAAGAAGACT
AGAACTAATTTGGTTCACTGGCCAGTGGAAGAAGTAGAAAGCTTAAGACTTGGCAGTTCCGAATTTGAAGGAGTTGTGGT
TAAACCTGGCTCAGTTGTGCCACTGGACATAGGCCCAGCCACACAGTTGGACGTATTTGCTGAATTTGAAATCGAATTTT
TAGCATCCAAAGGGAGTGGCAAGGACAATATAGGATGTGGAAATGGAGCTGTTGATAGAAGTGCTTTGGGACCATTCGGT
ATTCTGGCTATTGCAGATGACCATCTTTCTGAACTAACACCAATTTATTTCCATCTTTCTAGTACTACTAAAGATGGCAG
TTCAACCACTTCCTTCTGTGTTGATGAAACTAGGTCATCAAAGGCTCCTGATGTTTCAAAGCTCGTTTTTGGAAGCAAAG
TTCCTGTTCTCAGTGATGAAAAATTATCAATGAGGGTATTGGTGGACCATTCAATCATTGAGAGCTTTGCTCAGGGAGGG
AGAACAGTGATCTCATCTAGAGTTTATCCAACAGAAGCAATATATGGAGCTGCAAGATTGTTTCTGTTCAACAACGCAAC
GGACATAAACATCAAGGTCTCGCTCAAGATTTGGCAATTGAACTCAGCTTTCATACGCCCCTTTCCCTTTGACCAGAAGT
TGTGA