Microexon ID Gm_6:50711832-50711840:+
Species Glycine max
Coordinates 6:50711832..50711840
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTGATG
Microexon Amino Acid seq DPDG
Microexon-tag DNA Seq TGGCAAAGAACAGCTTTTCATTTTCAACCACAAAATAATTGGATGAATGATCCTGATGGTCCATTGTTTCACATGGGGTGGTACCATTTATTTTACCAATATAATCCT
Microexon-tag Amino Acid Seq WQRTAFHFQPQNNWMNDPDGPLFHMGWYHLFYQYNP
Microexon-tag spanning region50711532-50713467
Microexon-tag prediction score0.9443
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH56348x
Reference Transcript ID KRH56348
Gene ID GLYMA_06G318500
Gene Name NA
Transcript ID KRH56348
Protein ID KRH56348
Gene ID GLYMA_06G318500
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.1e-101
Motif start 122
Motif end 440
Protein seq >KRH56348
MEANPSHTTTPHLDHALQTPLLNHPSRVNNISHRPSRGIFVILLSIVFLVSLVALIMIQGQNYMENLENSNIEITLFSNI
SKQELPRGAAQGVSAKSNPPLFHKVSYNWTNAMFSWQRTAFHFQPQNNWMNDPDGPLFHMGWYHLFYQYNPDSAIWGNIS
WGHAVSRDMIHWFYLPIAMGPDTWYDINGVWTGSATILPGGKIIILYTGDTNEYVQVQNLAYPANLSDPLLLDWVKYAGN
PVLVPPPGIGPKDFRDPTTGWIGPDGKWRVAIGSKKGKKGISLVYTTTDFVNFESNDHYLHAVPGTGMWECVDFYPVSIS
GSRGLDTSENEPNVKHVLKASMDETRVDHYALGTYFIENDTWVPDNPLEDVGIGLVLDYGRYYASKTFYDPEKERRILWG
WINETDTESDDLRKGWASLQTIPRTVLFDSKTGTNLLLWPVEEVESLRLSSDEFEGVVVKPGSVVPLNISLATQLDMFAE
FEIETLESKSIGKNNIGCGSGGATNRSAFGPFGLLAIADDTLSEQTPIYFRLSNTTLGSSTTFFCVDETRSSKAADVAKP
IYGSKVPVLSDEKLSMRVLVDHSIIESFAQGGRTVITSRVYPTEAIYGAARLFLFNNATGINIKATLKIWQLSSAFIRPF
PFDQSQ*
CDS seq >KRH56348
ATGGAGGCCAATCCTTCTCACACTACAACCCCTCATCTTGATCATGCTCTACAAACTCCCTTGCTGAACCATCCATCAAG
AGTGAACAATATTAGTCATAGACCATCAAGGGGTATCTTTGTGATACTTCTTTCCATTGTTTTTCTAGTGTCATTGGTTG
CATTGATCATGATTCAGGGCCAAAACTACATGGAAAATTTGGAGAATAGCAACATAGAAATTACCCTTTTTTCTAACATC
TCTAAGCAAGAATTGCCTAGAGGAGCGGCTCAAGGGGTTTCAGCCAAGTCCAACCCACCCCTTTTCCACAAAGTTTCATA
TAATTGGACCAACGCCATGTTTTCTTGGCAAAGAACAGCTTTTCATTTTCAACCACAAAATAATTGGATGAATGATCCTG
ATGGTCCATTGTTTCACATGGGGTGGTACCATTTATTTTACCAATATAATCCTGATTCAGCTATATGGGGCAACATTTCA
TGGGGTCATGCTGTATCAAGGGACATGATTCACTGGTTCTACCTTCCCATTGCCATGGGACCTGACACGTGGTACGATAT
CAACGGTGTATGGACCGGGTCCGCCACGATTCTTCCAGGTGGCAAAATCATAATACTCTACACAGGTGACACCAATGAAT
ATGTGCAAGTGCAAAACCTTGCATACCCTGCCAATCTATCTGATCCCCTTCTCCTTGATTGGGTCAAGTATGCGGGTAAC
CCGGTCCTAGTGCCCCCACCCGGTATCGGCCCGAAGGATTTTCGTGACCCAACCACGGGTTGGATCGGGCCGGATGGAAA
GTGGAGGGTCGCAATTGGGTCAAAGAAAGGAAAAAAAGGCATTTCATTGGTTTACACAACCACAGATTTTGTCAATTTTG
AGTCCAATGATCACTACTTACATGCGGTTCCGGGTACGGGTATGTGGGAGTGTGTGGACTTTTACCCAGTTTCAATAAGC
GGGTCAAGGGGTTTGGATACATCAGAAAATGAGCCAAATGTTAAGCATGTGCTAAAGGCTAGCATGGATGAAACAAGGGT
GGATCATTATGCACTTGGGACCTATTTTATTGAAAATGATACATGGGTGCCCGATAACCCACTTGAGGATGTGGGTATTG
GGTTGGTTTTGGACTATGGGAGATACTATGCTTCAAAGACTTTCTATGATCCAGAGAAAGAGAGGAGGATCCTGTGGGGT
TGGATTAATGAAACGGATACAGAAAGTGATGACTTGAGAAAGGGTTGGGCTTCTCTTCAGACAATTCCGAGAACAGTGCT
GTTTGACAGCAAGACTGGGACTAATTTGCTTCTGTGGCCAGTAGAGGAAGTAGAAAGCTTAAGACTAAGCAGTGATGAAT
TTGAAGGAGTGGTGGTTAAGCCTGGATCTGTTGTGCCACTGAACATAAGCCTAGCAACACAGTTGGACATGTTTGCTGAA
TTTGAGATTGAAACATTGGAATCCAAAAGCATTGGCAAGAACAACATAGGTTGTGGAAGTGGTGGTGCCACAAACAGAAG
TGCTTTTGGACCATTTGGTCTTTTAGCCATTGCAGATGACACACTTTCAGAACAAACCCCAATTTATTTTCGCCTTTCTA
ATACTACCCTTGGTAGTTCAACCACTTTTTTTTGTGTTGATGAAACAAGATCATCCAAGGCTGCTGATGTTGCAAAGCCA
ATTTATGGAAGCAAAGTTCCAGTCCTTAGTGATGAAAAATTATCAATGAGGGTGTTGGTTGACCATTCAATTATTGAGAG
CTTTGCTCAAGGAGGGAGAACTGTGATCACAAGTAGAGTTTACCCAACGGAAGCAATATATGGAGCTGCAAGATTATTTC
TATTCAACAATGCAACTGGCATAAACATTAAGGCCACCCTAAAGATTTGGCAATTGAGCTCTGCTTTTATACGCCCCTTT
CCCTTTGATCAAAGTCAATAA
Microexon DNA seq ATCCTGATG
Microexon Amino Acid seq DPDG
Microexon-tag DNA Seq TGGCAAAGAACAGCTTTTCATTTTCAACCACAAAATAATTGGATGAATGATCCTGATGGTCCATTGTTTCACATGGGGTGGTACCATTTATTTTACCAATATAATCCT
Microexon-tag Amino Acid seq WQRTAFHFQPQNNWMNDPDGPLFHMGWYHLFYQYNP
Transcript ID KRH56348
Gene ID Gm.45384
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.1e-101
Motif start 122
Motif end 440
Protein seq >KRH56348
MEANPSHTTTPHLDHALQTPLLNHPSRVNNISHRPSRGIFVILLSIVFLVSLVALIMIQGQNYMENLENSNIEITLFSNI
SKQELPRGAAQGVSAKSNPPLFHKVSYNWTNAMFSWQRTAFHFQPQNNWMNDPDGPLFHMGWYHLFYQYNPDSAIWGNIS
WGHAVSRDMIHWFYLPIAMGPDTWYDINGVWTGSATILPGGKIIILYTGDTNEYVQVQNLAYPANLSDPLLLDWVKYAGN
PVLVPPPGIGPKDFRDPTTGWIGPDGKWRVAIGSKKGKKGISLVYTTTDFVNFESNDHYLHAVPGTGMWECVDFYPVSIS
GSRGLDTSENEPNVKHVLKASMDETRVDHYALGTYFIENDTWVPDNPLEDVGIGLVLDYGRYYASKTFYDPEKERRILWG
WINETDTESDDLRKGWASLQTIPRTVLFDSKTGTNLLLWPVEEVESLRLSSDEFEGVVVKPGSVVPLNISLATQLDMFAE
FEIETLESKSIGKNNIGCGSGGATNRSAFGPFGLLAIADDTLSEQTPIYFRLSNTTLGSSTTFFCVDETRSSKAADVAKP
IYGSKVPVLSDEKLSMRVLVDHSIIESFAQGGRTVITSRVYPTEAIYGAARLFLFNNATGINIKATLKIWQLSSAFIRPF
PFDQSQ*
CDS seq >KRH56348
ATGGAGGCCAATCCTTCTCACACTACAACCCCTCATCTTGATCATGCTCTACAAACTCCCTTGCTGAACCATCCATCAAG
AGTGAACAATATTAGTCATAGACCATCAAGGGGTATCTTTGTGATACTTCTTTCCATTGTTTTTCTAGTGTCATTGGTTG
CATTGATCATGATTCAGGGCCAAAACTACATGGAAAATTTGGAGAATAGCAACATAGAAATTACCCTTTTTTCTAACATC
TCTAAGCAAGAATTGCCTAGAGGAGCGGCTCAAGGGGTTTCAGCCAAGTCCAACCCACCCCTTTTCCACAAAGTTTCATA
TAATTGGACCAACGCCATGTTTTCTTGGCAAAGAACAGCTTTTCATTTTCAACCACAAAATAATTGGATGAATGATCCTG
ATGGTCCATTGTTTCACATGGGGTGGTACCATTTATTTTACCAATATAATCCTGATTCAGCTATATGGGGCAACATTTCA
TGGGGTCATGCTGTATCAAGGGACATGATTCACTGGTTCTACCTTCCCATTGCCATGGGACCTGACACGTGGTACGATAT
CAACGGTGTATGGACCGGGTCCGCCACGATTCTTCCAGGTGGCAAAATCATAATACTCTACACAGGTGACACCAATGAAT
ATGTGCAAGTGCAAAACCTTGCATACCCTGCCAATCTATCTGATCCCCTTCTCCTTGATTGGGTCAAGTATGCGGGTAAC
CCGGTCCTAGTGCCCCCACCCGGTATCGGCCCGAAGGATTTTCGTGACCCAACCACGGGTTGGATCGGGCCGGATGGAAA
GTGGAGGGTCGCAATTGGGTCAAAGAAAGGAAAAAAAGGCATTTCATTGGTTTACACAACCACAGATTTTGTCAATTTTG
AGTCCAATGATCACTACTTACATGCGGTTCCGGGTACGGGTATGTGGGAGTGTGTGGACTTTTACCCAGTTTCAATAAGC
GGGTCAAGGGGTTTGGATACATCAGAAAATGAGCCAAATGTTAAGCATGTGCTAAAGGCTAGCATGGATGAAACAAGGGT
GGATCATTATGCACTTGGGACCTATTTTATTGAAAATGATACATGGGTGCCCGATAACCCACTTGAGGATGTGGGTATTG
GGTTGGTTTTGGACTATGGGAGATACTATGCTTCAAAGACTTTCTATGATCCAGAGAAAGAGAGGAGGATCCTGTGGGGT
TGGATTAATGAAACGGATACAGAAAGTGATGACTTGAGAAAGGGTTGGGCTTCTCTTCAGACAATTCCGAGAACAGTGCT
GTTTGACAGCAAGACTGGGACTAATTTGCTTCTGTGGCCAGTAGAGGAAGTAGAAAGCTTAAGACTAAGCAGTGATGAAT
TTGAAGGAGTGGTGGTTAAGCCTGGATCTGTTGTGCCACTGAACATAAGCCTAGCAACACAGTTGGACATGTTTGCTGAA
TTTGAGATTGAAACATTGGAATCCAAAAGCATTGGCAAGAACAACATAGGTTGTGGAAGTGGTGGTGCCACAAACAGAAG
TGCTTTTGGACCATTTGGTCTTTTAGCCATTGCAGATGACACACTTTCAGAACAAACCCCAATTTATTTTCGCCTTTCTA
ATACTACCCTTGGTAGTTCAACCACTTTTTTTTGTGTTGATGAAACAAGATCATCCAAGGCTGCTGATGTTGCAAAGCCA
ATTTATGGAAGCAAAGTTCCAGTCCTTAGTGATGAAAAATTATCAATGAGGGTGTTGGTTGACCATTCAATTATTGAGAG
CTTTGCTCAAGGAGGGAGAACTGTGATCACAAGTAGAGTTTACCCAACGGAAGCAATATATGGAGCTGCAAGATTATTTC
TATTCAACAATGCAACTGGCATAAACATTAAGGCCACCCTAAAGATTTGGCAATTGAGCTCTGCTTTTATACGCCCCTTT
CCCTTTGATCAAAGTCAATAA