Microexon ID Gm_20:3461773-3461781:+
Species Glycine max
Coordinates 20:3461773..3461781
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TATCACAGAACTGGATTCCATTTCCAGCCCCTTAAAAACTGGATGAACGATCCAAATGGGCCAATGTACTACAATGGAGTATACCATCTGTTCTACCAGTACAATCCC
Microexon-tag Amino Acid Seq YHRTGFHFQPLKNWMNDPNGPMYYNGVYHLFYQYNP
Microexon-tag spanning region3461016-3461939
Microexon-tag prediction score0.9653
Overlapped with the annotated transcript (%) 100
New Transcript ID KRG89527x
Reference Transcript ID KRG89527
Gene ID GLYMA_20G029100
Gene Name NA
Transcript ID KRG89527
Protein ID KRG89527
Gene ID GLYMA_20G029100
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.4e-103
Motif start 39
Motif end 351
Protein seq >KRG89527
MALPSTKMPVVFYSMVLLIINNCIEAVSVRGDYHRTGFHFQPLKNWMNDPNGPMYYNGVYHLFYQYNPNGTVWGNIVWAH
SVSKDLINWNGIEHAIYPSKPFDKFGCWSGSATIIPGKGPVILYTGVIDENNTQVQCYAEPEDPNDPLLRRWVKPDKLNP
AVVDKDVNHTEFRDPTTAWWGKDGHWRMLVGSVRKRRGIAYLYRSKDFKTWVRAKHPIHSKGGTGMWECPDFYPVSVIGN
VVGNPVKHVLKNSLDDTKFDYYTVGTYLEDKDRYVPDNTSVDGWGGLRYDYGNFYASKSFFDPSKNRRILWGWANECDKP
IDNFRKGWAGIQAIPRTVWLDFTGRQLVQWPVEELNSLRGKEVNIDNQRLEKGDYSEVKGITAAQADVEVTFSFSSLDKA
EAYDPKWVKAQDLCAQKGSKLQGGVGPFGLLTLASQNLEEFTPVFFRVFKSPNKHIVLLCSDARSSSLKSDLYKPQFAGF
VDVDLAADKKISLRSLIDHSVVESFGAGGKTNILSRVYPELAVMNQAHLFVFNNGTEPIVVQNLKAWSMISADIK*
CDS seq >KRG89527
ATGGCTCTCCCAAGCACCAAGATGCCTGTGGTGTTTTACTCCATGGTGTTGCTGATCATCAACAATTGCATTGAAGCAGT
TTCGGTGAGAGGAGATTATCACAGAACTGGATTCCATTTCCAGCCCCTTAAAAACTGGATGAACGATCCAAATGGGCCAA
TGTACTACAATGGAGTATACCATCTGTTCTACCAGTACAATCCCAATGGCACAGTGTGGGGTAACATAGTGTGGGCGCAC
TCAGTGTCCAAGGATCTGATAAACTGGAATGGCATTGAACATGCTATTTATCCATCAAAGCCCTTTGACAAATTTGGCTG
CTGGTCAGGGTCTGCTACCATAATCCCTGGTAAGGGGCCCGTGATCCTCTACACTGGAGTTATAGACGAAAATAACACTC
AAGTGCAATGCTATGCCGAACCAGAAGACCCAAATGACCCACTTCTTCGGAGATGGGTGAAACCAGACAAGCTCAACCCA
GCTGTGGTAGATAAAGATGTTAACCACACTGAATTTCGTGACCCCACAACAGCTTGGTGGGGCAAGGACGGTCACTGGAG
GATGCTGGTGGGCAGTGTAAGGAAGCGCAGAGGAATAGCTTATCTATACAGGAGCAAGGACTTCAAGACATGGGTTCGGG
CCAAACACCCTATCCATTCCAAGGGTGGTACGGGTATGTGGGAGTGCCCCGACTTTTACCCAGTTTCAGTTATAGGAAAT
GTAGTGGGGAATCCAGTGAAACATGTGTTGAAGAACAGCCTTGATGATACTAAGTTCGATTACTATACTGTGGGGACCTA
TCTGGAGGATAAGGATAGGTATGTGCCTGACAACACTTCAGTGGATGGTTGGGGTGGACTTAGGTATGATTATGGCAACT
TCTATGCTTCCAAATCATTTTTTGACCCCAGCAAGAACAGGAGGATCTTATGGGGTTGGGCAAACGAGTGTGATAAACCG
ATAGACAATTTTCGGAAAGGATGGGCAGGAATTCAGGCAATTCCACGAACTGTGTGGCTCGATTTTACTGGGAGACAATT
GGTGCAATGGCCTGTTGAAGAGTTAAACAGTCTCAGAGGCAAAGAAGTTAACATAGACAATCAAAGGCTTGAGAAGGGAG
ATTATAGTGAAGTAAAAGGAATCACTGCTGCCCAGGCAGATGTTGAAGTTACGTTCTCCTTTTCAAGCTTGGACAAGGCA
GAGGCATATGATCCTAAGTGGGTAAAGGCGCAGGATCTATGTGCCCAAAAGGGTTCAAAACTACAGGGTGGGGTTGGACC
GTTTGGACTTCTGACTTTGGCTTCTCAAAATCTCGAAGAGTTCACTCCTGTGTTTTTCAGAGTTTTCAAAAGTCCAAATA
AGCATATTGTTCTCTTATGCTCTGATGCGAGAAGTTCATCTTTGAAGAGTGATCTGTACAAACCACAATTTGCTGGGTTT
GTAGATGTGGATTTGGCCGCGGATAAGAAGATTTCTCTTAGGAGTTTGATTGATCACTCAGTTGTGGAGAGTTTTGGAGC
AGGAGGGAAGACAAACATTTTGTCCCGAGTCTATCCCGAGCTAGCAGTGATGAACCAAGCGCACTTGTTTGTGTTCAACA
ATGGTACTGAACCAATCGTAGTGCAAAACCTCAAAGCTTGGAGCATGATATCTGCTGATATAAAATGA
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TATCACAGAACTGGATTCCATTTCCAGCCCCTTAAAAACTGGATGAACGATCCAAATGGGCCAATGTACTACAATGGAGTATACCATCTGTTCTACCAGTACAATCCC
Microexon-tag Amino Acid seq YHRTGFHFQPLKNWMNDPNGPMYYNGVYHLFYQYNP
Transcript ID KRG89527
Gene ID Gm.32715
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.4e-103
Motif start 39
Motif end 351
Protein seq >KRG89527
MALPSTKMPVVFYSMVLLIINNCIEAVSVRGDYHRTGFHFQPLKNWMNDPNGPMYYNGVYHLFYQYNPNGTVWGNIVWAH
SVSKDLINWNGIEHAIYPSKPFDKFGCWSGSATIIPGKGPVILYTGVIDENNTQVQCYAEPEDPNDPLLRRWVKPDKLNP
AVVDKDVNHTEFRDPTTAWWGKDGHWRMLVGSVRKRRGIAYLYRSKDFKTWVRAKHPIHSKGGTGMWECPDFYPVSVIGN
VVGNPVKHVLKNSLDDTKFDYYTVGTYLEDKDRYVPDNTSVDGWGGLRYDYGNFYASKSFFDPSKNRRILWGWANECDKP
IDNFRKGWAGIQAIPRTVWLDFTGRQLVQWPVEELNSLRGKEVNIDNQRLEKGDYSEVKGITAAQADVEVTFSFSSLDKA
EAYDPKWVKAQDLCAQKGSKLQGGVGPFGLLTLASQNLEEFTPVFFRVFKSPNKHIVLLCSDARSSSLKSDLYKPQFAGF
VDVDLAADKKISLRSLIDHSVVESFGAGGKTNILSRVYPELAVMNQAHLFVFNNGTEPIVVQNLKAWSMISADIK*
CDS seq >KRG89527
ATGGCTCTCCCAAGCACCAAGATGCCTGTGGTGTTTTACTCCATGGTGTTGCTGATCATCAACAATTGCATTGAAGCAGT
TTCGGTGAGAGGAGATTATCACAGAACTGGATTCCATTTCCAGCCCCTTAAAAACTGGATGAACGATCCAAATGGGCCAA
TGTACTACAATGGAGTATACCATCTGTTCTACCAGTACAATCCCAATGGCACAGTGTGGGGTAACATAGTGTGGGCGCAC
TCAGTGTCCAAGGATCTGATAAACTGGAATGGCATTGAACATGCTATTTATCCATCAAAGCCCTTTGACAAATTTGGCTG
CTGGTCAGGGTCTGCTACCATAATCCCTGGTAAGGGGCCCGTGATCCTCTACACTGGAGTTATAGACGAAAATAACACTC
AAGTGCAATGCTATGCCGAACCAGAAGACCCAAATGACCCACTTCTTCGGAGATGGGTGAAACCAGACAAGCTCAACCCA
GCTGTGGTAGATAAAGATGTTAACCACACTGAATTTCGTGACCCCACAACAGCTTGGTGGGGCAAGGACGGTCACTGGAG
GATGCTGGTGGGCAGTGTAAGGAAGCGCAGAGGAATAGCTTATCTATACAGGAGCAAGGACTTCAAGACATGGGTTCGGG
CCAAACACCCTATCCATTCCAAGGGTGGTACGGGTATGTGGGAGTGCCCCGACTTTTACCCAGTTTCAGTTATAGGAAAT
GTAGTGGGGAATCCAGTGAAACATGTGTTGAAGAACAGCCTTGATGATACTAAGTTCGATTACTATACTGTGGGGACCTA
TCTGGAGGATAAGGATAGGTATGTGCCTGACAACACTTCAGTGGATGGTTGGGGTGGACTTAGGTATGATTATGGCAACT
TCTATGCTTCCAAATCATTTTTTGACCCCAGCAAGAACAGGAGGATCTTATGGGGTTGGGCAAACGAGTGTGATAAACCG
ATAGACAATTTTCGGAAAGGATGGGCAGGAATTCAGGCAATTCCACGAACTGTGTGGCTCGATTTTACTGGGAGACAATT
GGTGCAATGGCCTGTTGAAGAGTTAAACAGTCTCAGAGGCAAAGAAGTTAACATAGACAATCAAAGGCTTGAGAAGGGAG
ATTATAGTGAAGTAAAAGGAATCACTGCTGCCCAGGCAGATGTTGAAGTTACGTTCTCCTTTTCAAGCTTGGACAAGGCA
GAGGCATATGATCCTAAGTGGGTAAAGGCGCAGGATCTATGTGCCCAAAAGGGTTCAAAACTACAGGGTGGGGTTGGACC
GTTTGGACTTCTGACTTTGGCTTCTCAAAATCTCGAAGAGTTCACTCCTGTGTTTTTCAGAGTTTTCAAAAGTCCAAATA
AGCATATTGTTCTCTTATGCTCTGATGCGAGAAGTTCATCTTTGAAGAGTGATCTGTACAAACCACAATTTGCTGGGTTT
GTAGATGTGGATTTGGCCGCGGATAAGAAGATTTCTCTTAGGAGTTTGATTGATCACTCAGTTGTGGAGAGTTTTGGAGC
AGGAGGGAAGACAAACATTTTGTCCCGAGTCTATCCCGAGCTAGCAGTGATGAACCAAGCGCACTTGTTTGTGTTCAACA
ATGGTACTGAACCAATCGTAGTGCAAAACCTCAAAGCTTGGAGCATGATATCTGCTGATATAAAATGA