Microexon ID Gm_3:40682914-40682922:+
Species Glycine max
Coordinates 3:40682914..40682922
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CAACACAGAACTGCGTACCATTTTCAACCTCCTAAGAACTGGATTAACGATCCAAATGGACCCATGTATTACAAGGGAATCTATCATCTATTCTACCAATACAACCCC
Microexon-tag Amino Acid Seq QHRTAYHFQPPKNWINDPNGPMYYKGIYHLFYQYNP
Microexon-tag spanning region40681353-40683050
Microexon-tag prediction score0.96
Overlapped with the annotated transcript (%) 91.67
New Transcript ID KRH67945x
Reference Transcript ID KRH67945
Gene ID GLYMA_03G197400
Gene Name NA
Gm_3:40682914-40682922:+ does not have available information here.
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CAACACAGAACTGCGTACCATTTTCAACCTCCTAAGAACTGGATTAACGATCCAAATGGACCCATGTATTACAAGGGAATCTATCATCTATTCTACCAATACAACCCC
Microexon-tag Amino Acid seq QHRTAYHFQPPKNWINDPNGPMYYKGIYHLFYQYNP
Transcript ID Gm.36649.1
Gene ID Gm.36649
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.4e-104
Motif start 69
Motif end 388
Protein seq >Gm.36649.1
MLNHRTQQIHLFSSCFLMVLPKCRYISVVFFAFVVLLINNGVEAFHKVYPHLQSVSTISVSGQHRTAYHFQPPKNWINDP
NGPMYYKGIYHLFYQYNPKGSVWGNIVWAHSVSKDLINWRSLEHALYPSKPFDKFGCWSGSATIVPGKGPVILYTGVVDD
KQTQVQCYAIPEDLNDPLLQKWVKPDKFNPILVANKGVNGSAFRDPTTAWLSKDGHWKILVGSRKNLTGIAYLYRSKDFM
NWVQAKHPIHSKGETGMWECPDFYPVLLRGNAGLETSEEGNHVKYVFKNSLDITRFDYYTVGTYFKDKDRYAPDNTSEDG
WGGLRYDYGNFYASKSFFDPSKNRRILWGWANESDTKEDDVRKGWAGIQAIPRTVWLDSTGRQLVQWPVEE*
CDS seq >Gm.36649.1
ATGCTAAACCACCGCACGCAACAAATCCATTTATTTTCATCTTGTTTTCTGATGGTTCTCCCAAAGTGTCGCTATATCTC
TGTAGTTTTTTTCGCCTTTGTTGTGTTACTGATCAACAATGGCGTTGAAGCTTTTCATAAAGTATATCCTCATCTTCAAT
CTGTTTCTACGATATCCGTGAGCGGACAACACAGAACTGCGTACCATTTTCAACCTCCTAAGAACTGGATTAACGATCCA
AATGGACCCATGTATTACAAGGGAATCTATCATCTATTCTACCAATACAACCCCAAAGGGTCAGTGTGGGGTAACATTGT
GTGGGCTCACTCAGTGTCAAAGGATCTCATCAATTGGAGGTCCCTTGAACATGCACTTTACCCATCCAAACCATTTGACA
AGTTCGGGTGTTGGTCTGGGTCAGCCACCATAGTCCCAGGTAAAGGACCAGTGATCCTCTACACCGGAGTTGTTGACGAC
AAACAAACTCAGGTTCAATGCTATGCTATACCTGAAGACCTAAACGACCCACTCCTCCAAAAATGGGTTAAACCTGACAA
ATTCAACCCAATCTTGGTTGCTAACAAGGGTGTCAACGGTAGTGCGTTTCGCGACCCAACGACGGCATGGTTGAGCAAGG
ACGGTCACTGGAAGATATTGGTGGGTAGTAGAAAGAATCTTACAGGTATAGCTTATTTGTATAGGAGCAAGGACTTTATG
AATTGGGTGCAAGCCAAACATCCGATCCATTCCAAGGGTGAAACTGGTATGTGGGAGTGTCCTGATTTTTATCCAGTTTT
GCTTAGAGGCAATGCAGGGTTGGAGACGTCCGAGGAGGGGAATCATGTGAAGTACGTGTTTAAGAATAGTCTTGACATTA
CAAGGTTCGACTACTATACAGTGGGAACATATTTTAAAGATAAGGATAGATATGCCCCCGATAACACCTCAGAGGATGGT
TGGGGTGGACTTAGGTATGACTATGGTAATTTTTATGCTTCCAAGTCATTTTTTGACCCCAGTAAAAATCGAAGAATCTT
GTGGGGTTGGGCAAATGAGTCTGATACCAAGGAAGATGATGTTCGCAAAGGATGGGCGGGAATTCAGGCGATTCCGCGAA
CTGTGTGGCTAGATTCTACTGGGAGACAATTGGTGCAATGGCCTGTTGAAGAATAA