Microexon ID Gm_3:42420938-42420943:+
Species Glycine max
Coordinates 3:42420938..42420943
Microexon Cluster ID MEP12
Size 6
Phase 1
Pfam Domain Motif Unknown
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 52,6,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GAAGAMCRTGTKCGKGAGGATKTKCARAAGTWYYCWAGRGGWTCYCCACAAGCWAGAGCTTATSGKAATGATGGMRCWMRRRGYCGWTCAASMCATTCAAAATCTCCM
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq CAAGAG
Microexon Amino Acid seq ARA
Microexon-tag DNA Seq GAAGACCGTATCCGGGATGACGTGCAAAAGTTTTCTAGAGGTTCTCCACAAGCAAGAGCTTATGGAAATAGTGGAGCTAGAGGTCGTTCAAGCCAGTCAAGATCTCCA
Microexon-tag Amino Acid Seq EDRIRDDVQKFSRGSPQARAYGNSGARGRSSQSRSP
Microexon-tag spanning region42420736-42421230
Microexon-tag prediction score0.9626
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH68287x
Reference Transcript ID KRH68287
Gene ID GLYMA_03G221000
Gene Name NA
Transcript ID KRH68287
Protein ID KRH68287
Gene ID GLYMA_03G221000
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >KRH68287
MANRSDPDIDDDFLELYKEYTGPLGSATTNTQERAKSNKRSNAGSDEEEEARDPNAVPTDFTSREAKVWEAKSKATERNW
KKRKEEEMICKLCGESGHFTQGCPSTLGANRKSQDFFERIPARDKNVRALFTEKVLSKIEKDVGCKIKMDEKFIIVSGKD
RLILAKGVDSVHKIREEGDQRGTSSSQMTRSRSPERSPVSARFQRSEPQRSHSGPRNTSQFQQRFGRQERAVEDRIRDDV
QKFSRGSPQARAYGNSGARGRSSQSRSPRHAPYTGNSYNSFEGRNQNMGAYRNDGWDSHRRESGIRPGHQFDYSASPQTL
EELELEYKNEAAELMKIRDREEDEENFKHREAIRDLREKYMSKVSLVRDTHAKQWEGFLQLDAQRRQQQAVQQMPSGYRG
FKQQSFPEYDGSTANPPPYAGSNLPLESRNRFSDNMETYPNRPHDNFGEFHRRGDFAKAYNRY*
CDS seq >KRH68287
ATGGCTAATAGATCCGATCCGGATATTGATGATGATTTTCTTGAGCTTTATAAGGAGTACACTGGCCCCCTGGGGTCTGC
TACTACAAACACGCAAGAGAGGGCAAAATCAAATAAGAGGTCTAATGCAGGGTCTGATGAGGAGGAGGAAGCCCGGGACC
CTAATGCTGTTCCAACCGATTTCACCAGCCGAGAAGCTAAGGTTTGGGAGGCTAAGTCAAAGGCTACGGAAAGGAATTGG
AAGAAAAGGAAAGAGGAGGAAATGATCTGCAAGCTTTGTGGAGAATCAGGGCATTTCACTCAGGGCTGCCCATCTACTCT
TGGAGCAAATCGCAAGTCTCAAGATTTCTTTGAAAGGATACCAGCCAGAGACAAAAATGTGCGGGCACTTTTCACGGAGA
AAGTTTTAAGCAAGATTGAAAAGGATGTTGGCTGCAAAATTAAGATGGATGAGAAGTTTATTATTGTCAGTGGTAAGGAT
AGATTAATTTTGGCCAAAGGTGTTGATTCTGTGCACAAGATTCGAGAGGAGGGTGATCAAAGGGGAACATCTAGTTCTCA
AATGACCCGATCAAGATCACCTGAAAGAAGTCCTGTTAGTGCTCGGTTTCAACGCTCTGAGCCCCAAAGGTCTCATTCTG
GACCTCGAAATACATCTCAGTTTCAACAAAGATTTGGTAGGCAAGAGCGGGCGGTTGAAGACCGTATCCGGGATGACGTG
CAAAAGTTTTCTAGAGGTTCTCCACAAGCAAGAGCTTATGGAAATAGTGGAGCTAGAGGTCGTTCAAGCCAGTCAAGATC
TCCAAGACATGCCCCTTATACAGGAAACTCATATAACTCATTTGAGGGTCGTAATCAAAACATGGGTGCTTATAGGAATG
ATGGGTGGGATTCTCATAGAAGAGAATCTGGTATCCGGCCTGGTCATCAGTTTGATTACAGCGCCTCCCCACAGACTTTA
GAAGAATTAGAGTTGGAGTACAAGAATGAGGCAGCAGAGCTAATGAAAATCCGTGACAGAGAAGAAGATGAAGAAAATTT
CAAGCATCGTGAGGCTATTAGAGATTTGAGGGAGAAGTACATGAGCAAAGTTTCCTTGGTAAGGGACACACATGCAAAAC
AGTGGGAAGGATTTCTTCAGCTTGATGCGCAGAGGCGTCAACAGCAGGCAGTTCAACAGATGCCTTCTGGTTATCGGGGT
TTTAAACAGCAGAGCTTTCCTGAATATGATGGATCCACTGCCAATCCTCCTCCTTATGCTGGTTCTAATCTACCATTGGA
ATCGAGGAACAGGTTCTCAGACAACATGGAAACTTATCCTAATAGGCCTCATGATAATTTTGGTGAATTTCATAGGCGTG
GAGATTTCGCAAAAGCTTACAACAGATATTAA
Microexon DNA seq CAAGAG
Microexon Amino Acid seq ARA
Microexon-tag DNA Seq GAAGACCGTATCCGGGATGACGTGCAAAAGTTTTCTAGAGGTTCTCCACAAGCAAGAGCTTATGGAAATAGTGGAGCTAGAGGTCGTTCAAGCCAGTCAAGATCTCCA
Microexon-tag Amino Acid seq EDRIRDDVQKFSRGSPQARAYGNSGARGRSSQSRSP
Transcript ID Gm.36897.3
Gene ID Gm.36897
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Gm.36897.3
MANRSDPDIDDDFLELYKEYTGPLGSATTNTQERAKSNKRSNAGSDEEEEARDPNAVPTDFTSREAKVWEAKSKATERNW
KKRKEEEMICKLCGESGHFTQGCPSTLGANRKSQDFFERIPARDKNVRALFTEKVLSKIEKDVGCKIKMDEKFIIVSGKD
RLILAKGVDSVHKIREEGDQRGTSSSQMTRSRSPERSPVSARFQRSEPQRSHSGPRNTSQFQQRFGRQERAVEDRIRDDV
QKFSRGSPQARAYGNSGARGRSSQSRSPRHAPYTGNSYNSFEGRNQNMGAYRNDGWDSHRRESGIRPGHQFDYSASPQTL
EELELEYKNEAAELMKIRDREEDEENFKHREAIRDLREKYMSKVSLVRDTHAKQWEGFLQLDAQRRQQQAVQQMPSGYRG
FKQQSFPEYDGSTANPPPYAGSNLPLESRNRFSDNMETYPNRPHDNFGEFHRRGDFAKAYNRY*
CDS seq >Gm.36897.3
ATGGCTAATAGATCCGATCCGGATATTGATGATGATTTTCTTGAGCTTTATAAGGAGTACACTGGCCCCCTGGGGTCTGC
TACTACAAACACGCAAGAGAGGGCAAAATCAAATAAGAGGTCTAATGCAGGGTCTGATGAGGAGGAGGAAGCCCGGGACC
CTAATGCTGTTCCAACCGATTTCACCAGCCGAGAAGCTAAGGTTTGGGAGGCTAAGTCAAAGGCTACGGAAAGGAATTGG
AAGAAAAGGAAAGAGGAGGAAATGATCTGCAAGCTTTGTGGAGAATCAGGGCATTTCACTCAGGGCTGCCCATCTACTCT
TGGAGCAAATCGCAAGTCTCAAGATTTCTTTGAAAGGATACCAGCCAGAGACAAAAATGTGCGGGCACTTTTCACGGAGA
AAGTTTTAAGCAAGATTGAAAAGGATGTTGGCTGCAAAATTAAGATGGATGAGAAGTTTATTATTGTCAGTGGTAAGGAT
AGATTAATTTTGGCCAAAGGTGTTGATTCTGTGCACAAGATTCGAGAGGAGGGTGATCAAAGGGGAACATCTAGTTCTCA
AATGACCCGATCAAGATCACCTGAAAGAAGTCCTGTTAGTGCTCGGTTTCAACGCTCTGAGCCCCAAAGGTCTCATTCTG
GACCTCGAAATACATCTCAGTTTCAACAAAGATTTGGTAGGCAAGAGCGGGCGGTTGAAGACCGTATCCGGGATGACGTG
CAAAAGTTTTCTAGAGGTTCTCCACAAGCAAGAGCTTATGGAAATAGTGGAGCTAGAGGTCGTTCAAGCCAGTCAAGATC
TCCAAGACATGCCCCTTATACAGGAAACTCATATAACTCATTTGAGGGTCGTAATCAAAACATGGGTGCTTATAGGAATG
ATGGGTGGGATTCTCATAGAAGAGAATCTGGTATCCGGCCTGGTCATCAGTTTGATTACAGCGCCTCCCCACAGACTTTA
GAAGAATTAGAGTTGGAGTACAAGAATGAGGCAGCAGAGCTAATGAAAATCCGTGACAGAGAAGAAGATGAAGAAAATTT
CAAGCATCGTGAGGCTATTAGAGATTTGAGGGAGAAGTACATGAGCAAAGTTTCCTTGGTAAGGGACACACATGCAAAAC
AGTGGGAAGGATTTCTTCAGCTTGATGCGCAGAGGCGTCAACAGCAGGCAGTTCAACAGATGCCTTCTGGTTATCGGGGT
TTTAAACAGCAGAGCTTTCCTGAATATGATGGATCCACTGCCAATCCTCCTCCTTATGCTGGTTCTAATCTACCATTGGA
ATCGAGGAACAGGTTCTCAGACAACATGGAAACTTATCCTAATAGGCCTCATGATAATTTTGGTGAATTTCATAGGCGTG
GAGATTTCGCAAAAGCTTACAACAGATATTAA