Microexon ID Gm_19:47071981-47071986:+
Species Glycine max
Coordinates 19:47071981..47071986
Microexon Cluster ID MEP12
Size 6
Phase 1
Pfam Domain Motif Unknown
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 52,6,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GAAGAMCRTGTKCGKGAGGATKTKCARAAGTWYYCWAGRGGWTCYCCACAAGCWAGAGCTTATSGKAATGATGGMRCWMRRRGYCGWTCAASMCATTCAAAATCTCCM
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq CAAGAG
Microexon Amino Acid seq ARA
Microexon-tag DNA Seq GAAGACCGTATCAGGGATGATGTTCAAAAGTTTTCTAGAGGTTCTCCACAAGCAAGAGCTTATGGAAATAGTGGAGCTAGAGGTCGTTCAAGCCAGTCAAGATCTCCA
Microexon-tag Amino Acid Seq EDRIRDDVQKFSRGSPQARAYGNSGARGRSSQSRSP
Microexon-tag spanning region47071776-47072265
Microexon-tag prediction score0.955
Overlapped with the annotated transcript (%) 100
New Transcript ID KRG96549x
Reference Transcript ID KRG96549
Gene ID GLYMA_19G218000
Gene Name NA
Transcript ID KRG96549
Protein ID KRG96549
Gene ID GLYMA_19G218000
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >KRG96549
MANRPDPDIDDDFRELYKEYTGPLGTATTNMQERAKSNKRSNAGSDEEEEARDPNAVPTDFTSREAKVWEAKSKATERNW
KKRKEEEMICKLCGESGHFTQGCPSTLGANRKSQDFFERIPARDKNVRALFTEKVLSKIEKDVGCKIKMDEKFIIVSGKD
RLILAKGVDAVHKIREEGDQRGSSSSQMTQSRSPERSPVSARFQRSEPQRSHSGPRNTSQFQQRFGRQERAVEDRIRDDV
QKFSRGSPQARAYGNSGARGRSSQSRSPRHAPYTGNSYNSFDDRNQNMGAYRNEGWDSHRRESGIQPGHQFDYNASPQTL
EELELEYKNEATELMKIRDREEDEENFKHREAIRDLREKYMSKVSLVRVTHAKQLEEFLQLDAQRRRQQVGQQMSSGYRG
FKQQSFPEYDGSTANPPTYAGSNIPLESRNRFSGNMETYPNRPHDNFGEFHRRGDFAKAYNRY*
CDS seq >KRG96549
ATGGCTAATAGACCGGATCCGGATATTGATGATGATTTTCGTGAGCTTTATAAGGAGTACACTGGCCCCCTGGGGACTGC
TACTACAAACATGCAAGAGAGGGCAAAATCAAATAAGAGGTCTAATGCAGGGTCTGACGAGGAGGAGGAAGCCCGGGACC
CTAATGCTGTTCCAACTGATTTCACCAGCCGAGAAGCTAAGGTTTGGGAGGCTAAGTCAAAGGCTACGGAAAGGAATTGG
AAGAAAAGGAAAGAGGAGGAAATGATCTGCAAGCTTTGTGGAGAATCAGGGCATTTCACTCAGGGCTGTCCATCTACTCT
CGGAGCAAATCGAAAGTCTCAAGATTTCTTTGAAAGGATACCTGCCAGAGACAAAAATGTACGGGCACTTTTCACAGAGA
AAGTTTTAAGCAAGATTGAAAAGGATGTTGGCTGCAAAATTAAGATGGATGAGAAGTTTATTATTGTCAGTGGTAAGGAT
AGATTAATTTTGGCCAAAGGTGTTGATGCTGTGCACAAGATTCGAGAGGAGGGTGATCAAAGGGGATCATCTAGTTCTCA
AATGACCCAATCAAGATCACCTGAAAGAAGTCCTGTTAGTGCTCGGTTTCAACGCTCTGAGCCCCAAAGGTCTCATTCTG
GACCACGAAATACATCTCAGTTTCAACAAAGGTTTGGTAGGCAAGAGAGGGCTGTTGAAGACCGTATCAGGGATGATGTT
CAAAAGTTTTCTAGAGGTTCTCCACAAGCAAGAGCTTATGGAAATAGTGGAGCTAGAGGTCGTTCAAGCCAGTCAAGATC
TCCAAGGCATGCCCCTTATACAGGAAACTCATATAATTCATTTGATGATCGTAATCAAAACATGGGTGCTTATAGGAATG
AAGGATGGGATTCTCATAGAAGAGAATCTGGTATCCAGCCTGGTCATCAGTTTGATTACAACGCCTCCCCACAGACTTTA
GAAGAATTAGAGTTGGAGTATAAGAACGAGGCAACAGAGCTAATGAAAATTCGTGACAGAGAAGAAGATGAAGAAAATTT
CAAGCATCGTGAGGCTATTAGAGATTTGAGGGAGAAGTACATGAGCAAAGTTTCCTTGGTAAGGGTCACACATGCAAAAC
AGTTGGAAGAATTTCTTCAGCTTGATGCGCAGAGGCGTCGACAGCAAGTGGGTCAACAGATGTCTTCTGGTTATCGGGGT
TTTAAACAGCAGAGTTTTCCTGAATATGATGGGTCCACTGCCAATCCTCCTACTTATGCTGGTTCTAATATACCATTGGA
ATCGAGGAACAGGTTCTCAGGCAACATGGAAACTTATCCTAATAGGCCTCATGATAATTTTGGTGAATTTCATAGGCGTG
GAGATTTTGCAAAAGCTTACAACAGATATTAA
Microexon DNA seq CAAGAG
Microexon Amino Acid seq ARA
Microexon-tag DNA Seq GAAGACCGTATCAGGGATGATGTTCAAAAGTTTTCTAGAGGTTCTCCACAAGCAAGAGCTTATGGAAATAGTGGAGCTAGAGGTCGTTCAAGCCAGTCAAGATCTCCA
Microexon-tag Amino Acid seq EDRIRDDVQKFSRGSPQARAYGNSGARGRSSQSRSP
Transcript ID Gm.28914.2
Gene ID Gm.28914
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Gm.28914.2
MANRPDPDIDDDFRELYKEYTGPLGTATTNMQERAKSNKRSNAGSDEEEEARDPNAVPTDFTSREAKVWEAKSKATERNW
KKRKEEEMICKLCGESGHFTQGCPSTLGANRKSQDFFERIPARDKNVRALFTEKVLSKIEKDVGCKIKMDEKFIIVSGKD
RLILAKGVDAVHKIREEGDQRGSSSSQMTQSRSPERSPVSARFQRSEPQRSHSGPRNTSQFQQRFGRQERAVEDRIRDDV
QKFSRGSPQARAYGNSGARGRSSQSRSPRHAPYTGNSYNSFDDRNQNMGAYRNEGWDSHRRESGIQPGHQFDYNASPQTL
EELELEYKNEATELMKIRDREEDEENFKHREAIRDLREKYMSKVSLVRVTHAKQLEEFLQLDAQRRRQQVGQQMSSGYRG
FKQQSFPEYDGSTANPPTYAGSNIPLESRNRFSGNMETYPNRPHDNFGEFHRRGDFAKAYNRY*
CDS seq >Gm.28914.2
ATGGCTAATAGACCGGATCCGGATATTGATGATGATTTTCGTGAGCTTTATAAGGAGTACACTGGCCCCCTGGGGACTGC
TACTACAAACATGCAAGAGAGGGCAAAATCAAATAAGAGGTCTAATGCAGGGTCTGACGAGGAGGAGGAAGCCCGGGACC
CTAATGCTGTTCCAACTGATTTCACCAGCCGAGAAGCTAAGGTTTGGGAGGCTAAGTCAAAGGCTACGGAAAGGAATTGG
AAGAAAAGGAAAGAGGAGGAAATGATCTGCAAGCTTTGTGGAGAATCAGGGCATTTCACTCAGGGCTGTCCATCTACTCT
CGGAGCAAATCGAAAGTCTCAAGATTTCTTTGAAAGGATACCTGCCAGAGACAAAAATGTACGGGCACTTTTCACAGAGA
AAGTTTTAAGCAAGATTGAAAAGGATGTTGGCTGCAAAATTAAGATGGATGAGAAGTTTATTATTGTCAGTGGTAAGGAT
AGATTAATTTTGGCCAAAGGTGTTGATGCTGTGCACAAGATTCGAGAGGAGGGTGATCAAAGGGGATCATCTAGTTCTCA
AATGACCCAATCAAGATCACCTGAAAGAAGTCCTGTTAGTGCTCGGTTTCAACGCTCTGAGCCCCAAAGGTCTCATTCTG
GACCACGAAATACATCTCAGTTTCAACAAAGGTTTGGTAGGCAAGAGAGGGCTGTTGAAGACCGTATCAGGGATGATGTT
CAAAAGTTTTCTAGAGGTTCTCCACAAGCAAGAGCTTATGGAAATAGTGGAGCTAGAGGTCGTTCAAGCCAGTCAAGATC
TCCAAGGCATGCCCCTTATACAGGAAACTCATATAATTCATTTGATGATCGTAATCAAAACATGGGTGCTTATAGGAATG
AAGGATGGGATTCTCATAGAAGAGAATCTGGTATCCAGCCTGGTCATCAGTTTGATTACAACGCCTCCCCACAGACTTTA
GAAGAATTAGAGTTGGAGTATAAGAACGAGGCAACAGAGCTAATGAAAATTCGTGACAGAGAAGAAGATGAAGAAAATTT
CAAGCATCGTGAGGCTATTAGAGATTTGAGGGAGAAGTACATGAGCAAAGTTTCCTTGGTAAGGGTCACACATGCAAAAC
AGTTGGAAGAATTTCTTCAGCTTGATGCGCAGAGGCGTCGACAGCAAGTGGGTCAACAGATGTCTTCTGGTTATCGGGGT
TTTAAACAGCAGAGTTTTCCTGAATATGATGGGTCCACTGCCAATCCTCCTACTTATGCTGGTTCTAATATACCATTGGA
ATCGAGGAACAGGTTCTCAGGCAACATGGAAACTTATCCTAATAGGCCTCATGATAATTTTGGTGAATTTCATAGGCGTG
GAGATTTTGCAAAAGCTTACAACAGATATTAA