Microexon ID Gm_2:39192079-39192087:+
Species Glycine max
Coordinates 2:39192079..39192087
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TCTATCTTG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAAGCATTGCTGGAATGAATCACAGAATAAAAAAGGACGACAAGTCTATCTTGGCGCTTATGATAATGAAGAGGCAGCAGCACATGCTTATGATCTAGCAGCA
Microexon-tag Amino Acid Seq WDKHCWNESQNKKGRQVYLGAYDNEEAAAHAYDLAA
Microexon-tag spanning region39191871-39192219
Microexon-tag prediction score0.959
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH72352x
Reference Transcript ID KRH72352
Gene ID GLYMA_02G207100
Gene Name NA
Transcript ID KRH72352
Protein ID KRH72352
Gene ID GLYMA_02G207100
Gene Name NA
Pfam domain motif AP2
Motif E-value 4.5e-13
Motif start 49
Motif end 107
Protein seq >KRH72352
MAKKSQLRTQKNNATNDDINLNATNTVITKVKRTRRSVPRDSPPQRSSIYRGVTRHRWTGRYEAHLWDKHCWNESQNKKG
RQVYLGAYDNEEAAAHAYDLAALKYWGQDTILNFPLSNYLNELKEMEGQSREEYIGSLRRKSSGFSRGISKYRGVARHHH
NGRWEARIGKVFGNKYLYLGTYATQEEAATAYDLAAIEYRGLNAVTNFDLSRYIKWLKPNNTNSNNDQISINLTNINNNC
TNNFIPNPDQEQEVSFFHNQDSLNNTIVEEATLVPHQPRPASATLALELLLQSSKFKEMVEMTSVANLSTQMESDQLPQC
TFPDHIQTYFEYEDSNKYEEGDDLLFKFSEFSSIVPFYHCDEFES*
CDS seq >KRH72352
ATGGCCAAAAAATCACAGCTGCGTACCCAGAAAAACAATGCTACTAATGACGATATTAATCTTAACGCAACCAACACTGT
AATCACCAAGGTGAAACGAACAAGGAGAAGTGTCCCTAGAGACTCCCCACCTCAACGCAGCTCAATATACCGAGGAGTCA
CTAGGCACCGATGGACTGGCCGATACGAAGCTCATTTGTGGGACAAGCATTGCTGGAATGAATCACAGAATAAAAAAGGA
CGACAAGTCTATCTTGGCGCTTATGATAATGAAGAGGCAGCAGCACATGCTTATGATCTAGCAGCACTGAAATACTGGGG
TCAAGATACCATTCTTAATTTTCCGTTATCAAACTACCTGAATGAACTGAAAGAAATGGAGGGTCAATCACGGGAAGAGT
ATATCGGATCGCTGAGGAGGAAAAGCAGTGGTTTTTCTAGGGGGATTTCTAAATACAGAGGTGTTGCAAGGCATCATCAT
AACGGAAGGTGGGAGGCTCGGATTGGCAAAGTTTTTGGCAATAAATATCTTTATCTCGGAACTTACGCTACCCAAGAAGA
AGCTGCTACTGCCTATGACCTGGCAGCTATAGAATACCGTGGACTCAATGCTGTCACCAATTTCGATCTCAGCCGTTACA
TTAAGTGGCTTAAGCCCAACAACACCAACAGCAACAATGACCAGATTAGTATTAATCTCACTAACATAAATAATAATTGC
ACTAACAACTTCATCCCAAACCCTGATCAAGAACAAGAAGTTAGTTTCTTCCACAACCAGGATTCACTCAATAATACTAT
TGTAGAGGAAGCCACGTTGGTGCCACATCAGCCTCGTCCAGCAAGTGCCACGTTAGCATTGGAGCTTCTACTTCAGTCAT
CCAAGTTCAAGGAAATGGTGGAAATGACATCCGTGGCCAATCTTTCAACACAGATGGAATCTGATCAGTTGCCACAGTGC
ACATTTCCTGATCACATTCAGACATACTTTGAGTATGAAGATTCCAATAAATATGAGGAAGGGGATGATCTCCTGTTCAA
GTTCAGCGAGTTCAGCTCCATTGTGCCGTTTTACCATTGTGACGAGTTCGAGAGTTGA
Microexon DNA seq TCTATCTTG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAAGCATTGCTGGAATGAATCACAGAATAAAAAAGGACGACAAGTCTATCTTGGCGCTTATGATAATGAAGAGGCAGCAGCACATGCTTATGATCTAGCAGCA
Microexon-tag Amino Acid seq WDKHCWNESQNKKGRQVYLGAYDNEEAAAHAYDLAA
Transcript ID KRH72352
Gene ID GLYMA_02G207100
Gene Name NA
Pfam domain motif AP2
Motif E-value 4.5e-13
Motif start 49
Motif end 107
Protein seq >KRH72352
MAKKSQLRTQKNNATNDDINLNATNTVITKVKRTRRSVPRDSPPQRSSIYRGVTRHRWTGRYEAHLWDKHCWNESQNKKG
RQVYLGAYDNEEAAAHAYDLAALKYWGQDTILNFPLSNYLNELKEMEGQSREEYIGSLRRKSSGFSRGISKYRGVARHHH
NGRWEARIGKVFGNKYLYLGTYATQEEAATAYDLAAIEYRGLNAVTNFDLSRYIKWLKPNNTNSNNDQISINLTNINNNC
TNNFIPNPDQEQEVSFFHNQDSLNNTIVEEATLVPHQPRPASATLALELLLQSSKFKEMVEMTSVANLSTQMESDQLPQC
TFPDHIQTYFEYEDSNKYEEGDDLLFKFSEFSSIVPFYHCDEFES*
CDS seq >KRH72352
ATGGCCAAAAAATCACAGCTGCGTACCCAGAAAAACAATGCTACTAATGACGATATTAATCTTAACGCAACCAACACTGT
AATCACCAAGGTGAAACGAACAAGGAGAAGTGTCCCTAGAGACTCCCCACCTCAACGCAGCTCAATATACCGAGGAGTCA
CTAGGCACCGATGGACTGGCCGATACGAAGCTCATTTGTGGGACAAGCATTGCTGGAATGAATCACAGAATAAAAAAGGA
CGACAAGTCTATCTTGGCGCTTATGATAATGAAGAGGCAGCAGCACATGCTTATGATCTAGCAGCACTGAAATACTGGGG
TCAAGATACCATTCTTAATTTTCCGTTATCAAACTACCTGAATGAACTGAAAGAAATGGAGGGTCAATCACGGGAAGAGT
ATATCGGATCGCTGAGGAGGAAAAGCAGTGGTTTTTCTAGGGGGATTTCTAAATACAGAGGTGTTGCAAGGCATCATCAT
AACGGAAGGTGGGAGGCTCGGATTGGCAAAGTTTTTGGCAATAAATATCTTTATCTCGGAACTTACGCTACCCAAGAAGA
AGCTGCTACTGCCTATGACCTGGCAGCTATAGAATACCGTGGACTCAATGCTGTCACCAATTTCGATCTCAGCCGTTACA
TTAAGTGGCTTAAGCCCAACAACACCAACAGCAACAATGACCAGATTAGTATTAATCTCACTAACATAAATAATAATTGC
ACTAACAACTTCATCCCAAACCCTGATCAAGAACAAGAAGTTAGTTTCTTCCACAACCAGGATTCACTCAATAATACTAT
TGTAGAGGAAGCCACGTTGGTGCCACATCAGCCTCGTCCAGCAAGTGCCACGTTAGCATTGGAGCTTCTACTTCAGTCAT
CCAAGTTCAAGGAAATGGTGGAAATGACATCCGTGGCCAATCTTTCAACACAGATGGAATCTGATCAGTTGCCACAGTGC
ACATTTCCTGATCACATTCAGACATACTTTGAGTATGAAGATTCCAATAAATATGAGGAAGGGGATGATCTCCTGTTCAA
GTTCAGCGAGTTCAGCTCCATTGTGCCGTTTTACCATTGTGACGAGTTCGAGAGTTGA