Microexon ID Gm_1:52991999-52992007:+
Species Glycine max
Coordinates 1:52991999..52992007
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTATCTAG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAACAGCTGCAAGAAAGAGGGCCAAGGAAGGAAAGGAAGGCAAGTTTATCTAGGGGGTTATGATATGGAAGAAAAAGCAGCAAGAGCTTATGATATGGCCGCA
Microexon-tag Amino Acid Seq WDNSCKKEGQGRKGRQVYLGGYDMEEKAARAYDMAA
Microexon-tag spanning region52991761-52992222
Microexon-tag prediction score0.9721
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH77164x
Reference Transcript ID KRH77164
Gene ID GLYMA_01G195900
Gene Name NA
Transcript ID KRH77164
Protein ID KRH77164
Gene ID GLYMA_01G195900
Gene Name NA
Pfam domain motif AP2
Motif E-value 5.3e-13
Motif start 275
Motif end 333
Protein seq >KRH77164
MKPIENGNTVSINYQNSWLGFSLSPQMNIGVPSHLHQTQPSSAAVEAVPPNFYHHTPLHNYGLYYELEGEHVGMSSSLPI
MPLKSNASLSGIEALSRSQAQATTTISAMHSSLNSMLINELLCHGLNNPNNLNHVQEDISQQQFSYYSTLRNQDVILEGS
KHQLPCIAEDENPGLKSWFSRDFHARHAEESRMIVPLECNGGESGSIGSITYGDLHSSNLSVSPTSGSSSVTSSPALTNT
VATNTKKRWLEMVDQNQKQIVHRKSIDTFGQRTSQYRGVTRHRWTGRYEAHLWDNSCKKEGQGRKGRQVYLGGYDMEEKA
ARAYDMAALKYWGPSSHINFPLENYQNELEEMKNMTRQEYVAHLRRKSSGFSRGASMYRGVTRSQRHHQHGRWQARIGRV
AGNKDLYLGTFSTQEEAAEAYDIAAIKFRGVNAVTNFDITRYDVEKIMESNNLLSSEQAKRKREMDDGTRSEATVNQKPS
TYDHTQETILMQKRCKNQSEWKMVQFPCPQQLDQNQRIESCRTQPFSTDLDNMFRHQVEERSNMGTHLSNPSSLVTSLSS
SREESPDKTSMPMLFGMPSTVSKLLANVDSWDLSSNLRTALSMPQMPIFAAWTDA*
CDS seq >KRH77164
ATGAAGCCTATAGAAAATGGCAATACCGTTAGCATCAACTATCAAAATAGCTGGTTAGGCTTCTCCCTCTCCCCACAAAT
GAATATAGGTGTTCCTTCACACCTTCATCAGACACAGCCTTCATCAGCAGCTGTTGAAGCAGTTCCACCAAACTTTTACC
ATCATACACCTCTCCATAATTATGGCCTTTACTATGAACTGGAAGGTGAACACGTTGGAATGAGTTCATCCTTGCCTATC
ATGCCCCTTAAGTCCAATGCTTCTCTCTCTGGAATAGAAGCTCTGAGTAGATCACAAGCACAAGCAACGACAACTATTTC
AGCAATGCATTCAAGCTTAAATAGTATGCTTATCAACGAACTCTTATGTCATGGACTAAACAATCCAAATAACTTAAATC
ACGTTCAAGAGGATATCAGCCAGCAACAATTCTCGTATTACTCTACCTTAAGGAACCAAGATGTGATTTTAGAAGGTTCC
AAACACCAGCTCCCATGTATAGCTGAGGATGAAAATCCAGGTTTGAAGAGTTGGTTCTCAAGGGATTTTCATGCTAGACA
TGCAGAAGAATCAAGGATGATTGTTCCTTTGGAGTGTAATGGAGGCGAATCTGGATCCATTGGATCAATAACATATGGGG
ATTTGCATTCATCGAATTTGTCTGTGAGTCCTACCTCAGGATCAAGCAGTGTTACAAGTTCACCTGCTCTCACTAATACT
GTTGCCACTAATACTAAGAAAAGGTGGCTTGAAATGGTGGACCAGAATCAGAAGCAAATAGTTCATAGGAAATCCATAGA
TACCTTTGGGCAAAGAACATCTCAATATAGAGGTGTAACAAGGCATAGGTGGACTGGTAGATATGAAGCTCATCTATGGG
ACAACAGCTGCAAGAAAGAGGGCCAAGGAAGGAAAGGAAGGCAAGTTTATCTAGGGGGTTATGATATGGAAGAAAAAGCA
GCAAGAGCTTATGATATGGCCGCACTCAAGTATTGGGGACCCTCCTCTCATATAAATTTCCCATTGGAAAATTATCAAAA
CGAACTTGAGGAAATGAAGAACATGACGAGACAAGAATATGTTGCTCATTTACGAAGGAAAAGCAGCGGATTCTCAAGAG
GGGCCTCCATGTACAGAGGAGTAACAAGGTCTCAAAGGCACCACCAACATGGAAGGTGGCAAGCTCGAATTGGAAGGGTA
GCCGGAAACAAAGATCTATATCTTGGAACCTTTAGTACCCAAGAGGAAGCAGCTGAAGCCTATGACATTGCTGCTATTAA
ATTCAGAGGAGTTAATGCTGTCACTAACTTTGATATAACAAGATATGACGTGGAAAAAATTATGGAGAGCAATAACCTTC
TTAGCAGTGAACAAGCTAAGCGGAAAAGAGAGATGGATGATGGAACTAGAAGCGAGGCTACCGTTAACCAAAAACCTTCT
ACATATGACCACACTCAAGAAACCATTCTAATGCAGAAAAGATGCAAAAACCAATCAGAATGGAAGATGGTTCAGTTTCC
ATGCCCCCAACAGCTTGATCAGAATCAAAGAATCGAGAGTTGTAGAACTCAGCCCTTCTCAACGGACTTAGATAACATGT
TTCGTCACCAAGTTGAGGAACGGAGCAACATGGGAACACACTTGTCAAATCCTTCTTCTCTGGTGACAAGTTTGAGTAGC
TCAAGAGAAGAGAGCCCAGATAAGACAAGCATGCCAATGCTCTTTGGAATGCCTTCAACAGTGTCCAAATTATTGGCTAA
CGTGGATTCTTGGGATCTATCTTCCAATCTCAGGACTGCGCTTTCTATGCCTCAGATGCCAATTTTTGCTGCTTGGACAG
ATGCATAA
Microexon DNA seq TTTATCTAG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAACAGCTGCAAGAAAGAGGGCCAAGGAAGGAAAGGAAGGCAAGTTTATCTAGGGGGTTATGATATGGAAGAAAAAGCAGCAAGAGCTTATGATATGGCCGCA
Microexon-tag Amino Acid seq WDNSCKKEGQGRKGRQVYLGGYDMEEKAARAYDMAA
Transcript ID KRH77164
Gene ID Gm.1876
Gene Name NA
Pfam domain motif AP2
Motif E-value 5.3e-13
Motif start 275
Motif end 333
Protein seq >KRH77164
MKPIENGNTVSINYQNSWLGFSLSPQMNIGVPSHLHQTQPSSAAVEAVPPNFYHHTPLHNYGLYYELEGEHVGMSSSLPI
MPLKSNASLSGIEALSRSQAQATTTISAMHSSLNSMLINELLCHGLNNPNNLNHVQEDISQQQFSYYSTLRNQDVILEGS
KHQLPCIAEDENPGLKSWFSRDFHARHAEESRMIVPLECNGGESGSIGSITYGDLHSSNLSVSPTSGSSSVTSSPALTNT
VATNTKKRWLEMVDQNQKQIVHRKSIDTFGQRTSQYRGVTRHRWTGRYEAHLWDNSCKKEGQGRKGRQVYLGGYDMEEKA
ARAYDMAALKYWGPSSHINFPLENYQNELEEMKNMTRQEYVAHLRRKSSGFSRGASMYRGVTRSQRHHQHGRWQARIGRV
AGNKDLYLGTFSTQEEAAEAYDIAAIKFRGVNAVTNFDITRYDVEKIMESNNLLSSEQAKRKREMDDGTRSEATVNQKPS
TYDHTQETILMQKRCKNQSEWKMVQFPCPQQLDQNQRIESCRTQPFSTDLDNMFRHQVEERSNMGTHLSNPSSLVTSLSS
SREESPDKTSMPMLFGMPSTVSKLLANVDSWDLSSNLRTALSMPQMPIFAAWTDA*
CDS seq >KRH77164
ATGAAGCCTATAGAAAATGGCAATACCGTTAGCATCAACTATCAAAATAGCTGGTTAGGCTTCTCCCTCTCCCCACAAAT
GAATATAGGTGTTCCTTCACACCTTCATCAGACACAGCCTTCATCAGCAGCTGTTGAAGCAGTTCCACCAAACTTTTACC
ATCATACACCTCTCCATAATTATGGCCTTTACTATGAACTGGAAGGTGAACACGTTGGAATGAGTTCATCCTTGCCTATC
ATGCCCCTTAAGTCCAATGCTTCTCTCTCTGGAATAGAAGCTCTGAGTAGATCACAAGCACAAGCAACGACAACTATTTC
AGCAATGCATTCAAGCTTAAATAGTATGCTTATCAACGAACTCTTATGTCATGGACTAAACAATCCAAATAACTTAAATC
ACGTTCAAGAGGATATCAGCCAGCAACAATTCTCGTATTACTCTACCTTAAGGAACCAAGATGTGATTTTAGAAGGTTCC
AAACACCAGCTCCCATGTATAGCTGAGGATGAAAATCCAGGTTTGAAGAGTTGGTTCTCAAGGGATTTTCATGCTAGACA
TGCAGAAGAATCAAGGATGATTGTTCCTTTGGAGTGTAATGGAGGCGAATCTGGATCCATTGGATCAATAACATATGGGG
ATTTGCATTCATCGAATTTGTCTGTGAGTCCTACCTCAGGATCAAGCAGTGTTACAAGTTCACCTGCTCTCACTAATACT
GTTGCCACTAATACTAAGAAAAGGTGGCTTGAAATGGTGGACCAGAATCAGAAGCAAATAGTTCATAGGAAATCCATAGA
TACCTTTGGGCAAAGAACATCTCAATATAGAGGTGTAACAAGGCATAGGTGGACTGGTAGATATGAAGCTCATCTATGGG
ACAACAGCTGCAAGAAAGAGGGCCAAGGAAGGAAAGGAAGGCAAGTTTATCTAGGGGGTTATGATATGGAAGAAAAAGCA
GCAAGAGCTTATGATATGGCCGCACTCAAGTATTGGGGACCCTCCTCTCATATAAATTTCCCATTGGAAAATTATCAAAA
CGAACTTGAGGAAATGAAGAACATGACGAGACAAGAATATGTTGCTCATTTACGAAGGAAAAGCAGCGGATTCTCAAGAG
GGGCCTCCATGTACAGAGGAGTAACAAGGTCTCAAAGGCACCACCAACATGGAAGGTGGCAAGCTCGAATTGGAAGGGTA
GCCGGAAACAAAGATCTATATCTTGGAACCTTTAGTACCCAAGAGGAAGCAGCTGAAGCCTATGACATTGCTGCTATTAA
ATTCAGAGGAGTTAATGCTGTCACTAACTTTGATATAACAAGATATGACGTGGAAAAAATTATGGAGAGCAATAACCTTC
TTAGCAGTGAACAAGCTAAGCGGAAAAGAGAGATGGATGATGGAACTAGAAGCGAGGCTACCGTTAACCAAAAACCTTCT
ACATATGACCACACTCAAGAAACCATTCTAATGCAGAAAAGATGCAAAAACCAATCAGAATGGAAGATGGTTCAGTTTCC
ATGCCCCCAACAGCTTGATCAGAATCAAAGAATCGAGAGTTGTAGAACTCAGCCCTTCTCAACGGACTTAGATAACATGT
TTCGTCACCAAGTTGAGGAACGGAGCAACATGGGAACACACTTGTCAAATCCTTCTTCTCTGGTGACAAGTTTGAGTAGC
TCAAGAGAAGAGAGCCCAGATAAGACAAGCATGCCAATGCTCTTTGGAATGCCTTCAACAGTGTCCAAATTATTGGCTAA
CGTGGATTCTTGGGATCTATCTTCCAATCTCAGGACTGCGCTTTCTATGCCTCAGATGCCAATTTTTGCTGCTTGGACAG
ATGCATAA