Microexon ID Gm_18:54213539-54213547:+
Species Glycine max
Coordinates 18:54213539..54213547
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTACCTTG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAAACTCTCTTGGAACATAACACAGAAGAAGAAAGGGAAGCAAGTTTACCTTGGGGCTTATGATGAAGAAGAATCTGCTGCCAGAGCATATGATTTAGCTGCA
Microexon-tag Amino Acid Seq WDKLSWNITQKKKGKQVYLGAYDEEESAARAYDLAA
Microexon-tag spanning region54213336-54213692
Microexon-tag prediction score0.9338
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH01131x
Reference Transcript ID KRH01131
Gene ID GLYMA_18G256000
Gene Name NA
Transcript ID KRH01131
Protein ID KRH01131
Gene ID GLYMA_18G256000
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.3e-13
Motif start 71
Motif end 130
Protein seq >KRH01131
MVMMKENIIEEKLGRSQMSMVEGEFQGTWGVKRRRREVAAAASSGDDNHHQQLPQQEVGENSSISTTKRSSRFRGVSRHR
WTGRYEAHLWDKLSWNITQKKKGKQVYLGAYDEEESAARAYDLAALKYWGNSTFTNFPISDYEKEIEIMQTMTKEEYLAT
LRRKSSGFSRGVSKYRGVARHHHNGRWEARIGRVFGNKYLYLGTYSTQEEAARAYDIAAIEYRGIHAVTNFDLSTYIKWL
KPSGGGTPEENLESHAVLEHQKLASPSNYALTEESKSLVLPNSFISPDSLDSPVKHESFGNKTYQFSRNKSSSPTALGLL
LRSSLFRELVEKNSNVSGEEADGEVTKDQQPQLASDDDLDGIFFDSFGDIPFVCDPTRYNLELQERDLHSIF*
CDS seq >KRH01131
ATGGTAATGATGAAAGAGAATATTATTGAAGAGAAGCTTGGAAGGAGCCAAATGTCTATGGTAGAAGGTGAGTTCCAAGG
AACATGGGGTGTCAAGAGACGACGAAGAGAGGTTGCTGCAGCAGCAAGCAGCGGTGATGATAACCACCACCAGCAGTTAC
CACAGCAAGAAGTTGGTGAAAATTCTTCAATCAGCACAACAAAGAGAAGCTCAAGATTTCGCGGTGTTAGCAGACATAGA
TGGACGGGTCGGTATGAAGCTCACTTGTGGGACAAACTCTCTTGGAACATAACACAGAAGAAGAAAGGGAAGCAAGTTTA
CCTTGGGGCTTATGATGAAGAAGAATCTGCTGCCAGAGCATATGATTTAGCTGCACTTAAGTATTGGGGAAATTCAACTT
TCACCAATTTTCCAATATCAGATTATGAGAAAGAGATAGAAATAATGCAGACTATGACTAAAGAGGAGTATTTGGCCACT
TTAAGGAGAAAGAGCAGTGGCTTTTCAAGAGGTGTATCAAAGTATCGGGGCGTTGCAAGGCACCACCACAATGGTAGATG
GGAAGCAAGAATAGGAAGGGTTTTTGGAAACAAATATCTCTACCTTGGCACCTACAGCACCCAAGAAGAAGCTGCGCGTG
CTTATGACATAGCAGCCATTGAGTACAGAGGAATTCATGCTGTAACAAACTTTGATTTGAGCACTTACATAAAATGGTTA
AAGCCTTCAGGAGGAGGCACCCCAGAAGAAAATCTTGAATCACATGCAGTACTAGAGCATCAAAAGTTGGCATCCCCATC
TAATTATGCTCTAACAGAAGAGTCTAAGTCTTTGGTCCTTCCCAACAGTTTTATCAGTCCAGATTCTCTGGATTCACCTG
TAAAGCATGAAAGTTTTGGAAACAAAACCTACCAATTCTCAAGAAATAAGTCATCTTCTCCCACTGCACTTGGTCTCCTT
CTGCGCTCTTCATTATTTAGAGAATTGGTTGAAAAGAATTCAAATGTTTCTGGAGAAGAAGCTGATGGGGAAGTCACAAA
AGATCAACAGCCACAACTAGCTAGCGATGATGATCTGGATGGAATCTTCTTCGATAGCTTTGGCGATATTCCATTTGTGT
GTGATCCCACTAGATACAACTTGGAATTGCAGGAAAGAGACCTGCACTCAATATTTTGA
Microexon DNA seq TTTACCTTG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAAACTCTCTTGGAACATAACACAGAAGAAGAAAGGGAAGCAAGTTTACCTTGGGGCTTATGATGAAGAAGAATCTGCTGCCAGAGCATATGATTTAGCTGCA
Microexon-tag Amino Acid seq WDKLSWNITQKKKGKQVYLGAYDEEESAARAYDLAA
Transcript ID KRH01131
Gene ID Gm.26307
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.3e-13
Motif start 71
Motif end 130
Protein seq >KRH01131
MVMMKENIIEEKLGRSQMSMVEGEFQGTWGVKRRRREVAAAASSGDDNHHQQLPQQEVGENSSISTTKRSSRFRGVSRHR
WTGRYEAHLWDKLSWNITQKKKGKQVYLGAYDEEESAARAYDLAALKYWGNSTFTNFPISDYEKEIEIMQTMTKEEYLAT
LRRKSSGFSRGVSKYRGVARHHHNGRWEARIGRVFGNKYLYLGTYSTQEEAARAYDIAAIEYRGIHAVTNFDLSTYIKWL
KPSGGGTPEENLESHAVLEHQKLASPSNYALTEESKSLVLPNSFISPDSLDSPVKHESFGNKTYQFSRNKSSSPTALGLL
LRSSLFRELVEKNSNVSGEEADGEVTKDQQPQLASDDDLDGIFFDSFGDIPFVCDPTRYNLELQERDLHSIF*
CDS seq >KRH01131
ATGGTAATGATGAAAGAGAATATTATTGAAGAGAAGCTTGGAAGGAGCCAAATGTCTATGGTAGAAGGTGAGTTCCAAGG
AACATGGGGTGTCAAGAGACGACGAAGAGAGGTTGCTGCAGCAGCAAGCAGCGGTGATGATAACCACCACCAGCAGTTAC
CACAGCAAGAAGTTGGTGAAAATTCTTCAATCAGCACAACAAAGAGAAGCTCAAGATTTCGCGGTGTTAGCAGACATAGA
TGGACGGGTCGGTATGAAGCTCACTTGTGGGACAAACTCTCTTGGAACATAACACAGAAGAAGAAAGGGAAGCAAGTTTA
CCTTGGGGCTTATGATGAAGAAGAATCTGCTGCCAGAGCATATGATTTAGCTGCACTTAAGTATTGGGGAAATTCAACTT
TCACCAATTTTCCAATATCAGATTATGAGAAAGAGATAGAAATAATGCAGACTATGACTAAAGAGGAGTATTTGGCCACT
TTAAGGAGAAAGAGCAGTGGCTTTTCAAGAGGTGTATCAAAGTATCGGGGCGTTGCAAGGCACCACCACAATGGTAGATG
GGAAGCAAGAATAGGAAGGGTTTTTGGAAACAAATATCTCTACCTTGGCACCTACAGCACCCAAGAAGAAGCTGCGCGTG
CTTATGACATAGCAGCCATTGAGTACAGAGGAATTCATGCTGTAACAAACTTTGATTTGAGCACTTACATAAAATGGTTA
AAGCCTTCAGGAGGAGGCACCCCAGAAGAAAATCTTGAATCACATGCAGTACTAGAGCATCAAAAGTTGGCATCCCCATC
TAATTATGCTCTAACAGAAGAGTCTAAGTCTTTGGTCCTTCCCAACAGTTTTATCAGTCCAGATTCTCTGGATTCACCTG
TAAAGCATGAAAGTTTTGGAAACAAAACCTACCAATTCTCAAGAAATAAGTCATCTTCTCCCACTGCACTTGGTCTCCTT
CTGCGCTCTTCATTATTTAGAGAATTGGTTGAAAAGAATTCAAATGTTTCTGGAGAAGAAGCTGATGGGGAAGTCACAAA
AGATCAACAGCCACAACTAGCTAGCGATGATGATCTGGATGGAATCTTCTTCGATAGCTTTGGCGATATTCCATTTGTGT
GTGATCCCACTAGATACAACTTGGAATTGCAGGAAAGAGACCTGCACTCAATATTTTGA