Microexon ID Gm_19:39937654-39937662:-
Species Glycine max
Coordinates 19:39937654..39937662
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTACTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAAGAGTACGTGGAACCAAAATCAGAATAAGAAGGGAAAGCAAGTTTACTTGGGGGCGTATGATGACGAGGAAGCAGCGGCTAGAGCATATGATCTTGCTGCT
Microexon-tag Amino Acid Seq WDKSTWNQNQNKKGKQVYLGAYDDEEAAARAYDLAA
Microexon-tag spanning region39937252-39937861
Microexon-tag prediction score0.9653
Overlapped with the annotated transcript (%) 100
New Transcript ID KRG95239x
Reference Transcript ID KRG95239
Gene ID GLYMA_19G138000
Gene Name NA
Transcript ID KRG95239
Protein ID KRG95239
Gene ID GLYMA_19G138000
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.1e-13
Motif start 61
Motif end 119
Protein seq >KRG95239
MASSSSDPGKSAETSEAAVVVANDQLLLYRGLKKAKKERGCTAKERISKMPPCAAGKRSSIYRGVTRHRWTGRYEAHLWD
KSTWNQNQNKKGKQVYLGAYDDEEAAARAYDLAALKYWGPGTLINFPVTDYTRDLEEMQNVSREEYLASLRRKSSGFSRG
LAKYRGLSSRWDPSYGRMSGSDYFNSMHYGDDSAAESEYVSGFCIERKIDLTSHIKWWGSNKSRHSDAGTRLSEEKKLGS
AGDICIEIKQLEQKVQPTAPYQMPQLGRPQNEKKHRSSSVSALSILSQSAAYKSLQEKASKTLENIIDNDENENKNTINK
LDHGKAVEKPSNHDGGNDRLDIATGMSGTMSLQRNVYPLTPFLSAPLLTAYNTVDPLVDPVLWTSLVPMLPAGLSHTAEV
TKTETSSTYTMFRPEE*
CDS seq >KRG95239
ATGGCTTCGTCTTCCTCCGATCCAGGAAAGTCCGCGGAGACGTCGGAGGCGGCGGTGGTGGTGGCGAACGACCAGCTCTT
GCTCTACAGAGGATTGAAGAAGGCCAAGAAGGAGAGAGGTTGCACGGCCAAAGAGCGCATCAGCAAAATGCCTCCGTGCG
CCGCCGGCAAACGCAGTTCCATATACCGCGGCGTCACGAGGCACAGGTGGACTGGTCGTTATGAAGCGCATCTTTGGGAT
AAGAGTACGTGGAACCAAAATCAGAATAAGAAGGGAAAGCAAGTTTACTTGGGGGCGTATGATGACGAGGAAGCAGCGGC
TAGAGCATATGATCTTGCTGCTCTCAAATATTGGGGTCCTGGGACTCTCATTAACTTTCCTGTGACGGATTATACAAGAG
ATCTTGAAGAAATGCAGAATGTGTCAAGAGAAGAGTACCTTGCATCTTTACGGCGGAAGAGCAGTGGTTTTTCAAGAGGA
CTAGCAAAATATCGTGGACTCTCCAGTCGATGGGATCCATCGTATGGCCGTATGTCTGGATCTGATTACTTCAATAGCAT
GCATTATGGGGATGATTCTGCAGCAGAAAGTGAATATGTAAGTGGTTTCTGCATAGAAAGAAAGATTGATTTAACAAGTC
ACATCAAATGGTGGGGATCTAATAAGAGTAGACATTCTGATGCTGGGACAAGATTATCAGAAGAAAAGAAACTTGGTTCT
GCTGGAGATATTTGCATTGAAATTAAGCAATTGGAGCAGAAAGTTCAACCTACAGCACCTTACCAGATGCCACAGTTAGG
AAGGCCCCAAAATGAGAAAAAGCATAGAAGTTCTTCAGTTTCTGCCTTAAGTATCTTGTCACAATCTGCTGCTTACAAGA
GCTTGCAAGAGAAAGCGTCGAAGACACTGGAAAATATCATAGATAATGATGAGAATGAAAACAAAAATACAATCAACAAG
TTGGATCATGGCAAGGCAGTTGAGAAACCATCAAATCATGACGGAGGCAATGACCGGCTTGACATTGCAACGGGAATGAG
TGGAACAATGTCTCTTCAAAGAAATGTTTACCCATTGACACCATTCTTGTCTGCACCACTTTTAACAGCCTACAATACTG
TTGATCCATTGGTAGATCCTGTTCTCTGGACATCTCTTGTTCCCATGCTTCCTGCTGGCCTTTCTCATACAGCTGAGGTT
ACAAAGACAGAGACCAGTTCAACGTACACGATGTTTCGGCCAGAGGAATGA
Microexon DNA seq TTTACTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAAGAGTACGTGGAACCAAAATCAGAATAAGAAGGGAAAGCAAGTTTACTTGGGGGCGTATGATGACGAGGAAGCAGCGGCTAGAGCATATGATCTTGCTGCT
Microexon-tag Amino Acid seq WDKSTWNQNQNKKGKQVYLGAYDDEEAAARAYDLAA
Transcript ID KRG95239
Gene ID Gm.28084
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.1e-13
Motif start 61
Motif end 119
Protein seq >KRG95239
MASSSSDPGKSAETSEAAVVVANDQLLLYRGLKKAKKERGCTAKERISKMPPCAAGKRSSIYRGVTRHRWTGRYEAHLWD
KSTWNQNQNKKGKQVYLGAYDDEEAAARAYDLAALKYWGPGTLINFPVTDYTRDLEEMQNVSREEYLASLRRKSSGFSRG
LAKYRGLSSRWDPSYGRMSGSDYFNSMHYGDDSAAESEYVSGFCIERKIDLTSHIKWWGSNKSRHSDAGTRLSEEKKLGS
AGDICIEIKQLEQKVQPTAPYQMPQLGRPQNEKKHRSSSVSALSILSQSAAYKSLQEKASKTLENIIDNDENENKNTINK
LDHGKAVEKPSNHDGGNDRLDIATGMSGTMSLQRNVYPLTPFLSAPLLTAYNTVDPLVDPVLWTSLVPMLPAGLSHTAEV
TKTETSSTYTMFRPEE*
CDS seq >KRG95239
ATGGCTTCGTCTTCCTCCGATCCAGGAAAGTCCGCGGAGACGTCGGAGGCGGCGGTGGTGGTGGCGAACGACCAGCTCTT
GCTCTACAGAGGATTGAAGAAGGCCAAGAAGGAGAGAGGTTGCACGGCCAAAGAGCGCATCAGCAAAATGCCTCCGTGCG
CCGCCGGCAAACGCAGTTCCATATACCGCGGCGTCACGAGGCACAGGTGGACTGGTCGTTATGAAGCGCATCTTTGGGAT
AAGAGTACGTGGAACCAAAATCAGAATAAGAAGGGAAAGCAAGTTTACTTGGGGGCGTATGATGACGAGGAAGCAGCGGC
TAGAGCATATGATCTTGCTGCTCTCAAATATTGGGGTCCTGGGACTCTCATTAACTTTCCTGTGACGGATTATACAAGAG
ATCTTGAAGAAATGCAGAATGTGTCAAGAGAAGAGTACCTTGCATCTTTACGGCGGAAGAGCAGTGGTTTTTCAAGAGGA
CTAGCAAAATATCGTGGACTCTCCAGTCGATGGGATCCATCGTATGGCCGTATGTCTGGATCTGATTACTTCAATAGCAT
GCATTATGGGGATGATTCTGCAGCAGAAAGTGAATATGTAAGTGGTTTCTGCATAGAAAGAAAGATTGATTTAACAAGTC
ACATCAAATGGTGGGGATCTAATAAGAGTAGACATTCTGATGCTGGGACAAGATTATCAGAAGAAAAGAAACTTGGTTCT
GCTGGAGATATTTGCATTGAAATTAAGCAATTGGAGCAGAAAGTTCAACCTACAGCACCTTACCAGATGCCACAGTTAGG
AAGGCCCCAAAATGAGAAAAAGCATAGAAGTTCTTCAGTTTCTGCCTTAAGTATCTTGTCACAATCTGCTGCTTACAAGA
GCTTGCAAGAGAAAGCGTCGAAGACACTGGAAAATATCATAGATAATGATGAGAATGAAAACAAAAATACAATCAACAAG
TTGGATCATGGCAAGGCAGTTGAGAAACCATCAAATCATGACGGAGGCAATGACCGGCTTGACATTGCAACGGGAATGAG
TGGAACAATGTCTCTTCAAAGAAATGTTTACCCATTGACACCATTCTTGTCTGCACCACTTTTAACAGCCTACAATACTG
TTGATCCATTGGTAGATCCTGTTCTCTGGACATCTCTTGTTCCCATGCTTCCTGCTGGCCTTTCTCATACAGCTGAGGTT
ACAAAGACAGAGACCAGTTCAACGTACACGATGTTTCGGCCAGAGGAATGA