
Microexon ID | Gm_19:39937654-39937662:- |
Species | Glycine max | Coordinates | 19:39937654..39937662 |
Microexon Cluster ID | MEP21 |
Size | 9 |
Phase | 1 |
Pfam Domain Motif | AP2 |
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 49,9,50 |
Microexon location in the Microexon-tag | 2 |
Microexon-tag DNA Seq | TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW |
Logo of Microexon-tag DNA Seq | ![]() |
Alignment of exons | ![]() |
Microexon DNA seq | TTTACTTGG |
Microexon Amino Acid seq | VYLG |
Microexon-tag DNA Seq | TGGGATAAGAGTACGTGGAACCAAAATCAGAATAAGAAGGGAAAGCAAGTTTACTTGGGGGCGTATGATGACGAGGAAGCAGCGGCTAGAGCATATGATCTTGCTGCT |
Microexon-tag Amino Acid Seq | WDKSTWNQNQNKKGKQVYLGAYDDEEAAARAYDLAA |
Microexon-tag spanning region | 39937252-39937861 |
Microexon-tag prediction score | 0.9653 |
Overlapped with the annotated transcript (%) | 100 |
New Transcript ID | KRG95239x |
Reference Transcript ID | KRG95239 |
Gene ID | GLYMA_19G138000 |
Gene Name | NA |
Transcript ID | KRG95239 |
Protein ID | KRG95239 |
Gene ID | GLYMA_19G138000 |
Gene Name | NA |
Pfam domain motif | AP2 |
Motif E-value | 1.1e-13 |
Motif start | 61 |
Motif end | 119 |
Protein seq | >KRG95239 MASSSSDPGKSAETSEAAVVVANDQLLLYRGLKKAKKERGCTAKERISKMPPCAAGKRSSIYRGVTRHRWTGRYEAHLWD KSTWNQNQNKKGKQVYLGAYDDEEAAARAYDLAALKYWGPGTLINFPVTDYTRDLEEMQNVSREEYLASLRRKSSGFSRG LAKYRGLSSRWDPSYGRMSGSDYFNSMHYGDDSAAESEYVSGFCIERKIDLTSHIKWWGSNKSRHSDAGTRLSEEKKLGS AGDICIEIKQLEQKVQPTAPYQMPQLGRPQNEKKHRSSSVSALSILSQSAAYKSLQEKASKTLENIIDNDENENKNTINK LDHGKAVEKPSNHDGGNDRLDIATGMSGTMSLQRNVYPLTPFLSAPLLTAYNTVDPLVDPVLWTSLVPMLPAGLSHTAEV TKTETSSTYTMFRPEE* |
CDS seq | >KRG95239 ATGGCTTCGTCTTCCTCCGATCCAGGAAAGTCCGCGGAGACGTCGGAGGCGGCGGTGGTGGTGGCGAACGACCAGCTCTT GCTCTACAGAGGATTGAAGAAGGCCAAGAAGGAGAGAGGTTGCACGGCCAAAGAGCGCATCAGCAAAATGCCTCCGTGCG CCGCCGGCAAACGCAGTTCCATATACCGCGGCGTCACGAGGCACAGGTGGACTGGTCGTTATGAAGCGCATCTTTGGGAT AAGAGTACGTGGAACCAAAATCAGAATAAGAAGGGAAAGCAAGTTTACTTGGGGGCGTATGATGACGAGGAAGCAGCGGC TAGAGCATATGATCTTGCTGCTCTCAAATATTGGGGTCCTGGGACTCTCATTAACTTTCCTGTGACGGATTATACAAGAG ATCTTGAAGAAATGCAGAATGTGTCAAGAGAAGAGTACCTTGCATCTTTACGGCGGAAGAGCAGTGGTTTTTCAAGAGGA CTAGCAAAATATCGTGGACTCTCCAGTCGATGGGATCCATCGTATGGCCGTATGTCTGGATCTGATTACTTCAATAGCAT GCATTATGGGGATGATTCTGCAGCAGAAAGTGAATATGTAAGTGGTTTCTGCATAGAAAGAAAGATTGATTTAACAAGTC ACATCAAATGGTGGGGATCTAATAAGAGTAGACATTCTGATGCTGGGACAAGATTATCAGAAGAAAAGAAACTTGGTTCT GCTGGAGATATTTGCATTGAAATTAAGCAATTGGAGCAGAAAGTTCAACCTACAGCACCTTACCAGATGCCACAGTTAGG AAGGCCCCAAAATGAGAAAAAGCATAGAAGTTCTTCAGTTTCTGCCTTAAGTATCTTGTCACAATCTGCTGCTTACAAGA GCTTGCAAGAGAAAGCGTCGAAGACACTGGAAAATATCATAGATAATGATGAGAATGAAAACAAAAATACAATCAACAAG TTGGATCATGGCAAGGCAGTTGAGAAACCATCAAATCATGACGGAGGCAATGACCGGCTTGACATTGCAACGGGAATGAG TGGAACAATGTCTCTTCAAAGAAATGTTTACCCATTGACACCATTCTTGTCTGCACCACTTTTAACAGCCTACAATACTG TTGATCCATTGGTAGATCCTGTTCTCTGGACATCTCTTGTTCCCATGCTTCCTGCTGGCCTTTCTCATACAGCTGAGGTT ACAAAGACAGAGACCAGTTCAACGTACACGATGTTTCGGCCAGAGGAATGA |
Microexon DNA seq | TTTACTTGG |
Microexon Amino Acid seq | VYLG |
Microexon-tag DNA Seq | TGGGATAAGAGTACGTGGAACCAAAATCAGAATAAGAAGGGAAAGCAAGTTTACTTGGGGGCGTATGATGACGAGGAAGCAGCGGCTAGAGCATATGATCTTGCTGCT |
Microexon-tag Amino Acid seq | WDKSTWNQNQNKKGKQVYLGAYDDEEAAARAYDLAA |
Transcript ID | KRG95239 |
Gene ID | Gm.28084 |
Gene Name | NA |
Pfam domain motif | AP2 |
Motif E-value | 1.1e-13 |
Motif start | 61 |
Motif end | 119 |
Protein seq | >KRG95239 MASSSSDPGKSAETSEAAVVVANDQLLLYRGLKKAKKERGCTAKERISKMPPCAAGKRSSIYRGVTRHRWTGRYEAHLWD KSTWNQNQNKKGKQVYLGAYDDEEAAARAYDLAALKYWGPGTLINFPVTDYTRDLEEMQNVSREEYLASLRRKSSGFSRG LAKYRGLSSRWDPSYGRMSGSDYFNSMHYGDDSAAESEYVSGFCIERKIDLTSHIKWWGSNKSRHSDAGTRLSEEKKLGS AGDICIEIKQLEQKVQPTAPYQMPQLGRPQNEKKHRSSSVSALSILSQSAAYKSLQEKASKTLENIIDNDENENKNTINK LDHGKAVEKPSNHDGGNDRLDIATGMSGTMSLQRNVYPLTPFLSAPLLTAYNTVDPLVDPVLWTSLVPMLPAGLSHTAEV TKTETSSTYTMFRPEE* |
CDS seq | >KRG95239 ATGGCTTCGTCTTCCTCCGATCCAGGAAAGTCCGCGGAGACGTCGGAGGCGGCGGTGGTGGTGGCGAACGACCAGCTCTT GCTCTACAGAGGATTGAAGAAGGCCAAGAAGGAGAGAGGTTGCACGGCCAAAGAGCGCATCAGCAAAATGCCTCCGTGCG CCGCCGGCAAACGCAGTTCCATATACCGCGGCGTCACGAGGCACAGGTGGACTGGTCGTTATGAAGCGCATCTTTGGGAT AAGAGTACGTGGAACCAAAATCAGAATAAGAAGGGAAAGCAAGTTTACTTGGGGGCGTATGATGACGAGGAAGCAGCGGC TAGAGCATATGATCTTGCTGCTCTCAAATATTGGGGTCCTGGGACTCTCATTAACTTTCCTGTGACGGATTATACAAGAG ATCTTGAAGAAATGCAGAATGTGTCAAGAGAAGAGTACCTTGCATCTTTACGGCGGAAGAGCAGTGGTTTTTCAAGAGGA CTAGCAAAATATCGTGGACTCTCCAGTCGATGGGATCCATCGTATGGCCGTATGTCTGGATCTGATTACTTCAATAGCAT GCATTATGGGGATGATTCTGCAGCAGAAAGTGAATATGTAAGTGGTTTCTGCATAGAAAGAAAGATTGATTTAACAAGTC ACATCAAATGGTGGGGATCTAATAAGAGTAGACATTCTGATGCTGGGACAAGATTATCAGAAGAAAAGAAACTTGGTTCT GCTGGAGATATTTGCATTGAAATTAAGCAATTGGAGCAGAAAGTTCAACCTACAGCACCTTACCAGATGCCACAGTTAGG AAGGCCCCAAAATGAGAAAAAGCATAGAAGTTCTTCAGTTTCTGCCTTAAGTATCTTGTCACAATCTGCTGCTTACAAGA GCTTGCAAGAGAAAGCGTCGAAGACACTGGAAAATATCATAGATAATGATGAGAATGAAAACAAAAATACAATCAACAAG TTGGATCATGGCAAGGCAGTTGAGAAACCATCAAATCATGACGGAGGCAATGACCGGCTTGACATTGCAACGGGAATGAG TGGAACAATGTCTCTTCAAAGAAATGTTTACCCATTGACACCATTCTTGTCTGCACCACTTTTAACAGCCTACAATACTG TTGATCCATTGGTAGATCCTGTTCTCTGGACATCTCTTGTTCCCATGCTTCCTGCTGGCCTTTCTCATACAGCTGAGGTT ACAAAGACAGAGACCAGTTCAACGTACACGATGTTTCGGCCAGAGGAATGA |