Microexon ID Gm_9:42431050-42431058:+
Species Glycine max
Coordinates 9:42431050..42431058
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAATAGTTGCAGAAGGGAGGGTCAAGCTAGAAAAGGGCGTCAAGTTTATTTGGGTGGATATGATAAGGAAGAAAAGGCCGCTAGATCTTATGATTTGGCAGCT
Microexon-tag Amino Acid Seq WDNSCRREGQARKGRQVYLGGYDKEEKAARSYDLAA
Microexon-tag spanning region42429656-42431224
Microexon-tag prediction score0.9634
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH39470x
Reference Transcript ID KRH39470
Gene ID GLYMA_09G199800
Gene Name NA
Transcript ID KRH39470
Protein ID KRH39470
Gene ID GLYMA_09G199800
Gene Name NA
Pfam domain motif AP2
Motif E-value 6e-12
Motif start 223
Motif end 281
Protein seq >KRH39470
MARASTNWLSFSLSPMDMLRTPEPQFVQYDAASDTSSHHYYLDNLYTNGWGNGSLKFEQNLNHSDVSFVQSSSQSVSHAP
PKLEDFLGDSSAVMRYSDSQTETQDSSLTHIYDHHHHHHHGSSAYFGGDHQDLKAITGFQAFSTNSGSEVDDSASIGKAQ
GSEFGTHSIESSVNEFAAFSGGTNTGGTLSLAVAQSSEKAVAAAAESDRSKKVVDTFGQRTSIYRGVTRHRWTGRYEAHL
WDNSCRREGQARKGRQVYLGGYDKEEKAARSYDLAALKYWGPTATTNFPVSNYSKEVEEMKHVTKQEFIASLRRKSSGFS
RGASIYRGVTRHHQQGRWQARIGRVAGNKDLYLGTFATEEEAAEAYDIAAIKFRGANAVTNFEMNRYDVEAIMKSSLPVG
GAAKRLKLSLESEQKALPVSSSSSSSQQQNPQCGNVSASINFSSIHQPIASIPCGIPFDSTTAYYHHNLFQHFHPTNAGT
AASAVTSANANALTALPPTAAAEFFIWPHQSY*
CDS seq >KRH39470
ATGGCTCGTGCTTCGACCAACTGGCTATCGTTCTCTCTCTCCCCCATGGATATGCTCCGAACCCCCGAACCTCAGTTCGT
TCAATACGACGCCGCTTCCGACACTTCCTCGCATCACTACTACCTCGACAACTTGTACACCAACGGGTGGGGGAACGGGA
GCCTCAAGTTTGAGCAGAATCTGAACCACAGCGACGTGAGTTTCGTTCAATCGTCGTCGCAGAGCGTCAGCCACGCGCCG
CCGAAGCTGGAGGATTTTCTCGGCGACTCCTCCGCTGTTATGCGTTACTCCGACAGCCAGACGGAGACGCAGGACTCGTC
GCTGACGCACATCTACGACCACCACCACCACCACCACCACGGTTCTTCTGCGTACTTCGGCGGTGACCACCAGGATCTCA
AGGCCATTACTGGATTCCAAGCTTTTTCGACTAACTCTGGCTCCGAGGTTGATGATTCTGCATCGATCGGAAAGGCGCAG
GGCAGCGAGTTCGGGACTCACTCTATTGAGTCCTCCGTCAACGAGTTCGCCGCGTTCTCCGGTGGCACCAACACCGGTGG
AACCTTGTCGCTCGCCGTCGCGCAGAGCTCCGAGAAGGCCGTCGCTGCTGCGGCGGAGTCCGATCGCTCGAAGAAGGTTG
TGGATACCTTCGGCCAGCGGACTTCTATATACAGAGGTGTCACTAGGCACCGATGGACAGGAAGATATGAAGCGCATCTA
TGGGACAATAGTTGCAGAAGGGAGGGTCAAGCTAGAAAAGGGCGTCAAGTTTATTTGGGTGGATATGATAAGGAAGAAAA
GGCCGCTAGATCTTATGATTTGGCAGCTCTGAAGTACTGGGGTCCCACTGCTACCACCAACTTCCCTGTTTCCAATTATT
CAAAGGAAGTGGAGGAGATGAAACATGTAACAAAGCAGGAATTTATCGCATCATTGCGAAGGAAAAGTAGTGGTTTCTCC
AGGGGAGCTTCCATATACAGAGGTGTTACAAGGCATCATCAACAGGGTAGGTGGCAAGCAAGAATTGGCCGTGTAGCTGG
AAACAAAGATCTTTACTTGGGAACATTCGCAACCGAGGAGGAAGCAGCAGAGGCATATGATATTGCAGCCATTAAGTTCA
GAGGTGCAAACGCAGTAACCAACTTTGAGATGAATAGATATGATGTGGAAGCTATAATGAAGAGTTCTCTTCCAGTGGGT
GGGGCAGCAAAGCGCTTGAAGCTTTCCCTTGAATCAGAGCAGAAAGCTCTTCCTGTGAGCAGCAGCAGCAGCAGCAGTCA
GCAGCAGAATCCACAGTGTGGAAACGTGAGTGCCAGCATCAATTTCTCATCCATTCATCAGCCAATTGCTTCTATCCCTT
GTGGAATTCCCTTTGATTCAACAACAGCATATTATCATCACAACCTTTTCCAACATTTTCACCCTACCAACGCTGGCACA
GCAGCGTCTGCTGTTACTTCTGCCAATGCAAATGCACTAACTGCACTGCCACCAACAGCAGCAGCTGAGTTCTTTATTTG
GCCTCATCAGTCTTATTGA
Microexon DNA seq TTTATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAATAGTTGCAGAAGGGAGGGTCAAGCTAGAAAAGGGCGTCAAGTTTATTTGGGTGGATATGATAAGGAAGAAAAGGCCGCTAGATCTTATGATTTGGCAGCT
Microexon-tag Amino Acid seq WDNSCRREGQARKGRQVYLGGYDKEEKAARSYDLAA
Transcript ID Gm.53686.1
Gene ID Gm.53686
Gene Name NA
Pfam domain motif AP2
Motif E-value 6e-12
Motif start 223
Motif end 281
Protein seq >Gm.53686.1
MARASTNWLSFSLSPMDMLRTPEPQFVQYDAASDTSSHHYYLDNLYTNGWGNGSLKFEQNLNHSDVSFVQSSSQSVSHAP
PKLEDFLGDSSAVMRYSDSQTETQDSSLTHIYDHHHHHHHGSSAYFGGDHQDLKAITGFQAFSTNSGSEVDDSASIGKAQ
GSEFGTHSIESSVNEFAAFSGGTNTGGTLSLAVAQSSEKAVAAAAESDRSKKVVDTFGQRTSIYRGVTRHRWTGRYEAHL
WDNSCRREGQARKGRQVYLGGYDKEEKAARSYDLAALKYWGPTATTNFPVSNYSKEVEEMKHVTKQEFIASLRRKSSGFS
RGASIYRGVTRHHQQGRWQARIGRVAGNKDLYLGTFATEEEAAEAYDIAAIKFRGANAVTNFEMNRYDVEAIMKSSLPVG
GAAKRLKLSLESEQKALPVSSSSSSSQQQNPQCGNVSASINFSSIHQPIASIPCGIPFDSTTAYYHHNLFQHFHPTNAGT
AASAVTSANANALTALPPTAAAEFFIWPHQSY*
CDS seq >Gm.53686.1
ATGGCTCGTGCTTCGACCAACTGGCTATCGTTCTCTCTCTCCCCCATGGATATGCTCCGAACCCCCGAACCTCAGTTCGT
TCAATACGACGCCGCTTCCGACACTTCCTCGCATCACTACTACCTCGACAACTTGTACACCAACGGGTGGGGGAACGGGA
GCCTCAAGTTTGAGCAGAATCTGAACCACAGCGACGTGAGTTTCGTTCAATCGTCGTCGCAGAGCGTCAGCCACGCGCCG
CCGAAGCTGGAGGATTTTCTCGGCGACTCCTCCGCTGTTATGCGTTACTCCGACAGCCAGACGGAGACGCAGGACTCGTC
GCTGACGCACATCTACGACCACCACCACCACCACCACCACGGTTCTTCTGCGTACTTCGGCGGTGACCACCAGGATCTCA
AGGCCATTACTGGATTCCAAGCTTTTTCGACTAACTCTGGCTCCGAGGTTGATGATTCTGCATCGATCGGAAAGGCGCAG
GGCAGCGAGTTCGGGACTCACTCTATTGAGTCCTCCGTCAACGAGTTCGCCGCGTTCTCCGGTGGCACCAACACCGGTGG
AACCTTGTCGCTCGCCGTCGCGCAGAGCTCCGAGAAGGCCGTCGCTGCTGCGGCGGAGTCCGATCGCTCGAAGAAGGTTG
TGGATACCTTCGGCCAGCGGACTTCTATATACAGAGGTGTCACTAGGCACCGATGGACAGGAAGATATGAAGCGCATCTA
TGGGACAATAGTTGCAGAAGGGAGGGTCAAGCTAGAAAAGGGCGTCAAGTTTATTTGGGTGGATATGATAAGGAAGAAAA
GGCCGCTAGATCTTATGATTTGGCAGCTCTGAAGTACTGGGGTCCCACTGCTACCACCAACTTCCCTGTTTCCAATTATT
CAAAGGAAGTGGAGGAGATGAAACATGTAACAAAGCAGGAATTTATCGCATCATTGCGAAGGAAAAGTAGTGGTTTCTCC
AGGGGAGCTTCCATATACAGAGGTGTTACAAGGCATCATCAACAGGGTAGGTGGCAAGCAAGAATTGGCCGTGTAGCTGG
AAACAAAGATCTTTACTTGGGAACATTCGCAACCGAGGAGGAAGCAGCAGAGGCATATGATATTGCAGCCATTAAGTTCA
GAGGTGCAAACGCAGTAACCAACTTTGAGATGAATAGATATGATGTGGAAGCTATAATGAAGAGTTCTCTTCCAGTGGGT
GGGGCAGCAAAGCGCTTGAAGCTTTCCCTTGAATCAGAGCAGAAAGCTCTTCCTGTGAGCAGCAGCAGCAGCAGCAGTCA
GCAGCAGAATCCACAGTGTGGAAACGTGAGTGCCAGCATCAATTTCTCATCCATTCATCAGCCAATTGCTTCTATCCCTT
GTGGAATTCCCTTTGATTCAACAACAGCATATTATCATCACAACCTTTTCCAACATTTTCACCCTACCAACGCTGGCACA
GCAGCGTCTGCTGTTACTTCTGCCAATGCAAATGCACTAACTGCACTGCCACCAACAGCAGCAGCTGAGTTCTTTATTTG
GCCTCATCAGTCTTATTGA