Microexon ID Gm_17:13597688-13597696:-
Species Glycine max
Coordinates 17:13597688..13597696
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTATTTAG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAACAGCTGCAAGAAAGAGGGGCAAAGCAGGAAAGGAAGACAAGTTTATTTAGGGGGTTATGATATGGAAGAAAAAGCTGCAAGAGCTTATGATCTAGCTGCA
Microexon-tag Amino Acid Seq WDNSCKKEGQSRKGRQVYLGGYDMEEKAARAYDLAA
Microexon-tag spanning region13597463-13597953
Microexon-tag prediction score0.9839
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH04378x
Reference Transcript ID KRH04378
Gene ID GLYMA_17G158300
Gene Name NA
Transcript ID KRH04378
Protein ID KRH04378
Gene ID GLYMA_17G158300
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.3e-13
Motif start 327
Motif end 385
Protein seq >KRH04378
MKSMENDDTVDLNNQNNNWLGFSLSPQMNNNNIGVSTHTQPSSAAAEVVPTSFYHTTPLSSYGFYYGLEAENVGLYSALP
IMPLKSDGSLYGMEAVSRSQAQAMATTSTPKLENFLGGEAMGTPHHHYECSATETMPLSLDSVFYNQPSRRDQNNNQTYQ
NHVQHISTQQQQQQELQAYYSTLRNHDMMLEGSKQSQTSENNNNNLQVQNMGDDAVSVPVAGLKSWGVRNFQASHAHESK
MIVPHHVEENGGESGSIGSMAYGDLQSLSLSMSPSSQSSCVTSSHRASSAVIDSAAMDTKKRGSEKVDQKQIVHRKSIDT
FGQRTSQYRGVTRHRWTGRYEAHLWDNSCKKEGQSRKGRQVYLGGYDMEEKAARAYDLAALKYWGPSTHINFPLENYQNE
LEEMKNMTRQEYVAHLRRKSSGFSRGASMYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGA
NAVTNFDITRYDVEKIMASSNLLSSELARRNQETTDNGTQYIDQNHNKPSAYEDTQEAAILMHQKSCETQNDQWKMVLYQ
QSSQQLEQNIPPRIESDRTNQSFSVALENMFHQEVEESSKVRTHVSNPSSLATSLSSSRECSPDRTSLPMLSGVPSTSSK
LLATNPNNVNSWDPSPHLRPALTLPQMPVFAAWTDA*
CDS seq >KRH04378
ATGAAGAGTATGGAAAATGATGACACTGTTGACCTTAATAATCAAAACAACAACTGGTTGGGTTTCTCACTCTCTCCTCA
AATGAATAATAATAATATAGGAGTTTCTACACACACACAACCTTCCTCTGCTGCCGCTGAAGTGGTTCCTACAAGCTTCT
ACCACACTACTCCACTTAGTAGCTATGGTTTCTACTATGGACTTGAAGCGGAAAATGTTGGATTGTATTCAGCTTTGCCT
ATCATGCCCCTCAAATCTGATGGCTCTCTCTATGGAATGGAAGCTGTGAGCAGGTCACAAGCACAAGCAATGGCTACTAC
TTCAACACCAAAACTGGAGAACTTCTTAGGTGGGGAAGCAATGGGGACCCCTCATCATCACTATGAATGTAGTGCCACAG
AAACAATGCCTCTGAGCTTAGACAGTGTTTTTTACAACCAACCCTCACGCCGTGACCAAAATAATAACCAAACTTACCAA
AACCATGTCCAACACATTAGCACTCAACAACAACAGCAACAAGAACTTCAAGCATATTACTCTACCTTGAGAAACCATGA
TATGATGTTAGAAGGGTCCAAGCAAAGCCAAACTTCAGAAAACAACAACAACAATCTTCAGGTTCAAAACATGGGTGATG
ATGCTGTTTCTGTTCCTGTTGCTGGCCTTAAGAGTTGGGGAGTGAGGAACTTCCAAGCTAGCCATGCACATGAGTCAAAG
ATGATTGTTCCTCATCATGTGGAGGAAAATGGTGGTGAATCAGGGTCCATTGGATCAATGGCTTATGGTGACTTGCAATC
TTTGAGCTTGTCCATGAGTCCTAGCTCTCAGTCTAGCTGCGTCACAAGTTCACACCGTGCTTCATCTGCTGTCATTGATT
CTGCTGCCATGGATACAAAGAAGAGGGGATCTGAAAAAGTTGACCAGAAGCAAATTGTTCATAGGAAGTCCATTGACACC
TTTGGACAAAGAACCTCTCAATATAGAGGTGTAACACGGCATAGGTGGACTGGGAGATATGAAGCTCATCTTTGGGACAA
CAGCTGCAAGAAAGAGGGGCAAAGCAGGAAAGGAAGACAAGTTTATTTAGGGGGTTATGATATGGAAGAAAAAGCTGCAA
GAGCTTATGATCTAGCTGCACTCAAGTATTGGGGACCCTCCACTCACATAAACTTTCCTTTGGAAAATTATCAAAATGAA
CTTGAGGAAATGAAGAACATGACTAGACAAGAGTATGTTGCTCATTTGAGAAGAAAAAGCAGTGGATTCTCAAGAGGGGC
TTCCATGTACAGAGGAGTAACAAGACACCACCAACATGGAAGGTGGCAAGCTCGAATTGGTAGAGTGGCTGGAAACAAAG
ATCTATATCTTGGAACCTTCAGTACACAAGAGGAAGCAGCTGAAGCCTATGACATTGCTGCTATAAAATTCCGAGGAGCG
AATGCTGTAACCAACTTTGACATAACAAGATATGATGTTGAGAAAATCATGGCAAGCAGCAACCTTCTTAGCAGCGAGCT
CGCTAGGCGCAACCAAGAGACGACGGACAATGGAACTCAGTACATTGATCAAAATCACAATAAGCCTTCTGCATATGAGG
ACACTCAAGAAGCTGCAATTCTGATGCATCAGAAGAGCTGTGAGACCCAAAATGATCAGTGGAAGATGGTTCTCTACCAA
CAATCCTCTCAGCAACTTGAGCAGAATATTCCACCGAGAATTGAGAGTGACAGAACTAACCAGTCCTTCTCAGTGGCTTT
GGAAAACATGTTTCATCAAGAAGTAGAGGAATCAAGTAAGGTGAGAACACATGTGTCAAATCCTTCTTCATTGGCCACAA
GTTTGAGCAGCTCAAGAGAATGTAGCCCTGATAGGACAAGCTTGCCAATGCTCTCTGGAGTGCCTTCAACTTCATCAAAA
CTATTGGCTACTAATCCAAATAACGTGAATTCTTGGGACCCTTCACCCCATTTGAGGCCAGCACTTACTTTGCCTCAAAT
GCCAGTTTTTGCAGCTTGGACAGATGCATAG
Microexon DNA seq TTTATTTAG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAACAGCTGCAAGAAAGAGGGGCAAAGCAGGAAAGGAAGACAAGTTTATTTAGGGGGTTATGATATGGAAGAAAAAGCTGCAAGAGCTTATGATCTAGCTGCA
Microexon-tag Amino Acid seq WDNSCKKEGQSRKGRQVYLGGYDMEEKAARAYDLAA
Transcript ID KRH04378
Gene ID Gm.22977
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.3e-13
Motif start 327
Motif end 385
Protein seq >KRH04378
MKSMENDDTVDLNNQNNNWLGFSLSPQMNNNNIGVSTHTQPSSAAAEVVPTSFYHTTPLSSYGFYYGLEAENVGLYSALP
IMPLKSDGSLYGMEAVSRSQAQAMATTSTPKLENFLGGEAMGTPHHHYECSATETMPLSLDSVFYNQPSRRDQNNNQTYQ
NHVQHISTQQQQQQELQAYYSTLRNHDMMLEGSKQSQTSENNNNNLQVQNMGDDAVSVPVAGLKSWGVRNFQASHAHESK
MIVPHHVEENGGESGSIGSMAYGDLQSLSLSMSPSSQSSCVTSSHRASSAVIDSAAMDTKKRGSEKVDQKQIVHRKSIDT
FGQRTSQYRGVTRHRWTGRYEAHLWDNSCKKEGQSRKGRQVYLGGYDMEEKAARAYDLAALKYWGPSTHINFPLENYQNE
LEEMKNMTRQEYVAHLRRKSSGFSRGASMYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGA
NAVTNFDITRYDVEKIMASSNLLSSELARRNQETTDNGTQYIDQNHNKPSAYEDTQEAAILMHQKSCETQNDQWKMVLYQ
QSSQQLEQNIPPRIESDRTNQSFSVALENMFHQEVEESSKVRTHVSNPSSLATSLSSSRECSPDRTSLPMLSGVPSTSSK
LLATNPNNVNSWDPSPHLRPALTLPQMPVFAAWTDA*
CDS seq >KRH04378
ATGAAGAGTATGGAAAATGATGACACTGTTGACCTTAATAATCAAAACAACAACTGGTTGGGTTTCTCACTCTCTCCTCA
AATGAATAATAATAATATAGGAGTTTCTACACACACACAACCTTCCTCTGCTGCCGCTGAAGTGGTTCCTACAAGCTTCT
ACCACACTACTCCACTTAGTAGCTATGGTTTCTACTATGGACTTGAAGCGGAAAATGTTGGATTGTATTCAGCTTTGCCT
ATCATGCCCCTCAAATCTGATGGCTCTCTCTATGGAATGGAAGCTGTGAGCAGGTCACAAGCACAAGCAATGGCTACTAC
TTCAACACCAAAACTGGAGAACTTCTTAGGTGGGGAAGCAATGGGGACCCCTCATCATCACTATGAATGTAGTGCCACAG
AAACAATGCCTCTGAGCTTAGACAGTGTTTTTTACAACCAACCCTCACGCCGTGACCAAAATAATAACCAAACTTACCAA
AACCATGTCCAACACATTAGCACTCAACAACAACAGCAACAAGAACTTCAAGCATATTACTCTACCTTGAGAAACCATGA
TATGATGTTAGAAGGGTCCAAGCAAAGCCAAACTTCAGAAAACAACAACAACAATCTTCAGGTTCAAAACATGGGTGATG
ATGCTGTTTCTGTTCCTGTTGCTGGCCTTAAGAGTTGGGGAGTGAGGAACTTCCAAGCTAGCCATGCACATGAGTCAAAG
ATGATTGTTCCTCATCATGTGGAGGAAAATGGTGGTGAATCAGGGTCCATTGGATCAATGGCTTATGGTGACTTGCAATC
TTTGAGCTTGTCCATGAGTCCTAGCTCTCAGTCTAGCTGCGTCACAAGTTCACACCGTGCTTCATCTGCTGTCATTGATT
CTGCTGCCATGGATACAAAGAAGAGGGGATCTGAAAAAGTTGACCAGAAGCAAATTGTTCATAGGAAGTCCATTGACACC
TTTGGACAAAGAACCTCTCAATATAGAGGTGTAACACGGCATAGGTGGACTGGGAGATATGAAGCTCATCTTTGGGACAA
CAGCTGCAAGAAAGAGGGGCAAAGCAGGAAAGGAAGACAAGTTTATTTAGGGGGTTATGATATGGAAGAAAAAGCTGCAA
GAGCTTATGATCTAGCTGCACTCAAGTATTGGGGACCCTCCACTCACATAAACTTTCCTTTGGAAAATTATCAAAATGAA
CTTGAGGAAATGAAGAACATGACTAGACAAGAGTATGTTGCTCATTTGAGAAGAAAAAGCAGTGGATTCTCAAGAGGGGC
TTCCATGTACAGAGGAGTAACAAGACACCACCAACATGGAAGGTGGCAAGCTCGAATTGGTAGAGTGGCTGGAAACAAAG
ATCTATATCTTGGAACCTTCAGTACACAAGAGGAAGCAGCTGAAGCCTATGACATTGCTGCTATAAAATTCCGAGGAGCG
AATGCTGTAACCAACTTTGACATAACAAGATATGATGTTGAGAAAATCATGGCAAGCAGCAACCTTCTTAGCAGCGAGCT
CGCTAGGCGCAACCAAGAGACGACGGACAATGGAACTCAGTACATTGATCAAAATCACAATAAGCCTTCTGCATATGAGG
ACACTCAAGAAGCTGCAATTCTGATGCATCAGAAGAGCTGTGAGACCCAAAATGATCAGTGGAAGATGGTTCTCTACCAA
CAATCCTCTCAGCAACTTGAGCAGAATATTCCACCGAGAATTGAGAGTGACAGAACTAACCAGTCCTTCTCAGTGGCTTT
GGAAAACATGTTTCATCAAGAAGTAGAGGAATCAAGTAAGGTGAGAACACATGTGTCAAATCCTTCTTCATTGGCCACAA
GTTTGAGCAGCTCAAGAGAATGTAGCCCTGATAGGACAAGCTTGCCAATGCTCTCTGGAGTGCCTTCAACTTCATCAAAA
CTATTGGCTACTAATCCAAATAACGTGAATTCTTGGGACCCTTCACCCCATTTGAGGCCAGCACTTACTTTGCCTCAAAT
GCCAGTTTTTGCAGCTTGGACAGATGCATAG