Microexon ID Gm_5:28775465-28775473:+
Species Glycine max
Coordinates 5:28775465..28775473
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTATCTAG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAACAGCTGCAAGAAAGAGGGGCAAAGCAGGAAAGGAAGACAAGTTTATCTAGGGGGTTATGATATGGAAGAAAAAGCTGCGAGAGCTTATGATCTAGCGGCA
Microexon-tag Amino Acid Seq WDNSCKKEGQSRKGRQVYLGGYDMEEKAARAYDLAA
Microexon-tag spanning region28775217-28775699
Microexon-tag prediction score0.9818
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH58159x
Reference Transcript ID KRH58159
Gene ID GLYMA_05G108600
Gene Name NA
Transcript ID KRH58159
Protein ID KRH58159
Gene ID GLYMA_05G108600
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.2e-13
Motif start 320
Motif end 378
Protein seq >KRH58159
MKSMENDDNADLNNQNNWLGFSLSPQMHNIGVSSHSQPSSAAEVVPTSFYHHTAPLSSYGFYYGLEAENVGLYSALPIMP
LKSDGSLYGLETLSRSQAQAMATTSTPKLENFLGGEAMGTPHHYECSATETMPLSLDSVFYIQPSRRDPNNNQTYQNHVQ
HISTNQQQQQQELQAYYSTLRNHDMILEGSKQSQTSDNNNLHVQNMGGDDAVPVPGLKSWEVRNFQASHAHESKMIVPHV
EENAGESGSIGSMAYGDLQSLSLSMSPSSQSSSVTSSHRASPAVVDSVAMDTKKRGPEKVDQKQIVHRKSIDTFGQRTSQ
YRGVTRHRWTGRYEAHLWDNSCKKEGQSRKGRQVYLGGYDMEEKAARAYDLAALKYWGPSTHINFPLENYQNELEEMKNM
TRQEYVAHLRRKSSGFSRGASMYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGANAVTNFD
ITRYDVEKIMASSNLLSSELARRNRETDNETQCIDQNHNKPSAYEDTQEAILMHQKSCESENDQWKMVLYQSSQQLEQNP
PTIESDRTNQSFAVALDNMFHQEVEESSKARTHVSNPSSLATSLSSSREGSPDRTSLPMLSGMPSTASKLLATNPNNVNS
WDPSPHLRPALTLPQMPVFAAWTDA*
CDS seq >KRH58159
ATGAAGAGTATGGAAAATGATGACAATGCTGACCTTAATAATCAAAACAATTGGTTGGGTTTCTCACTCTCTCCTCAAAT
GCATAATATAGGAGTTTCTTCACACTCACAACCTTCCTCTGCTGCTGAAGTGGTTCCTACAAGCTTTTACCACCACACTG
CTCCACTTAGTAGCTATGGTTTCTACTATGGACTTGAAGCTGAAAATGTTGGATTGTATTCAGCTTTGCCAATCATGCCC
CTCAAATCTGATGGCTCTCTCTATGGATTGGAAACTTTAAGCAGGTCACAAGCACAAGCAATGGCTACTACTTCAACACC
AAAACTGGAGAACTTCTTAGGTGGGGAAGCCATGGGGACCCCTCATCACTACGAATGTAGTGCCACAGAAACAATGCCTC
TGAGCTTAGACAGTGTTTTTTACATCCAACCCTCACGCCGTGACCCAAATAATAACCAAACCTACCAAAACCATGTTCAA
CACATTAGCACCAACCAACAACAACAACAGCAAGAGCTTCAAGCATATTACTCTACCTTGAGAAACCATGATATGATATT
AGAAGGGTCAAAGCAAAGCCAAACTTCTGACAACAACAATCTTCATGTTCAAAACATGGGTGGTGATGATGCCGTTCCTG
TTCCTGGCCTCAAGAGTTGGGAAGTGAGGAACTTCCAAGCTAGCCATGCACATGAGTCAAAGATGATTGTTCCTCATGTG
GAGGAAAATGCTGGTGAATCAGGGTCCATTGGATCAATGGCTTATGGTGACTTGCAATCGTTGAGCTTGTCCATGAGTCC
TAGCTCTCAGTCTAGCAGTGTCACAAGTTCTCACCGTGCTTCACCTGCTGTCGTTGATTCTGTTGCCATGGATACTAAGA
AAAGGGGGCCTGAAAAGGTTGACCAGAAGCAAATTGTTCATAGGAAGTCCATTGACACCTTTGGACAAAGAACCTCCCAG
TATAGAGGAGTAACAAGGCATAGGTGGACTGGGAGATATGAAGCTCATCTTTGGGACAACAGCTGCAAGAAAGAGGGGCA
AAGCAGGAAAGGAAGACAAGTTTATCTAGGGGGTTATGATATGGAAGAAAAAGCTGCGAGAGCTTATGATCTAGCGGCAC
TCAAGTATTGGGGACCCTCCACTCACATAAACTTTCCTTTGGAAAATTATCAAAATGAACTTGAGGAAATGAAGAACATG
ACTAGACAAGAGTATGTTGCTCATTTGAGAAGAAAAAGCAGCGGATTCTCAAGAGGGGCTTCCATGTACAGAGGAGTAAC
AAGACACCACCAACATGGAAGGTGGCAAGCTCGAATTGGTAGAGTGGCTGGAAACAAAGATCTATATCTTGGAACCTTTA
GTACACAAGAGGAAGCAGCTGAAGCCTATGATATTGCTGCTATAAAATTCCGAGGAGCGAATGCTGTAACCAACTTTGAC
ATCACAAGATATGATGTGGAGAAAATCATGGCAAGCAGCAACCTCCTTAGCAGTGAGCTAGCTAGGCGCAACCGAGAGAC
GGACAATGAAACTCAGTGCATTGATCAAAATCACAATAAGCCTTCTGCATATGAGGACACTCAAGAAGCTATTCTAATGC
ACCAGAAGAGCTGTGAGAGCGAAAATGATCAGTGGAAGATGGTTCTCTACCAATCCTCTCAGCAACTTGAGCAGAATCCA
CCAACAATTGAGAGTGACAGAACTAACCAGTCCTTCGCAGTGGCTTTGGACAACATGTTTCATCAGGAAGTAGAGGAATC
AAGTAAGGCGAGGACGCATGTGTCAAATCCTTCTTCATTGGCCACAAGTTTGAGCAGCTCAAGAGAAGGTAGCCCTGATA
GGACAAGCTTGCCAATGCTCTCTGGAATGCCTTCAACTGCATCAAAACTATTGGCTACTAATCCAAATAACGTGAATTCT
TGGGACCCTTCACCCCATTTGAGGCCAGCACTTACTTTGCCTCAAATGCCAGTTTTTGCAGCTTGGACAGATGCATAG
Microexon DNA seq TTTATCTAG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAACAGCTGCAAGAAAGAGGGGCAAAGCAGGAAAGGAAGACAAGTTTATCTAGGGGGTTATGATATGGAAGAAAAAGCTGCGAGAGCTTATGATCTAGCGGCA
Microexon-tag Amino Acid seq WDNSCKKEGQSRKGRQVYLGGYDMEEKAARAYDLAA
Transcript ID KRH58159
Gene ID Gm.40892
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.3e-13
Motif start 320
Motif end 378
Protein seq >KRH58159
MKSMENDDNADLNNQNNWLGFSLSPQMHNIGVSSHSQPSSAAEVVPTSFYHHTAPLSSYGFYYGLEAENVGLYSALPIMP
LKSDGSLYGLETLSRSQAQAMATTSTPKLENFLGGEAMGTPHHYECSATETMPLSLDSVFYIQPSRRDPNNNQTYQNHVQ
HISTNQQQQQQELQAYYSTLRNHDMILEGSKQSQTSDNNNLHVQNMGGDDAVPVPGLKSWEVRNFQASHAHESKMIVPHV
EENAGESGSIGSMAYGDLQSLSLSMSPSSQSSSVTSSHRASPAVVDSVAMDTKKRGPEKVDQKQIVHRKSIDTFGQRTSQ
YRGVTRHRWTGRYEAHLWDNSCKKEGQSRKGRQVYLGGYDMEEKAARAYDLAALKYWGPSTHINFPLENYQNELEEMKNM
TRQEYVAHLRRKSSGFSRGASMYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGANAVTNFD
ITRYDVEKIMASSNLLSSELARRNRETDNETQCIDQNHNKPSAYEDTQEAILMHQKSCESENDQWKMVLYQSSQQLEQNP
PTIESDRTNQSFAVALDNMFHQEVEESSKARTHVSNPSSLATSLSSSREGSPDRTSLPMLSGMPSTASKLLATNPNNVNS
WDPSPHLRPALTLPQMPVFAAWTDA*
CDS seq >KRH58159
ATGAAGAGTATGGAAAATGATGACAATGCTGACCTTAATAATCAAAACAATTGGTTGGGTTTCTCACTCTCTCCTCAAAT
GCATAATATAGGAGTTTCTTCACACTCACAACCTTCCTCTGCTGCTGAAGTGGTTCCTACAAGCTTTTACCACCACACTG
CTCCACTTAGTAGCTATGGTTTCTACTATGGACTTGAAGCTGAAAATGTTGGATTGTATTCAGCTTTGCCAATCATGCCC
CTCAAATCTGATGGCTCTCTCTATGGATTGGAAACTTTAAGCAGGTCACAAGCACAAGCAATGGCTACTACTTCAACACC
AAAACTGGAGAACTTCTTAGGTGGGGAAGCCATGGGGACCCCTCATCACTACGAATGTAGTGCCACAGAAACAATGCCTC
TGAGCTTAGACAGTGTTTTTTACATCCAACCCTCACGCCGTGACCCAAATAATAACCAAACCTACCAAAACCATGTTCAA
CACATTAGCACCAACCAACAACAACAACAGCAAGAGCTTCAAGCATATTACTCTACCTTGAGAAACCATGATATGATATT
AGAAGGGTCAAAGCAAAGCCAAACTTCTGACAACAACAATCTTCATGTTCAAAACATGGGTGGTGATGATGCCGTTCCTG
TTCCTGGCCTCAAGAGTTGGGAAGTGAGGAACTTCCAAGCTAGCCATGCACATGAGTCAAAGATGATTGTTCCTCATGTG
GAGGAAAATGCTGGTGAATCAGGGTCCATTGGATCAATGGCTTATGGTGACTTGCAATCGTTGAGCTTGTCCATGAGTCC
TAGCTCTCAGTCTAGCAGTGTCACAAGTTCTCACCGTGCTTCACCTGCTGTCGTTGATTCTGTTGCCATGGATACTAAGA
AAAGGGGGCCTGAAAAGGTTGACCAGAAGCAAATTGTTCATAGGAAGTCCATTGACACCTTTGGACAAAGAACCTCCCAG
TATAGAGGAGTAACAAGGCATAGGTGGACTGGGAGATATGAAGCTCATCTTTGGGACAACAGCTGCAAGAAAGAGGGGCA
AAGCAGGAAAGGAAGACAAGTTTATCTAGGGGGTTATGATATGGAAGAAAAAGCTGCGAGAGCTTATGATCTAGCGGCAC
TCAAGTATTGGGGACCCTCCACTCACATAAACTTTCCTTTGGAAAATTATCAAAATGAACTTGAGGAAATGAAGAACATG
ACTAGACAAGAGTATGTTGCTCATTTGAGAAGAAAAAGCAGCGGATTCTCAAGAGGGGCTTCCATGTACAGAGGAGTAAC
AAGACACCACCAACATGGAAGGTGGCAAGCTCGAATTGGTAGAGTGGCTGGAAACAAAGATCTATATCTTGGAACCTTTA
GTACACAAGAGGAAGCAGCTGAAGCCTATGATATTGCTGCTATAAAATTCCGAGGAGCGAATGCTGTAACCAACTTTGAC
ATCACAAGATATGATGTGGAGAAAATCATGGCAAGCAGCAACCTCCTTAGCAGTGAGCTAGCTAGGCGCAACCGAGAGAC
GGACAATGAAACTCAGTGCATTGATCAAAATCACAATAAGCCTTCTGCATATGAGGACACTCAAGAAGCTATTCTAATGC
ACCAGAAGAGCTGTGAGAGCGAAAATGATCAGTGGAAGATGGTTCTCTACCAATCCTCTCAGCAACTTGAGCAGAATCCA
CCAACAATTGAGAGTGACAGAACTAACCAGTCCTTCGCAGTGGCTTTGGACAACATGTTTCATCAGGAAGTAGAGGAATC
AAGTAAGGCGAGGACGCATGTGTCAAATCCTTCTTCATTGGCCACAAGTTTGAGCAGCTCAAGAGAAGGTAGCCCTGATA
GGACAAGCTTGCCAATGCTCTCTGGAATGCCTTCAACTGCATCAAAACTATTGGCTACTAATCCAAATAACGTGAATTCT
TGGGACCCTTCACCCCATTTGAGGCCAGCACTTACTTTGCCTCAAATGCCAGTTTTTGCAGCTTGGACAGATGCATAG