Microexon ID Gm_8:1171851-1171865:+
Species Glycine max
Coordinates 8:1171851..1171865
Microexon Cluster ID MEP42
Size 15
Phase 0
Pfam Domain Motif bHLH-MYC_N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq AGTGCATCCATTGAG
Microexon Amino Acid seq SASIE
Microexon-tag DNA Seq GCCCATTCTTCTGATTGTAAAATTTTTAGCCGTTCTCTTCTGGCAAAGAGTGCATCCATTGAGACGGTTGTGTGCTTTCCGTTTAGGGAAGGGGTTATTGAGTTAGGT
Microexon-tag Amino Acid Seq AHSSDCKIFSRSLLAKSASIETVVCFPFREGVIELG
Microexon-tag spanning region1171627-1172295
Microexon-tag prediction score0.9106
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH41187x
Reference Transcript ID KRH41187
Gene ID GLYMA_08G014900
Gene Name NA
Transcript ID KRH41187
Protein ID KRH41187
Gene ID GLYMA_08G014900
Gene Name NA
Pfam domain motif bHLH-MYC_N
Motif E-value 1.6e-51
Motif start 6
Motif end 190
Protein seq >KRH41187
MVPENLKKQLALAVRSIHWSYAIFWTDSTTQPGVLSWGEGYYNGDIKTRKTSQGVELNSDQIGLQRSEQLRELFKSLKTV
EVTPQTKRPSAAALSPEDLTDAEWYYLVCMSFIFNIGQGLPGRTLAKGQPIWLNNAHSSDCKIFSRSLLAKSASIETVVC
FPFREGVIELGTTEQVPEDLSVIELIKTSFLNSLHANVPNKSVATLKSRNQEDLSYAAFDHNDYNVKSIPEVGYEIANTT
SPDGSSNAFQANQPLDETFMIESITNGTSQVQNWQVIDDELSNCVHNSMNSSDCISQTFACPENIASAPKSNNPSDPCAR
NFQKCNNPKMTLVDPRSDDLHYQRVLSTLIKSSDQLLMGMHLQKFPQESSFVSWRKEQPMDCKWPRAGTSQKLLKKVLFE
VPQMHLDGLHESQEENDYKEGMRVEADENGMNHVMSERRRRAKLNERFLTLRSMVPSISKDDKVSILDDAIDYLKKLERR
VKELEAHRVVTDIETGTRRSPQDTVERTSDHYFRKNNNGKKPGMKKRKACGVDETEKEINSDALKGSYANDVTVSTSDNE
IVIELKCPSKAGRLLEIMEAINSFNIDFSSVQSTEADGNLYLTIKSVLTGPSVATTKRIKQALQKLASKC*
CDS seq >KRH41187
ATGGTGCCAGAGAACTTGAAGAAACAACTTGCTTTGGCTGTGAGAAGCATCCATTGGAGCTATGCAATCTTCTGGACTGA
TTCAACTACCCAACCCGGGGTGTTGAGCTGGGGGGAAGGGTATTACAATGGAGACATTAAGACTAGGAAAACAAGTCAAG
GAGTAGAACTCAATTCTGACCAAATAGGGTTGCAGAGGAGTGAACAGCTGAGAGAACTATTCAAATCCCTCAAAACTGTA
GAAGTCACCCCTCAAACCAAAAGGCCTTCAGCAGCAGCACTGTCCCCAGAAGATCTCACAGATGCTGAGTGGTATTACTT
GGTTTGCATGTCCTTCATATTCAACATTGGCCAAGGGTTACCAGGAAGAACTCTAGCAAAGGGTCAACCCATTTGGCTGA
ACAATGCCCATTCTTCTGATTGTAAAATTTTTAGCCGTTCTCTTCTGGCAAAGAGTGCATCCATTGAGACGGTTGTGTGC
TTTCCGTTTAGGGAAGGGGTTATTGAGTTAGGTACTACTGAACAGGTCCCAGAAGATTTGAGTGTCATTGAACTGATCAA
AACTTCTTTCTTGAATAGTCTGCATGCCAATGTTCCCAATAAGTCAGTAGCTACATTGAAATCAAGGAACCAGGAAGATC
TTTCTTATGCAGCATTTGATCATAATGACTATAATGTTAAATCAATTCCAGAAGTTGGGTATGAAATAGCCAACACAACC
TCTCCTGATGGTAGTTCAAATGCATTCCAAGCCAATCAACCGCTAGATGAAACATTTATGATTGAAAGTATAACCAATGG
CACTTCTCAAGTCCAAAACTGGCAAGTTATTGATGATGAATTGAGTAACTGTGTCCATAACTCCATGAATTCCAGCGACT
GTATATCACAAACTTTTGCCTGCCCTGAGAATATTGCTTCTGCCCCCAAGTCTAACAACCCTTCTGATCCTTGTGCCCGA
AATTTTCAAAAGTGCAACAACCCAAAAATGACCTTAGTGGATCCTCGAAGTGATGATTTGCACTATCAGAGAGTTCTTTC
TACCCTTATAAAAAGTTCTGACCAGTTACTTATGGGAATGCATTTGCAAAAATTTCCTCAGGAATCAAGCTTTGTTAGTT
GGAGGAAAGAACAGCCAATGGATTGCAAATGGCCAAGAGCAGGAACGTCACAAAAGTTATTGAAGAAAGTATTGTTTGAA
GTTCCTCAAATGCACTTGGATGGGCTGCATGAGTCTCAAGAAGAGAATGACTATAAAGAGGGAATGAGAGTAGAGGCTGA
TGAAAACGGCATGAACCATGTTATGTCAGAAAGGAGGAGGAGAGCAAAACTAAATGAAAGGTTTTTAACCCTTAGGTCAA
TGGTCCCTTCAATCAGTAAGGATGACAAAGTTTCAATACTAGATGATGCTATTGATTACCTTAAAAAGCTCGAGAGAAGG
GTAAAAGAGTTGGAAGCTCACAGGGTGGTAACAGACATAGAGACTGGGACTAGAAGATCACCACAAGATACGGTGGAGAG
GACTTCTGATCATTATTTTAGGAAAAATAATAATGGCAAGAAACCAGGGATGAAAAAGAGGAAGGCTTGTGGTGTAGATG
AGACAGAAAAAGAGATTAATTCAGATGCTTTAAAAGGAAGTTATGCTAATGATGTTACTGTGAGTACCAGTGACAATGAA
ATTGTGATCGAATTGAAGTGCCCCTCGAAAGCAGGAAGGCTGCTAGAAATTATGGAAGCAATCAACAGTTTCAATATAGA
TTTTAGTTCAGTTCAGTCAACAGAAGCTGATGGAAATCTTTATCTGACCATTAAATCTGTGCTCACAGGACCAAGTGTTG
CAACAACCAAAAGAATCAAACAAGCACTCCAAAAATTGGCTTCCAAGTGCTGA
Microexon DNA seq AGTGCATCCATTGAG
Microexon Amino Acid seq SASIE
Microexon-tag DNA Seq GCCCATTCTTCTGATTGTAAAATTTTTAGCCGTTCTCTTCTGGCAAAGAGTGCATCCATTGAGACGGTTGTGTGCTTTCCGTTTAGGGAAGGGGTTATTGAGTTAGGT
Microexon-tag Amino Acid seq AHSSDCKIFSRSLLAKSASIETVVCFPFREGVIELG
Transcript ID KRH41187
Gene ID Gm.48315
Gene Name NA
Pfam domain motif bHLH-MYC_N
Motif E-value 1.6e-51
Motif start 6
Motif end 190
Protein seq >KRH41187
MVPENLKKQLALAVRSIHWSYAIFWTDSTTQPGVLSWGEGYYNGDIKTRKTSQGVELNSDQIGLQRSEQLRELFKSLKTV
EVTPQTKRPSAAALSPEDLTDAEWYYLVCMSFIFNIGQGLPGRTLAKGQPIWLNNAHSSDCKIFSRSLLAKSASIETVVC
FPFREGVIELGTTEQVPEDLSVIELIKTSFLNSLHANVPNKSVATLKSRNQEDLSYAAFDHNDYNVKSIPEVGYEIANTT
SPDGSSNAFQANQPLDETFMIESITNGTSQVQNWQVIDDELSNCVHNSMNSSDCISQTFACPENIASAPKSNNPSDPCAR
NFQKCNNPKMTLVDPRSDDLHYQRVLSTLIKSSDQLLMGMHLQKFPQESSFVSWRKEQPMDCKWPRAGTSQKLLKKVLFE
VPQMHLDGLHESQEENDYKEGMRVEADENGMNHVMSERRRRAKLNERFLTLRSMVPSISKDDKVSILDDAIDYLKKLERR
VKELEAHRVVTDIETGTRRSPQDTVERTSDHYFRKNNNGKKPGMKKRKACGVDETEKEINSDALKGSYANDVTVSTSDNE
IVIELKCPSKAGRLLEIMEAINSFNIDFSSVQSTEADGNLYLTIKSVLTGPSVATTKRIKQALQKLASKC*
CDS seq >KRH41187
ATGGTGCCAGAGAACTTGAAGAAACAACTTGCTTTGGCTGTGAGAAGCATCCATTGGAGCTATGCAATCTTCTGGACTGA
TTCAACTACCCAACCCGGGGTGTTGAGCTGGGGGGAAGGGTATTACAATGGAGACATTAAGACTAGGAAAACAAGTCAAG
GAGTAGAACTCAATTCTGACCAAATAGGGTTGCAGAGGAGTGAACAGCTGAGAGAACTATTCAAATCCCTCAAAACTGTA
GAAGTCACCCCTCAAACCAAAAGGCCTTCAGCAGCAGCACTGTCCCCAGAAGATCTCACAGATGCTGAGTGGTATTACTT
GGTTTGCATGTCCTTCATATTCAACATTGGCCAAGGGTTACCAGGAAGAACTCTAGCAAAGGGTCAACCCATTTGGCTGA
ACAATGCCCATTCTTCTGATTGTAAAATTTTTAGCCGTTCTCTTCTGGCAAAGAGTGCATCCATTGAGACGGTTGTGTGC
TTTCCGTTTAGGGAAGGGGTTATTGAGTTAGGTACTACTGAACAGGTCCCAGAAGATTTGAGTGTCATTGAACTGATCAA
AACTTCTTTCTTGAATAGTCTGCATGCCAATGTTCCCAATAAGTCAGTAGCTACATTGAAATCAAGGAACCAGGAAGATC
TTTCTTATGCAGCATTTGATCATAATGACTATAATGTTAAATCAATTCCAGAAGTTGGGTATGAAATAGCCAACACAACC
TCTCCTGATGGTAGTTCAAATGCATTCCAAGCCAATCAACCGCTAGATGAAACATTTATGATTGAAAGTATAACCAATGG
CACTTCTCAAGTCCAAAACTGGCAAGTTATTGATGATGAATTGAGTAACTGTGTCCATAACTCCATGAATTCCAGCGACT
GTATATCACAAACTTTTGCCTGCCCTGAGAATATTGCTTCTGCCCCCAAGTCTAACAACCCTTCTGATCCTTGTGCCCGA
AATTTTCAAAAGTGCAACAACCCAAAAATGACCTTAGTGGATCCTCGAAGTGATGATTTGCACTATCAGAGAGTTCTTTC
TACCCTTATAAAAAGTTCTGACCAGTTACTTATGGGAATGCATTTGCAAAAATTTCCTCAGGAATCAAGCTTTGTTAGTT
GGAGGAAAGAACAGCCAATGGATTGCAAATGGCCAAGAGCAGGAACGTCACAAAAGTTATTGAAGAAAGTATTGTTTGAA
GTTCCTCAAATGCACTTGGATGGGCTGCATGAGTCTCAAGAAGAGAATGACTATAAAGAGGGAATGAGAGTAGAGGCTGA
TGAAAACGGCATGAACCATGTTATGTCAGAAAGGAGGAGGAGAGCAAAACTAAATGAAAGGTTTTTAACCCTTAGGTCAA
TGGTCCCTTCAATCAGTAAGGATGACAAAGTTTCAATACTAGATGATGCTATTGATTACCTTAAAAAGCTCGAGAGAAGG
GTAAAAGAGTTGGAAGCTCACAGGGTGGTAACAGACATAGAGACTGGGACTAGAAGATCACCACAAGATACGGTGGAGAG
GACTTCTGATCATTATTTTAGGAAAAATAATAATGGCAAGAAACCAGGGATGAAAAAGAGGAAGGCTTGTGGTGTAGATG
AGACAGAAAAAGAGATTAATTCAGATGCTTTAAAAGGAAGTTATGCTAATGATGTTACTGTGAGTACCAGTGACAATGAA
ATTGTGATCGAATTGAAGTGCCCCTCGAAAGCAGGAAGGCTGCTAGAAATTATGGAAGCAATCAACAGTTTCAATATAGA
TTTTAGTTCAGTTCAGTCAACAGAAGCTGATGGAAATCTTTATCTGACCATTAAATCTGTGCTCACAGGACCAAGTGTTG
CAACAACCAAAAGAATCAAACAAGCACTCCAAAAATTGGCTTCCAAGTGCTGA