Microexon ID Gm_5:39034413-39034427:+
Species Glycine max
Coordinates 5:39034413..39034427
Microexon Cluster ID MEP42
Size 15
Phase 0
Pfam Domain Motif bHLH-MYC_N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq AGTGCGTCCATTGAG
Microexon Amino Acid seq SASIE
Microexon-tag DNA Seq GCCCATTCTGCTGATTGTAAAATTTTTAGCCGTTCTCTTCTGGCAAAGAGTGCGTCCATTGAGACGGTTGTGTGCTTTCCGTTTAGGGAAGGGGTTATTGAGCTAGGT
Microexon-tag Amino Acid Seq AHSADCKIFSRSLLAKSASIETVVCFPFREGVIELG
Microexon-tag spanning region39034193-39034905
Microexon-tag prediction score0.9232
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH59908x
Reference Transcript ID KRH59908
Gene ID GLYMA_05G208300
Gene Name NA
Transcript ID KRH59908
Protein ID KRH59908
Gene ID GLYMA_05G208300
Gene Name NA
Pfam domain motif bHLH-MYC_N
Motif E-value 6.4e-52
Motif start 6
Motif end 189
Protein seq >KRH59908
MVPENLKKQLALAVRSIHWSYAIFWTDSTTQPGVLSWGEGYYNGDIKTRKTSQGVELNSDQIGLQRSEQLRELFKSLKTV
EVSPQTKRPSAALSPEDLTDAEWYYLVCMSFIFNIGQGLPGRTLAKGQSIWLNNAHSADCKIFSRSLLAKSASIETVVCF
PFREGVIELGTTEQVSEDLSVIERIKTSFLNSLHVDVPNKSVATLKSRKQEDLSYVAFDHNDYNVESIPEVGYEIANTTS
PNGSSNAIQANQPLDDTLMVESITNGTSQVQNWQVIDDELSNCVHNSMNSSDCISPTFASLENIASAPKCNNPSDPCARD
FQKCNNPKMTLVDPRSDEWHYQRVISTLIKNTDQLLMGMHLQKFPQASSFVSWRKGEPMDSQWPRAGTSQKLLKKVLFEV
PQMHLDGLHESQEENDYKEGMRVEADENGMNHVMSERRRRAKLNQRFLTLRSMVPSISKDDKVSILDDAIEYLKKLERRI
NELEAHRGVTDIETGTRRSPQDTVERTPDHYFSKNNNNNGKKPGMKKRKACGVDEKGREINLDALKGSYANDVIVSTSDN
GIVIEMKCPSRAGRMLEIMEAINSFNIDFSSVQSTEADGNLYLTIKSVLTGPRVATAKRIKLALQKVASKC*
CDS seq >KRH59908
ATGGTGCCAGAGAACTTGAAGAAACAACTTGCTTTGGCTGTGAGAAGCATCCATTGGAGCTATGCAATCTTCTGGACTGA
TTCAACTACCCAACCCGGGGTGTTGAGCTGGGGGGAAGGGTATTACAATGGAGACATTAAGACTAGGAAAACAAGTCAAG
GAGTAGAGCTCAATTCTGACCAAATAGGGTTGCAGAGGAGTGAACAGCTGCGAGAACTATTCAAATCCCTCAAAACTGTA
GAAGTCAGCCCTCAAACTAAAAGGCCTTCAGCAGCACTGTCCCCAGAAGATCTCACAGATGCTGAGTGGTATTACTTGGT
TTGCATGTCCTTCATATTCAACATTGGCCAAGGGTTACCAGGAAGAACTCTAGCGAAGGGTCAATCCATTTGGCTGAACA
ATGCCCATTCTGCTGATTGTAAAATTTTTAGCCGTTCTCTTCTGGCAAAGAGTGCGTCCATTGAGACGGTTGTGTGCTTT
CCGTTTAGGGAAGGGGTTATTGAGCTAGGTACTACTGAACAGGTCTCAGAAGATTTGAGTGTCATTGAACGGATCAAAAC
TTCTTTCTTGAACAGTCTGCATGTCGATGTTCCCAATAAGTCAGTAGCTACATTGAAATCAAGGAAACAGGAAGATCTTT
CTTATGTAGCATTTGATCATAATGACTATAATGTTGAATCGATTCCAGAAGTTGGGTATGAAATAGCCAACACAACCTCT
CCTAATGGTAGTTCAAATGCAATCCAAGCCAATCAACCACTAGATGACACACTTATGGTTGAAAGTATAACCAATGGCAC
TTCTCAAGTCCAAAACTGGCAAGTTATTGATGATGAATTGAGTAACTGTGTCCATAACTCCATGAATTCCAGCGACTGTA
TATCACCAACTTTTGCCAGCCTTGAGAATATTGCTTCTGCCCCCAAGTGTAACAACCCTTCTGATCCTTGTGCCCGAGAT
TTTCAAAAGTGCAACAACCCAAAAATGACCTTAGTGGATCCTCGAAGTGATGAGTGGCATTATCAGAGAGTTATTTCCAC
CCTTATAAAAAACACTGACCAGTTACTTATGGGAATGCATTTGCAAAAATTTCCTCAGGCATCAAGCTTTGTTAGTTGGA
GGAAAGGAGAGCCAATGGATTCCCAATGGCCAAGAGCAGGAACCTCACAAAAGTTATTGAAGAAAGTATTGTTTGAAGTT
CCTCAAATGCACTTGGATGGGCTTCATGAGTCTCAAGAAGAGAATGACTATAAAGAGGGAATGAGAGTAGAAGCTGATGA
AAACGGCATGAACCATGTTATGTCAGAAAGGAGGAGAAGAGCAAAACTAAATCAAAGGTTTTTAACCCTTAGGTCAATGG
TCCCTTCAATCAGTAAGGATGACAAAGTTTCGATACTAGATGATGCAATTGAATACCTTAAAAAGCTTGAGAGAAGGATA
AACGAGTTGGAAGCTCACAGGGGTGTAACAGATATAGAGACTGGGACTAGAAGATCACCACAAGATACGGTGGAGAGGAC
TCCTGATCATTATTTTAGCAAAAATAATAATAATAATGGTAAAAAACCAGGGATGAAAAAGAGGAAGGCTTGTGGTGTAG
ATGAGAAAGGAAGAGAGATTAATTTAGATGCTTTAAAAGGCAGTTATGCTAATGATGTTATTGTGAGTACCAGTGACAAT
GGAATTGTGATCGAAATGAAGTGCCCCTCGAGAGCAGGAAGGATGCTAGAAATTATGGAAGCAATTAACAGTTTCAATAT
AGATTTTAGTTCAGTTCAGTCAACAGAAGCTGATGGAAATCTTTATCTGACCATTAAATCTGTGCTCACAGGACCAAGGG
TTGCAACAGCCAAAAGAATCAAACTAGCACTACAAAAAGTGGCTTCCAAGTGCTGA
Microexon DNA seq AGTGCGTCCATTGAG
Microexon Amino Acid seq SASIE
Microexon-tag DNA Seq GCCCATTCTGCTGATTGTAAAATTTTTAGCCGTTCTCTTCTGGCAAAGAGTGCGTCCATTGAGACGGTTGTGTGCTTTCCGTTTAGGGAAGGGGTTATTGAGCTAGGT
Microexon-tag Amino Acid seq AHSADCKIFSRSLLAKSASIETVVCFPFREGVIELG
Transcript ID KRH59908
Gene ID Gm.41898
Gene Name NA
Pfam domain motif bHLH-MYC_N
Motif E-value 6.4e-52
Motif start 6
Motif end 189
Protein seq >KRH59908
MVPENLKKQLALAVRSIHWSYAIFWTDSTTQPGVLSWGEGYYNGDIKTRKTSQGVELNSDQIGLQRSEQLRELFKSLKTV
EVSPQTKRPSAALSPEDLTDAEWYYLVCMSFIFNIGQGLPGRTLAKGQSIWLNNAHSADCKIFSRSLLAKSASIETVVCF
PFREGVIELGTTEQVSEDLSVIERIKTSFLNSLHVDVPNKSVATLKSRKQEDLSYVAFDHNDYNVESIPEVGYEIANTTS
PNGSSNAIQANQPLDDTLMVESITNGTSQVQNWQVIDDELSNCVHNSMNSSDCISPTFASLENIASAPKCNNPSDPCARD
FQKCNNPKMTLVDPRSDEWHYQRVISTLIKNTDQLLMGMHLQKFPQASSFVSWRKGEPMDSQWPRAGTSQKLLKKVLFEV
PQMHLDGLHESQEENDYKEGMRVEADENGMNHVMSERRRRAKLNQRFLTLRSMVPSISKDDKVSILDDAIEYLKKLERRI
NELEAHRGVTDIETGTRRSPQDTVERTPDHYFSKNNNNNGKKPGMKKRKACGVDEKGREINLDALKGSYANDVIVSTSDN
GIVIEMKCPSRAGRMLEIMEAINSFNIDFSSVQSTEADGNLYLTIKSVLTGPRVATAKRIKLALQKVASKC*
CDS seq >KRH59908
ATGGTGCCAGAGAACTTGAAGAAACAACTTGCTTTGGCTGTGAGAAGCATCCATTGGAGCTATGCAATCTTCTGGACTGA
TTCAACTACCCAACCCGGGGTGTTGAGCTGGGGGGAAGGGTATTACAATGGAGACATTAAGACTAGGAAAACAAGTCAAG
GAGTAGAGCTCAATTCTGACCAAATAGGGTTGCAGAGGAGTGAACAGCTGCGAGAACTATTCAAATCCCTCAAAACTGTA
GAAGTCAGCCCTCAAACTAAAAGGCCTTCAGCAGCACTGTCCCCAGAAGATCTCACAGATGCTGAGTGGTATTACTTGGT
TTGCATGTCCTTCATATTCAACATTGGCCAAGGGTTACCAGGAAGAACTCTAGCGAAGGGTCAATCCATTTGGCTGAACA
ATGCCCATTCTGCTGATTGTAAAATTTTTAGCCGTTCTCTTCTGGCAAAGAGTGCGTCCATTGAGACGGTTGTGTGCTTT
CCGTTTAGGGAAGGGGTTATTGAGCTAGGTACTACTGAACAGGTCTCAGAAGATTTGAGTGTCATTGAACGGATCAAAAC
TTCTTTCTTGAACAGTCTGCATGTCGATGTTCCCAATAAGTCAGTAGCTACATTGAAATCAAGGAAACAGGAAGATCTTT
CTTATGTAGCATTTGATCATAATGACTATAATGTTGAATCGATTCCAGAAGTTGGGTATGAAATAGCCAACACAACCTCT
CCTAATGGTAGTTCAAATGCAATCCAAGCCAATCAACCACTAGATGACACACTTATGGTTGAAAGTATAACCAATGGCAC
TTCTCAAGTCCAAAACTGGCAAGTTATTGATGATGAATTGAGTAACTGTGTCCATAACTCCATGAATTCCAGCGACTGTA
TATCACCAACTTTTGCCAGCCTTGAGAATATTGCTTCTGCCCCCAAGTGTAACAACCCTTCTGATCCTTGTGCCCGAGAT
TTTCAAAAGTGCAACAACCCAAAAATGACCTTAGTGGATCCTCGAAGTGATGAGTGGCATTATCAGAGAGTTATTTCCAC
CCTTATAAAAAACACTGACCAGTTACTTATGGGAATGCATTTGCAAAAATTTCCTCAGGCATCAAGCTTTGTTAGTTGGA
GGAAAGGAGAGCCAATGGATTCCCAATGGCCAAGAGCAGGAACCTCACAAAAGTTATTGAAGAAAGTATTGTTTGAAGTT
CCTCAAATGCACTTGGATGGGCTTCATGAGTCTCAAGAAGAGAATGACTATAAAGAGGGAATGAGAGTAGAAGCTGATGA
AAACGGCATGAACCATGTTATGTCAGAAAGGAGGAGAAGAGCAAAACTAAATCAAAGGTTTTTAACCCTTAGGTCAATGG
TCCCTTCAATCAGTAAGGATGACAAAGTTTCGATACTAGATGATGCAATTGAATACCTTAAAAAGCTTGAGAGAAGGATA
AACGAGTTGGAAGCTCACAGGGGTGTAACAGATATAGAGACTGGGACTAGAAGATCACCACAAGATACGGTGGAGAGGAC
TCCTGATCATTATTTTAGCAAAAATAATAATAATAATGGTAAAAAACCAGGGATGAAAAAGAGGAAGGCTTGTGGTGTAG
ATGAGAAAGGAAGAGAGATTAATTTAGATGCTTTAAAAGGCAGTTATGCTAATGATGTTATTGTGAGTACCAGTGACAAT
GGAATTGTGATCGAAATGAAGTGCCCCTCGAGAGCAGGAAGGATGCTAGAAATTATGGAAGCAATTAACAGTTTCAATAT
AGATTTTAGTTCAGTTCAGTCAACAGAAGCTGATGGAAATCTTTATCTGACCATTAAATCTGTGCTCACAGGACCAAGGG
TTGCAACAGCCAAAAGAATCAAACTAGCACTACAAAAAGTGGCTTCCAAGTGCTGA