Microexon ID Gm_13:43906612-43906620:-
Species Glycine max
Coordinates 13:43906612..43906620
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ACCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTATAGAACTGCTTACCATTTCCAACCTCGCAAAAATTGGATAAATGACCCTAATGGACCAATGAGGTACAAAGGACTTTACCATCTGTTCTATCAATACAATCCA
Microexon-tag Amino Acid Seq PYRTAYHFQPRKNWINDPNGPMRYKGLYHLFYQYNP
Microexon-tag spanning region43906289-43906803
Microexon-tag prediction score0.936
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH23303x
Reference Transcript ID KRH23303
Gene ID GLYMA_13G349300
Gene Name NA
Transcript ID KRH23303
Protein ID KRH23303
Gene ID GLYMA_13G349300
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 3e-106
Motif start 50
Motif end 372
Protein seq >KRH23303
MAISPILLLAILSLIYGNGVLPIEATHHVYRNLQTLSSDSSDQPYRTAYHFQPRKNWINDPNGPMRYKGLYHLFYQYNPK
GAVWGNIVWAHSISNDLVNWTPLDHAIYPSQPSDINGCWSGSATILPRGKPAILYTGINPNKHQVQNLAIPKNMSDPLLR
EWVKSPKNPLMAPTISNNINSSSFRDPTTAWLGKDGYWRVLIGSKIHTRGMAILYKSKNFVNWVQAKQPLHSAEGTGMWE
CPDFYPVLDNKGPSTIGLDTSVNGDNVRHVLKVSLDDTKHDHYLIGTYDIAKDIFTPDNGFEDSQTVLRYDYGKYYASKT
IFEDGKNRRVLLGWVNESSSVPDDIKKGWAGIHTIPRAIWLHKSGKQLVQWPVVELESLRVNPVHWPTKVVKGGEMLQVT
GVTAAQADVEISFEVNEFGKAEVLDKWVDPQILCSRKGAAVKGGLGPFGLLVFASRGLQEYTAVFFRIFRYQNKNLVLMC
SDQSRSSLNKDNDMTTYGTFVDMDPLHEKLSLRTLIDRSVVESFGGEGMACITARVYPTIAINKKAQLYVFNNGTAAVKI
TRLSAWSMKKAKIN*
CDS seq >KRH23303
ATGGCCATATCTCCAATTTTGTTGTTGGCTATCTTATCTCTCATTTATGGCAATGGTGTTCTTCCCATTGAAGCTACCCA
TCATGTTTACAGAAATCTTCAGACTCTATCTTCTGATTCCTCTGATCAACCTTATAGAACTGCTTACCATTTCCAACCTC
GCAAAAATTGGATAAATGACCCTAATGGACCAATGAGGTACAAAGGACTTTACCATCTGTTCTATCAATACAATCCAAAA
GGTGCCGTATGGGGCAATATTGTCTGGGCCCACTCAATATCAAATGATCTTGTGAATTGGACTCCACTGGATCATGCCAT
CTACCCTTCTCAACCGTCTGATATAAACGGTTGTTGGTCAGGCTCAGCCACAATACTCCCTCGGGGCAAGCCAGCCATTT
TATACACAGGAATTAACCCTAATAAGCACCAAGTTCAAAACTTAGCCATACCCAAAAACATGTCTGACCCATTACTTAGG
GAATGGGTTAAGTCACCCAAAAATCCACTAATGGCACCAACTATTTCTAACAATATCAATTCAAGCTCATTTAGGGACCC
TACCACTGCTTGGCTAGGAAAAGATGGATACTGGAGGGTGCTGATTGGAAGCAAAATACACACTAGGGGTATGGCAATTT
TGTACAAGAGCAAAAACTTTGTTAATTGGGTTCAAGCCAAACAACCCCTACATTCAGCTGAAGGCACTGGAATGTGGGAG
TGCCCTGATTTCTATCCAGTGCTGGATAATAAGGGCCCATCAACTATTGGTCTTGACACATCTGTGAATGGTGATAATGT
TAGGCATGTGCTTAAGGTTAGTTTGGATGATACAAAACATGATCATTATTTGATTGGGACTTATGACATTGCCAAGGATA
TCTTCACTCCGGATAACGGATTTGAGGATAGCCAAACTGTCTTAAGATATGACTATGGAAAATATTATGCCTCAAAAACG
ATTTTTGAAGATGGAAAAAACAGAAGGGTCTTATTGGGTTGGGTTAACGAATCCTCAAGTGTTCCGGATGATATCAAGAA
AGGATGGGCTGGAATCCATACCATTCCAAGGGCCATCTGGCTTCATAAATCTGGGAAACAGTTGGTGCAATGGCCGGTGG
TGGAACTTGAAAGCTTGCGTGTGAACCCTGTCCACTGGCCCACCAAAGTGGTCAAAGGTGGTGAAATGCTTCAAGTTACT
GGTGTCACTGCGGCACAGGCTGACGTTGAAATTTCATTTGAAGTGAATGAGTTTGGAAAGGCCGAAGTATTGGACAAATG
GGTGGATCCCCAAATTCTGTGTAGTAGAAAGGGGGCAGCCGTAAAGGGTGGTTTGGGACCCTTTGGCTTGCTAGTTTTTG
CTTCTCGTGGCTTACAAGAGTACACGGCAGTGTTCTTTAGAATATTCAGATACCAAAATAAAAATTTGGTTCTCATGTGT
AGCGACCAAAGCAGATCCTCTTTGAATAAAGATAACGATATGACCACCTATGGGACTTTTGTGGACATGGACCCTCTTCA
TGAGAAGCTGTCGCTAAGAACTTTGATTGATCGCTCAGTAGTGGAGAGTTTTGGAGGAGAGGGAATGGCTTGCATCACAG
CCAGAGTATATCCCACAATAGCAATTAATAAAAAGGCACAACTATATGTTTTCAATAATGGAACTGCCGCTGTCAAGATC
ACAAGATTGAGTGCTTGGAGCATGAAGAAGGCAAAAATCAACTGA
Microexon DNA seq ACCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTATAGAACTGCTTACCATTTCCAACCTCGCAAAAATTGGATAAATGACCCTAATGGACCAATGAGGTACAAAGGACTTTACCATCTGTTCTATCAATACAATCCA
Microexon-tag Amino Acid seq PYRTAYHFQPRKNWINDPNGPMRYKGLYHLFYQYNP
Transcript ID KRH23303
Gene ID Gm.13994
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 3e-106
Motif start 50
Motif end 372
Protein seq >KRH23303
MAISPILLLAILSLIYGNGVLPIEATHHVYRNLQTLSSDSSDQPYRTAYHFQPRKNWINDPNGPMRYKGLYHLFYQYNPK
GAVWGNIVWAHSISNDLVNWTPLDHAIYPSQPSDINGCWSGSATILPRGKPAILYTGINPNKHQVQNLAIPKNMSDPLLR
EWVKSPKNPLMAPTISNNINSSSFRDPTTAWLGKDGYWRVLIGSKIHTRGMAILYKSKNFVNWVQAKQPLHSAEGTGMWE
CPDFYPVLDNKGPSTIGLDTSVNGDNVRHVLKVSLDDTKHDHYLIGTYDIAKDIFTPDNGFEDSQTVLRYDYGKYYASKT
IFEDGKNRRVLLGWVNESSSVPDDIKKGWAGIHTIPRAIWLHKSGKQLVQWPVVELESLRVNPVHWPTKVVKGGEMLQVT
GVTAAQADVEISFEVNEFGKAEVLDKWVDPQILCSRKGAAVKGGLGPFGLLVFASRGLQEYTAVFFRIFRYQNKNLVLMC
SDQSRSSLNKDNDMTTYGTFVDMDPLHEKLSLRTLIDRSVVESFGGEGMACITARVYPTIAINKKAQLYVFNNGTAAVKI
TRLSAWSMKKAKIN*
CDS seq >KRH23303
ATGGCCATATCTCCAATTTTGTTGTTGGCTATCTTATCTCTCATTTATGGCAATGGTGTTCTTCCCATTGAAGCTACCCA
TCATGTTTACAGAAATCTTCAGACTCTATCTTCTGATTCCTCTGATCAACCTTATAGAACTGCTTACCATTTCCAACCTC
GCAAAAATTGGATAAATGACCCTAATGGACCAATGAGGTACAAAGGACTTTACCATCTGTTCTATCAATACAATCCAAAA
GGTGCCGTATGGGGCAATATTGTCTGGGCCCACTCAATATCAAATGATCTTGTGAATTGGACTCCACTGGATCATGCCAT
CTACCCTTCTCAACCGTCTGATATAAACGGTTGTTGGTCAGGCTCAGCCACAATACTCCCTCGGGGCAAGCCAGCCATTT
TATACACAGGAATTAACCCTAATAAGCACCAAGTTCAAAACTTAGCCATACCCAAAAACATGTCTGACCCATTACTTAGG
GAATGGGTTAAGTCACCCAAAAATCCACTAATGGCACCAACTATTTCTAACAATATCAATTCAAGCTCATTTAGGGACCC
TACCACTGCTTGGCTAGGAAAAGATGGATACTGGAGGGTGCTGATTGGAAGCAAAATACACACTAGGGGTATGGCAATTT
TGTACAAGAGCAAAAACTTTGTTAATTGGGTTCAAGCCAAACAACCCCTACATTCAGCTGAAGGCACTGGAATGTGGGAG
TGCCCTGATTTCTATCCAGTGCTGGATAATAAGGGCCCATCAACTATTGGTCTTGACACATCTGTGAATGGTGATAATGT
TAGGCATGTGCTTAAGGTTAGTTTGGATGATACAAAACATGATCATTATTTGATTGGGACTTATGACATTGCCAAGGATA
TCTTCACTCCGGATAACGGATTTGAGGATAGCCAAACTGTCTTAAGATATGACTATGGAAAATATTATGCCTCAAAAACG
ATTTTTGAAGATGGAAAAAACAGAAGGGTCTTATTGGGTTGGGTTAACGAATCCTCAAGTGTTCCGGATGATATCAAGAA
AGGATGGGCTGGAATCCATACCATTCCAAGGGCCATCTGGCTTCATAAATCTGGGAAACAGTTGGTGCAATGGCCGGTGG
TGGAACTTGAAAGCTTGCGTGTGAACCCTGTCCACTGGCCCACCAAAGTGGTCAAAGGTGGTGAAATGCTTCAAGTTACT
GGTGTCACTGCGGCACAGGCTGACGTTGAAATTTCATTTGAAGTGAATGAGTTTGGAAAGGCCGAAGTATTGGACAAATG
GGTGGATCCCCAAATTCTGTGTAGTAGAAAGGGGGCAGCCGTAAAGGGTGGTTTGGGACCCTTTGGCTTGCTAGTTTTTG
CTTCTCGTGGCTTACAAGAGTACACGGCAGTGTTCTTTAGAATATTCAGATACCAAAATAAAAATTTGGTTCTCATGTGT
AGCGACCAAAGCAGATCCTCTTTGAATAAAGATAACGATATGACCACCTATGGGACTTTTGTGGACATGGACCCTCTTCA
TGAGAAGCTGTCGCTAAGAACTTTGATTGATCGCTCAGTAGTGGAGAGTTTTGGAGGAGAGGGAATGGCTTGCATCACAG
CCAGAGTATATCCCACAATAGCAATTAATAAAAAGGCACAACTATATGTTTTCAATAATGGAACTGCCGCTGTCAAGATC
ACAAGATTGAGTGCTTGGAGCATGAAGAAGGCAAAAATCAACTGA