Microexon ID Gm_7:652428-652436:-
Species Glycine max
Coordinates 7:652428..652436
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCCAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTACAGAACTGCTTATCACTTCCAACCTCCCAAGAATTGGATAAATGATCCCAATGGACCATTGAGATATGCAGGACTTTACCACCTATTCTATCAATACAATCCT
Microexon-tag Amino Acid Seq PYRTAYHFQPPKNWINDPNGPLRYAGLYHLFYQYNP
Microexon-tag spanning region651751-652686
Microexon-tag prediction score0.9256
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH47104x
Reference Transcript ID KRH47104
Gene ID GLYMA_07G008800
Gene Name NA
Transcript ID KRH47104
Protein ID KRH47104
Gene ID GLYMA_07G008800
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.5e-108
Motif start 45
Motif end 362
Protein seq >KRH47104
MAMSTILLLTFFSFIYGSAATHHVYRNLQSLSSDSSNQPYRTAYHFQPPKNWINDPNGPLRYAGLYHLFYQYNPKGAVWG
NIVWAHSVSKDLVNWTPLDPAIFPSQPSDINGCWSGSTTLLPGNKPVILYTGIDLLNQQVQNLAQPKNLSDPFLREWVKS
PKNPLMAPTSANKINSSSFRDPTTAWLGKDGHWRVLVGSKRRTRGIAILYRSKDFVNWVQAKHPLYSILGSGMWECPDFF
PVLNNDQLGVDTSVNGYDVRHVLKVSLDDKKHDYYMIGSYNAAKDAFIPDEESNEFVLRYDYGKYYASKTFFDDGKKRRI
LLGWANESSSVAADIKKGWSGIHTIPRALWLHKSGKQLVQWPVVEVEKLRAYPVNLPPQVLKGGKLLPINGVTATQADVE
ISFEVSNLREAEVLDYWTDPQILCSKKGSSIKSGLGPFGLLVFASEGLQEYTSVFFRIFRHQHKYLVLLCSDQSRSSLNK
DNDLTSYGTFVDVDPLHEKLSLRTLIDHSVVESFGGEGRACITARVYPTLAINDEAQLYAFNNGTADVNITKLNAWSMKK
AQIN*
CDS seq >KRH47104
ATGGCTATGTCTACTATTTTGCTTTTGACTTTCTTTTCTTTCATTTATGGCAGTGCAGCTACTCATCACGTTTATAGAAA
TCTTCAGAGTTTATCTTCTGATTCCTCCAACCAACCTTACAGAACTGCTTATCACTTCCAACCTCCCAAGAATTGGATAA
ATGATCCCAATGGACCATTGAGATATGCAGGACTTTACCACCTATTCTATCAATACAATCCTAAAGGTGCAGTTTGGGGA
AATATTGTGTGGGCACATTCAGTGTCAAAGGATCTTGTGAATTGGACTCCACTAGATCCTGCCATTTTTCCATCTCAACC
GTCCGATATAAATGGCTGTTGGTCAGGATCAACCACACTACTTCCTGGGAACAAACCAGTTATTCTATACACTGGAATTG
ACCTATTGAATCAGCAAGTTCAAAACTTGGCCCAACCCAAAAATTTGTCTGACCCATTTCTTAGGGAATGGGTCAAGTCC
CCCAAAAATCCTCTAATGGCACCAACTAGTGCTAACAAGATCAATTCAAGTTCATTTAGAGACCCCACCACTGCTTGGCT
AGGCAAAGATGGGCATTGGAGGGTACTTGTTGGAAGCAAAAGAAGAACTAGGGGAATCGCAATTTTGTATAGGAGCAAAG
ACTTTGTTAATTGGGTTCAAGCCAAACACCCTTTGTATTCAATCCTAGGAAGTGGCATGTGGGAGTGTCCTGATTTTTTC
CCTGTTTTGAATAATGACCAATTGGGTGTTGACACGTCGGTGAATGGTTATGATGTTAGGCACGTGCTCAAGGTTAGCTT
GGATGACAAAAAACATGATTACTATATGATTGGGAGTTACAATGCTGCCAAGGATGCATTTATCCCGGATGAAGAGTCTA
ATGAATTTGTTTTAAGATATGACTATGGAAAATACTATGCCTCAAAAACTTTCTTTGATGATGGGAAGAAGAGAAGGATT
TTGTTAGGGTGGGCTAATGAATCCTCTAGTGTTGCTGCTGATATCAAGAAGGGATGGTCTGGAATCCATACAATTCCAAG
GGCTTTATGGCTGCATAAGTCTGGTAAACAGTTGGTACAGTGGCCAGTGGTGGAAGTTGAAAAGCTACGTGCATACCCAG
TCAACTTGCCCCCCCAAGTGCTAAAAGGAGGCAAGTTGCTTCCAATTAATGGTGTCACAGCAACACAGGCCGATGTTGAA
ATTTCATTTGAAGTGAGTAATTTGAGAGAGGCTGAAGTATTGGACTATTGGACAGACCCCCAAATTCTGTGTAGTAAAAA
GGGTTCATCCATCAAAAGTGGACTGGGCCCATTTGGTCTGTTAGTTTTTGCTTCAGAGGGTTTGCAAGAGTATACATCAG
TTTTCTTCAGAATATTCAGACACCAACACAAATATTTGGTGCTCTTGTGCAGTGATCAAAGCAGGTCTTCCTTAAACAAA
GACAATGATTTGACCAGTTATGGAACTTTTGTGGACGTCGACCCTCTTCATGAGAAGCTATCATTGAGAACCTTGATTGA
TCATTCTGTCGTGGAGAGTTTTGGAGGAGAAGGAAGGGCTTGCATCACGGCTAGAGTTTATCCCACATTAGCAATCAATG
ATGAAGCACAACTATATGCTTTCAATAATGGAACTGCTGATGTCAACATCACAAAACTGAATGCTTGGAGCATGAAGAAA
GCACAGATAAATTGA
Microexon DNA seq ATCCCAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTACAGAACTGCTTATCACTTCCAACCTCCCAAGAATTGGATAAATGATCCCAATGGACCATTGAGATATGCAGGACTTTACCACCTATTCTATCAATACAATCCT
Microexon-tag Amino Acid seq PYRTAYHFQPPKNWINDPNGPLRYAGLYHLFYQYNP
Transcript ID KRH47104
Gene ID Gm.45555
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.5e-108
Motif start 45
Motif end 362
Protein seq >KRH47104
MAMSTILLLTFFSFIYGSAATHHVYRNLQSLSSDSSNQPYRTAYHFQPPKNWINDPNGPLRYAGLYHLFYQYNPKGAVWG
NIVWAHSVSKDLVNWTPLDPAIFPSQPSDINGCWSGSTTLLPGNKPVILYTGIDLLNQQVQNLAQPKNLSDPFLREWVKS
PKNPLMAPTSANKINSSSFRDPTTAWLGKDGHWRVLVGSKRRTRGIAILYRSKDFVNWVQAKHPLYSILGSGMWECPDFF
PVLNNDQLGVDTSVNGYDVRHVLKVSLDDKKHDYYMIGSYNAAKDAFIPDEESNEFVLRYDYGKYYASKTFFDDGKKRRI
LLGWANESSSVAADIKKGWSGIHTIPRALWLHKSGKQLVQWPVVEVEKLRAYPVNLPPQVLKGGKLLPINGVTATQADVE
ISFEVSNLREAEVLDYWTDPQILCSKKGSSIKSGLGPFGLLVFASEGLQEYTSVFFRIFRHQHKYLVLLCSDQSRSSLNK
DNDLTSYGTFVDVDPLHEKLSLRTLIDHSVVESFGGEGRACITARVYPTLAINDEAQLYAFNNGTADVNITKLNAWSMKK
AQIN*
CDS seq >KRH47104
ATGGCTATGTCTACTATTTTGCTTTTGACTTTCTTTTCTTTCATTTATGGCAGTGCAGCTACTCATCACGTTTATAGAAA
TCTTCAGAGTTTATCTTCTGATTCCTCCAACCAACCTTACAGAACTGCTTATCACTTCCAACCTCCCAAGAATTGGATAA
ATGATCCCAATGGACCATTGAGATATGCAGGACTTTACCACCTATTCTATCAATACAATCCTAAAGGTGCAGTTTGGGGA
AATATTGTGTGGGCACATTCAGTGTCAAAGGATCTTGTGAATTGGACTCCACTAGATCCTGCCATTTTTCCATCTCAACC
GTCCGATATAAATGGCTGTTGGTCAGGATCAACCACACTACTTCCTGGGAACAAACCAGTTATTCTATACACTGGAATTG
ACCTATTGAATCAGCAAGTTCAAAACTTGGCCCAACCCAAAAATTTGTCTGACCCATTTCTTAGGGAATGGGTCAAGTCC
CCCAAAAATCCTCTAATGGCACCAACTAGTGCTAACAAGATCAATTCAAGTTCATTTAGAGACCCCACCACTGCTTGGCT
AGGCAAAGATGGGCATTGGAGGGTACTTGTTGGAAGCAAAAGAAGAACTAGGGGAATCGCAATTTTGTATAGGAGCAAAG
ACTTTGTTAATTGGGTTCAAGCCAAACACCCTTTGTATTCAATCCTAGGAAGTGGCATGTGGGAGTGTCCTGATTTTTTC
CCTGTTTTGAATAATGACCAATTGGGTGTTGACACGTCGGTGAATGGTTATGATGTTAGGCACGTGCTCAAGGTTAGCTT
GGATGACAAAAAACATGATTACTATATGATTGGGAGTTACAATGCTGCCAAGGATGCATTTATCCCGGATGAAGAGTCTA
ATGAATTTGTTTTAAGATATGACTATGGAAAATACTATGCCTCAAAAACTTTCTTTGATGATGGGAAGAAGAGAAGGATT
TTGTTAGGGTGGGCTAATGAATCCTCTAGTGTTGCTGCTGATATCAAGAAGGGATGGTCTGGAATCCATACAATTCCAAG
GGCTTTATGGCTGCATAAGTCTGGTAAACAGTTGGTACAGTGGCCAGTGGTGGAAGTTGAAAAGCTACGTGCATACCCAG
TCAACTTGCCCCCCCAAGTGCTAAAAGGAGGCAAGTTGCTTCCAATTAATGGTGTCACAGCAACACAGGCCGATGTTGAA
ATTTCATTTGAAGTGAGTAATTTGAGAGAGGCTGAAGTATTGGACTATTGGACAGACCCCCAAATTCTGTGTAGTAAAAA
GGGTTCATCCATCAAAAGTGGACTGGGCCCATTTGGTCTGTTAGTTTTTGCTTCAGAGGGTTTGCAAGAGTATACATCAG
TTTTCTTCAGAATATTCAGACACCAACACAAATATTTGGTGCTCTTGTGCAGTGATCAAAGCAGGTCTTCCTTAAACAAA
GACAATGATTTGACCAGTTATGGAACTTTTGTGGACGTCGACCCTCTTCATGAGAAGCTATCATTGAGAACCTTGATTGA
TCATTCTGTCGTGGAGAGTTTTGGAGGAGAAGGAAGGGCTTGCATCACGGCTAGAGTTTATCCCACATTAGCAATCAATG
ATGAAGCACAACTATATGCTTTCAATAATGGAACTGCTGATGTCAACATCACAAAACTGAATGCTTGGAGCATGAAGAAA
GCACAGATAAATTGA