Microexon ID Gm_17:38275930-38275938:+
Species Glycine max
Coordinates 17:38275930..38275938
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq CCTTATAGAACTTGGTACCACTTTCAACCCCCACAAAATTGGATGAACGATCCAAATGCACCTATGTACTACAAAGGAGTTTACCACTTTTTCTACCAACATAACCCC
Microexon-tag Amino Acid Seq PYRTWYHFQPPQNWMNDPNAPMYYKGVYHFFYQHNP
Microexon-tag spanning region38275736-38276458
Microexon-tag prediction score0.9455
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH05439x
Reference Transcript ID KRH05439
Gene ID GLYMA_17G227800
Gene Name NA
Transcript ID KRH05439
Protein ID KRH05439
Gene ID GLYMA_17G227800
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.1e-100
Motif start 31
Motif end 351
Protein seq >KRH05439
MEINGEGASPHSINSIKFKVPEKQPYRTWYHFQPPQNWMNDPNAPMYYKGVYHFFYQHNPYAPTFGEKMVWAHSVSYDLI
NWIHLNHAIEPSDSYDINSCWSGSATILPGEEEQPVILYTGIDNNKYQVQNMAMPKDLSDPFLREWVKHPQNPAMTPPSG
VEVNNFRDPSTAWQGKDGKWRVVIGAQNGDEGKTILYQSEDFVNWRVELNPFFATDNTGVCECPDFFPVSINSTNGVDAS
VQSQSVRHVLKISYLRRHQDYYFLGKYVYDEGNFVPDVKFTGTSSDLRLDYGKFYASKSFFDHAKNRRILWGWVNECDTR
QNDIEKGWAGLQCIPRQVWLDESGKQLMQWPIEEIEKLRDKQISILGEKLVGGSIIEVSGITASQADVEVLFELPELENV
EWLDESEVDPHLLCSEEYATRSGTIGPFGLLALASEDQTEHTAVFFRIYRASNRYICFMCSDQSRSSLRQDLDKTTYGTI
FDIDPNVKTISLRSLIDRSIIESFGEKGRICITSRVYPSMSIDKNAHLYVFNNGSQSVVISELNAWSMKQAEFGQEESIN
KQ*
CDS seq >KRH05439
ATGGAGATCAATGGAGAGGGTGCATCCCCACACAGCATAAATTCTATCAAGTTCAAGGTACCTGAGAAACAGCCTTATAG
AACTTGGTACCACTTTCAACCCCCACAAAATTGGATGAACGATCCAAATGCACCTATGTACTACAAAGGAGTTTACCACT
TTTTCTACCAACATAACCCCTATGCACCAACCTTTGGCGAGAAAATGGTGTGGGCTCACTCTGTGTCCTATGATCTCATC
AATTGGATTCATCTGAATCATGCCATTGAACCAAGTGATTCCTATGACATCAACAGCTGCTGGTCAGGCTCAGCCACAAT
ACTCCCAGGTGAAGAAGAACAACCTGTTATTTTGTACACAGGAATTGATAACAATAAATATCAAGTTCAGAACATGGCTA
TGCCAAAGGATCTATCAGACCCTTTCTTAAGGGAATGGGTGAAACACCCTCAGAACCCTGCCATGACACCACCAAGTGGT
GTTGAAGTGAATAACTTCAGAGACCCTTCAACTGCTTGGCAGGGAAAGGATGGAAAATGGAGGGTAGTCATTGGTGCTCA
AAATGGGGATGAAGGGAAGACAATTCTCTACCAAAGTGAGGATTTTGTTAATTGGAGAGTGGAATTGAACCCTTTTTTTG
CAACAGATAACACTGGAGTTTGTGAGTGTCCAGATTTTTTTCCTGTGTCCATCAATAGCACAAATGGGGTGGATGCATCT
GTCCAAAGTCAAAGTGTTAGACATGTCTTGAAGATAAGCTATCTACGTAGACATCAGGACTATTATTTTCTTGGTAAATA
TGTCTATGATGAGGGGAACTTTGTTCCTGATGTTAAATTCACAGGAACTAGTTCGGACTTAAGGCTTGACTATGGTAAGT
TTTATGCTTCAAAGTCATTTTTTGACCATGCTAAGAACAGGAGGATATTGTGGGGGTGGGTGAACGAGTGTGACACTAGA
CAAAATGACATTGAGAAAGGATGGGCTGGTCTACAGTGTATTCCAAGGCAAGTTTGGCTTGATGAAAGTGGGAAGCAGTT
GATGCAGTGGCCAATTGAAGAAATAGAAAAACTACGCGACAAACAAATTAGCATATTGGGGGAGAAACTGGTTGGTGGAT
CAATTATTGAAGTCTCAGGTATCACTGCATCACAGGCCGATGTAGAAGTGTTGTTTGAGCTACCCGAACTAGAGAATGTG
GAGTGGCTAGATGAAAGTGAAGTTGATCCCCACTTGCTGTGTAGTGAAGAGTATGCAACAAGAAGTGGCACAATAGGGCC
ATTTGGTTTGTTAGCTTTAGCTTCTGAGGACCAAACAGAACACACTGCAGTGTTCTTCAGAATATATAGAGCTTCCAATA
GATATATATGCTTCATGTGCAGTGACCAAAGCAGGTCTTCATTGCGGCAGGACCTTGATAAAACCACATATGGAACAATC
TTTGACATTGACCCTAATGTCAAAACGATTTCACTTAGAAGTTTGATTGACCGCTCAATTATTGAGAGTTTTGGGGAGAA
AGGGAGAATTTGTATTACCAGTAGAGTTTATCCCTCGATGTCTATAGACAAAAATGCACATCTTTATGTGTTCAACAATG
GAAGCCAGAGTGTGGTGATCTCTGAACTGAATGCTTGGAGCATGAAGCAAGCAGAATTTGGCCAAGAAGAAAGCATAAAT
AAGCAGTAG
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq CCTTATAGAACTTGGTACCACTTTCAACCCCCACAAAATTGGATGAACGATCCAAATGCACCTATGTACTACAAAGGAGTTTACCACTTTTTCTACCAACATAACCCC
Microexon-tag Amino Acid seq PYRTWYHFQPPQNWMNDPNAPMYYKGVYHFFYQHNP
Transcript ID KRH05439
Gene ID Gm.23615
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.1e-100
Motif start 31
Motif end 351
Protein seq >KRH05439
MEINGEGASPHSINSIKFKVPEKQPYRTWYHFQPPQNWMNDPNAPMYYKGVYHFFYQHNPYAPTFGEKMVWAHSVSYDLI
NWIHLNHAIEPSDSYDINSCWSGSATILPGEEEQPVILYTGIDNNKYQVQNMAMPKDLSDPFLREWVKHPQNPAMTPPSG
VEVNNFRDPSTAWQGKDGKWRVVIGAQNGDEGKTILYQSEDFVNWRVELNPFFATDNTGVCECPDFFPVSINSTNGVDAS
VQSQSVRHVLKISYLRRHQDYYFLGKYVYDEGNFVPDVKFTGTSSDLRLDYGKFYASKSFFDHAKNRRILWGWVNECDTR
QNDIEKGWAGLQCIPRQVWLDESGKQLMQWPIEEIEKLRDKQISILGEKLVGGSIIEVSGITASQADVEVLFELPELENV
EWLDESEVDPHLLCSEEYATRSGTIGPFGLLALASEDQTEHTAVFFRIYRASNRYICFMCSDQSRSSLRQDLDKTTYGTI
FDIDPNVKTISLRSLIDRSIIESFGEKGRICITSRVYPSMSIDKNAHLYVFNNGSQSVVISELNAWSMKQAEFGQEESIN
KQ*
CDS seq >KRH05439
ATGGAGATCAATGGAGAGGGTGCATCCCCACACAGCATAAATTCTATCAAGTTCAAGGTACCTGAGAAACAGCCTTATAG
AACTTGGTACCACTTTCAACCCCCACAAAATTGGATGAACGATCCAAATGCACCTATGTACTACAAAGGAGTTTACCACT
TTTTCTACCAACATAACCCCTATGCACCAACCTTTGGCGAGAAAATGGTGTGGGCTCACTCTGTGTCCTATGATCTCATC
AATTGGATTCATCTGAATCATGCCATTGAACCAAGTGATTCCTATGACATCAACAGCTGCTGGTCAGGCTCAGCCACAAT
ACTCCCAGGTGAAGAAGAACAACCTGTTATTTTGTACACAGGAATTGATAACAATAAATATCAAGTTCAGAACATGGCTA
TGCCAAAGGATCTATCAGACCCTTTCTTAAGGGAATGGGTGAAACACCCTCAGAACCCTGCCATGACACCACCAAGTGGT
GTTGAAGTGAATAACTTCAGAGACCCTTCAACTGCTTGGCAGGGAAAGGATGGAAAATGGAGGGTAGTCATTGGTGCTCA
AAATGGGGATGAAGGGAAGACAATTCTCTACCAAAGTGAGGATTTTGTTAATTGGAGAGTGGAATTGAACCCTTTTTTTG
CAACAGATAACACTGGAGTTTGTGAGTGTCCAGATTTTTTTCCTGTGTCCATCAATAGCACAAATGGGGTGGATGCATCT
GTCCAAAGTCAAAGTGTTAGACATGTCTTGAAGATAAGCTATCTACGTAGACATCAGGACTATTATTTTCTTGGTAAATA
TGTCTATGATGAGGGGAACTTTGTTCCTGATGTTAAATTCACAGGAACTAGTTCGGACTTAAGGCTTGACTATGGTAAGT
TTTATGCTTCAAAGTCATTTTTTGACCATGCTAAGAACAGGAGGATATTGTGGGGGTGGGTGAACGAGTGTGACACTAGA
CAAAATGACATTGAGAAAGGATGGGCTGGTCTACAGTGTATTCCAAGGCAAGTTTGGCTTGATGAAAGTGGGAAGCAGTT
GATGCAGTGGCCAATTGAAGAAATAGAAAAACTACGCGACAAACAAATTAGCATATTGGGGGAGAAACTGGTTGGTGGAT
CAATTATTGAAGTCTCAGGTATCACTGCATCACAGGCCGATGTAGAAGTGTTGTTTGAGCTACCCGAACTAGAGAATGTG
GAGTGGCTAGATGAAAGTGAAGTTGATCCCCACTTGCTGTGTAGTGAAGAGTATGCAACAAGAAGTGGCACAATAGGGCC
ATTTGGTTTGTTAGCTTTAGCTTCTGAGGACCAAACAGAACACACTGCAGTGTTCTTCAGAATATATAGAGCTTCCAATA
GATATATATGCTTCATGTGCAGTGACCAAAGCAGGTCTTCATTGCGGCAGGACCTTGATAAAACCACATATGGAACAATC
TTTGACATTGACCCTAATGTCAAAACGATTTCACTTAGAAGTTTGATTGACCGCTCAATTATTGAGAGTTTTGGGGAGAA
AGGGAGAATTTGTATTACCAGTAGAGTTTATCCCTCGATGTCTATAGACAAAAATGCACATCTTTATGTGTTCAACAATG
GAAGCCAGAGTGTGGTGATCTCTGAACTGAATGCTTGGAGCATGAAGCAAGCAGAATTTGGCCAAGAAGAAAGCATAAAT
AAGCAGTAG