Microexon ID Gm_17:38287217-38287225:-
Species Glycine max
Coordinates 17:38287217..38287225
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTACCGAACTTGGTACCACTTTCAGCCCCCACAAAATTGGATGAATGATCCAAATGGACCAATGTACTACAAAGGAGTTTACCACTTTTTCTACCAACATAACCCT
Microexon-tag Amino Acid Seq PYRTWYHFQPPQNWMNDPNGPMYYKGVYHFFYQHNP
Microexon-tag spanning region38286749-38287439
Microexon-tag prediction score0.9522
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH05441x
Reference Transcript ID KRH05441
Gene ID GLYMA_17G227900
Gene Name NA
Transcript ID KRH05441
Protein ID KRH05441
Gene ID GLYMA_17G227900
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 6.7e-101
Motif start 30
Motif end 349
Protein seq >KRH05441
MIMEINASPDNINSVKYNVHEKQPYRTWYHFQPPQNWMNDPNGPMYYKGVYHFFYQHNPYAPTFGRHMVWGHSVSYDLIN
WIHLNHILEPSESYDINGCYSGSITTLPVEKPVIMYTGSDTNKHQIQNLAMPKNLSDPFLREWVKDPQNPIMIPPSGIDV
EGFRDPTTAWQGGDGKWRVIIGAKTGDDGKALLYHSDDFVNWKLHPNPLYASDNTGMFECPDFFPVHISGSKSGVDTSIQ
NSSVKHVLKMSYQNKQLEYYFLGEYFPDQEKFIPDADWARTGLDLILDHGMFYASKSFFDNAKKRRILWGWSKECDTTQD
DYEKGWAGLQSIPRQVWLDKSGKWLMQWPIEEVEKLRDKQVSITGEKLIGGSTIEVSGITASQVDVEVLFELPELENAEW
LDESEVDSHLLCSEEYASRSGIIGPFGLLALASEDQTEHTAIFFRIYRAPNRYLCLMCSDQSRSSLRQDLDKTPYGTIFD
IDPNVKTISLRSLIDRSIIESFGEKGRICITSRVYPSLAIDKDAHLYVFNNGSQSVVISELNAWSMKEAEFS*
CDS seq >KRH05441
ATGATCATGGAGATCAATGCATCCCCCGACAACATTAATTCAGTCAAGTACAACGTACATGAAAAACAGCCTTACCGAAC
TTGGTACCACTTTCAGCCCCCACAAAATTGGATGAATGATCCAAATGGACCAATGTACTACAAAGGAGTTTACCACTTTT
TCTACCAACATAACCCTTATGCACCAACCTTTGGTAGGCATATGGTATGGGGTCATTCCGTATCCTATGATCTCATCAAT
TGGATTCATCTAAACCATATTCTTGAACCAAGTGAGTCCTATGACATTAATGGCTGTTATTCAGGCTCAATCACAACCCT
CCCAGTGGAAAAACCTGTTATCATGTATACAGGGAGTGATACTAACAAACATCAAATTCAGAACTTGGCTATGCCAAAGA
ATCTATCAGACCCCTTCTTAAGGGAATGGGTGAAAGACCCCCAAAACCCTATCATGATTCCACCAAGTGGAATTGATGTG
GAGGGTTTCAGAGACCCGACAACTGCATGGCAAGGAGGTGATGGAAAATGGAGAGTGATTATTGGTGCCAAAACGGGTGA
TGATGGGAAGGCTCTTCTCTACCATAGTGATGATTTTGTTAATTGGAAACTGCATCCCAATCCTTTGTATGCATCAGACA
ATACTGGAATGTTTGAGTGTCCAGATTTCTTTCCAGTGCACATAAGTGGCTCAAAGAGTGGGGTTGATACATCAATCCAA
AACTCCAGTGTCAAGCATGTCTTGAAAATGAGTTATCAAAACAAACAACTAGAATACTATTTTCTTGGTGAATATTTTCC
TGATCAGGAAAAGTTTATTCCTGATGCTGATTGGGCAAGAACTGGTTTGGACTTGATATTGGACCATGGAATGTTTTATG
CTTCCAAGTCATTTTTTGACAATGCCAAGAAAAGAAGGATATTGTGGGGATGGTCAAAGGAGTGTGACACCACACAAGAT
GATTATGAGAAAGGATGGGCTGGTCTACAGAGTATTCCAAGGCAAGTTTGGCTTGATAAAAGTGGGAAGTGGTTGATGCA
GTGGCCAATTGAAGAGGTAGAAAAACTACGTGACAAACAAGTTAGCATAACGGGAGAGAAACTAATTGGTGGATCAACTA
TTGAAGTCTCTGGTATTACTGCATCACAAGTCGATGTAGAAGTGTTGTTTGAGCTACCTGAACTAGAGAATGCGGAGTGG
CTAGATGAAAGTGAAGTTGATTCTCACTTGCTGTGTAGTGAAGAATATGCATCAAGAAGTGGCATAATAGGGCCATTTGG
TTTGTTAGCTTTAGCATCTGAGGACCAAACAGAACACACTGCAATTTTCTTCAGAATATATAGAGCTCCCAATAGATATT
TATGCCTCATGTGCAGTGACCAAAGCAGGTCTTCATTGAGGCAGGACCTTGATAAAACCCCATATGGAACAATCTTTGAC
ATTGACCCTAATGTCAAGACGATTTCACTTAGAAGTTTGATTGACCGCTCCATTATTGAGAGTTTTGGGGAGAAAGGGAG
AATTTGTATTACCAGTAGAGTTTATCCCTCGTTGGCTATTGACAAAGATGCACATCTTTATGTTTTCAACAATGGAAGCC
AGAGTGTGGTGATCTCTGAACTGAATGCTTGGAGCATGAAGGAAGCAGAATTTAGTTAA
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTACCGAACTTGGTACCACTTTCAGCCCCCACAAAATTGGATGAATGATCCAAATGGACCAATGTACTACAAAGGAGTTTACCACTTTTTCTACCAACATAACCCT
Microexon-tag Amino Acid seq PYRTWYHFQPPQNWMNDPNGPMYYKGVYHFFYQHNP
Transcript ID KRH05441
Gene ID Gm.23616
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 6.7e-101
Motif start 30
Motif end 349
Protein seq >KRH05441
MIMEINASPDNINSVKYNVHEKQPYRTWYHFQPPQNWMNDPNGPMYYKGVYHFFYQHNPYAPTFGRHMVWGHSVSYDLIN
WIHLNHILEPSESYDINGCYSGSITTLPVEKPVIMYTGSDTNKHQIQNLAMPKNLSDPFLREWVKDPQNPIMIPPSGIDV
EGFRDPTTAWQGGDGKWRVIIGAKTGDDGKALLYHSDDFVNWKLHPNPLYASDNTGMFECPDFFPVHISGSKSGVDTSIQ
NSSVKHVLKMSYQNKQLEYYFLGEYFPDQEKFIPDADWARTGLDLILDHGMFYASKSFFDNAKKRRILWGWSKECDTTQD
DYEKGWAGLQSIPRQVWLDKSGKWLMQWPIEEVEKLRDKQVSITGEKLIGGSTIEVSGITASQVDVEVLFELPELENAEW
LDESEVDSHLLCSEEYASRSGIIGPFGLLALASEDQTEHTAIFFRIYRAPNRYLCLMCSDQSRSSLRQDLDKTPYGTIFD
IDPNVKTISLRSLIDRSIIESFGEKGRICITSRVYPSLAIDKDAHLYVFNNGSQSVVISELNAWSMKEAEFS*
CDS seq >KRH05441
ATGATCATGGAGATCAATGCATCCCCCGACAACATTAATTCAGTCAAGTACAACGTACATGAAAAACAGCCTTACCGAAC
TTGGTACCACTTTCAGCCCCCACAAAATTGGATGAATGATCCAAATGGACCAATGTACTACAAAGGAGTTTACCACTTTT
TCTACCAACATAACCCTTATGCACCAACCTTTGGTAGGCATATGGTATGGGGTCATTCCGTATCCTATGATCTCATCAAT
TGGATTCATCTAAACCATATTCTTGAACCAAGTGAGTCCTATGACATTAATGGCTGTTATTCAGGCTCAATCACAACCCT
CCCAGTGGAAAAACCTGTTATCATGTATACAGGGAGTGATACTAACAAACATCAAATTCAGAACTTGGCTATGCCAAAGA
ATCTATCAGACCCCTTCTTAAGGGAATGGGTGAAAGACCCCCAAAACCCTATCATGATTCCACCAAGTGGAATTGATGTG
GAGGGTTTCAGAGACCCGACAACTGCATGGCAAGGAGGTGATGGAAAATGGAGAGTGATTATTGGTGCCAAAACGGGTGA
TGATGGGAAGGCTCTTCTCTACCATAGTGATGATTTTGTTAATTGGAAACTGCATCCCAATCCTTTGTATGCATCAGACA
ATACTGGAATGTTTGAGTGTCCAGATTTCTTTCCAGTGCACATAAGTGGCTCAAAGAGTGGGGTTGATACATCAATCCAA
AACTCCAGTGTCAAGCATGTCTTGAAAATGAGTTATCAAAACAAACAACTAGAATACTATTTTCTTGGTGAATATTTTCC
TGATCAGGAAAAGTTTATTCCTGATGCTGATTGGGCAAGAACTGGTTTGGACTTGATATTGGACCATGGAATGTTTTATG
CTTCCAAGTCATTTTTTGACAATGCCAAGAAAAGAAGGATATTGTGGGGATGGTCAAAGGAGTGTGACACCACACAAGAT
GATTATGAGAAAGGATGGGCTGGTCTACAGAGTATTCCAAGGCAAGTTTGGCTTGATAAAAGTGGGAAGTGGTTGATGCA
GTGGCCAATTGAAGAGGTAGAAAAACTACGTGACAAACAAGTTAGCATAACGGGAGAGAAACTAATTGGTGGATCAACTA
TTGAAGTCTCTGGTATTACTGCATCACAAGTCGATGTAGAAGTGTTGTTTGAGCTACCTGAACTAGAGAATGCGGAGTGG
CTAGATGAAAGTGAAGTTGATTCTCACTTGCTGTGTAGTGAAGAATATGCATCAAGAAGTGGCATAATAGGGCCATTTGG
TTTGTTAGCTTTAGCATCTGAGGACCAAACAGAACACACTGCAATTTTCTTCAGAATATATAGAGCTCCCAATAGATATT
TATGCCTCATGTGCAGTGACCAAAGCAGGTCTTCATTGAGGCAGGACCTTGATAAAACCCCATATGGAACAATCTTTGAC
ATTGACCCTAATGTCAAGACGATTTCACTTAGAAGTTTGATTGACCGCTCCATTATTGAGAGTTTTGGGGAGAAAGGGAG
AATTTGTATTACCAGTAGAGTTTATCCCTCGTTGGCTATTGACAAAGATGCACATCTTTATGTTTTCAACAATGGAAGCC
AGAGTGTGGTGATCTCTGAACTGAATGCTTGGAGCATGAAGGAAGCAGAATTTAGTTAA