Microexon ID Gm_15:1992933-1992941:+
Species Glycine max
Coordinates 15:1992933..1992941
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ACCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTATAGAACCGCTTACCATTTCCAACCTCCCAAAAATTGGATAAATGACCCTAATGGACCAATGAGGTACAAAGGACTTTATCATCTCTTCTACCAATACAATCCA
Microexon-tag Amino Acid Seq PYRTAYHFQPPKNWINDPNGPMRYKGLYHLFYQYNP
Microexon-tag spanning region1992732-1993266
Microexon-tag prediction score0.9408
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH10019x
Reference Transcript ID KRH10019
Gene ID GLYMA_15G024600
Gene Name NA
Transcript ID KRH10019
Protein ID KRH10019
Gene ID GLYMA_15G024600
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 7.7e-108
Motif start 51
Motif end 373
Protein seq >KRH10019
MAVSPILLLLAIFSLIYGNGILPIEATHHVYRNLQTLSSDSSDQPYRTAYHFQPPKNWINDPNGPMRYKGLYHLFYQYNP
KGAVWGNIVWAHSVSKDLVNWTPLDHAIYPSQPSDINGCWSGSATILPGGKPAILYTGIDPNNHQVQNLALPKNMSDPLL
REWVKSPKNPLMAPTSANMINSSSFRDPTTAWLGKDGYWRVLIGSKIHTRGMAILYKSKNFVNWVQAKQPLHSAEGTGMW
ECPDFYPVLNNKPSSTIGLDTSVNGDNVRHVLKVSLDDKKHDHYLIGTYDIAKDIFTPDNGFEDSQTVLRYDYGKYYASK
TIFEDGKNRRVLLGWVNESSSVSDDIKKGWAGIHTIPRAIWLHKSGKQLVQWPVVELESLRVNPVHWPTKVVKGGEMLQV
TGVTAAQADVEISFDVNEFGKGEVLDQWVDPQILCSRKGAAVKGGLGPFGLLVFASRGLQEYTAVFFRIFRYQNKNLVLM
CSDQSRSSLNKDNDMTTYGTFVDMDPLHEKLSLRTLIDHSVVESFGGEGRACITARVYPTIAINEKAQLYAFNNGTAAVK
ITRLSAWSMEKAKIN*
CDS seq >KRH10019
ATGGCCGTATCTCCAATTTTGTTGTTGTTGGCTATCTTCTCTCTCATTTATGGCAATGGTATTCTTCCCATTGAAGCTAC
CCACCATGTTTACAGAAATCTTCAGACTCTATCTTCTGATTCCTCTGATCAACCTTATAGAACCGCTTACCATTTCCAAC
CTCCCAAAAATTGGATAAATGACCCTAATGGACCAATGAGGTACAAAGGACTTTATCATCTCTTCTACCAATACAATCCA
AAAGGTGCTGTTTGGGGTAATATTGTGTGGGCCCACTCAGTATCAAAGGATCTTGTGAATTGGACCCCTCTAGATCATGC
CATCTACCCTTCTCAACCGTCTGATATCAACGGTTGTTGGTCAGGCTCAGCCACAATACTTCCTGGGGGCAAACCAGCCA
TTTTATACACAGGAATTGACCCTAATAATCACCAAGTTCAAAACTTAGCCCTACCCAAAAACATGTCTGACCCATTACTT
AGGGAATGGGTTAAGTCACCCAAAAATCCACTAATGGCACCAACTAGTGCTAACATGATCAATTCAAGCTCATTTAGGGA
TCCTACCACTGCTTGGCTAGGCAAAGATGGGTACTGGAGGGTGCTGATTGGAAGCAAAATACACACTAGGGGTATGGCAA
TTTTGTACAAGAGCAAAAACTTTGTTAATTGGGTTCAAGCCAAACAACCCCTACATTCAGCTGAAGGCACTGGAATGTGG
GAGTGCCCTGATTTCTATCCAGTGCTGAATAATAAACCATCATCAACTATTGGTCTTGACACATCTGTGAATGGTGATAA
TGTTAGGCATGTGCTTAAGGTTAGTTTGGATGATAAAAAACATGATCATTATTTGATTGGGACTTATGACATTGCCAAGG
ATATCTTCACTCCGGATAATGGATTTGAGGATAGTCAAACTGTCCTAAGATATGACTATGGAAAATATTATGCCTCAAAA
ACCATTTTTGAGGATGGAAAGAACAGAAGGGTCTTATTGGGTTGGGTTAATGAATCCTCAAGTGTTTCGGATGATATCAA
GAAAGGATGGGCTGGAATCCATACTATTCCAAGGGCCATCTGGCTTCATAAATCTGGAAAACAGTTGGTGCAATGGCCGG
TGGTGGAACTTGAAAGCTTACGTGTGAATCCTGTCCACTGGCCCACCAAAGTGGTCAAAGGTGGTGAAATGCTTCAAGTT
ACTGGTGTTACTGCGGCACAGGCTGACGTTGAAATTTCATTTGACGTGAATGAGTTTGGAAAGGGCGAAGTATTGGACCA
ATGGGTGGATCCCCAAATTCTGTGTAGTAGAAAGGGTGCAGCCGTAAAGGGTGGTTTGGGACCCTTTGGCTTGCTAGTTT
TTGCTTCTCGTGGCTTGCAAGAGTACACGGCAGTATTCTTTAGAATATTCAGATACCAAAATAAAAATTTGGTTCTCATG
TGTAGCGACCAAAGCAGATCCTCTTTGAATAAAGATAACGATATGACCACCTATGGGACTTTTGTGGACATGGACCCTCT
TCATGAGAAGCTGTCACTAAGAACTTTGATTGATCACTCAGTAGTGGAGAGTTTTGGAGGAGAGGGAAGGGCTTGCATCA
CAGCCAGAGTATATCCCACAATAGCAATTAATGAAAAGGCACAACTATATGCTTTCAATAATGGAACTGCCGCCGTCAAG
ATTACAAGATTGAGTGCTTGGAGCATGGAGAAGGCAAAAATAAACTGA
Microexon DNA seq ACCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTATAGAACCGCTTACCATTTCCAACCTCCCAAAAATTGGATAAATGACCCTAATGGACCAATGAGGTACAAAGGACTTTATCATCTCTTCTACCAATACAATCCA
Microexon-tag Amino Acid seq PYRTAYHFQPPKNWINDPNGPMRYKGLYHLFYQYNP
Transcript ID KRH10019
Gene ID Gm.16781
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 7.7e-108
Motif start 51
Motif end 373
Protein seq >KRH10019
MAVSPILLLLAIFSLIYGNGILPIEATHHVYRNLQTLSSDSSDQPYRTAYHFQPPKNWINDPNGPMRYKGLYHLFYQYNP
KGAVWGNIVWAHSVSKDLVNWTPLDHAIYPSQPSDINGCWSGSATILPGGKPAILYTGIDPNNHQVQNLALPKNMSDPLL
REWVKSPKNPLMAPTSANMINSSSFRDPTTAWLGKDGYWRVLIGSKIHTRGMAILYKSKNFVNWVQAKQPLHSAEGTGMW
ECPDFYPVLNNKPSSTIGLDTSVNGDNVRHVLKVSLDDKKHDHYLIGTYDIAKDIFTPDNGFEDSQTVLRYDYGKYYASK
TIFEDGKNRRVLLGWVNESSSVSDDIKKGWAGIHTIPRAIWLHKSGKQLVQWPVVELESLRVNPVHWPTKVVKGGEMLQV
TGVTAAQADVEISFDVNEFGKGEVLDQWVDPQILCSRKGAAVKGGLGPFGLLVFASRGLQEYTAVFFRIFRYQNKNLVLM
CSDQSRSSLNKDNDMTTYGTFVDMDPLHEKLSLRTLIDHSVVESFGGEGRACITARVYPTIAINEKAQLYAFNNGTAAVK
ITRLSAWSMEKAKIN*
CDS seq >KRH10019
ATGGCCGTATCTCCAATTTTGTTGTTGTTGGCTATCTTCTCTCTCATTTATGGCAATGGTATTCTTCCCATTGAAGCTAC
CCACCATGTTTACAGAAATCTTCAGACTCTATCTTCTGATTCCTCTGATCAACCTTATAGAACCGCTTACCATTTCCAAC
CTCCCAAAAATTGGATAAATGACCCTAATGGACCAATGAGGTACAAAGGACTTTATCATCTCTTCTACCAATACAATCCA
AAAGGTGCTGTTTGGGGTAATATTGTGTGGGCCCACTCAGTATCAAAGGATCTTGTGAATTGGACCCCTCTAGATCATGC
CATCTACCCTTCTCAACCGTCTGATATCAACGGTTGTTGGTCAGGCTCAGCCACAATACTTCCTGGGGGCAAACCAGCCA
TTTTATACACAGGAATTGACCCTAATAATCACCAAGTTCAAAACTTAGCCCTACCCAAAAACATGTCTGACCCATTACTT
AGGGAATGGGTTAAGTCACCCAAAAATCCACTAATGGCACCAACTAGTGCTAACATGATCAATTCAAGCTCATTTAGGGA
TCCTACCACTGCTTGGCTAGGCAAAGATGGGTACTGGAGGGTGCTGATTGGAAGCAAAATACACACTAGGGGTATGGCAA
TTTTGTACAAGAGCAAAAACTTTGTTAATTGGGTTCAAGCCAAACAACCCCTACATTCAGCTGAAGGCACTGGAATGTGG
GAGTGCCCTGATTTCTATCCAGTGCTGAATAATAAACCATCATCAACTATTGGTCTTGACACATCTGTGAATGGTGATAA
TGTTAGGCATGTGCTTAAGGTTAGTTTGGATGATAAAAAACATGATCATTATTTGATTGGGACTTATGACATTGCCAAGG
ATATCTTCACTCCGGATAATGGATTTGAGGATAGTCAAACTGTCCTAAGATATGACTATGGAAAATATTATGCCTCAAAA
ACCATTTTTGAGGATGGAAAGAACAGAAGGGTCTTATTGGGTTGGGTTAATGAATCCTCAAGTGTTTCGGATGATATCAA
GAAAGGATGGGCTGGAATCCATACTATTCCAAGGGCCATCTGGCTTCATAAATCTGGAAAACAGTTGGTGCAATGGCCGG
TGGTGGAACTTGAAAGCTTACGTGTGAATCCTGTCCACTGGCCCACCAAAGTGGTCAAAGGTGGTGAAATGCTTCAAGTT
ACTGGTGTTACTGCGGCACAGGCTGACGTTGAAATTTCATTTGACGTGAATGAGTTTGGAAAGGGCGAAGTATTGGACCA
ATGGGTGGATCCCCAAATTCTGTGTAGTAGAAAGGGTGCAGCCGTAAAGGGTGGTTTGGGACCCTTTGGCTTGCTAGTTT
TTGCTTCTCGTGGCTTGCAAGAGTACACGGCAGTATTCTTTAGAATATTCAGATACCAAAATAAAAATTTGGTTCTCATG
TGTAGCGACCAAAGCAGATCCTCTTTGAATAAAGATAACGATATGACCACCTATGGGACTTTTGTGGACATGGACCCTCT
TCATGAGAAGCTGTCACTAAGAACTTTGATTGATCACTCAGTAGTGGAGAGTTTTGGAGGAGAGGGAAGGGCTTGCATCA
CAGCCAGAGTATATCCCACAATAGCAATTAATGAAAAGGCACAACTATATGCTTTCAATAATGGAACTGCCGCCGTCAAG
ATTACAAGATTGAGTGCTTGGAGCATGGAGAAGGCAAAAATAAACTGA