Microexon ID Gm_18:6199289-6199299:+
Species Glycine max
Coordinates 18:6199289..6199299
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAGACTTCAAG
Microexon Amino Acid seq GDFK
Microexon-tag DNA Seq CAAGGATTCATAGATGACCCACATTTGTTTGCATTAAAAATAACGAGAGGAGACTTCAAGGTCAAAGAGATATACAACTATACACAGGATGACTTAATAACTGAAGAT
Microexon-tag Amino Acid Seq QGFIDDPHLFALKITRGDFKVKEIYNYTQDDLITED
Microexon-tag spanning region6199105-6199772
Microexon-tag prediction score0.9082
Overlapped with the annotated transcript (%) 100
New Transcript ID KRG98341x
Reference Transcript ID KRG98341
Gene ID GLYMA_18G067000
Gene Name NA
Transcript ID KRG98341
Protein ID KRG98341
Gene ID GLYMA_18G067000
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >KRG98341
MPTATKDMDFAFQTAGANPGLEVWCIENQRLVSVSKSSHGKFYTGSAYLVLNAVFPKIGPPQYDIHYWLGNEAKKVDSSL
ASDKALELDAALGSCSVQYREIQGQESQKFLSYFRPCLIPIEGVFTSKQGNLNGEYHVSLYTCKGDYVVYVKEVPFLRSS
LNHEDVFILDTALKIFLFSGCNSTIQERAKALEVVQYIKENKHGGKCEVATIEDGKFVGDSDVGEFWSLFGGYAPIPRDS
PSVQESEAPPVKLFWINLQGKLCETGSNAFSKEMLETDKCYMLDCDGEIFVWMGRQTLLTERRTTIRAVEEFVRNEGRSN
KTHLTFLSEGLESTIFRSYFTNWPKTVEPRLYEEGKEKVAAIFKHQGYEVKELPEEDNEPSIDCSGTIKVWRVDGDELSL
LSVAELTKLYSGDCYIVQYTFLGNGRDETLFYAWLGSKCVMEDKAAAISHMSTMADSIRTNPVMAQIHEGKEPAQFFSIL
QRLIILKGGNSSGYRKFIEEKGIVDETYNENLVALFRVQGTSPDNMQAIQVDQVSTSLNSSYCYILQSKASIYTWIGSLS
SARDHNLLDRMVELSNPTWLPVSVREGNEPDIFWDALSGKAEYPKGKEIQGFIDDPHLFALKITRGDFKVKEIYNYTQDD
LITEDVLLLDCQREIYVWVGLHSAVKSKQEALNLGLKFLEMDVLVEGLSLNIPIYIVTEGHEPPFFTRFFSWDHSKENIF
GNSFERKLAILKGKPKSLEGHNRTPLKANSRPSTPDGHGSISVFSNGRGRSSSPIPSSAGSDLRQSGDRSLSSSTPVVKK
LFEGSPSQSSAGKTMPQSGSPATELSSSDETASFPQKDRNVDGENTAIYPYERLRVVSANPVTGIDLTKREVYLSNEEFR
EKFGMPKSAFYKLPRWKQNKLKMSLDLF*
CDS seq >KRG98341
ATGCCTACTGCCACTAAAGATATGGATTTCGCATTCCAAACTGCAGGAGCAAACCCAGGCTTAGAAGTTTGGTGTATTGA
GAACCAGCGGTTGGTTTCGGTGTCAAAGTCAAGCCATGGAAAATTCTATACTGGAAGTGCATACTTAGTCTTGAATGCAG
TCTTTCCAAAAATTGGCCCTCCTCAGTATGACATACATTACTGGTTGGGAAATGAAGCCAAGAAGGTAGACTCAAGCTTG
GCATCAGACAAGGCACTTGAACTGGATGCAGCCTTAGGATCGTGTAGTGTTCAATACAGGGAAATTCAAGGCCAAGAATC
GCAGAAGTTTCTGTCATACTTCAGACCTTGTCTTATACCCATTGAAGGAGTGTTTACTTCAAAGCAGGGGAACCTGAATG
GTGAATACCATGTCAGCCTGTATACTTGCAAGGGAGACTATGTTGTTTACGTGAAAGAAGTGCCATTTCTGAGGTCATCG
TTGAATCATGAAGATGTATTCATTCTTGACACTGCCTTAAAAATCTTCCTCTTCAGTGGGTGCAACTCTACCATTCAAGA
AAGAGCCAAAGCTTTGGAGGTTGTTCAGTATATCAAGGAGAATAAGCATGGTGGAAAATGCGAAGTGGCAACAATAGAGG
ATGGAAAATTTGTTGGTGATTCTGATGTGGGTGAATTCTGGAGTTTATTTGGTGGTTATGCTCCCATTCCTCGAGATTCG
CCTTCTGTTCAGGAATCTGAGGCTCCTCCTGTAAAGCTATTTTGGATAAATTTACAGGGAAAACTTTGTGAAACTGGAAG
CAATGCTTTCAGCAAAGAAATGCTTGAGACAGACAAGTGTTATATGTTGGACTGTGATGGTGAGATTTTTGTCTGGATGG
GAAGGCAGACTTTATTGACAGAAAGAAGAACAACAATCAGAGCTGTAGAAGAATTTGTCAGAAATGAAGGCAGATCAAAC
AAGACTCATTTGACATTTTTATCAGAAGGATTGGAAAGTACCATCTTTCGGTCTTACTTTACTAATTGGCCTAAAACAGT
GGAGCCTAGGCTTTATGAGGAAGGCAAAGAAAAAGTGGCAGCCATATTCAAGCACCAGGGTTATGAGGTGAAAGAGCTTC
CTGAAGAAGACAATGAGCCATCTATAGATTGCAGTGGCACAATAAAAGTTTGGCGGGTGGATGGTGATGAATTGTCCCTT
CTTTCAGTTGCAGAACTGACAAAGCTTTACAGTGGAGATTGCTATATAGTACAGTATACATTTCTGGGAAATGGAAGGGA
TGAGACACTATTTTATGCTTGGCTTGGCTCCAAATGTGTAATGGAGGATAAAGCAGCTGCCATTTCCCACATGAGTACTA
TGGCCGATTCAATCAGAACTAATCCTGTTATGGCTCAAATCCATGAGGGTAAGGAACCAGCTCAGTTTTTCTCAATACTT
CAGAGATTAATCATATTGAAGGGGGGAAACAGTTCAGGATATAGGAAGTTTATAGAAGAAAAAGGTATAGTGGATGAAAC
ATACAATGAAAACCTGGTTGCTTTGTTTCGGGTACAAGGTACAAGTCCAGATAATATGCAGGCCATCCAAGTTGATCAAG
TTTCAACCTCCCTGAATTCATCCTATTGCTACATTCTGCAAAGTAAAGCATCTATCTATACTTGGATTGGGAGCCTATCT
TCAGCTAGAGACCATAATCTCCTTGATAGAATGGTGGAACTATCTAATCCAACATGGCTACCTGTTTCTGTGAGGGAAGG
GAATGAGCCTGATATTTTCTGGGATGCTCTCAGTGGAAAAGCAGAGTATCCAAAGGGCAAAGAAATTCAAGGATTCATAG
ATGACCCACATTTGTTTGCATTAAAAATAACGAGAGGAGACTTCAAGGTCAAAGAGATATACAACTATACACAGGATGAC
TTAATAACTGAAGATGTTTTATTGCTTGATTGCCAAAGAGAGATTTATGTGTGGGTTGGTTTACATTCGGCTGTCAAATC
AAAACAAGAAGCTCTTAATCTTGGCCTGAAATTTCTGGAAATGGATGTCCTTGTTGAAGGCCTATCCCTGAACATCCCTA
TTTATATTGTAACGGAAGGTCATGAGCCACCTTTTTTCACTCGTTTCTTTTCATGGGATCACTCAAAAGAAAATATTTTT
GGTAACTCATTCGAGAGAAAGCTTGCAATTCTGAAAGGAAAACCAAAATCTCTAGAGGGACATAATAGAACCCCATTGAA
AGCAAACTCCAGGCCTTCTACTCCTGATGGTCACGGAAGCATTTCTGTTTTCTCCAATGGACGTGGAAGAAGTAGTTCAC
CTATACCTAGTAGTGCAGGCTCAGATCTTAGGCAATCAGGTGATAGGAGTCTTTCTAGCTCTACTCCAGTTGTCAAGAAG
CTCTTCGAAGGATCTCCTTCCCAGAGTAGTGCTGGAAAAACAATGCCACAGTCTGGTTCTCCAGCAACAGAACTGAGCTC
ATCTGATGAGACCGCGAGTTTCCCTCAGAAGGATAGAAATGTGGATGGTGAGAATACGGCAATATACCCCTATGAGCGCC
TGAGAGTGGTTTCTGCTAATCCAGTAACTGGCATCGATTTGACAAAAAGAGAGGTATATTTATCTAATGAAGAGTTCCGC
GAGAAGTTTGGAATGCCAAAATCTGCTTTCTATAAGCTTCCTAGATGGAAACAAAACAAACTGAAGATGTCTCTGGATCT
ATTTTAG
Microexon DNA seq GAGACTTCAAG
Microexon Amino Acid seq GDFK
Microexon-tag DNA Seq CAAGGATTCATAGATGACCCACATTTGTTTGCATTAAAAATAACGAGAGGAGACTTCAAGGTCAAAGAGATATACAACTATACACAGGATGACTTAATAACTGAAGAT
Microexon-tag Amino Acid seq QGFIDDPHLFALKITRGDFKVKEIYNYTQDDLITED
Transcript ID Gm.24641.1
Gene ID Gm.24641
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Gm.24641.1
MPTATKDMDFAFQTAGANPGLEVWCIENQRLVSVSKSSHGKFYTGSAYLVLNAVFPKIGPPQYDIHYWLGNEAKKVDSSL
ASDKALELDAALGSCSVQYREIQGQESQKFLSYFRPCLIPIEGVFTSKQGNLNGEYHVSLYTCKGDYVVYVKEVPFLRSS
LNHEDVFILDTALKIFLFSGCNSTIQERAKALEVVQYIKENKHGGKCEVATIEDGKFVGDSDVGEFWSLFGGYAPIPRDS
PSVQESEAPPVKLFWINLQGKLCETGSNAFSKEMLETDKCYMLDCDGEIFVWMGRQTLLTERRTTIRAVEEFVRNEGRSN
KTHLTFLSEGLESTIFRSYFTNWPKTVEPRLYEEGKEKVAAIFKHQGYEVKELPEEDNEPSIDCSGTIKVWRVDGDELSL
LSVAELTKLYSGDCYIVQYTFLGNGRDETLFYAWLGSKCVMEDKAAAISHMSTMADSIRTNPVMAQIHEGKEPAQFFSIL
QRLIILKGGNSSGYRKFIEEKGIVDETYNENLVALFRVQGTSPDNMQAIQVDQVSTSLNSSYCYILQSKASIYTWIGSLS
SARDHNLLDRMVELSNPTWLPVSVREGNEPDIFWDALSGKAEYPKGKEIQGFIDDPHLFALKITRGDFKVKEIYNYTQDD
LITEDVLLLDCQREIYVWVGLHSAVKSKQEALNLGLKFLEMDVLVEGLSLNIPIYIVTEGHEPPFFTRFFSWDHSKENIF
GNSFERKLAILKGKPKSLEGHNRTPLKANSRPSTPDGHGSISVFSNGRGRSSSPIPSSAGSDLRQSGDRSLSSSTPVVKK
LFEGSPSQSSAGKTMPQSGSPATELSSSDETASFPQKDRNVDGENTAIYPYERLRVVSANPVTGIDLTKREVYLSNEEFR
EKFGMPKSAFYKLPRWKQNKLKMSLDLF*
CDS seq >Gm.24641.1
ATGCCTACTGCCACTAAAGATATGGATTTCGCATTCCAAACTGCAGGAGCAAACCCAGGCTTAGAAGTTTGGTGTATTGA
GAACCAGCGGTTGGTTTCGGTGTCAAAGTCAAGCCATGGAAAATTCTATACTGGAAGTGCATACTTAGTCTTGAATGCAG
TCTTTCCAAAAATTGGCCCTCCTCAGTATGACATACATTACTGGTTGGGAAATGAAGCCAAGAAGGTAGACTCAAGCTTG
GCATCAGACAAGGCACTTGAACTGGATGCAGCCTTAGGATCGTGTAGTGTTCAATACAGGGAAATTCAAGGCCAAGAATC
GCAGAAGTTTCTGTCATACTTCAGACCTTGTCTTATACCCATTGAAGGAGTGTTTACTTCAAAGCAGGGGAACCTGAATG
GTGAATACCATGTCAGCCTGTATACTTGCAAGGGAGACTATGTTGTTTACGTGAAAGAAGTGCCATTTCTGAGGTCATCG
TTGAATCATGAAGATGTATTCATTCTTGACACTGCCTTAAAAATCTTCCTCTTCAGTGGGTGCAACTCTACCATTCAAGA
AAGAGCCAAAGCTTTGGAGGTTGTTCAGTATATCAAGGAGAATAAGCATGGTGGAAAATGCGAAGTGGCAACAATAGAGG
ATGGAAAATTTGTTGGTGATTCTGATGTGGGTGAATTCTGGAGTTTATTTGGTGGTTATGCTCCCATTCCTCGAGATTCG
CCTTCTGTTCAGGAATCTGAGGCTCCTCCTGTAAAGCTATTTTGGATAAATTTACAGGGAAAACTTTGTGAAACTGGAAG
CAATGCTTTCAGCAAAGAAATGCTTGAGACAGACAAGTGTTATATGTTGGACTGTGATGGTGAGATTTTTGTCTGGATGG
GAAGGCAGACTTTATTGACAGAAAGAAGAACAACAATCAGAGCTGTAGAAGAATTTGTCAGAAATGAAGGCAGATCAAAC
AAGACTCATTTGACATTTTTATCAGAAGGATTGGAAAGTACCATCTTTCGGTCTTACTTTACTAATTGGCCTAAAACAGT
GGAGCCTAGGCTTTATGAGGAAGGCAAAGAAAAAGTGGCAGCCATATTCAAGCACCAGGGTTATGAGGTGAAAGAGCTTC
CTGAAGAAGACAATGAGCCATCTATAGATTGCAGTGGCACAATAAAAGTTTGGCGGGTGGATGGTGATGAATTGTCCCTT
CTTTCAGTTGCAGAACTGACAAAGCTTTACAGTGGAGATTGCTATATAGTACAGTATACATTTCTGGGAAATGGAAGGGA
TGAGACACTATTTTATGCTTGGCTTGGCTCCAAATGTGTAATGGAGGATAAAGCAGCTGCCATTTCCCACATGAGTACTA
TGGCCGATTCAATCAGAACTAATCCTGTTATGGCTCAAATCCATGAGGGTAAGGAACCAGCTCAGTTTTTCTCAATACTT
CAGAGATTAATCATATTGAAGGGGGGAAACAGTTCAGGATATAGGAAGTTTATAGAAGAAAAAGGTATAGTGGATGAAAC
ATACAATGAAAACCTGGTTGCTTTGTTTCGGGTACAAGGTACAAGTCCAGATAATATGCAGGCCATCCAAGTTGATCAAG
TTTCAACCTCCCTGAATTCATCCTATTGCTACATTCTGCAAAGTAAAGCATCTATCTATACTTGGATTGGGAGCCTATCT
TCAGCTAGAGACCATAATCTCCTTGATAGAATGGTGGAACTATCTAATCCAACATGGCTACCTGTTTCTGTGAGGGAAGG
GAATGAGCCTGATATTTTCTGGGATGCTCTCAGTGGAAAAGCAGAGTATCCAAAGGGCAAAGAAATTCAAGGATTCATAG
ATGACCCACATTTGTTTGCATTAAAAATAACGAGAGGAGACTTCAAGGTCAAAGAGATATACAACTATACACAGGATGAC
TTAATAACTGAAGATGTTTTATTGCTTGATTGCCAAAGAGAGATTTATGTGTGGGTTGGTTTACATTCGGCTGTCAAATC
AAAACAAGAAGCTCTTAATCTTGGCCTGAAATTTCTGGAAATGGATGTCCTTGTTGAAGGCCTATCCCTGAACATCCCTA
TTTATATTGTAACGGAAGGTCATGAGCCACCTTTTTTCACTCGTTTCTTTTCATGGGATCACTCAAAAGAAAATATTTTT
GGTAACTCATTCGAGAGAAAGCTTGCAATTCTGAAAGGAAAACCAAAATCTCTAGAGGGACATAATAGAACCCCATTGAA
AGCAAACTCCAGGCCTTCTACTCCTGATGGTCACGGAAGCATTTCTGTTTTCTCCAATGGACGTGGAAGAAGTAGTTCAC
CTATACCTAGTAGTGCAGGCTCAGATCTTAGGCAATCAGGTGATAGGAGTCTTTCTAGCTCTACTCCAGTTGTCAAGAAG
CTCTTCGAAGGATCTCCTTCCCAGAGTAGTGCTGGAAAAACAATGCCACAGTCTGGTTCTCCAGCAACAGAACTGAGCTC
ATCTGATGAGACCGCGAGTTTCCCTCAGAAGGATAGAAATGTGGATGGTGAGAATACGGCAATATACCCCTATGAGCGCC
TGAGAGTGGTTTCTGCTAATCCAGTAACTGGCATCGATTTGACAAAAAGAGAGGTATATTTATCTAATGAAGAGTTCCGC
GAGAAGTTTGGAATGCCAAAATCTGCTTTCTATAAGCTTCCTAGATGGAAACAAAACAAACTGAAGATGTCTCTGGATCT
ATTTTAG