Microexon ID Gm_10:17444769-17444779:+
Species Glycine max
Coordinates 10:17444769..17444779
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAAAGTTCAAT
Microexon Amino Acid seq GKFN
Microexon-tag DNA Seq AATGAGGTTGTCAGAGATCCACATTTGTTCACTTTATCATTTAACAAAGGAAAGTTCAATGTAGAGGAGGTTTACAACTTCTCTCAGGACGACCTGTTGCCGGAGGAT
Microexon-tag Amino Acid Seq NEVVRDPHLFTLSFNKGKFNVEEVYNFSQDDLLPED
Microexon-tag spanning region17444599-17444941
Microexon-tag prediction score0.9564
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH33093x
Reference Transcript ID KRH33093
Gene ID GLYMA_10G099200
Gene Name NA
Transcript ID KRH33093
Protein ID KRH33093
Gene ID GLYMA_10G099200
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 1.3e-05
Motif start 635
Motif end 713
Protein seq >KRH33093
MSSSAKVLDPAFQGVGQRVGTEIWRIENFQPVALPKSEYGKFYTGDSYIILQTTQGKGGTYFYDLHFWIGKDTSQDEAGT
AAIKTVELDAALGGRAVQHREIQGHESDKFLSYFKPCIIPLEGGVASGFKKPEEEEFETRLYVCRGKRVVRLRQVPFARS
SLNHEDVFILDTENKIYQFNGANSNIQERAKALEVIQFLKEKYHEGKCDVAIVDDGKLDTESDSGEFWVLFGGFAPIGKK
VISEDDIIPETIPAQLYSIVDVEIKPVEGELSKSLLENNKCYLLDCGAEVFVWVGRVTQVEERKSACQAVEEFVASQNRP
KSTRITRIIQGYEPHSFKSNFDSWPSGSASTSAEEGRGKVAALLKQQGMGVKGMTKSTPVNEEIPPLLEGGGKIEVWRIN
GNAKNALPKEEIGKFYSGDCYIVLYTYHSGERKEDYFLCCWFGKDSVEEDQTTATRLANTMSTSLKGRPVQGRIFEGKEP
PQFVAIFQPMVVLKGGFSSGYKKLIADKGVSDETYTAESIALIRISGTSIYNNKSVQVDAVPSSLNSTECFVLQSGSTIF
TWHGNQCSFEQQQLAAKVADFLRPGATLKHAKEGTESSAFWSALGGKQSYTSKKVVNEVVRDPHLFTLSFNKGKFNVEEV
YNFSQDDLLPEDILILDTHAEVFIWIGHSVEPKEKRNAFEIGQKYIDLVASLEGLSPHVPLYKVTEGNEPCFFTTYFSWD
HAKAMVMGNSFQKKVSLLFGLGHAVEEKLNGSSPGGPRQRAEALAALSNAFGSSSEKASGLAQDRLNGLGQGGPRQRAEA
LAALNSAFNSSSGTKTFTPRPSGRGQGSQRAAAVAALSQVLMAEKKKSPDGSPVASRSPITEGSATETKSDSSEVEEVAE
AKETEELPPETGSNGDLELKQENAEEGNDGQRMFSYEQLKTKSGHNVPGVDLKRREAYLSEDEFNTVFGMAKEAFYKLPR
WKQDMLKKKYELF*
CDS seq >KRH33093
ATGTCTAGCTCAGCAAAAGTTTTGGATCCTGCATTCCAGGGAGTTGGTCAAAGAGTAGGAACTGAAATATGGAGGATTGA
GAATTTTCAGCCAGTTGCATTGCCCAAATCTGAGTATGGCAAATTCTACACGGGAGATTCGTACATCATCTTGCAGACAA
CTCAAGGCAAAGGAGGCACTTATTTTTATGATTTACACTTTTGGATTGGAAAGGATACAAGTCAGGATGAGGCTGGAACT
GCAGCCATTAAAACTGTTGAACTTGATGCTGCTCTTGGAGGGCGTGCAGTGCAGCACAGGGAAATCCAAGGACATGAGTC
CGACAAGTTTTTGTCATACTTTAAACCATGTATTATACCATTAGAGGGTGGAGTTGCATCTGGGTTTAAAAAACCTGAAG
AAGAGGAGTTTGAAACACGTTTGTATGTATGCAGAGGGAAAAGAGTTGTAAGATTGAGACAGGTCCCTTTTGCAAGGTCT
TCGTTGAACCATGAAGATGTATTCATACTAGATACTGAGAACAAGATTTATCAATTCAATGGTGCAAATTCCAATATTCA
GGAAAGAGCCAAGGCTTTGGAAGTCATACAGTTTCTGAAAGAAAAATACCATGAAGGGAAATGTGATGTTGCAATTGTTG
ACGATGGAAAATTAGACACTGAGTCAGACTCAGGTGAATTTTGGGTCCTCTTTGGTGGTTTTGCTCCCATTGGGAAGAAG
GTAATCAGTGAGGATGATATTATTCCAGAGACAATTCCTGCTCAGCTTTACAGTATTGTTGATGTTGAGATCAAGCCTGT
GGAAGGTGAACTTTCTAAATCACTGTTGGAAAACAACAAATGCTATTTACTGGACTGTGGTGCTGAGGTGTTTGTCTGGG
TTGGTCGTGTGACACAAGTTGAAGAACGAAAATCAGCCTGCCAAGCCGTTGAGGAGTTTGTTGCAAGCCAAAATAGGCCA
AAGTCTACAAGGATAACCCGGATTATTCAAGGTTATGAGCCACATTCATTTAAGTCCAACTTTGATTCTTGGCCATCAGG
ATCTGCTAGTACCAGTGCTGAGGAAGGAAGAGGAAAAGTTGCAGCATTGCTTAAGCAACAAGGCATGGGTGTCAAAGGAA
TGACAAAAAGTACCCCTGTAAATGAGGAAATTCCACCTTTGCTTGAAGGAGGTGGAAAGATAGAGGTATGGCGAATCAAT
GGAAATGCCAAGAATGCATTGCCAAAGGAGGAGATCGGTAAATTTTATAGTGGAGATTGTTACATTGTACTGTACACCTA
CCACTCTGGTGAGCGGAAAGAAGACTACTTCTTGTGCTGTTGGTTTGGCAAAGACAGTGTTGAGGAGGACCAAACAACGG
CTACTAGGTTGGCCAATACAATGTCTACCTCATTAAAGGGTAGACCTGTACAGGGTCGCATATTTGAAGGCAAAGAGCCG
CCACAGTTTGTTGCTATTTTCCAACCAATGGTGGTTCTCAAGGGAGGGTTCAGCTCTGGATACAAGAAACTAATAGCAGA
CAAAGGAGTATCAGATGAGACATACACAGCAGAGAGTATTGCACTTATTCGAATTTCTGGAACTTCTATTTACAACAATA
AATCAGTACAAGTTGATGCAGTGCCATCATCATTGAATTCCACTGAGTGTTTTGTCCTGCAATCTGGCTCTACAATTTTC
ACTTGGCATGGAAATCAGTGTTCCTTTGAGCAGCAACAGCTAGCAGCAAAGGTTGCTGATTTTTTACGGCCAGGAGCTAC
TTTAAAGCATGCTAAAGAAGGAACAGAAAGCTCAGCTTTCTGGTCTGCACTAGGAGGAAAACAAAGTTACACCAGCAAGA
AAGTTGTTAATGAGGTTGTCAGAGATCCACATTTGTTCACTTTATCATTTAACAAAGGAAAGTTCAATGTAGAGGAGGTT
TACAACTTCTCTCAGGACGACCTGTTGCCGGAGGATATCCTTATACTTGACACACATGCAGAAGTGTTTATTTGGATTGG
TCATTCTGTGGAACCCAAAGAAAAGCGAAATGCTTTTGAAATTGGCCAGAAATACATAGATTTGGTTGCATCTCTGGAGG
GGCTATCTCCACATGTACCACTATATAAAGTAACAGAAGGGAATGAACCTTGCTTTTTCACAACATACTTTTCATGGGAT
CATGCAAAAGCTATGGTTATGGGGAACTCATTTCAGAAAAAGGTGTCACTACTCTTTGGACTTGGCCATGCTGTGGAGGA
AAAGTTGAATGGGTCAAGTCCAGGGGGACCAAGACAAAGAGCAGAAGCTTTGGCTGCCTTATCTAATGCATTTGGTTCAT
CTTCTGAGAAAGCATCCGGTTTGGCACAAGATAGATTGAATGGGTTAGGCCAAGGGGGACCAAGGCAAAGGGCAGAAGCT
TTAGCCGCTTTAAACTCTGCATTTAATTCATCATCTGGGACGAAGACTTTTACTCCTAGGCCATCTGGAAGAGGTCAAGG
ATCACAAAGAGCTGCAGCAGTAGCTGCTCTTTCACAAGTTCTTATGGCTGAAAAGAAAAAATCACCGGATGGTTCTCCTG
TTGCTAGCAGGAGTCCTATCACTGAAGGTAGTGCTACTGAAACTAAAAGTGACTCCTCTGAAGTTGAAGAAGTTGCAGAA
GCCAAGGAAACAGAGGAACTTCCCCCTGAGACCGGTAGCAATGGGGATTTGGAACTAAAACAAGAGAATGCGGAGGAAGG
AAATGATGGTCAAAGGATGTTCAGTTATGAGCAATTAAAGACTAAATCTGGTCATAATGTGCCTGGAGTTGATCTTAAAC
GGAGAGAGGCCTATCTGTCAGAGGATGAGTTCAACACTGTATTTGGAATGGCAAAAGAAGCATTTTACAAGTTGCCAAGA
TGGAAGCAAGACATGCTGAAAAAGAAATACGAATTGTTCTAA
Microexon DNA seq GAAAGTTCAAT
Microexon Amino Acid seq GKFN
Microexon-tag DNA Seq AATGAGGTTGTCAGAGATCCACATTTGTTCACTTTATCATTTAACAAAGGAAAGTTCAATGTAGAGGAGGTTTACAACTTCTCTCAGGACGACCTGTTGCCGGAGGAT
Microexon-tag Amino Acid seq NEVVRDPHLFTLSFNKGKFNVEEVYNFSQDDLLPED
Transcript ID Gm.3336.2
Gene ID Gm.3336
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 1.3e-05
Motif start 635
Motif end 713
Protein seq >Gm.3336.2
MSSSAKVLDPAFQGVGQRVGTEIWRIENFQPVALPKSEYGKFYTGDSYIILQTTQGKGGTYFYDLHFWIGKDTSQDEAGT
AAIKTVELDAALGGRAVQHREIQGHESDKFLSYFKPCIIPLEGGVASGFKKPEEEEFETRLYVCRGKRVVRLRQVPFARS
SLNHEDVFILDTENKIYQFNGANSNIQERAKALEVIQFLKEKYHEGKCDVAIVDDGKLDTESDSGEFWVLFGGFAPIGKK
VISEDDIIPETIPAQLYSIVDVEIKPVEGELSKSLLENNKCYLLDCGAEVFVWVGRVTQVEERKSACQAVEEFVASQNRP
KSTRITRIIQGYEPHSFKSNFDSWPSGSASTSAEEGRGKVAALLKQQGMGVKGMTKSTPVNEEIPPLLEGGGKIEVWRIN
GNAKNALPKEEIGKFYSGDCYIVLYTYHSGERKEDYFLCCWFGKDSVEEDQTTATRLANTMSTSLKGRPVQGRIFEGKEP
PQFVAIFQPMVVLKGGFSSGYKKLIADKGVSDETYTAESIALIRISGTSIYNNKSVQVDAVPSSLNSTECFVLQSGSTIF
TWHGNQCSFEQQQLAAKVADFLRPGATLKHAKEGTESSAFWSALGGKQSYTSKKVVNEVVRDPHLFTLSFNKGKFNVEEV
YNFSQDDLLPEDILILDTHAEVFIWIGHSVEPKEKRNAFEIGQKYIDLVASLEGLSPHVPLYKVTEGNEPCFFTTYFSWD
HAKAMVMGNSFQKKVSLLFGLGHAVEEKLNGSSPGGPRQRAEALAALSNAFGSSSEKASGLAQDRLNGLGQGGPRQRAEA
LAALNSAFNSSSGTKTFTPRPSGRGQGSQRAAAVAALSQVLMAEKKKSPDGSPVASRSPITEETKSDSSEVEEVAEAKET
EELPPETGSNGDLELKQENAEEGNDGQRMFSYEQLKTKSGHNVPGVDLKRREAYLSEDEFNTVFGMAKEAFYKLPRWKQD
MLKKKYELF*
CDS seq >Gm.3336.2
ATGTCTAGCTCAGCAAAAGTTTTGGATCCTGCATTCCAGGGAGTTGGTCAAAGAGTAGGAACTGAAATATGGAGGATTGA
GAATTTTCAGCCAGTTGCATTGCCCAAATCTGAGTATGGCAAATTCTACACGGGAGATTCGTACATCATCTTGCAGACAA
CTCAAGGCAAAGGAGGCACTTATTTTTATGATTTACACTTTTGGATTGGAAAGGATACAAGTCAGGATGAGGCTGGAACT
GCAGCCATTAAAACTGTTGAACTTGATGCTGCTCTTGGAGGGCGTGCAGTGCAGCACAGGGAAATCCAAGGACATGAGTC
CGACAAGTTTTTGTCATACTTTAAACCATGTATTATACCATTAGAGGGTGGAGTTGCATCTGGGTTTAAAAAACCTGAAG
AAGAGGAGTTTGAAACACGTTTGTATGTATGCAGAGGGAAAAGAGTTGTAAGATTGAGACAGGTCCCTTTTGCAAGGTCT
TCGTTGAACCATGAAGATGTATTCATACTAGATACTGAGAACAAGATTTATCAATTCAATGGTGCAAATTCCAATATTCA
GGAAAGAGCCAAGGCTTTGGAAGTCATACAGTTTCTGAAAGAAAAATACCATGAAGGGAAATGTGATGTTGCAATTGTTG
ACGATGGAAAATTAGACACTGAGTCAGACTCAGGTGAATTTTGGGTCCTCTTTGGTGGTTTTGCTCCCATTGGGAAGAAG
GTAATCAGTGAGGATGATATTATTCCAGAGACAATTCCTGCTCAGCTTTACAGTATTGTTGATGTTGAGATCAAGCCTGT
GGAAGGTGAACTTTCTAAATCACTGTTGGAAAACAACAAATGCTATTTACTGGACTGTGGTGCTGAGGTGTTTGTCTGGG
TTGGTCGTGTGACACAAGTTGAAGAACGAAAATCAGCCTGCCAAGCCGTTGAGGAGTTTGTTGCAAGCCAAAATAGGCCA
AAGTCTACAAGGATAACCCGGATTATTCAAGGTTATGAGCCACATTCATTTAAGTCCAACTTTGATTCTTGGCCATCAGG
ATCTGCTAGTACCAGTGCTGAGGAAGGAAGAGGAAAAGTTGCAGCATTGCTTAAGCAACAAGGCATGGGTGTCAAAGGAA
TGACAAAAAGTACCCCTGTAAATGAGGAAATTCCACCTTTGCTTGAAGGAGGTGGAAAGATAGAGGTATGGCGAATCAAT
GGAAATGCCAAGAATGCATTGCCAAAGGAGGAGATCGGTAAATTTTATAGTGGAGATTGTTACATTGTACTGTACACCTA
CCACTCTGGTGAGCGGAAAGAAGACTACTTCTTGTGCTGTTGGTTTGGCAAAGACAGTGTTGAGGAGGACCAAACAACGG
CTACTAGGTTGGCCAATACAATGTCTACCTCATTAAAGGGTAGACCTGTACAGGGTCGCATATTTGAAGGCAAAGAGCCG
CCACAGTTTGTTGCTATTTTCCAACCAATGGTGGTTCTCAAGGGAGGGTTCAGCTCTGGATACAAGAAACTAATAGCAGA
CAAAGGAGTATCAGATGAGACATACACAGCAGAGAGTATTGCACTTATTCGAATTTCTGGAACTTCTATTTACAACAATA
AATCAGTACAAGTTGATGCAGTGCCATCATCATTGAATTCCACTGAGTGTTTTGTCCTGCAATCTGGCTCTACAATTTTC
ACTTGGCATGGAAATCAGTGTTCCTTTGAGCAGCAACAGCTAGCAGCAAAGGTTGCTGATTTTTTACGGCCAGGAGCTAC
TTTAAAGCATGCTAAAGAAGGAACAGAAAGCTCAGCTTTCTGGTCTGCACTAGGAGGAAAACAAAGTTACACCAGCAAGA
AAGTTGTTAATGAGGTTGTCAGAGATCCACATTTGTTCACTTTATCATTTAACAAAGGAAAGTTCAATGTAGAGGAGGTT
TACAACTTCTCTCAGGACGACCTGTTGCCGGAGGATATCCTTATACTTGACACACATGCAGAAGTGTTTATTTGGATTGG
TCATTCTGTGGAACCCAAAGAAAAGCGAAATGCTTTTGAAATTGGCCAGAAATACATAGATTTGGTTGCATCTCTGGAGG
GGCTATCTCCACATGTACCACTATATAAAGTAACAGAAGGGAATGAACCTTGCTTTTTCACAACATACTTTTCATGGGAT
CATGCAAAAGCTATGGTTATGGGGAACTCATTTCAGAAAAAGGTGTCACTACTCTTTGGACTTGGCCATGCTGTGGAGGA
AAAGTTGAATGGGTCAAGTCCAGGGGGACCAAGACAAAGAGCAGAAGCTTTGGCTGCCTTATCTAATGCATTTGGTTCAT
CTTCTGAGAAAGCATCCGGTTTGGCACAAGATAGATTGAATGGGTTAGGCCAAGGGGGACCAAGGCAAAGGGCAGAAGCT
TTAGCCGCTTTAAACTCTGCATTTAATTCATCATCTGGGACGAAGACTTTTACTCCTAGGCCATCTGGAAGAGGTCAAGG
ATCACAAAGAGCTGCAGCAGTAGCTGCTCTTTCACAAGTTCTTATGGCTGAAAAGAAAAAATCACCGGATGGTTCTCCTG
TTGCTAGCAGGAGTCCTATCACTGAAGAAACTAAAAGTGACTCCTCTGAAGTTGAAGAAGTTGCAGAAGCCAAGGAAACA
GAGGAACTTCCCCCTGAGACCGGTAGCAATGGGGATTTGGAACTAAAACAAGAGAATGCGGAGGAAGGAAATGATGGTCA
AAGGATGTTCAGTTATGAGCAATTAAAGACTAAATCTGGTCATAATGTGCCTGGAGTTGATCTTAAACGGAGAGAGGCCT
ATCTGTCAGAGGATGAGTTCAACACTGTATTTGGAATGGCAAAAGAAGCATTTTACAAGTTGCCAAGATGGAAGCAAGAC
ATGCTGAAAAAGAAATACGAATTGTTCTAA