Microexon ID Gm_2:32250123-32250133:+
Species Glycine max
Coordinates 2:32250123..32250133
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAAAGTTCAAT
Microexon Amino Acid seq GKFN
Microexon-tag DNA Seq AACGAGTTTGTCAGAGATCCACATTTGTTCACTATATCATTTAATAAAGGAAAGTTCAATGTAGAGGAGGTTTACAACTTCTCTCAAGATGACCTGTTGCCGGAGGAT
Microexon-tag Amino Acid Seq NEFVRDPHLFTISFNKGKFNVEEVYNFSQDDLLPED
Microexon-tag spanning region32249955-32250295
Microexon-tag prediction score0.947
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH72003x
Reference Transcript ID KRH72003
Gene ID GLYMA_02G184600
Gene Name NA
Transcript ID KRH72003
Protein ID KRH72003
Gene ID GLYMA_02G184600
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 2.9e-07
Motif start 635
Motif end 713
Protein seq >KRH72003
MSSSAKVLDPAFQGVGQRVGTEIWRIENFQPVPLPKSEYGKFYMGDSYIILQTTQGKGSTYFYDLHFWIGKHTSQDEAGT
AAIKTVELDAAIGGRAVQHREIQGHESDKFLSYFKPCIIPLEGGVASGFKKPEEEKFETCLYVCRGKRVVRLRQVPFARS
SLNHEDVFILDTQNKIYQFNGANSNIQERAKALEVIQFLKEKYHEGKCDVAIVDDGKLDTESDSGEFWVLFGGFAPIGKK
VISEDDIIPETIPAQLYSIVDGEVKPVEGELSKSLLENNKCYLLDCGAEMFVWVGRVTQVEERKAACQAVEEFVASQNRP
KSTRITRIIQGYETHSFKSNFDSWPSGSASTNAEEGRGKVAALLKQQGMGVKGMTKSTPVNEEIPPLLEGDGKIEVWRIN
GNAKTALPKEEIGKFYSGDCYIVLYTYHSGERKEDYFVCCWFGKDSVEEDQTTATRLANTMSTSLKGRPVQGRIFEGKEP
PQFVAIFQPMVVLKGGLSSGYKKLMADKGASDETYTAESIALIRISGTSIHNNKSVQVDAVPSSLNSTECFVLQSGSTIF
TWHGNQCSFEQQQLAAKVADFLRPGATLKHAKEGTESSAFWSALGGKQSYTSKKVVNEFVRDPHLFTISFNKGKFNVEEV
YNFSQDDLLPEDILILDTHVEVFIWIGHSVDPKEKQNAFDIGQKYIDLAASLEELSPHVPLYKVTEGNEPCFFTTYFSWD
HAKAMVLGNSFQKKVSLLFGFGHAVEEKSNGSSLGGPRQRAEALAALSNAFSSSSEKASSLAQDRLNGLGQGGPRQRAEA
LAALNSAFSSSSGTKTFTPRPSGRGQGSQRAAAVAALSQVLTAEKKKSPDGSPVASRSPITQGSATETKSDSSEVEEVAE
AKETEELPPETGSNGDLEPKQENVEEGNDGQRTFSYEQLKTKSGRNVPGIDLKRREAYLSEEEFNTVFGMTKEAFYKLPR
WKQDMLKKKYELF*
CDS seq >KRH72003
ATGTCTAGCTCAGCAAAAGTTTTGGATCCTGCATTCCAGGGAGTTGGTCAAAGAGTAGGAACTGAAATATGGAGGATTGA
GAATTTTCAGCCAGTTCCATTGCCCAAATCTGAGTATGGGAAATTCTACATGGGAGATTCGTACATCATCTTGCAGACAA
CTCAAGGCAAAGGAAGCACTTATTTTTATGATTTACACTTTTGGATTGGAAAGCATACAAGTCAGGATGAGGCTGGAACT
GCGGCCATTAAAACTGTTGAACTTGATGCTGCTATTGGAGGGCGTGCAGTGCAGCACAGGGAAATCCAAGGGCATGAGTC
TGACAAGTTTTTGTCATACTTTAAACCATGTATTATACCATTGGAGGGTGGAGTTGCATCTGGGTTTAAAAAACCTGAAG
AAGAGAAGTTTGAAACATGTTTGTATGTATGCAGAGGGAAAAGAGTTGTTAGATTGAGACAGGTCCCTTTTGCAAGGTCT
TCATTGAACCATGAAGATGTATTCATACTAGACACTCAGAATAAGATTTATCAATTCAATGGTGCAAATTCCAATATTCA
GGAAAGAGCCAAGGCTTTGGAAGTTATACAGTTTCTGAAAGAAAAATATCATGAAGGGAAATGTGATGTTGCAATTGTTG
ATGATGGAAAATTAGACACTGAGTCAGACTCAGGTGAGTTTTGGGTCCTCTTTGGTGGTTTTGCTCCCATTGGGAAGAAG
GTAATTAGTGAGGATGATATTATTCCAGAGACAATTCCTGCTCAGCTTTATAGTATTGTTGATGGTGAGGTCAAGCCTGT
GGAAGGTGAACTTTCTAAATCACTGTTGGAAAACAACAAATGCTATTTACTGGACTGTGGTGCCGAGATGTTTGTCTGGG
TTGGTCGCGTGACACAAGTTGAAGAACGAAAAGCAGCCTGCCAAGCCGTTGAGGAGTTTGTTGCTAGTCAAAATAGGCCA
AAGTCTACAAGGATAACCCGGATTATTCAAGGTTATGAGACGCATTCATTTAAGTCCAACTTTGATTCTTGGCCATCAGG
ATCTGCTAGTACCAATGCTGAGGAAGGAAGAGGAAAAGTTGCAGCATTGCTTAAGCAACAAGGCATGGGTGTCAAAGGAA
TGACAAAAAGTACCCCTGTAAATGAGGAAATTCCACCTTTGCTTGAAGGAGATGGGAAGATAGAGGTATGGCGAATCAAT
GGAAATGCCAAGACTGCATTGCCGAAGGAGGAGATTGGTAAATTTTATAGTGGAGATTGCTACATTGTATTGTACACCTA
CCATTCTGGTGAGCGGAAAGAAGACTACTTCGTGTGCTGTTGGTTTGGCAAAGACAGTGTTGAGGAGGACCAAACAACAG
CTACTAGGTTAGCCAATACAATGTCTACCTCATTAAAGGGTAGACCTGTACAGGGTCGCATATTTGAAGGCAAAGAGCCG
CCACAGTTTGTTGCTATTTTCCAACCAATGGTGGTTCTCAAGGGAGGGTTGAGCTCTGGATACAAGAAATTAATGGCAGA
CAAAGGTGCATCAGATGAGACATACACAGCAGAGAGTATTGCACTTATTCGAATTTCTGGAACTTCTATTCATAACAATA
AATCAGTACAAGTTGATGCAGTGCCATCATCATTGAATTCTACTGAGTGTTTTGTCCTGCAATCTGGCTCTACAATTTTC
ACTTGGCATGGAAATCAGTGTTCCTTTGAGCAGCAACAGCTAGCAGCAAAGGTTGCTGATTTTTTACGGCCAGGAGCTAC
TTTAAAGCATGCTAAAGAAGGAACAGAAAGCTCAGCTTTCTGGTCTGCACTAGGAGGAAAACAAAGTTACACCAGCAAGA
AAGTTGTTAACGAGTTTGTCAGAGATCCACATTTGTTCACTATATCATTTAATAAAGGAAAGTTCAATGTAGAGGAGGTT
TACAACTTCTCTCAAGATGACCTGTTGCCGGAGGATATCCTTATACTTGACACACATGTAGAAGTGTTTATTTGGATTGG
TCATTCTGTGGACCCAAAGGAAAAGCAAAATGCTTTTGATATCGGCCAGAAATACATAGATTTGGCTGCATCTCTGGAGG
AACTATCTCCACATGTACCACTATATAAAGTAACAGAAGGGAATGAACCTTGCTTTTTCACAACATACTTTTCATGGGAT
CATGCAAAAGCTATGGTTCTGGGGAACTCATTTCAGAAAAAAGTGTCACTACTCTTTGGATTTGGCCATGCTGTGGAGGA
AAAGTCGAATGGGTCAAGTCTAGGGGGACCAAGACAAAGAGCAGAAGCCTTGGCTGCCTTATCTAATGCATTTAGTTCAT
CTTCTGAGAAAGCATCCAGTTTGGCACAAGATAGATTGAATGGGTTAGGCCAAGGAGGACCAAGACAAAGGGCAGAAGCT
TTAGCCGCTTTAAACTCTGCATTTAGTTCATCATCTGGGACGAAGACTTTTACTCCTAGGCCATCTGGAAGAGGTCAAGG
ATCACAAAGAGCTGCAGCAGTGGCTGCTCTTTCACAAGTTCTTACGGCCGAAAAGAAAAAATCACCTGATGGTTCTCCTG
TTGCTAGCAGGAGTCCTATCACTCAAGGTAGCGCTACTGAAACTAAAAGTGACTCCTCTGAAGTTGAAGAAGTTGCAGAA
GCCAAGGAAACAGAGGAACTCCCCCCTGAGACAGGTAGCAATGGGGATTTGGAACCAAAACAAGAAAATGTGGAGGAAGG
AAATGATGGTCAAAGGACGTTCAGTTATGAACAATTAAAGACTAAATCTGGTCGTAATGTGCCTGGAATTGATCTTAAAC
GGAGAGAGGCCTATCTGTCAGAAGAGGAGTTCAACACTGTATTTGGAATGACAAAAGAAGCATTTTACAAGTTGCCAAGA
TGGAAGCAAGACATGCTGAAAAAGAAATACGAATTGTTCTAG
Microexon DNA seq GAAAGTTCAAT
Microexon Amino Acid seq GKFN
Microexon-tag DNA Seq AACGAGTTTGTCAGAGATCCACATTTGTTCACTATATCATTTAATAAAGGAAAGTTCAATGTAGAGGAGGTTTACAACTTCTCTCAAGATGACCTGTTGCCGGAGGAT
Microexon-tag Amino Acid seq NEFVRDPHLFTISFNKGKFNVEEVYNFSQDDLLPED
Transcript ID Gm.31222.1
Gene ID Gm.31222
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 2.9e-07
Motif start 635
Motif end 713
Protein seq >Gm.31222.1
MSSSAKVLDPAFQGVGQRVGTEIWRIENFQPVPLPKSEYGKFYMGDSYIILQTTQGKGSTYFYDLHFWIGKHTSQDEAGT
AAIKTVELDAAIGGRAVQHREIQGHESDKFLSYFKPCIIPLEGGVASGFKKPEEEKFETCLYVCRGKRVVRLRQVPFARS
SLNHEDVFILDTQNKIYQFNGANSNIQERAKALEVIQFLKEKYHEGKCDVAIVDDGKLDTESDSGEFWVLFGGFAPIGKK
VISEDDIIPETIPAQLYSIVDGEVKPVEGELSKSLLENNKCYLLDCGAEMFVWVGRVTQVEERKAACQAVEEFVASQNRP
KSTRITRIIQGYETHSFKSNFDSWPSGSASTNAEEGRGKVAALLKQQGMGVKGMTKSTPVNEEIPPLLEGDGKIEVWRIN
GNAKTALPKEEIGKFYSGDCYIVLYTYHSGERKEDYFVCCWFGKDSVEEDQTTATRLANTMSTSLKGRPVQGRIFEGKEP
PQFVAIFQPMVVLKGGLSSGYKKLMADKGASDETYTAESIALIRISGTSIHNNKSVQVDAVPSSLNSTECFVLQSGSTIF
TWHGNQCSFEQQQLAAKVADFLRPGATLKHAKEGTESSAFWSALGGKQSYTSKKVVNEFVRDPHLFTISFNKGKFNVEEV
YNFSQDDLLPEDILILDTHVEVFIWIGHSVDPKEKQNAFDIGQKYIDLAASLEELSPHVPLYKVTEGNEPCFFTTYFSWD
HAKAMVLGNSFQKKVSLLFGFGHAVEEKSNGSSLGGPRQRAEALAALSNAFSSSSEKASSLAQDRLNGLGQGGPRQRAEA
LAALNSAFSSSSGTKTFTPRPSGRGQGSQRAAAVAALSQVLTAEKKKSPDGSPVASRSPITQGSATETKSDSSEVEEVAE
AKETEELPPETGSNGDLEPKQENVEEGNDGQRTFSYEQLKTKSGRNVPGIDLKRREAYLSEEEFNTVFGMTKEAFYKLPR
WKQDMLKKKYELF*
CDS seq >Gm.31222.1
ATGTCTAGCTCAGCAAAAGTTTTGGATCCTGCATTCCAGGGAGTTGGTCAAAGAGTAGGAACTGAAATATGGAGGATTGA
GAATTTTCAGCCAGTTCCATTGCCCAAATCTGAGTATGGGAAATTCTACATGGGAGATTCGTACATCATCTTGCAGACAA
CTCAAGGCAAAGGAAGCACTTATTTTTATGATTTACACTTTTGGATTGGAAAGCATACAAGTCAGGATGAGGCTGGAACT
GCGGCCATTAAAACTGTTGAACTTGATGCTGCTATTGGAGGGCGTGCAGTGCAGCACAGGGAAATCCAAGGGCATGAGTC
TGACAAGTTTTTGTCATACTTTAAACCATGTATTATACCATTGGAGGGTGGAGTTGCATCTGGGTTTAAAAAACCTGAAG
AAGAGAAGTTTGAAACATGTTTGTATGTATGCAGAGGGAAAAGAGTTGTTAGATTGAGACAGGTCCCTTTTGCAAGGTCT
TCATTGAACCATGAAGATGTATTCATACTAGACACTCAGAATAAGATTTATCAATTCAATGGTGCAAATTCCAATATTCA
GGAAAGAGCCAAGGCTTTGGAAGTTATACAGTTTCTGAAAGAAAAATATCATGAAGGGAAATGTGATGTTGCAATTGTTG
ATGATGGAAAATTAGACACTGAGTCAGACTCAGGTGAGTTTTGGGTCCTCTTTGGTGGTTTTGCTCCCATTGGGAAGAAG
GTAATTAGTGAGGATGATATTATTCCAGAGACAATTCCTGCTCAGCTTTATAGTATTGTTGATGGTGAGGTCAAGCCTGT
GGAAGGTGAACTTTCTAAATCACTGTTGGAAAACAACAAATGCTATTTACTGGACTGTGGTGCCGAGATGTTTGTCTGGG
TTGGTCGCGTGACACAAGTTGAAGAACGAAAAGCAGCCTGCCAAGCCGTTGAGGAGTTTGTTGCTAGTCAAAATAGGCCA
AAGTCTACAAGGATAACCCGGATTATTCAAGGTTATGAGACGCATTCATTTAAGTCCAACTTTGATTCTTGGCCATCAGG
ATCTGCTAGTACCAATGCTGAGGAAGGAAGAGGAAAAGTTGCAGCATTGCTTAAGCAACAAGGCATGGGTGTCAAAGGAA
TGACAAAAAGTACCCCTGTAAATGAGGAAATTCCACCTTTGCTTGAAGGAGATGGGAAGATAGAGGTATGGCGAATCAAT
GGAAATGCCAAGACTGCATTGCCGAAGGAGGAGATTGGTAAATTTTATAGTGGAGATTGCTACATTGTATTGTACACCTA
CCATTCTGGTGAGCGGAAAGAAGACTACTTCGTGTGCTGTTGGTTTGGCAAAGACAGTGTTGAGGAGGACCAAACAACAG
CTACTAGGTTAGCCAATACAATGTCTACCTCATTAAAGGGTAGACCTGTACAGGGTCGCATATTTGAAGGCAAAGAGCCG
CCACAGTTTGTTGCTATTTTCCAACCAATGGTGGTTCTCAAGGGAGGGTTGAGCTCTGGATACAAGAAATTAATGGCAGA
CAAAGGTGCATCAGATGAGACATACACAGCAGAGAGTATTGCACTTATTCGAATTTCTGGAACTTCTATTCATAACAATA
AATCAGTACAAGTTGATGCAGTGCCATCATCATTGAATTCTACTGAGTGTTTTGTCCTGCAATCTGGCTCTACAATTTTC
ACTTGGCATGGAAATCAGTGTTCCTTTGAGCAGCAACAGCTAGCAGCAAAGGTTGCTGATTTTTTACGGCCAGGAGCTAC
TTTAAAGCATGCTAAAGAAGGAACAGAAAGCTCAGCTTTCTGGTCTGCACTAGGAGGAAAACAAAGTTACACCAGCAAGA
AAGTTGTTAACGAGTTTGTCAGAGATCCACATTTGTTCACTATATCATTTAATAAAGGAAAGTTCAATGTAGAGGAGGTT
TACAACTTCTCTCAAGATGACCTGTTGCCGGAGGATATCCTTATACTTGACACACATGTAGAAGTGTTTATTTGGATTGG
TCATTCTGTGGACCCAAAGGAAAAGCAAAATGCTTTTGATATCGGCCAGAAATACATAGATTTGGCTGCATCTCTGGAGG
AACTATCTCCACATGTACCACTATATAAAGTAACAGAAGGGAATGAACCTTGCTTTTTCACAACATACTTTTCATGGGAT
CATGCAAAAGCTATGGTTCTGGGGAACTCATTTCAGAAAAAAGTGTCACTACTCTTTGGATTTGGCCATGCTGTGGAGGA
AAAGTCGAATGGGTCAAGTCTAGGGGGACCAAGACAAAGAGCAGAAGCCTTGGCTGCCTTATCTAATGCATTTAGTTCAT
CTTCTGAGAAAGCATCCAGTTTGGCACAAGATAGATTGAATGGGTTAGGCCAAGGAGGACCAAGACAAAGGGCAGAAGCT
TTAGCCGCTTTAAACTCTGCATTTAGTTCATCATCTGGGACGAAGACTTTTACTCCTAGGCCATCTGGAAGAGGTCAAGG
ATCACAAAGAGCTGCAGCAGTGGCTGCTCTTTCACAAGTTCTTACGGCCGAAAAGAAAAAATCACCTGATGGTTCTCCTG
TTGCTAGCAGGAGTCCTATCACTCAAGGTAGCGCTACTGAAACTAAAAGTGACTCCTCTGAAGTTGAAGAAGTTGCAGAA
GCCAAGGAAACAGAGGAACTCCCCCCTGAGACAGGTAGCAATGGGGATTTGGAACCAAAACAAGAAAATGTGGAGGAAGG
AAATGATGGTCAAAGGACGTTCAGTTATGAACAATTAAAGACTAAATCTGGTCGTAATGTGCCTGGAATTGATCTTAAAC
GGAGAGAGGCCTATCTGTCAGAAGAGGAGTTCAACACTGTATTTGGAATGACAAAAGAAGCATTTTACAAGTTGCCAAGA
TGGAAGCAAGACATGCTGAAAAAGAAATACGAATTGTTCTAG