Microexon ID Gm_19:40015104-40015114:-
Species Glycine max
Coordinates 19:40015104..40015114
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAAAGTTACAG
Microexon Amino Acid seq GKLQ
Microexon-tag DNA Seq AATGACATTGTCAGAGACCCACATTTGTTCACTTTATCTTTTAATAGAGGAAAGTTACAGGTAGAGGAGGTTTACAACTTTTCCCAAGATGATTTGTTAACAGAAGAT
Microexon-tag Amino Acid Seq NDIVRDPHLFTLSFNRGKLQVEEVYNFSQDDLLTED
Microexon-tag spanning region40014951-40015291
Microexon-tag prediction score0.9515
Overlapped with the annotated transcript (%) 100
New Transcript ID KRG95247x
Reference Transcript ID KRG95247
Gene ID GLYMA_19G138700
Gene Name NA
Transcript ID KRG95247
Protein ID KRG95247
Gene ID GLYMA_19G138700
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 1.6e-08
Motif start 638
Motif end 713
Protein seq >KRG95247
MSSATKVLDPAFQGVGQKVGTEIWRIEDFQPVPLPRSEYGKFYMGDSYIILQTTQGKGGAYLYDIHFWIGKDTSQDEAGT
AAIKNVELDASLGGRAVQHREIQGHESDKFLSYFKPCIIPLEGGIASGFKKPEEEEFETRLYVCRGKRVVRIKQVPFARS
SLNHDDVFILDTQNKIYQFNGANSNIQERAKALEVIQLLKEKHHEGKCDVAIVDDGKLDTESDSGEFWVLFGGFAPIGKK
VISEDDIVPETIPAQLYSIADGEVKPVEGELSKSLLENYKCYLLDCGTEVFVWVGRVTQVEDRKAACQAAEEFVASQKRP
KSTRITRIIQGYETHSFKSNFDFWPSGSATNSADEGRGKVAALLKQQGMGVKGVTKTTPVVEDIPPLLEGGGKMEVWQIS
GSAKTPLSKEDIGKFYSGDCYIVLYTYHSSERKEDYYLCCWFGKDSIEEDQRMAIRLANSMFNSLKGRPVQGRIFDGKEP
PQFIALFHPMVVLKGGLSSGYKKFIADKGLPDETYAAESVALIRISGTSIHNNKVVQVDAVAALLNSTECFVLQSGSAVF
TWHGNQCSLEQQQLAAKVAEFLRPGVSLKLAKEGTETSTFWFALGGKQSYTSKNVTNDIVRDPHLFTLSFNRGKLQVEEV
YNFSQDDLLTEDILILDTHTEVFVWIGQCVDPKEKQKAFEIAQKYIDKAASLEGLSPHVPLYKVTEGNEPCFFTTYFSWD
HAKAMVPGNSFQKKVTLLFGTGHPVEEKSNGSSQGGGPRQRAEALAALNNAFNSSPETTSSADKLNGLNRGGPRQRAEAL
AALNSAFNSSSGTKVYTPRPSGRGQGSQRAAAVAALSSVLTAEKKKTSPETSPVASTSPVVESSNFDTKSESAPSETEVV
EEVADVKETEEVAPEAGTNGDSEQPKQENVEDGRNDSENNNQNVFSYEQLKTKSGSVVSGIDLKQREAYLSDKEFETVFG
MAKEAFSKLPRWKQDMLKRKVDLF*
CDS seq >KRG95247
ATGTCTAGTGCTACAAAAGTGTTGGATCCAGCATTCCAGGGAGTTGGTCAAAAAGTAGGCACTGAAATATGGAGGATTGA
GGATTTTCAGCCAGTTCCATTGCCCAGATCTGAATATGGGAAATTCTACATGGGAGATTCTTACATCATCTTGCAGACAA
CACAAGGCAAAGGAGGTGCTTATTTGTATGACATTCACTTCTGGATTGGAAAGGATACAAGTCAGGATGAGGCTGGAACA
GCAGCCATTAAAAATGTTGAACTCGATGCATCTCTTGGAGGACGTGCAGTGCAACACAGGGAAATTCAAGGGCATGAATC
TGACAAATTTTTGTCCTACTTTAAGCCATGTATAATTCCATTAGAGGGGGGTATCGCATCTGGATTTAAAAAACCTGAAG
AAGAGGAGTTTGAAACACGTTTATATGTATGCAGAGGAAAAAGAGTTGTCAGAATAAAACAGGTCCCTTTTGCACGGTCT
TCATTGAATCATGATGATGTATTCATCCTAGACACTCAGAATAAGATTTATCAATTCAATGGTGCAAACTCCAATATCCA
GGAAAGAGCCAAGGCTCTGGAAGTTATACAGTTGTTGAAGGAAAAACATCATGAGGGAAAATGTGACGTTGCAATTGTTG
ATGATGGCAAGCTGGATACTGAGTCAGACTCAGGTGAATTTTGGGTTCTCTTTGGTGGTTTTGCTCCCATTGGGAAGAAG
GTAATTAGTGAGGATGATATCGTTCCAGAGACCATTCCTGCTCAACTCTATAGTATTGCTGATGGTGAGGTCAAGCCTGT
GGAAGGTGAACTTTCAAAATCACTGCTGGAGAACTACAAATGCTATCTATTGGACTGTGGTACTGAGGTATTTGTCTGGG
TTGGCCGGGTAACACAAGTTGAAGATCGAAAAGCAGCTTGTCAAGCAGCTGAGGAGTTTGTCGCAAGTCAAAAAAGGCCG
AAATCTACAAGGATTACCAGAATCATTCAAGGTTATGAGACACATTCTTTTAAGTCCAACTTTGATTTTTGGCCATCAGG
ATCTGCTACTAACAGTGCTGACGAAGGAAGAGGAAAAGTTGCAGCTTTGCTGAAGCAGCAAGGTATGGGTGTGAAAGGGG
TGACAAAAACTACCCCAGTTGTTGAGGACATTCCACCTCTGCTTGAAGGAGGTGGAAAGATGGAGGTATGGCAAATCAGT
GGAAGTGCTAAGACTCCCTTATCTAAGGAGGATATTGGTAAATTTTATAGTGGAGATTGTTACATAGTACTGTACACTTA
TCACTCTAGCGAGAGAAAGGAAGACTACTACTTGTGTTGTTGGTTTGGAAAAGACAGCATTGAGGAGGACCAAAGAATGG
CTATTCGATTGGCTAACTCAATGTTCAACTCATTAAAGGGTAGACCTGTTCAGGGTCGCATATTTGATGGTAAAGAGCCA
CCACAGTTTATTGCCCTTTTCCATCCAATGGTGGTCCTCAAGGGAGGCTTGAGCTCTGGTTACAAAAAATTTATAGCAGA
TAAAGGTTTGCCGGATGAGACATACGCAGCAGAGAGTGTTGCACTTATTCGGATTTCTGGAACATCCATTCATAATAATA
AAGTGGTGCAAGTAGATGCAGTGGCAGCATTGCTGAACTCTACCGAGTGTTTTGTCCTGCAATCTGGCTCAGCTGTTTTT
ACATGGCATGGGAATCAATGTTCCCTTGAGCAGCAGCAGCTTGCAGCAAAAGTTGCTGAGTTTTTAAGGCCAGGAGTTTC
TTTAAAGCTTGCTAAAGAAGGAACAGAAACCTCAACTTTCTGGTTTGCACTTGGAGGAAAACAAAGTTACACCAGCAAAA
ATGTCACTAATGACATTGTCAGAGACCCACATTTGTTCACTTTATCTTTTAATAGAGGAAAGTTACAGGTAGAGGAGGTT
TACAACTTTTCCCAAGATGATTTGTTAACAGAAGATATCCTGATCCTTGACACGCACACAGAAGTGTTTGTTTGGATTGG
TCAGTGTGTGGACCCAAAAGAAAAGCAAAAAGCTTTTGAAATTGCCCAGAAATACATAGATAAGGCTGCATCTCTGGAAG
GACTATCTCCTCATGTACCACTATACAAAGTAACAGAAGGGAATGAACCTTGCTTTTTCACAACATACTTTTCTTGGGAT
CATGCAAAAGCTATGGTGCCAGGGAACTCATTCCAGAAAAAGGTGACATTACTCTTTGGAACTGGCCATCCTGTAGAGGA
AAAGTCTAATGGGTCAAGTCAAGGAGGGGGACCAAGACAAAGAGCAGAAGCTTTGGCTGCCTTAAATAATGCATTCAATT
CATCTCCTGAGACAACGTCCAGTGCGGATAAGTTGAATGGGTTAAATCGAGGTGGACCGAGACAAAGGGCAGAAGCCTTA
GCAGCCTTAAACTCTGCATTTAATTCATCATCTGGAACCAAAGTTTATACTCCAAGGCCATCTGGAAGAGGTCAAGGATC
ACAAAGAGCAGCAGCAGTAGCTGCTCTCTCTTCAGTTCTTACTGCTGAAAAGAAGAAAACTTCACCTGAAACTTCTCCTG
TGGCTAGCACTAGTCCTGTAGTGGAAAGTAGCAACTTTGACACTAAAAGTGAAAGTGCCCCTTCTGAAACGGAAGTTGTT
GAAGAAGTTGCAGATGTCAAAGAGACAGAAGAAGTTGCCCCTGAAGCTGGTACCAATGGGGATTCAGAACAACCAAAACA
AGAAAATGTGGAGGATGGAAGAAATGACAGTGAAAATAATAATCAAAATGTCTTCAGTTATGAGCAATTAAAGACTAAAT
CTGGTAGTGTTGTGTCTGGAATTGATCTTAAACAGAGAGAGGCTTATCTGTCAGACAAAGAGTTCGAAACTGTATTTGGA
ATGGCCAAAGAAGCATTCTCTAAGTTGCCAAGATGGAAGCAAGACATGCTGAAAAGGAAAGTGGATTTGTTCTAG
Microexon DNA seq GAAAGTTACAG
Microexon Amino Acid seq GKLQ
Microexon-tag DNA Seq AATGACATTGTCAGAGACCCACATTTGTTCACTTTATCTTTTAATAGAGGAAAGTTACAGGTAGAGGAGGTTTACAACTTTTCCCAAGATGATTTGTTAACAGAAGAT
Microexon-tag Amino Acid seq NDIVRDPHLFTLSFNRGKLQVEEVYNFSQDDLLTED
Transcript ID Gm.28088.1
Gene ID Gm.28088
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 1.6e-08
Motif start 638
Motif end 713
Protein seq >Gm.28088.1
MSSATKVLDPAFQGVGQKVGTEIWRIEDFQPVPLPRSEYGKFYMGDSYIILQTTQGKGGAYLYDIHFWIGKDTSQDEAGT
AAIKNVELDASLGGRAVQHREIQGHESDKFLSYFKPCIIPLEGGIASGFKKPEEEEFETRLYVCRGKRVVRIKQVPFARS
SLNHDDVFILDTQNKIYQFNGANSNIQERAKALEVIQLLKEKHHEGKCDVAIVDDGKLDTESDSGEFWVLFGGFAPIGKK
VISEDDIVPETIPAQLYSIADGEVKPVEGELSKSLLENYKCYLLDCGTEVFVWVGRVTQVEDRKAACQAAEEFVASQKRP
KSTRITRIIQGYETHSFKSNFDFWPSGSATNSADEGRGKVAALLKQQGMGVKGVTKTTPVVEDIPPLLEGGGKMEVWQIS
GSAKTPLSKEDIGKFYSGDCYIVLYTYHSSERKEDYYLCCWFGKDSIEEDQRMAIRLANSMFNSLKGRPVQGRIFDGKEP
PQFIALFHPMVVLKGGLSSGYKKFIADKGLPDETYAAESVALIRISGTSIHNNKVVQVDAVAALLNSTECFVLQSGSAVF
TWHGNQCSLEQQQLAAKVAEFLRPGVSLKLAKEGTETSTFWFALGGKQSYTSKNVTNDIVRDPHLFTLSFNRGKLQVEEV
YNFSQDDLLTEDILILDTHTEVFVWIGQCVDPKEKQKAFEIAQKYIDKAASLEGLSPHVPLYKVTEGNEPCFFTTYFSWD
HAKAMVPGNSFQKKVTLLFGTGHPVEEKSNGSSQGGGPRQRAEALAALNNAFNSSPETTSSADKLNGLNRGGPRQRAEAL
AALNSAFNSSSGTKVYTPRPSGRGQGSQRAAAVAALSSVLTAEKKKTSPETSPVASTSPVVESSNFDTKSESAPSETEVV
EEVADVKETEEVAPEAGTNGDSEQPKQENVEDGRNDSENNNQNVFSYEQLKTKSGSVVSGIDLKQREAYLSDKEFETVFG
MAKEAFSKLPRWKQDMLKRKVDLF*
CDS seq >Gm.28088.1
ATGTCTAGTGCTACAAAAGTGTTGGATCCAGCATTCCAGGGAGTTGGTCAAAAAGTAGGCACTGAAATATGGAGGATTGA
GGATTTTCAGCCAGTTCCATTGCCCAGATCTGAATATGGGAAATTCTACATGGGAGATTCTTACATCATCTTGCAGACAA
CACAAGGCAAAGGAGGTGCTTATTTGTATGACATTCACTTCTGGATTGGAAAGGATACAAGTCAGGATGAGGCTGGAACA
GCAGCCATTAAAAATGTTGAACTCGATGCATCTCTTGGAGGACGTGCAGTGCAACACAGGGAAATTCAAGGGCATGAATC
TGACAAATTTTTGTCCTACTTTAAGCCATGTATAATTCCATTAGAGGGGGGTATCGCATCTGGATTTAAAAAACCTGAAG
AAGAGGAGTTTGAAACACGTTTATATGTATGCAGAGGAAAAAGAGTTGTCAGAATAAAACAGGTCCCTTTTGCACGGTCT
TCATTGAATCATGATGATGTATTCATCCTAGACACTCAGAATAAGATTTATCAATTCAATGGTGCAAACTCCAATATCCA
GGAAAGAGCCAAGGCTCTGGAAGTTATACAGTTGTTGAAGGAAAAACATCATGAGGGAAAATGTGACGTTGCAATTGTTG
ATGATGGCAAGCTGGATACTGAGTCAGACTCAGGTGAATTTTGGGTTCTCTTTGGTGGTTTTGCTCCCATTGGGAAGAAG
GTAATTAGTGAGGATGATATCGTTCCAGAGACCATTCCTGCTCAACTCTATAGTATTGCTGATGGTGAGGTCAAGCCTGT
GGAAGGTGAACTTTCAAAATCACTGCTGGAGAACTACAAATGCTATCTATTGGACTGTGGTACTGAGGTATTTGTCTGGG
TTGGCCGGGTAACACAAGTTGAAGATCGAAAAGCAGCTTGTCAAGCAGCTGAGGAGTTTGTCGCAAGTCAAAAAAGGCCG
AAATCTACAAGGATTACCAGAATCATTCAAGGTTATGAGACACATTCTTTTAAGTCCAACTTTGATTTTTGGCCATCAGG
ATCTGCTACTAACAGTGCTGACGAAGGAAGAGGAAAAGTTGCAGCTTTGCTGAAGCAGCAAGGTATGGGTGTGAAAGGGG
TGACAAAAACTACCCCAGTTGTTGAGGACATTCCACCTCTGCTTGAAGGAGGTGGAAAGATGGAGGTATGGCAAATCAGT
GGAAGTGCTAAGACTCCCTTATCTAAGGAGGATATTGGTAAATTTTATAGTGGAGATTGTTACATAGTACTGTACACTTA
TCACTCTAGCGAGAGAAAGGAAGACTACTACTTGTGTTGTTGGTTTGGAAAAGACAGCATTGAGGAGGACCAAAGAATGG
CTATTCGATTGGCTAACTCAATGTTCAACTCATTAAAGGGTAGACCTGTTCAGGGTCGCATATTTGATGGTAAAGAGCCA
CCACAGTTTATTGCCCTTTTCCATCCAATGGTGGTCCTCAAGGGAGGCTTGAGCTCTGGTTACAAAAAATTTATAGCAGA
TAAAGGTTTGCCGGATGAGACATACGCAGCAGAGAGTGTTGCACTTATTCGGATTTCTGGAACATCCATTCATAATAATA
AAGTGGTGCAAGTAGATGCAGTGGCAGCATTGCTGAACTCTACCGAGTGTTTTGTCCTGCAATCTGGCTCAGCTGTTTTT
ACATGGCATGGGAATCAATGTTCCCTTGAGCAGCAGCAGCTTGCAGCAAAAGTTGCTGAGTTTTTAAGGCCAGGAGTTTC
TTTAAAGCTTGCTAAAGAAGGAACAGAAACCTCAACTTTCTGGTTTGCACTTGGAGGAAAACAAAGTTACACCAGCAAAA
ATGTCACTAATGACATTGTCAGAGACCCACATTTGTTCACTTTATCTTTTAATAGAGGAAAGTTACAGGTAGAGGAGGTT
TACAACTTTTCCCAAGATGATTTGTTAACAGAAGATATCCTGATCCTTGACACGCACACAGAAGTGTTTGTTTGGATTGG
TCAGTGTGTGGACCCAAAAGAAAAGCAAAAAGCTTTTGAAATTGCCCAGAAATACATAGATAAGGCTGCATCTCTGGAAG
GACTATCTCCTCATGTACCACTATACAAAGTAACAGAAGGGAATGAACCTTGCTTTTTCACAACATACTTTTCTTGGGAT
CATGCAAAAGCTATGGTGCCAGGGAACTCATTCCAGAAAAAGGTGACATTACTCTTTGGAACTGGCCATCCTGTAGAGGA
AAAGTCTAATGGGTCAAGTCAAGGAGGGGGACCAAGACAAAGAGCAGAAGCTTTGGCTGCCTTAAATAATGCATTCAATT
CATCTCCTGAGACAACGTCCAGTGCGGATAAGTTGAATGGGTTAAATCGAGGTGGACCGAGACAAAGGGCAGAAGCCTTA
GCAGCCTTAAACTCTGCATTTAATTCATCATCTGGAACCAAAGTTTATACTCCAAGGCCATCTGGAAGAGGTCAAGGATC
ACAAAGAGCAGCAGCAGTAGCTGCTCTCTCTTCAGTTCTTACTGCTGAAAAGAAGAAAACTTCACCTGAAACTTCTCCTG
TGGCTAGCACTAGTCCTGTAGTGGAAAGTAGCAACTTTGACACTAAAAGTGAAAGTGCCCCTTCTGAAACGGAAGTTGTT
GAAGAAGTTGCAGATGTCAAAGAGACAGAAGAAGTTGCCCCTGAAGCTGGTACCAATGGGGATTCAGAACAACCAAAACA
AGAAAATGTGGAGGATGGAAGAAATGACAGTGAAAATAATAATCAAAATGTCTTCAGTTATGAGCAATTAAAGACTAAAT
CTGGTAGTGTTGTGTCTGGAATTGATCTTAAACAGAGAGAGGCTTATCTGTCAGACAAAGAGTTCGAAACTGTATTTGGA
ATGGCCAAAGAAGCATTCTCTAAGTTGCCAAGATGGAAGCAAGACATGCTGAAAAGGAAAGTGGATTTGTTCTAG