Microexon ID Gm_3:35251129-35251139:-
Species Glycine max
Coordinates 3:35251129..35251139
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAAAGTTACAG
Microexon Amino Acid seq GKLQ
Microexon-tag DNA Seq AATGACATTGTCAGAGACCCCCATTTGTTCACTTTCTCTTTTAATAGAGGAAAGTTACAGGTAGAGGAGGTTTACAACTTTTCCCAAGATGATTTGTTAACAGAGGAT
Microexon-tag Amino Acid Seq NDIVRDPHLFTFSFNRGKLQVEEVYNFSQDDLLTED
Microexon-tag spanning region35250976-35251317
Microexon-tag prediction score0.9604
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH66923x
Reference Transcript ID KRH66923
Gene ID GLYMA_03G136500
Gene Name NA
Transcript ID KRH66923
Protein ID KRH66923
Gene ID GLYMA_03G136500
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 2.1e-07
Motif start 638
Motif end 713
Protein seq >KRH66923
MSSATKVLDPAFQGVGQKVGTEIWRIEDFQPVPLPRPDYGKFYMGDSYIILQTTQGKGSAYLYDIHFWIGKDTSQDEAGT
AAIKTVELDASLGGRAVQHREIQGHESDKFLSYFKPCIIPLEGGVASGFKKPEEEEFETRLYVCRGKRVVRIKQVPFARS
SLNHDDVFILDTQNKIYQFNGANSNIQERAKALEVIQLLKEKYHEGKCDVAIVDDGKLDTESDSGEFWVLFGGFAPIGKK
IISEDDIVPETIPAQLYSIADGEAKPVEGELSKSLLENYKCYLLDCGAEVFVWVGRVTQVEERKAACQAAEEFLTSQKRP
KSTRITRIIQGYETHSFKSNFDSWPSGSATTGADEGRGKVAALLKQQGMGVKGVTKTTSVVEEIPPLLEGGGKMEVWQIN
GSAKTPLPKEDIGKFYSGDCYIVLYTYHSSERKEDYYLCCWFGKDSTEEDQRMAIRLANTMFNSLKGRPVQGRIFDGKEP
PQFIVLFHPMVVLKGGLSSGYKKLIADKGLPDETYTAESVAFIRISGTSTHNNKVVQVDAVAALLNSTECFVLQSGSAVF
TWHGNQCSLEQQQLAAKVAEFLRPGVALKLAKEGTETSTFWFALGGKQSYNNKKVTNDIVRDPHLFTFSFNRGKLQVEEV
YNFSQDDLLTEDILILDTHAEVFVWIGQCVDPKEKQNAFEIAQKYIDKAASLEGLSPHVPLYKVTEGNEPCFFTTYFSWD
HTKAMVPGNSFQKKVTLLFGIGHPVEEKSNGSSQGGGPRQRAEALAALNNAFNSSPEATSSADKSNGLSRGGPRQRAEAL
AALNSAFNSSSGTKVYTPRPSGRGQGSQRAAAVAALSSVLTAEKKKTSPETSPVASTSPVVENSNFDTKSESAPSEKEIV
EEVTEVKETEVVALETGTNGDSEQPKQENVEDGGNDSENNNQNFFSYEQLKTKSGSVVSGIDLKRREAYLSDKEFQAVFG
MAKDAFSKLPRWKQDMLKRKVDLF*
CDS seq >KRH66923
ATGTCTAGTGCTACAAAAGTGTTGGATCCAGCATTCCAGGGAGTTGGTCAAAAAGTAGGGACTGAAATATGGAGGATTGA
GGATTTTCAGCCAGTTCCATTGCCCAGACCTGATTATGGGAAATTCTACATGGGAGATTCTTACATCATCTTGCAGACAA
CACAAGGCAAAGGAAGTGCTTATTTGTATGATATTCACTTCTGGATTGGAAAGGATACAAGTCAGGATGAGGCTGGAACA
GCAGCCATTAAAACTGTTGAACTCGATGCGTCTCTTGGAGGACGTGCAGTGCAACACAGGGAAATTCAAGGGCATGAATC
CGACAAGTTTTTGTCATACTTTAAGCCATGTATAATACCATTAGAGGGAGGTGTTGCATCGGGATTTAAAAAACCTGAAG
AAGAGGAGTTTGAAACACGTTTGTATGTATGCAGAGGAAAAAGAGTTGTCAGAATAAAACAGGTTCCTTTTGCACGGTCT
TCACTGAATCATGATGATGTATTCATCCTAGACACTCAGAATAAGATTTATCAATTCAATGGTGCAAACTCCAATATCCA
GGAAAGAGCCAAGGCTCTGGAAGTTATTCAGTTGTTGAAGGAAAAGTATCATGAGGGAAAATGCGACGTTGCAATTGTTG
ATGATGGAAAGCTGGATACTGAGTCAGACTCAGGTGAATTTTGGGTTCTCTTTGGTGGTTTTGCTCCCATTGGGAAGAAG
ATAATTAGTGAGGATGATATTGTTCCAGAGACCATTCCTGCTCAACTCTATAGTATTGCTGATGGTGAGGCCAAGCCTGT
GGAAGGTGAACTTTCAAAATCACTGCTGGAGAACTACAAATGCTATCTATTGGACTGTGGAGCTGAGGTATTTGTCTGGG
TTGGCCGGGTAACACAAGTTGAAGAACGAAAAGCAGCATGTCAAGCAGCTGAGGAGTTTCTCACAAGTCAAAAAAGGCCG
AAATCTACAAGGATTACCAGAATTATTCAAGGTTATGAGACACATTCGTTTAAGTCCAACTTTGATTCTTGGCCATCAGG
ATCTGCTACTACTGGTGCTGATGAAGGAAGAGGAAAAGTTGCAGCTTTGCTTAAGCAACAAGGTATGGGTGTGAAAGGGG
TGACAAAAACTACCTCAGTTGTTGAGGAAATTCCACCTCTACTTGAAGGAGGTGGAAAGATGGAGGTATGGCAAATCAAT
GGAAGTGCTAAGACTCCATTACCTAAGGAGGATATCGGTAAATTTTATAGTGGAGATTGTTACATAGTACTGTACACTTA
TCACTCTAGTGAGAGGAAGGAGGACTACTACTTGTGTTGTTGGTTTGGAAAAGACAGCACTGAGGAGGACCAAAGAATGG
CTATTCGATTGGCTAACACAATGTTCAACTCATTAAAGGGTAGACCTGTTCAGGGTCGCATATTTGATGGTAAAGAGCCA
CCACAGTTTATTGTTCTTTTCCATCCAATGGTGGTCCTCAAGGGAGGCTTGAGCTCTGGTTACAAAAAATTGATAGCAGA
TAAAGGTTTGCCAGATGAGACATACACAGCAGAGAGTGTTGCATTTATTCGAATTTCTGGAACATCCACTCATAATAATA
AAGTGGTGCAAGTAGATGCAGTGGCAGCATTGCTGAATTCTACCGAGTGTTTTGTCCTGCAATCTGGCTCAGCTGTTTTT
ACATGGCATGGGAATCAATGTTCCCTTGAGCAGCAGCAGCTTGCTGCAAAAGTTGCTGAATTTTTAAGGCCAGGAGTTGC
TTTAAAGCTTGCTAAAGAAGGAACAGAAACCTCAACTTTCTGGTTTGCACTTGGAGGGAAACAAAGTTACAACAACAAAA
AAGTCACTAATGACATTGTCAGAGACCCCCATTTGTTCACTTTCTCTTTTAATAGAGGAAAGTTACAGGTAGAGGAGGTT
TACAACTTTTCCCAAGATGATTTGTTAACAGAGGATATCCTGATCCTTGACACACACGCAGAAGTGTTTGTATGGATTGG
TCAGTGTGTGGACCCAAAAGAAAAGCAAAATGCTTTTGAAATTGCCCAGAAATACATAGATAAGGCTGCATCTCTGGAGG
GACTATCTCCTCATGTACCACTATACAAAGTAACAGAAGGGAATGAACCTTGCTTTTTCACAACATACTTTTCTTGGGAT
CATACAAAAGCTATGGTTCCAGGAAACTCATTCCAGAAAAAGGTGACATTACTCTTTGGAATTGGCCATCCTGTAGAGGA
AAAGTCTAATGGGTCAAGTCAAGGAGGGGGACCAAGACAAAGAGCAGAAGCTTTGGCTGCCTTAAATAATGCATTCAATT
CATCTCCTGAGGCAACATCCAGTGCGGATAAGTCGAATGGGCTAAGTCGAGGAGGACCAAGACAAAGGGCTGAAGCCTTA
GCAGCCTTAAACTCTGCATTTAATTCATCATCTGGAACCAAAGTTTATACTCCTAGGCCATCTGGAAGAGGTCAAGGATC
ACAAAGAGCCGCCGCAGTAGCTGCTCTCTCTTCAGTTCTTACTGCTGAAAAGAAGAAAACTTCACCTGAAACTTCTCCTG
TGGCTAGCACTAGTCCTGTAGTGGAGAATAGCAACTTTGATACTAAAAGTGAAAGTGCCCCTTCTGAAAAGGAAATTGTT
GAAGAAGTTACAGAAGTCAAGGAGACAGAAGTTGTTGCCCTTGAAACTGGTACCAATGGGGATTCAGAACAACCAAAACA
AGAAAATGTGGAGGATGGAGGAAATGACAGTGAAAATAATAATCAAAATTTCTTCAGTTATGAGCAGTTAAAGACCAAAT
CTGGTAGTGTTGTGTCTGGAATTGATCTTAAACGGAGAGAGGCCTATCTGTCAGACAAAGAGTTCCAAGCTGTATTTGGA
ATGGCCAAAGATGCATTCTCCAAGTTGCCAAGATGGAAGCAAGACATGCTGAAAAGAAAAGTGGATTTGTTCTAG
Microexon DNA seq GAAAGTTACAG
Microexon Amino Acid seq GKLQ
Microexon-tag DNA Seq AATGACATTGTCAGAGACCCCCATTTGTTCACTTTCTCTTTTAATAGAGGAAAGTTACAGGTAGAGGAGGTTTACAACTTTTCCCAAGATGATTTGTTAACAGAGGAT
Microexon-tag Amino Acid seq NDIVRDPHLFTFSFNRGKLQVEEVYNFSQDDLLTED
Transcript ID Gm.36039.1
Gene ID Gm.36039
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 2.1e-07
Motif start 638
Motif end 713
Protein seq >Gm.36039.1
MSSATKVLDPAFQGVGQKVGTEIWRIEDFQPVPLPRPDYGKFYMGDSYIILQTTQGKGSAYLYDIHFWIGKDTSQDEAGT
AAIKTVELDASLGGRAVQHREIQGHESDKFLSYFKPCIIPLEGGVASGFKKPEEEEFETRLYVCRGKRVVRIKQVPFARS
SLNHDDVFILDTQNKIYQFNGANSNIQERAKALEVIQLLKEKYHEGKCDVAIVDDGKLDTESDSGEFWVLFGGFAPIGKK
IISEDDIVPETIPAQLYSIADGEAKPVEGELSKSLLENYKCYLLDCGAEVFVWVGRVTQVEERKAACQAAEEFLTSQKRP
KSTRITRIIQGYETHSFKSNFDSWPSGSATTGADEGRGKVAALLKQQGMGVKGVTKTTSVVEEIPPLLEGGGKMEVWQIN
GSAKTPLPKEDIGKFYSGDCYIVLYTYHSSERKEDYYLCCWFGKDSTEEDQRMAIRLANTMFNSLKGRPVQGRIFDGKEP
PQFIVLFHPMVVLKGGLSSGYKKLIADKGLPDETYTAESVAFIRISGTSTHNNKVVQVDAVAALLNSTECFVLQSGSAVF
TWHGNQCSLEQQQLAAKVAEFLRPGVALKLAKEGTETSTFWFALGGKQSYNNKKVTNDIVRDPHLFTFSFNRGKLQVEEV
YNFSQDDLLTEDILILDTHAEVFVWIGQCVDPKEKQNAFEIAQKYIDKAASLEGLSPHVPLYKVTEGNEPCFFTTYFSWD
HTKAMVPGNSFQKKVTLLFGIGHPVEEKSNGSSQGGGPRQRAEALAALNNAFNSSPEATSSADKSNGLSRGGPRQRAEAL
AALNSAFNSSSGTKVYTPRPSGRGQGSQRAAAVAALSSVLTAEKKKTSPETSPVASTSPVVENSNFDTKSESAPSEKEIV
EEVTEVKETEVVALETGTNGDSEQPKQENVEDGGNDSENNNQNFFSYEQLKTKSGSVVSGIDLKRREAYLSDKEFQAVFG
MAKDAFSKLPRWKQDMLKRKVDLF*
CDS seq >Gm.36039.1
ATGTCTAGTGCTACAAAAGTGTTGGATCCAGCATTCCAGGGAGTTGGTCAAAAAGTAGGGACTGAAATATGGAGGATTGA
GGATTTTCAGCCAGTTCCATTGCCCAGACCTGATTATGGGAAATTCTACATGGGAGATTCTTACATCATCTTGCAGACAA
CACAAGGCAAAGGAAGTGCTTATTTGTATGATATTCACTTCTGGATTGGAAAGGATACAAGTCAGGATGAGGCTGGAACA
GCAGCCATTAAAACTGTTGAACTCGATGCGTCTCTTGGAGGACGTGCAGTGCAACACAGGGAAATTCAAGGGCATGAATC
CGACAAGTTTTTGTCATACTTTAAGCCATGTATAATACCATTAGAGGGAGGTGTTGCATCGGGATTTAAAAAACCTGAAG
AAGAGGAGTTTGAAACACGTTTGTATGTATGCAGAGGAAAAAGAGTTGTCAGAATAAAACAGGTTCCTTTTGCACGGTCT
TCACTGAATCATGATGATGTATTCATCCTAGACACTCAGAATAAGATTTATCAATTCAATGGTGCAAACTCCAATATCCA
GGAAAGAGCCAAGGCTCTGGAAGTTATTCAGTTGTTGAAGGAAAAGTATCATGAGGGAAAATGCGACGTTGCAATTGTTG
ATGATGGAAAGCTGGATACTGAGTCAGACTCAGGTGAATTTTGGGTTCTCTTTGGTGGTTTTGCTCCCATTGGGAAGAAG
ATAATTAGTGAGGATGATATTGTTCCAGAGACCATTCCTGCTCAACTCTATAGTATTGCTGATGGTGAGGCCAAGCCTGT
GGAAGGTGAACTTTCAAAATCACTGCTGGAGAACTACAAATGCTATCTATTGGACTGTGGAGCTGAGGTATTTGTCTGGG
TTGGCCGGGTAACACAAGTTGAAGAACGAAAAGCAGCATGTCAAGCAGCTGAGGAGTTTCTCACAAGTCAAAAAAGGCCG
AAATCTACAAGGATTACCAGAATTATTCAAGGTTATGAGACACATTCGTTTAAGTCCAACTTTGATTCTTGGCCATCAGG
ATCTGCTACTACTGGTGCTGATGAAGGAAGAGGAAAAGTTGCAGCTTTGCTTAAGCAACAAGGTATGGGTGTGAAAGGGG
TGACAAAAACTACCTCAGTTGTTGAGGAAATTCCACCTCTACTTGAAGGAGGTGGAAAGATGGAGGTATGGCAAATCAAT
GGAAGTGCTAAGACTCCATTACCTAAGGAGGATATCGGTAAATTTTATAGTGGAGATTGTTACATAGTACTGTACACTTA
TCACTCTAGTGAGAGGAAGGAGGACTACTACTTGTGTTGTTGGTTTGGAAAAGACAGCACTGAGGAGGACCAAAGAATGG
CTATTCGATTGGCTAACACAATGTTCAACTCATTAAAGGGTAGACCTGTTCAGGGTCGCATATTTGATGGTAAAGAGCCA
CCACAGTTTATTGTTCTTTTCCATCCAATGGTGGTCCTCAAGGGAGGCTTGAGCTCTGGTTACAAAAAATTGATAGCAGA
TAAAGGTTTGCCAGATGAGACATACACAGCAGAGAGTGTTGCATTTATTCGAATTTCTGGAACATCCACTCATAATAATA
AAGTGGTGCAAGTAGATGCAGTGGCAGCATTGCTGAATTCTACCGAGTGTTTTGTCCTGCAATCTGGCTCAGCTGTTTTT
ACATGGCATGGGAATCAATGTTCCCTTGAGCAGCAGCAGCTTGCTGCAAAAGTTGCTGAATTTTTAAGGCCAGGAGTTGC
TTTAAAGCTTGCTAAAGAAGGAACAGAAACCTCAACTTTCTGGTTTGCACTTGGAGGGAAACAAAGTTACAACAACAAAA
AAGTCACTAATGACATTGTCAGAGACCCCCATTTGTTCACTTTCTCTTTTAATAGAGGAAAGTTACAGGTAGAGGAGGTT
TACAACTTTTCCCAAGATGATTTGTTAACAGAGGATATCCTGATCCTTGACACACACGCAGAAGTGTTTGTATGGATTGG
TCAGTGTGTGGACCCAAAAGAAAAGCAAAATGCTTTTGAAATTGCCCAGAAATACATAGATAAGGCTGCATCTCTGGAGG
GACTATCTCCTCATGTACCACTATACAAAGTAACAGAAGGGAATGAACCTTGCTTTTTCACAACATACTTTTCTTGGGAT
CATACAAAAGCTATGGTTCCAGGAAACTCATTCCAGAAAAAGGTGACATTACTCTTTGGAATTGGCCATCCTGTAGAGGA
AAAGTCTAATGGGTCAAGTCAAGGAGGGGGACCAAGACAAAGAGCAGAAGCTTTGGCTGCCTTAAATAATGCATTCAATT
CATCTCCTGAGGCAACATCCAGTGCGGATAAGTCGAATGGGCTAAGTCGAGGAGGACCAAGACAAAGGGCTGAAGCCTTA
GCAGCCTTAAACTCTGCATTTAATTCATCATCTGGAACCAAAGTTTATACTCCTAGGCCATCTGGAAGAGGTCAAGGATC
ACAAAGAGCCGCCGCAGTAGCTGCTCTCTCTTCAGTTCTTACTGCTGAAAAGAAGAAAACTTCACCTGAAACTTCTCCTG
TGGCTAGCACTAGTCCTGTAGTGGAGAATAGCAACTTTGATACTAAAAGTGAAAGTGCCCCTTCTGAAAAGGAAATTGTT
GAAGAAGTTACAGAAGTCAAGGAGACAGAAGTTGTTGCCCTTGAAACTGGTACCAATGGGGATTCAGAACAACCAAAACA
AGAAAATGTGGAGGATGGAGGAAATGACAGTGAAAATAATAATCAAAATTTCTTCAGTTATGAGCAGTTAAAGACCAAAT
CTGGTAGTGTTGTGTCTGGAATTGATCTTAAACGGAGAGAGGCCTATCTGTCAGACAAAGAGTTCCAAGCTGTATTTGGA
ATGGCCAAAGATGCATTCTCCAAGTTGCCAAGATGGAAGCAAGACATGCTGAAAAGAAAAGTGGATTTGTTCTAG