Microexon ID Gm_9:5181904-5181914:-
Species Glycine max
Coordinates 9:5181904..5181914
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAAATTTAAAG
Microexon Amino Acid seq GNLK
Microexon-tag DNA Seq AGAGAGCCTGAAAGTGATCCTCACCTATTTTCTTGCCACTTCTCGAAAGGAAATTTAAAGGTGACCGAGGTATACAACTTCTCCCAGGATGATTTGATGACTGAAGAC
Microexon-tag Amino Acid Seq REPESDPHLFSCHFSKGNLKVTEVYNFSQDDLMTED
Microexon-tag spanning region5180783-5182081
Microexon-tag prediction score0.9632
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH37296x
Reference Transcript ID KRH37296
Gene ID GLYMA_09G057400
Gene Name NA
Transcript ID KRH37296
Protein ID KRH37296
Gene ID GLYMA_09G057400
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 1.2e-08
Motif start 637
Motif end 712
Protein seq >KRH37296
MAVSMRDLDPAFQGAGQKAGLEIWRIENFNPVPVPKSSYGKFFTGDSYVILKTTASKSGALRHDIHYWLGKDTSQDEAGA
AAIKTVELDAALGGRAVQYREVQGHETEKFLSYFKPCIIPQEGGVSSGFKHPEAEKHKTRLFVCRGKHVVHVKEVPFARA
SLNHDDIFVLDTESKIFQFNGSNSSIQERAKALEVVQYIKDTYHEGKCEVAAVEDGKLMADPETGEFWGFFGGFAPLPRK
TASDDDKPTDSRPPKLLCVEKGQAEPVETDSLKRELLDTNKCYILDCGFEVFVWLGRNTSLDERKSASGVADEIVSGTDQ
LKPQIIRVIEGFETVMFRSKFDSWPQTTDVTVSEDGRGKVAALLKRQGVNVKGLLKADPVREEPQPHIDCTGHLQVWHVN
GQEKILLQASDQSKFYSGDCFIFQYTYPGEDKEDCLIGTWIGKNSVEEERASANSLASKMVESMKFLASQARIYEGNEPI
QFHSILQSFIVFKGGISEGYKTYIAQKEIPDDTYNENGVALFRIQGSGPDNMQAIQVEPVASSLNSSYCYILHNGPAVFT
WSGNSTSAENQELVERMLDLIKPNLQSKPQREGSESEQFWDLLGGKSEYPSQKILREPESDPHLFSCHFSKGNLKVTEVY
NFSQDDLMTEDIFVLDCHSEIFVWVGQQVDSKSRMQALSIGEKFLEHDFLLEKLSRVAPIYVVMEGSEPPFFTRFFKWDS
AKAAMLGNSFQRKLTIVKSGGAPVLDKPKRRTSASYGGRSSSVPDKSSQRSSRSMSVSPDRVRVRGRSPAFNALAANFEN
PNSRNLSTPPPVIRKLYPKSVTTDSAILAPKSSAIAALSSSFEQPPSARETMIPRSLKVSPVMPKSNPEKNDKENSVSTR
VESLTIQEDVKEDEVEDEEGLVIYPYERLKIMSTDPVPNIDVTKRETYLSSAEFKEKFGMSKDAFYKLPKWKQNKLKMAV
QLF*
CDS seq >KRH37296
ATGGCTGTTTCCATGAGAGATTTGGATCCAGCTTTCCAGGGAGCTGGACAAAAGGCTGGACTTGAAATATGGCGTATTGA
GAATTTTAATCCAGTTCCTGTCCCAAAGTCCTCTTATGGGAAATTTTTCACTGGAGACTCCTATGTGATCTTAAAGACAA
CTGCCTCAAAAAGTGGTGCTCTGCGCCATGACATCCATTACTGGCTTGGTAAAGACACCAGTCAGGATGAAGCGGGTGCT
GCAGCTATCAAGACAGTTGAGCTGGATGCAGCTCTTGGAGGACGTGCTGTTCAGTATCGTGAAGTACAAGGCCATGAAAC
TGAAAAGTTTCTGTCTTATTTCAAACCATGTATTATCCCTCAAGAAGGTGGAGTTTCTTCTGGTTTCAAACATCCTGAGG
CTGAAAAACATAAGACACGGTTGTTTGTATGCAGAGGGAAACATGTTGTACATGTCAAAGAGGTTCCATTTGCCAGAGCT
TCACTCAACCATGATGATATTTTTGTTCTGGATACCGAATCGAAAATTTTCCAATTTAATGGTTCCAATTCGTCTATTCA
AGAAAGGGCTAAAGCTTTGGAAGTTGTACAGTATATTAAGGATACCTACCATGAAGGGAAATGTGAGGTAGCTGCTGTTG
AGGATGGAAAGTTGATGGCTGATCCTGAAACTGGGGAATTCTGGGGTTTCTTTGGGGGATTTGCTCCTCTTCCACGAAAA
ACAGCCAGCGATGATGATAAGCCTACTGATTCTCGCCCTCCAAAGCTGCTTTGTGTTGAAAAGGGTCAGGCAGAACCTGT
TGAGACTGATTCTTTGAAAAGGGAATTACTAGACACAAATAAATGCTATATTCTTGATTGTGGGTTTGAAGTGTTTGTCT
GGTTGGGAAGAAATACCTCCCTTGATGAAAGAAAAAGCGCAAGTGGAGTTGCAGATGAGATAGTCAGTGGCACTGATCAA
CTGAAACCCCAAATAATTCGTGTGATAGAAGGATTTGAAACAGTGATGTTCAGGTCCAAATTTGATTCTTGGCCTCAGAC
AACTGATGTAACAGTATCTGAAGATGGCCGTGGCAAGGTAGCAGCACTTCTAAAACGTCAAGGAGTAAATGTTAAGGGCT
TGTTGAAAGCTGATCCAGTGAGGGAAGAACCCCAACCCCACATTGATTGCACAGGACATTTGCAGGTTTGGCATGTGAAT
GGTCAGGAGAAGATTCTTCTTCAAGCTTCTGATCAATCAAAATTTTATAGTGGAGATTGCTTCATCTTCCAGTATACATA
TCCTGGAGAGGATAAAGAAGATTGTCTTATAGGAACGTGGATTGGAAAGAATAGTGTTGAGGAAGAACGAGCTTCAGCTA
ATTCATTGGCAAGTAAAATGGTTGAGTCAATGAAGTTTCTTGCTTCCCAGGCTCGTATATATGAAGGCAATGAACCAATT
CAATTTCATTCTATCCTTCAAAGCTTCATTGTTTTTAAGGGTGGGATTAGTGAAGGATACAAGACTTACATTGCACAAAA
GGAAATTCCTGATGATACATACAATGAGAATGGTGTTGCATTATTCCGCATCCAGGGCTCTGGACCAGACAATATGCAAG
CCATACAAGTTGAACCAGTTGCATCTTCCTTGAATTCCTCTTACTGTTACATACTTCACAATGGGCCTGCTGTCTTTACT
TGGTCTGGAAACTCTACAAGTGCAGAAAACCAGGAACTTGTTGAGAGGATGCTGGATTTGATAAAGCCAAATTTACAATC
CAAACCACAAAGGGAAGGTTCCGAATCTGAACAGTTTTGGGATTTGTTAGGAGGAAAATCAGAATATCCCAGTCAAAAGA
TTCTTAGAGAGCCTGAAAGTGATCCTCACCTATTTTCTTGCCACTTCTCGAAAGGAAATTTAAAGGTGACCGAGGTATAC
AACTTCTCCCAGGATGATTTGATGACTGAAGACATTTTTGTCTTGGATTGTCACTCGGAAATCTTTGTCTGGGTTGGCCA
GCAGGTTGACTCCAAGAGTAGAATGCAGGCTCTATCAATTGGTGAGAAATTTCTTGAGCATGATTTTCTTCTAGAAAAAT
TATCTCGTGTAGCTCCAATATATGTTGTCATGGAAGGGAGTGAGCCACCTTTCTTCACACGCTTCTTTAAATGGGATTCT
GCAAAAGCTGCAATGCTGGGAAACTCATTTCAAAGGAAGCTGACAATTGTGAAAAGTGGGGGTGCTCCAGTTTTGGATAA
ACCCAAACGGAGAACATCAGCATCTTATGGGGGAAGGTCGAGTAGTGTGCCAGATAAATCCTCCCAGCGTTCCTCTCGCA
GCATGTCTGTCAGTCCTGATCGTGTTCGTGTGAGGGGGCGGTCTCCAGCCTTTAATGCTCTAGCAGCTAATTTTGAGAAC
CCTAATTCTAGGAACCTTTCAACCCCACCTCCAGTAATTAGAAAGCTGTATCCTAAATCTGTGACAACAGATTCTGCAAT
ACTGGCGCCAAAATCTTCTGCCATAGCTGCACTTAGTTCTTCTTTTGAACAACCACCTTCAGCACGAGAAACCATGATAC
CTCGCTCACTTAAAGTGAGTCCAGTAATGCCCAAATCAAACCCTGAGAAAAATGACAAGGAGAATTCTGTGAGCACCAGA
GTGGAATCTCTTACCATACAGGAAGATGTGAAAGAGGATGAAGTTGAAGATGAGGAAGGTCTTGTGATTTACCCATATGA
ACGCCTTAAAATAATGTCCACAGATCCTGTACCAAATATTGATGTGACTAAGCGAGAGACTTATCTATCATCTGCGGAGT
TCAAAGAGAAATTTGGGATGTCCAAGGATGCCTTTTACAAGTTACCCAAATGGAAACAAAACAAACTCAAAATGGCTGTT
CAGTTATTCTGA
Microexon DNA seq GAAATTTAAAG
Microexon Amino Acid seq GNLK
Microexon-tag DNA Seq AGAGAGCCTGAAAGTGATCCTCACCTATTTTCTTGCCACTTCTCGAAAGGAAATTTAAAGGTGACCGAGGTATACAACTTCTCCCAGGATGATTTGATGACTGAAGAC
Microexon-tag Amino Acid seq REPESDPHLFSCHFSKGNLKVTEVYNFSQDDLMTED
Transcript ID Gm.52339.1
Gene ID Gm.52339
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 1.2e-08
Motif start 637
Motif end 712
Protein seq >Gm.52339.1
MAVSMRDLDPAFQGAGQKAGLEIWRIENFNPVPVPKSSYGKFFTGDSYVILKTTASKSGALRHDIHYWLGKDTSQDEAGA
AAIKTVELDAALGGRAVQYREVQGHETEKFLSYFKPCIIPQEGGVSSGFKHPEAEKHKTRLFVCRGKHVVHVKEVPFARA
SLNHDDIFVLDTESKIFQFNGSNSSIQERAKALEVVQYIKDTYHEGKCEVAAVEDGKLMADPETGEFWGFFGGFAPLPRK
TASDDDKPTDSRPPKLLCVEKGQAEPVETDSLKRELLDTNKCYILDCGFEVFVWLGRNTSLDERKSASGVADEIVSGTDQ
LKPQIIRVIEGFETVMFRSKFDSWPQTTDVTVSEDGRGKVAALLKRQGVNVKGLLKADPVREEPQPHIDCTGHLQVWHVN
GQEKILLQASDQSKFYSGDCFIFQYTYPGEDKEDCLIGTWIGKNSVEEERASANSLASKMVESMKFLASQARIYEGNEPI
QFHSILQSFIVFKGGISEGYKTYIAQKEIPDDTYNENGVALFRIQGSGPDNMQAIQVEPVASSLNSSYCYILHNGPAVFT
WSGNSTSAENQELVERMLDLIKPNLQSKPQREGSESEQFWDLLGGKSEYPSQKILREPESDPHLFSCHFSKGNLKVTEVY
NFSQDDLMTEDIFVLDCHSEIFVWVGQQVDSKSRMQALSIGEKFLEHDFLLEKLSRVAPIYVVMEGSEPPFFTRFFKWDS
AKAAMLGNSFQRKLTIVKSGGAPVLDKPKRRTSASYGGRSSSVPDKSSQRSSRSMSVSPDRVRVRGRSPAFNALAANFEN
PNSRNLSTPPPVIRKLYPKSVTTDSAILAPKSSAIAALSSSFEQPPSARETMIPRSLKVSPVMPKSNPEKNDKENSVSTR
VESLTIQEDVKEDEVEDEEGLVIYPYERLKIMSTDPVPNIDVTKRETYLSSAEFKEKFGMSKDAFYKLPKWKQNKLKMAV
QLF*
CDS seq >Gm.52339.1
ATGGCTGTTTCCATGAGAGATTTGGATCCAGCTTTCCAGGGAGCTGGACAAAAGGCTGGACTTGAAATATGGCGTATTGA
GAATTTTAATCCAGTTCCTGTCCCAAAGTCCTCTTATGGGAAATTTTTCACTGGAGACTCCTATGTGATCTTAAAGACAA
CTGCCTCAAAAAGTGGTGCTCTGCGCCATGACATCCATTACTGGCTTGGTAAAGACACCAGTCAGGATGAAGCGGGTGCT
GCAGCTATCAAGACAGTTGAGCTGGATGCAGCTCTTGGAGGACGTGCTGTTCAGTATCGTGAAGTACAAGGCCATGAAAC
TGAAAAGTTTCTGTCTTATTTCAAACCATGTATTATCCCTCAAGAAGGTGGAGTTTCTTCTGGTTTCAAACATCCTGAGG
CTGAAAAACATAAGACACGGTTGTTTGTATGCAGAGGGAAACATGTTGTACATGTCAAAGAGGTTCCATTTGCCAGAGCT
TCACTCAACCATGATGATATTTTTGTTCTGGATACCGAATCGAAAATTTTCCAATTTAATGGTTCCAATTCGTCTATTCA
AGAAAGGGCTAAAGCTTTGGAAGTTGTACAGTATATTAAGGATACCTACCATGAAGGGAAATGTGAGGTAGCTGCTGTTG
AGGATGGAAAGTTGATGGCTGATCCTGAAACTGGGGAATTCTGGGGTTTCTTTGGGGGATTTGCTCCTCTTCCACGAAAA
ACAGCCAGCGATGATGATAAGCCTACTGATTCTCGCCCTCCAAAGCTGCTTTGTGTTGAAAAGGGTCAGGCAGAACCTGT
TGAGACTGATTCTTTGAAAAGGGAATTACTAGACACAAATAAATGCTATATTCTTGATTGTGGGTTTGAAGTGTTTGTCT
GGTTGGGAAGAAATACCTCCCTTGATGAAAGAAAAAGCGCAAGTGGAGTTGCAGATGAGATAGTCAGTGGCACTGATCAA
CTGAAACCCCAAATAATTCGTGTGATAGAAGGATTTGAAACAGTGATGTTCAGGTCCAAATTTGATTCTTGGCCTCAGAC
AACTGATGTAACAGTATCTGAAGATGGCCGTGGCAAGGTAGCAGCACTTCTAAAACGTCAAGGAGTAAATGTTAAGGGCT
TGTTGAAAGCTGATCCAGTGAGGGAAGAACCCCAACCCCACATTGATTGCACAGGACATTTGCAGGTTTGGCATGTGAAT
GGTCAGGAGAAGATTCTTCTTCAAGCTTCTGATCAATCAAAATTTTATAGTGGAGATTGCTTCATCTTCCAGTATACATA
TCCTGGAGAGGATAAAGAAGATTGTCTTATAGGAACGTGGATTGGAAAGAATAGTGTTGAGGAAGAACGAGCTTCAGCTA
ATTCATTGGCAAGTAAAATGGTTGAGTCAATGAAGTTTCTTGCTTCCCAGGCTCGTATATATGAAGGCAATGAACCAATT
CAATTTCATTCTATCCTTCAAAGCTTCATTGTTTTTAAGGGTGGGATTAGTGAAGGATACAAGACTTACATTGCACAAAA
GGAAATTCCTGATGATACATACAATGAGAATGGTGTTGCATTATTCCGCATCCAGGGCTCTGGACCAGACAATATGCAAG
CCATACAAGTTGAACCAGTTGCATCTTCCTTGAATTCCTCTTACTGTTACATACTTCACAATGGGCCTGCTGTCTTTACT
TGGTCTGGAAACTCTACAAGTGCAGAAAACCAGGAACTTGTTGAGAGGATGCTGGATTTGATAAAGCCAAATTTACAATC
CAAACCACAAAGGGAAGGTTCCGAATCTGAACAGTTTTGGGATTTGTTAGGAGGAAAATCAGAATATCCCAGTCAAAAGA
TTCTTAGAGAGCCTGAAAGTGATCCTCACCTATTTTCTTGCCACTTCTCGAAAGGAAATTTAAAGGTGACCGAGGTATAC
AACTTCTCCCAGGATGATTTGATGACTGAAGACATTTTTGTCTTGGATTGTCACTCGGAAATCTTTGTCTGGGTTGGCCA
GCAGGTTGACTCCAAGAGTAGAATGCAGGCTCTATCAATTGGTGAGAAATTTCTTGAGCATGATTTTCTTCTAGAAAAAT
TATCTCGTGTAGCTCCAATATATGTTGTCATGGAAGGGAGTGAGCCACCTTTCTTCACACGCTTCTTTAAATGGGATTCT
GCAAAAGCTGCAATGCTGGGAAACTCATTTCAAAGGAAGCTGACAATTGTGAAAAGTGGGGGTGCTCCAGTTTTGGATAA
ACCCAAACGGAGAACATCAGCATCTTATGGGGGAAGGTCGAGTAGTGTGCCAGATAAATCCTCCCAGCGTTCCTCTCGCA
GCATGTCTGTCAGTCCTGATCGTGTTCGTGTGAGGGGGCGGTCTCCAGCCTTTAATGCTCTAGCAGCTAATTTTGAGAAC
CCTAATTCTAGGAACCTTTCAACCCCACCTCCAGTAATTAGAAAGCTGTATCCTAAATCTGTGACAACAGATTCTGCAAT
ACTGGCGCCAAAATCTTCTGCCATAGCTGCACTTAGTTCTTCTTTTGAACAACCACCTTCAGCACGAGAAACCATGATAC
CTCGCTCACTTAAAGTGAGTCCAGTAATGCCCAAATCAAACCCTGAGAAAAATGACAAGGAGAATTCTGTGAGCACCAGA
GTGGAATCTCTTACCATACAGGAAGATGTGAAAGAGGATGAAGTTGAAGATGAGGAAGGTCTTGTGATTTACCCATATGA
ACGCCTTAAAATAATGTCCACAGATCCTGTACCAAATATTGATGTGACTAAGCGAGAGACTTATCTATCATCTGCGGAGT
TCAAAGAGAAATTTGGGATGTCCAAGGATGCCTTTTACAAGTTACCCAAATGGAAACAAAACAAACTCAAAATGGCTGTT
CAGTTATTCTGA