Microexon ID Gm_15:14052365-14052375:-
Species Glycine max
Coordinates 15:14052365..14052375
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAAATTTAAAG
Microexon Amino Acid seq GNLK
Microexon-tag DNA Seq AGAGAGCCTGAAAGTGATCCTCACCTATTCTCTTGTCATTTCTCGAAAGGAAATTTAAAGGTGACAGAGGTATACAACTTCTCCCAGGATGATTTGATGACAGAAGAT
Microexon-tag Amino Acid Seq REPESDPHLFSCHFSKGNLKVTEVYNFSQDDLMTED
Microexon-tag spanning region14051369-14052541
Microexon-tag prediction score0.9645
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH12272x
Reference Transcript ID KRH12272
Gene ID GLYMA_15G163700
Gene Name NA
Transcript ID KRH12272
Protein ID KRH12272
Gene ID GLYMA_15G163700
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 3.3e-09
Motif start 637
Motif end 712
Protein seq >KRH12272
MAVSMRDLDPAFQGAGQKAGLEIWRIENFNPVPVPKSSYGKFFTGDSYVILKTTASKSGALRHDIHYWLGKDTSQDEAGA
AAIKTVELDAALGGRAVQYREVQGHETEKFLSYFKPCIIPQEGGVASGFKHPEAEKHKTRLFVCRGKHVVHVKEVPFARA
SLNHDDIFVLDTESKIFQFNGSNSSIQERAKALEVVQYIKDTYHEGKCEVAAVEDGKLMADPETGEFWGFFGGFAPLPRK
TASDDDKPTDSRPPKLLCFEKGQAEPVETDSLKRELLDTNKCYILDCGFEVFVWMGRNTSLDERKIASGVADELVSGTDQ
LKPQIIRVIEGFETVMFRSKFDSWPQITDVTVSEDGRGKVAALLKRQGVNVKGLLKADPVREEPQPHIDCTGHLQVWRVN
GQEKILLQASDQSKFYSGDCFIFQYTYPGEDKEDCLIGTWIGKNSVEEERASANSLASKMVESMKFLASQARIYEGNEPI
QFHSILQSFIVFKGGLSEGYKTYIAQKEIPDDTYNENGVALFRIQGSGPDNMQAIQVEPVASSLNSSYCYILHNGPAVFT
WSGNSTSAENQELVERMLDLIKPNLQSKPQREGSESEQFWDFLGGKSEYPSQKILREPESDPHLFSCHFSKGNLKVTEVY
NFSQDDLMTEDIFILDCHSEIFVWVGQQVDSKSRMQALTIGEKFLEHDFLLEKLSHVAPVYVVMEGSEPPFFTRFFKWDS
AKSSMLGNSFQRKLTIVKSGGAPVLDKPKRRTPVSYGGRSSSVPDKSSQRSSRSMSVSPDRVRVRGRSPAFNALAANFEN
PNARNLSTPPPVIRKLYPKSVTPDSAILAPKSAAIAALSSSFEQPPSARETMIPKSIKVSPVMPKSNPEKNDKENSVSTR
VESLTIQEDVKEDEIEDEEGLVIHPYERLKITSTDPVPNIDVTKRETYLSSAEFKEKFAMSKDAFYKLPKWKQNKLKMAV
QLF*
CDS seq >KRH12272
ATGGCTGTTTCCATGAGAGATTTGGATCCAGCTTTCCAGGGAGCTGGACAAAAGGCTGGACTTGAAATATGGCGTATTGA
GAATTTTAATCCAGTTCCTGTCCCAAAGTCGTCTTACGGGAAATTTTTCACTGGAGACTCCTATGTAATCTTAAAGACAA
CTGCATCAAAAAGTGGTGCTCTGCGCCATGACATCCATTACTGGCTTGGTAAAGACACCAGTCAGGATGAAGCCGGTGCT
GCGGCCATCAAGACAGTTGAGCTGGATGCAGCTCTTGGAGGACGTGCTGTTCAGTATCGTGAAGTACAAGGCCATGAAAC
TGAAAAGTTTCTGTCTTATTTCAAACCATGTATTATCCCACAAGAAGGTGGAGTTGCTTCTGGTTTCAAACATCCTGAGG
CTGAAAAACATAAGACACGGTTGTTTGTATGCAGAGGGAAACATGTTGTACATGTCAAAGAGGTTCCATTTGCTAGAGCT
TCACTCAACCATGATGATATTTTTGTTCTGGATACTGAATCGAAAATTTTCCAATTTAATGGTTCCAATTCATCTATTCA
AGAAAGGGCCAAAGCTTTGGAAGTTGTACAGTATATTAAGGATACCTACCATGAAGGGAAATGTGAGGTAGCTGCTGTTG
AGGATGGAAAGTTGATGGCTGATCCTGAAACTGGGGAATTCTGGGGTTTCTTTGGGGGATTTGCTCCTCTTCCACGAAAA
ACAGCCAGTGATGATGATAAGCCTACTGATTCTCGCCCTCCAAAGCTGCTTTGCTTTGAAAAGGGTCAGGCAGAACCTGT
TGAGACTGATTCTTTGAAAAGGGAATTACTAGACACAAATAAATGCTATATTCTTGATTGTGGGTTTGAAGTGTTTGTCT
GGATGGGAAGAAATACCTCTCTTGATGAAAGAAAAATTGCAAGTGGAGTTGCAGATGAGTTAGTCAGCGGCACTGATCAA
CTGAAACCCCAAATAATTCGTGTGATAGAAGGATTTGAAACAGTGATGTTCAGGTCCAAATTTGATTCTTGGCCTCAGAT
AACTGATGTAACAGTATCTGAAGATGGTCGTGGCAAGGTAGCAGCACTTCTAAAACGTCAAGGAGTAAATGTAAAGGGGT
TGTTGAAAGCTGATCCAGTGAGGGAAGAACCCCAACCCCACATCGATTGCACAGGACATTTGCAGGTTTGGCGCGTGAAT
GGTCAGGAGAAGATTCTTCTTCAAGCTTCTGATCAGTCAAAATTTTATAGTGGAGATTGCTTCATCTTTCAGTATACATA
TCCTGGAGAGGATAAAGAAGATTGTCTTATAGGAACATGGATTGGCAAGAATAGTGTTGAGGAAGAACGAGCTTCAGCTA
ATTCATTGGCAAGCAAAATGGTTGAGTCAATGAAGTTTCTTGCTTCCCAGGCTCGTATATATGAAGGCAACGAACCAATT
CAATTTCATTCTATCCTTCAAAGTTTCATTGTTTTTAAGGGTGGGCTTAGTGAAGGATATAAGACTTACATTGCACAAAA
GGAAATTCCTGATGATACATACAATGAGAATGGTGTTGCATTATTCCGCATCCAGGGCTCTGGACCAGACAATATGCAAG
CCATACAAGTTGAACCCGTTGCATCTTCCTTGAATTCCTCTTATTGTTACATACTTCATAATGGGCCTGCTGTCTTTACT
TGGTCTGGAAACTCTACAAGTGCAGAAAACCAGGAACTTGTTGAGAGGATGCTGGATTTGATAAAGCCAAATTTACAATC
CAAACCACAAAGGGAAGGTTCTGAATCTGAACAGTTTTGGGATTTTTTAGGAGGAAAATCAGAATATCCCAGTCAAAAGA
TTCTTAGAGAGCCTGAAAGTGATCCTCACCTATTCTCTTGTCATTTCTCGAAAGGAAATTTAAAGGTGACAGAGGTATAC
AACTTCTCCCAGGATGATTTGATGACAGAAGATATTTTCATCTTGGATTGTCACTCGGAAATCTTTGTCTGGGTTGGCCA
GCAGGTTGACTCCAAGAGTAGAATGCAGGCTCTAACAATTGGTGAGAAATTTCTTGAGCATGATTTTCTTCTAGAAAAGT
TATCTCATGTAGCTCCAGTATATGTTGTCATGGAAGGGAGTGAGCCACCTTTCTTCACACGCTTCTTTAAATGGGATTCT
GCAAAATCTTCAATGCTGGGAAACTCATTTCAAAGGAAGCTGACAATTGTGAAAAGTGGGGGTGCTCCAGTTTTGGATAA
ACCCAAACGGAGAACACCAGTATCTTATGGGGGAAGGTCGAGTAGTGTGCCAGATAAATCCTCCCAGCGTTCCTCCCGCA
GCATGTCTGTCAGTCCTGATCGTGTTCGAGTGAGGGGCCGGTCTCCAGCCTTTAATGCCCTAGCAGCTAATTTTGAGAAC
CCTAATGCTAGGAACCTTTCAACCCCGCCTCCAGTAATTAGAAAGCTGTATCCAAAATCTGTGACACCAGATTCTGCAAT
ACTGGCGCCAAAATCTGCAGCCATAGCTGCACTTAGTTCTTCTTTTGAACAGCCACCTTCAGCACGAGAAACTATGATAC
CAAAGTCAATTAAAGTGAGTCCAGTAATGCCCAAATCAAACCCTGAGAAAAATGACAAGGAGAATTCTGTGAGCACCAGA
GTGGAATCTCTTACCATACAGGAAGATGTGAAAGAGGATGAAATTGAAGATGAGGAAGGTCTTGTGATTCACCCATATGA
ACGCCTTAAAATAACATCCACAGATCCTGTACCAAATATTGATGTGACTAAGCGAGAGACTTATCTATCATCTGCGGAGT
TCAAAGAGAAATTTGCGATGTCCAAGGATGCCTTTTACAAGTTGCCCAAATGGAAACAAAACAAACTCAAAATGGCTGTT
CAGTTATTCTGA
Microexon DNA seq GAAATTTAAAG
Microexon Amino Acid seq GNLK
Microexon-tag DNA Seq AGAGAGCCTGAAAGTGATCCTCACCTATTCTCTTGTCATTTCTCGAAAGGAAATTTAAAGGTGACAGAGGTATACAACTTCTCCCAGGATGATTTGATGACAGAAGAT
Microexon-tag Amino Acid seq REPESDPHLFSCHFSKGNLKVTEVYNFSQDDLMTED
Transcript ID Gm.18180.1
Gene ID Gm.18180
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 3.3e-09
Motif start 637
Motif end 712
Protein seq >Gm.18180.1
MAVSMRDLDPAFQGAGQKAGLEIWRIENFNPVPVPKSSYGKFFTGDSYVILKTTASKSGALRHDIHYWLGKDTSQDEAGA
AAIKTVELDAALGGRAVQYREVQGHETEKFLSYFKPCIIPQEGGVASGFKHPEAEKHKTRLFVCRGKHVVHVKEVPFARA
SLNHDDIFVLDTESKIFQFNGSNSSIQERAKALEVVQYIKDTYHEGKCEVAAVEDGKLMADPETGEFWGFFGGFAPLPRK
TASDDDKPTDSRPPKLLCFEKGQAEPVETDSLKRELLDTNKCYILDCGFEVFVWMGRNTSLDERKIASGVADELVSGTDQ
LKPQIIRVIEGFETVMFRSKFDSWPQITDVTVSEDGRGKVAALLKRQGVNVKGLLKADPVREEPQPHIDCTGHLQVWRVN
GQEKILLQASDQSKFYSGDCFIFQYTYPGEDKEDCLIGTWIGKNSVEEERASANSLASKMVESMKFLASQARIYEGNEPI
QFHSILQSFIVFKGGLSEGYKTYIAQKEIPDDTYNENGVALFRIQGSGPDNMQAIQVEPVASSLNSSYCYILHNGPAVFT
WSGNSTSAENQELVERMLDLIKPNLQSKPQREGSESEQFWDFLGGKSEYPSQKILREPESDPHLFSCHFSKGNLKVTEVY
NFSQDDLMTEDIFILDCHSEIFVWVGQQVDSKSRMQALTIGEKFLEHDFLLEKLSHVAPVYVVMEGSEPPFFTRFFKWDS
AKSSMLGNSFQRKLTIVKSGGAPVLDKPKRRTPVSYGGRSSSVPDKSSQRSSRSMSVSPDRVRVRGRSPAFNALAANFEN
PNARNLSTPPPVIRKLYPKSVTPDSAILAPKSAAIAALSSSFEQPPSARETMIPKSIKVSPVMPKSNPEKNDKENSVSTR
VESLTIQEDVKEDEIEDEEGLVIHPYERLKITSTDPVPNIDVTKRETYLSSAEFKEKFAMSKDAFYKLPKWKQNKLKMAV
QLF*
CDS seq >Gm.18180.1
ATGGCTGTTTCCATGAGAGATTTGGATCCAGCTTTCCAGGGAGCTGGACAAAAGGCTGGACTTGAAATATGGCGTATTGA
GAATTTTAATCCAGTTCCTGTCCCAAAGTCGTCTTACGGGAAATTTTTCACTGGAGACTCCTATGTAATCTTAAAGACAA
CTGCATCAAAAAGTGGTGCTCTGCGCCATGACATCCATTACTGGCTTGGTAAAGACACCAGTCAGGATGAAGCCGGTGCT
GCGGCCATCAAGACAGTTGAGCTGGATGCAGCTCTTGGAGGACGTGCTGTTCAGTATCGTGAAGTACAAGGCCATGAAAC
TGAAAAGTTTCTGTCTTATTTCAAACCATGTATTATCCCACAAGAAGGTGGAGTTGCTTCTGGTTTCAAACATCCTGAGG
CTGAAAAACATAAGACACGGTTGTTTGTATGCAGAGGGAAACATGTTGTACATGTCAAAGAGGTTCCATTTGCTAGAGCT
TCACTCAACCATGATGATATTTTTGTTCTGGATACTGAATCGAAAATTTTCCAATTTAATGGTTCCAATTCATCTATTCA
AGAAAGGGCCAAAGCTTTGGAAGTTGTACAGTATATTAAGGATACCTACCATGAAGGGAAATGTGAGGTAGCTGCTGTTG
AGGATGGAAAGTTGATGGCTGATCCTGAAACTGGGGAATTCTGGGGTTTCTTTGGGGGATTTGCTCCTCTTCCACGAAAA
ACAGCCAGTGATGATGATAAGCCTACTGATTCTCGCCCTCCAAAGCTGCTTTGCTTTGAAAAGGGTCAGGCAGAACCTGT
TGAGACTGATTCTTTGAAAAGGGAATTACTAGACACAAATAAATGCTATATTCTTGATTGTGGGTTTGAAGTGTTTGTCT
GGATGGGAAGAAATACCTCTCTTGATGAAAGAAAAATTGCAAGTGGAGTTGCAGATGAGTTAGTCAGCGGCACTGATCAA
CTGAAACCCCAAATAATTCGTGTGATAGAAGGATTTGAAACAGTGATGTTCAGGTCCAAATTTGATTCTTGGCCTCAGAT
AACTGATGTAACAGTATCTGAAGATGGTCGTGGCAAGGTAGCAGCACTTCTAAAACGTCAAGGAGTAAATGTAAAGGGGT
TGTTGAAAGCTGATCCAGTGAGGGAAGAACCCCAACCCCACATCGATTGCACAGGACATTTGCAGGTTTGGCGCGTGAAT
GGTCAGGAGAAGATTCTTCTTCAAGCTTCTGATCAGTCAAAATTTTATAGTGGAGATTGCTTCATCTTTCAGTATACATA
TCCTGGAGAGGATAAAGAAGATTGTCTTATAGGAACATGGATTGGCAAGAATAGTGTTGAGGAAGAACGAGCTTCAGCTA
ATTCATTGGCAAGCAAAATGGTTGAGTCAATGAAGTTTCTTGCTTCCCAGGCTCGTATATATGAAGGCAACGAACCAATT
CAATTTCATTCTATCCTTCAAAGTTTCATTGTTTTTAAGGGTGGGCTTAGTGAAGGATATAAGACTTACATTGCACAAAA
GGAAATTCCTGATGATACATACAATGAGAATGGTGTTGCATTATTCCGCATCCAGGGCTCTGGACCAGACAATATGCAAG
CCATACAAGTTGAACCCGTTGCATCTTCCTTGAATTCCTCTTATTGTTACATACTTCATAATGGGCCTGCTGTCTTTACT
TGGTCTGGAAACTCTACAAGTGCAGAAAACCAGGAACTTGTTGAGAGGATGCTGGATTTGATAAAGCCAAATTTACAATC
CAAACCACAAAGGGAAGGTTCTGAATCTGAACAGTTTTGGGATTTTTTAGGAGGAAAATCAGAATATCCCAGTCAAAAGA
TTCTTAGAGAGCCTGAAAGTGATCCTCACCTATTCTCTTGTCATTTCTCGAAAGGAAATTTAAAGGTGACAGAGGTATAC
AACTTCTCCCAGGATGATTTGATGACAGAAGATATTTTCATCTTGGATTGTCACTCGGAAATCTTTGTCTGGGTTGGCCA
GCAGGTTGACTCCAAGAGTAGAATGCAGGCTCTAACAATTGGTGAGAAATTTCTTGAGCATGATTTTCTTCTAGAAAAGT
TATCTCATGTAGCTCCAGTATATGTTGTCATGGAAGGGAGTGAGCCACCTTTCTTCACACGCTTCTTTAAATGGGATTCT
GCAAAATCTTCAATGCTGGGAAACTCATTTCAAAGGAAGCTGACAATTGTGAAAAGTGGGGGTGCTCCAGTTTTGGATAA
ACCCAAACGGAGAACACCAGTATCTTATGGGGGAAGGTCGAGTAGTGTGCCAGATAAATCCTCCCAGCGTTCCTCCCGCA
GCATGTCTGTCAGTCCTGATCGTGTTCGAGTGAGGGGCCGGTCTCCAGCCTTTAATGCCCTAGCAGCTAATTTTGAGAAC
CCTAATGCTAGGAACCTTTCAACCCCGCCTCCAGTAATTAGAAAGCTGTATCCAAAATCTGTGACACCAGATTCTGCAAT
ACTGGCGCCAAAATCTGCAGCCATAGCTGCACTTAGTTCTTCTTTTGAACAGCCACCTTCAGCACGAGAAACTATGATAC
CAAAGTCAATTAAAGTGAGTCCAGTAATGCCCAAATCAAACCCTGAGAAAAATGACAAGGAGAATTCTGTGAGCACCAGA
GTGGAATCTCTTACCATACAGGAAGATGTGAAAGAGGATGAAATTGAAGATGAGGAAGGTCTTGTGATTCACCCATATGA
ACGCCTTAAAATAACATCCACAGATCCTGTACCAAATATTGATGTGACTAAGCGAGAGACTTATCTATCATCTGCGGAGT
TCAAAGAGAAATTTGCGATGTCCAAGGATGCCTTTTACAAGTTGCCCAAATGGAAACAAAACAAACTCAAAATGGCTGTT
CAGTTATTCTGA