Microexon ID Gm_13:21376169-21376179:-
Species Glycine max
Coordinates 13:21376169..21376179
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAAATTTAAAG
Microexon Amino Acid seq GNLK
Microexon-tag DNA Seq AGGGAAGCTGAAAATGATCCTCATTTATTTTCTTGCAACTTCTCAGAAGGAAATTTAAAGGTGAAAGAGATTCACAACTTTTCCCAGGATGATTTGATGACAGAAGAT
Microexon-tag Amino Acid Seq REAENDPHLFSCNFSEGNLKVKEIHNFSQDDLMTED
Microexon-tag spanning region21375354-21376357
Microexon-tag prediction score0.9548
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH19052x
Reference Transcript ID KRH19052
Gene ID GLYMA_13G098600
Gene Name NA
Transcript ID KRH19052
Protein ID KRH19052
Gene ID GLYMA_13G098600
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 5.9e-08
Motif start 637
Motif end 712
Protein seq >KRH19052
MSVSMRDLDPAFKGAGQKAGLEIWRIENFNPVAIPQSSYGKFFTGDSYVILKTTASKSGALRHDIHYWLGKDTSQDEAGA
AAIKTVELDAALGGRAVQYREVQGHETEKFLSYFKPCIIPQEGGAASGFKHVEAEEHKTRLFVCKGKHVVHVKEISFARS
SLNHDDIFILDTESKIFQFNGSNSSIQERAKALEVVQYIKDTYHDGKCEIASIEDGKLMADSESGEFWGCFGGFAPLPRR
TVSDDDKPADSHPPKLLCVDKGKAEPIESDSLTKELLDTNKCYILDCGLEVFAWMGRNTSLDERKSASGAADELISGTGR
PKSHIIRVIEGFETVMFKSKFDSWPQASHATMSEEGRGKVAALLKRQGLDVKGLVKSEPEKEEPQPHIDCTGHLQVWRVN
GPEKILLPATDQSKFYNGDCYIFQYSYPGEDKEEYLIGTWVGKNSVEEERASALSLASKMVESMKFLPSQARIYEGSEPI
QFHAILQSCIVFKGGRSDGYKNYIAEKEIPDETYNEDGVALFRIQGTGPDNMQAIQVEPVASSLNSAYCFILHSGPTVFI
WSGGLATSDDQELVERMLDLIKPDVQCKPLKEGLEPEQFWDLLGGKTEYPSQKITREAENDPHLFSCNFSEGNLKVKEIH
NFSQDDLMTEDIYTLDCHSEIFVWVGQQVDSKSRMQALTIGEKFLEHDFLLEGLSREAPIYIVKEGSEPPFFTRFFKWES
AKSAMLGNSFQRKLAIVKNGGTPLMVKHKRRASVTYGGRSSGAPDKSQRSRSMSVSPDRVRVRGRSPAFNALAANFESSN
ARNLSTPPPMIRKLYPKSMAQDTAKLATKSSAIAHLTSSFELTSARENLIPRSQKASSVTPKSNPETSDEEGSLSSRIES
LTIQEDAKEGEAEDDEGLPVYPHERVNTASTDPVEDIDVTKREAYLSSAEFQEKFGMAKNEFYKLPKWKQNKLKMAVQLF
*
CDS seq >KRH19052
ATGTCCGTTTCCATGAGAGATCTGGATCCAGCTTTCAAGGGAGCTGGACAAAAGGCCGGATTGGAAATATGGCGTATCGA
GAATTTTAATCCAGTTGCCATCCCTCAGTCATCTTATGGAAAGTTTTTCACTGGGGACTCCTATGTGATCTTGAAGACAA
CTGCTTCAAAAAGTGGTGCTCTTCGTCATGATATCCATTATTGGCTTGGTAAAGACACCAGTCAGGATGAAGCTGGCGCT
GCAGCCATCAAGACAGTCGAGTTGGATGCAGCTTTAGGAGGACGGGCTGTTCAGTATCGTGAAGTACAAGGTCATGAAAC
TGAAAAGTTCCTCTCTTATTTCAAACCATGCATCATACCTCAAGAAGGTGGGGCTGCCTCAGGTTTTAAGCATGTTGAGG
CTGAAGAACATAAGACACGGTTGTTTGTGTGCAAAGGGAAACATGTGGTACATGTCAAAGAGATTTCTTTCGCTCGATCT
TCACTGAACCATGATGATATTTTTATTCTGGATACCGAGTCCAAAATTTTCCAATTTAATGGTTCCAATTCAAGTATTCA
AGAAAGGGCTAAAGCATTGGAAGTTGTACAATATATCAAGGATACCTACCATGATGGGAAATGTGAGATAGCTTCTATTG
AGGATGGAAAGTTGATGGCTGATTCTGAGAGTGGAGAATTCTGGGGTTGCTTTGGGGGCTTTGCTCCTCTTCCACGGAGA
ACAGTCAGTGATGATGACAAGCCTGCTGATTCTCATCCTCCAAAGCTACTTTGTGTTGACAAGGGGAAAGCAGAACCAAT
TGAATCCGATTCTTTGACAAAGGAATTACTGGACACAAACAAATGTTATATTCTAGATTGTGGGTTGGAAGTTTTTGCAT
GGATGGGAAGAAACACATCTCTTGATGAAAGAAAAAGTGCAAGTGGAGCAGCAGATGAGTTAATCAGTGGCACTGGTCGA
CCAAAGTCCCATATAATTCGTGTAATTGAAGGATTTGAAACAGTAATGTTCAAGTCCAAGTTTGATTCTTGGCCTCAGGC
AAGTCATGCAACAATGTCTGAAGAAGGTCGTGGCAAGGTAGCAGCACTTCTAAAACGTCAAGGATTGGATGTCAAGGGTC
TCGTGAAATCTGAGCCCGAAAAAGAAGAACCTCAACCCCACATAGATTGCACAGGACATTTGCAGGTTTGGCGTGTGAAT
GGTCCGGAAAAGATTCTTCTTCCAGCCACTGATCAGTCAAAATTTTATAATGGAGATTGCTACATCTTTCAATATTCATA
TCCTGGAGAAGATAAGGAAGAGTATCTTATAGGAACATGGGTTGGAAAGAATAGTGTTGAGGAAGAGAGAGCTTCAGCTC
TTTCACTAGCAAGCAAAATGGTTGAGTCAATGAAGTTTCTTCCTTCCCAGGCTCGTATCTATGAAGGCAGTGAACCAATT
CAATTTCATGCCATCCTGCAAAGCTGTATTGTTTTTAAGGGTGGACGTAGTGATGGATACAAGAATTACATTGCGGAGAA
GGAAATTCCAGATGAGACATACAATGAGGATGGTGTTGCATTATTCCGCATCCAGGGCACTGGACCAGACAATATGCAAG
CTATACAAGTTGAACCAGTTGCTTCCTCCTTGAATTCCGCTTATTGCTTCATACTTCATAGCGGGCCCACTGTTTTTATT
TGGTCTGGAGGTTTAGCAACTTCAGATGACCAGGAGCTTGTTGAGAGAATGCTGGATTTGATTAAGCCGGATGTACAATG
CAAACCACTAAAGGAAGGCCTAGAACCGGAACAGTTTTGGGATTTGTTGGGGGGAAAAACGGAATATCCCAGTCAAAAGA
TCACGAGGGAAGCTGAAAATGATCCTCATTTATTTTCTTGCAACTTCTCAGAAGGAAATTTAAAGGTGAAAGAGATTCAC
AACTTTTCCCAGGATGATTTGATGACAGAAGATATTTACACCTTGGATTGTCACTCGGAAATCTTTGTTTGGGTTGGCCA
GCAGGTTGACTCAAAGAGTAGAATGCAGGCTCTAACAATTGGCGAGAAGTTTCTTGAGCATGATTTTCTCCTAGAAGGAT
TATCTCGTGAAGCTCCAATATATATTGTCAAAGAAGGTAGTGAGCCACCTTTCTTCACTCGCTTCTTTAAATGGGAGTCT
GCAAAATCTGCAATGCTAGGAAACTCATTTCAAAGGAAGCTTGCAATCGTGAAAAATGGGGGTACACCACTTATGGTTAA
ACACAAACGAAGAGCATCAGTAACTTATGGGGGAAGGTCTAGTGGTGCCCCAGATAAATCCCAGCGTTCCCGTAGCATGT
CTGTCAGTCCTGATCGTGTTCGTGTGAGGGGCAGATCTCCGGCCTTTAATGCACTAGCAGCTAATTTTGAGAGCTCAAAT
GCAAGGAACCTTTCAACTCCACCTCCGATGATTAGAAAACTTTACCCAAAATCTATGGCACAGGATACAGCAAAATTGGC
AACTAAATCTTCAGCCATAGCTCATCTTACTTCTAGTTTTGAACTAACATCAGCACGAGAAAATTTGATTCCTCGGTCAC
AAAAAGCGAGTTCAGTTACCCCCAAATCAAACCCTGAGACAAGTGATGAGGAGGGTTCTTTGAGCAGCAGGATAGAATCT
CTTACCATACAGGAGGATGCGAAAGAGGGTGAAGCTGAAGACGATGAAGGTCTCCCAGTTTACCCGCATGAACGCGTTAA
CACAGCTTCTACAGATCCTGTAGAAGATATTGACGTGACTAAACGAGAGGCTTATCTGTCATCTGCAGAGTTTCAAGAGA
AATTTGGGATGGCGAAGAATGAATTTTACAAGTTGCCAAAATGGAAACAGAACAAACTCAAAATGGCAGTTCAGTTGTTC
TGA
Microexon DNA seq GAAATTTAAAG
Microexon Amino Acid seq GNLK
Microexon-tag DNA Seq AGGGAAGCTGAAAATGATCCTCATTTATTTTCTTGCAACTTCTCAGAAGGAAATTTAAAGGTGAAAGAGATTCACAACTTTTCCCAGGATGATTTGATGACAGAAGAT
Microexon-tag Amino Acid seq REAENDPHLFSCNFSEGNLKVKEIHNFSQDDLMTED
Transcript ID KRH19052
Gene ID Gm.11405
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 5.9e-08
Motif start 637
Motif end 712
Protein seq >KRH19052
MSVSMRDLDPAFKGAGQKAGLEIWRIENFNPVAIPQSSYGKFFTGDSYVILKTTASKSGALRHDIHYWLGKDTSQDEAGA
AAIKTVELDAALGGRAVQYREVQGHETEKFLSYFKPCIIPQEGGAASGFKHVEAEEHKTRLFVCKGKHVVHVKEISFARS
SLNHDDIFILDTESKIFQFNGSNSSIQERAKALEVVQYIKDTYHDGKCEIASIEDGKLMADSESGEFWGCFGGFAPLPRR
TVSDDDKPADSHPPKLLCVDKGKAEPIESDSLTKELLDTNKCYILDCGLEVFAWMGRNTSLDERKSASGAADELISGTGR
PKSHIIRVIEGFETVMFKSKFDSWPQASHATMSEEGRGKVAALLKRQGLDVKGLVKSEPEKEEPQPHIDCTGHLQVWRVN
GPEKILLPATDQSKFYNGDCYIFQYSYPGEDKEEYLIGTWVGKNSVEEERASALSLASKMVESMKFLPSQARIYEGSEPI
QFHAILQSCIVFKGGRSDGYKNYIAEKEIPDETYNEDGVALFRIQGTGPDNMQAIQVEPVASSLNSAYCFILHSGPTVFI
WSGGLATSDDQELVERMLDLIKPDVQCKPLKEGLEPEQFWDLLGGKTEYPSQKITREAENDPHLFSCNFSEGNLKVKEIH
NFSQDDLMTEDIYTLDCHSEIFVWVGQQVDSKSRMQALTIGEKFLEHDFLLEGLSREAPIYIVKEGSEPPFFTRFFKWES
AKSAMLGNSFQRKLAIVKNGGTPLMVKHKRRASVTYGGRSSGAPDKSQRSRSMSVSPDRVRVRGRSPAFNALAANFESSN
ARNLSTPPPMIRKLYPKSMAQDTAKLATKSSAIAHLTSSFELTSARENLIPRSQKASSVTPKSNPETSDEEGSLSSRIES
LTIQEDAKEGEAEDDEGLPVYPHERVNTASTDPVEDIDVTKREAYLSSAEFQEKFGMAKNEFYKLPKWKQNKLKMAVQLF
*
CDS seq >KRH19052
ATGTCCGTTTCCATGAGAGATCTGGATCCAGCTTTCAAGGGAGCTGGACAAAAGGCCGGATTGGAAATATGGCGTATCGA
GAATTTTAATCCAGTTGCCATCCCTCAGTCATCTTATGGAAAGTTTTTCACTGGGGACTCCTATGTGATCTTGAAGACAA
CTGCTTCAAAAAGTGGTGCTCTTCGTCATGATATCCATTATTGGCTTGGTAAAGACACCAGTCAGGATGAAGCTGGCGCT
GCAGCCATCAAGACAGTCGAGTTGGATGCAGCTTTAGGAGGACGGGCTGTTCAGTATCGTGAAGTACAAGGTCATGAAAC
TGAAAAGTTCCTCTCTTATTTCAAACCATGCATCATACCTCAAGAAGGTGGGGCTGCCTCAGGTTTTAAGCATGTTGAGG
CTGAAGAACATAAGACACGGTTGTTTGTGTGCAAAGGGAAACATGTGGTACATGTCAAAGAGATTTCTTTCGCTCGATCT
TCACTGAACCATGATGATATTTTTATTCTGGATACCGAGTCCAAAATTTTCCAATTTAATGGTTCCAATTCAAGTATTCA
AGAAAGGGCTAAAGCATTGGAAGTTGTACAATATATCAAGGATACCTACCATGATGGGAAATGTGAGATAGCTTCTATTG
AGGATGGAAAGTTGATGGCTGATTCTGAGAGTGGAGAATTCTGGGGTTGCTTTGGGGGCTTTGCTCCTCTTCCACGGAGA
ACAGTCAGTGATGATGACAAGCCTGCTGATTCTCATCCTCCAAAGCTACTTTGTGTTGACAAGGGGAAAGCAGAACCAAT
TGAATCCGATTCTTTGACAAAGGAATTACTGGACACAAACAAATGTTATATTCTAGATTGTGGGTTGGAAGTTTTTGCAT
GGATGGGAAGAAACACATCTCTTGATGAAAGAAAAAGTGCAAGTGGAGCAGCAGATGAGTTAATCAGTGGCACTGGTCGA
CCAAAGTCCCATATAATTCGTGTAATTGAAGGATTTGAAACAGTAATGTTCAAGTCCAAGTTTGATTCTTGGCCTCAGGC
AAGTCATGCAACAATGTCTGAAGAAGGTCGTGGCAAGGTAGCAGCACTTCTAAAACGTCAAGGATTGGATGTCAAGGGTC
TCGTGAAATCTGAGCCCGAAAAAGAAGAACCTCAACCCCACATAGATTGCACAGGACATTTGCAGGTTTGGCGTGTGAAT
GGTCCGGAAAAGATTCTTCTTCCAGCCACTGATCAGTCAAAATTTTATAATGGAGATTGCTACATCTTTCAATATTCATA
TCCTGGAGAAGATAAGGAAGAGTATCTTATAGGAACATGGGTTGGAAAGAATAGTGTTGAGGAAGAGAGAGCTTCAGCTC
TTTCACTAGCAAGCAAAATGGTTGAGTCAATGAAGTTTCTTCCTTCCCAGGCTCGTATCTATGAAGGCAGTGAACCAATT
CAATTTCATGCCATCCTGCAAAGCTGTATTGTTTTTAAGGGTGGACGTAGTGATGGATACAAGAATTACATTGCGGAGAA
GGAAATTCCAGATGAGACATACAATGAGGATGGTGTTGCATTATTCCGCATCCAGGGCACTGGACCAGACAATATGCAAG
CTATACAAGTTGAACCAGTTGCTTCCTCCTTGAATTCCGCTTATTGCTTCATACTTCATAGCGGGCCCACTGTTTTTATT
TGGTCTGGAGGTTTAGCAACTTCAGATGACCAGGAGCTTGTTGAGAGAATGCTGGATTTGATTAAGCCGGATGTACAATG
CAAACCACTAAAGGAAGGCCTAGAACCGGAACAGTTTTGGGATTTGTTGGGGGGAAAAACGGAATATCCCAGTCAAAAGA
TCACGAGGGAAGCTGAAAATGATCCTCATTTATTTTCTTGCAACTTCTCAGAAGGAAATTTAAAGGTGAAAGAGATTCAC
AACTTTTCCCAGGATGATTTGATGACAGAAGATATTTACACCTTGGATTGTCACTCGGAAATCTTTGTTTGGGTTGGCCA
GCAGGTTGACTCAAAGAGTAGAATGCAGGCTCTAACAATTGGCGAGAAGTTTCTTGAGCATGATTTTCTCCTAGAAGGAT
TATCTCGTGAAGCTCCAATATATATTGTCAAAGAAGGTAGTGAGCCACCTTTCTTCACTCGCTTCTTTAAATGGGAGTCT
GCAAAATCTGCAATGCTAGGAAACTCATTTCAAAGGAAGCTTGCAATCGTGAAAAATGGGGGTACACCACTTATGGTTAA
ACACAAACGAAGAGCATCAGTAACTTATGGGGGAAGGTCTAGTGGTGCCCCAGATAAATCCCAGCGTTCCCGTAGCATGT
CTGTCAGTCCTGATCGTGTTCGTGTGAGGGGCAGATCTCCGGCCTTTAATGCACTAGCAGCTAATTTTGAGAGCTCAAAT
GCAAGGAACCTTTCAACTCCACCTCCGATGATTAGAAAACTTTACCCAAAATCTATGGCACAGGATACAGCAAAATTGGC
AACTAAATCTTCAGCCATAGCTCATCTTACTTCTAGTTTTGAACTAACATCAGCACGAGAAAATTTGATTCCTCGGTCAC
AAAAAGCGAGTTCAGTTACCCCCAAATCAAACCCTGAGACAAGTGATGAGGAGGGTTCTTTGAGCAGCAGGATAGAATCT
CTTACCATACAGGAGGATGCGAAAGAGGGTGAAGCTGAAGACGATGAAGGTCTCCCAGTTTACCCGCATGAACGCGTTAA
CACAGCTTCTACAGATCCTGTAGAAGATATTGACGTGACTAAACGAGAGGCTTATCTGTCATCTGCAGAGTTTCAAGAGA
AATTTGGGATGGCGAAGAATGAATTTTACAAGTTGCCAAAATGGAAACAGAACAAACTCAAAATGGCAGTTCAGTTGTTC
TGA