Microexon ID Gm_17:4662120-4662130:+
Species Glycine max
Coordinates 17:4662120..4662130
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAAATTTAAAG
Microexon Amino Acid seq GNLK
Microexon-tag DNA Seq AGGGATGCTGAAAATGATCCTCACTTATTTTCTTGCAACTTCTCAGAAGGAAATTTAAAGGTGAAAGAGATTCACAACTTTTCCCAGGATGATTTGATGACAGAAGAT
Microexon-tag Amino Acid Seq RDAENDPHLFSCNFSEGNLKVKEIHNFSQDDLMTED
Microexon-tag spanning region4661945-4662989
Microexon-tag prediction score0.9561
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH02827x
Reference Transcript ID KRH02827
Gene ID GLYMA_17G061200
Gene Name NA
Transcript ID KRH02827
Protein ID KRH02827
Gene ID GLYMA_17G061200
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 5e-09
Motif start 637
Motif end 712
Protein seq >KRH02827
MSISMRDLDPAFKGAGQKAGLEIWRIENFNPVPIPQSSYGKFFTGDSYVILKTTASKSGALRHDIHYWLGKDTSQDEAGA
AAIKTVELDASLGGRAVQYREVQGHETEKFLSYFKPCIIPQEGGAASGFKHVEAEEHKTRLFVCKGKHVVHVKEITFARS
SLNHDDIFILDTKSKIFQFNGSNSSIQERAKALEVVQYVKDTYHDGKCEIASIEDGKLMADSESGEFWGCFGGFAPLPRR
TVSDDDKPADSHPPKLLCVDKGKAEPIETDSLTKEFLDTNKCYILDCGLEVFAWMGRNTSLDERKSASVAADELIRGTGR
PKSHIIRVIEGFETVMFKSKFDSWPQASDAPMSEEGRGKVAALLKRQGLDVKGLVKSEPKQEEPQPHIDCTGHLQVWRVN
GQEKILLPATDQSKFYNGDCYIFQYSYPGEDKEEHLIGTWIGKTSVEEERASALSLASKMVESMKFLPSQARIYEGSEPI
QFHAILQSCIVFKGGLSDGYKNYIAEKEIPDETYNEDGVALFRIQGTGPDNMQAIQVEPVASSLNSTYCYILHSGPTVFI
WSGGLATSDDQELVERMLDLIKPDVQCKPLKEGVESEQFWDLLGGKTEYPSQKITRDAENDPHLFSCNFSEGNLKVKEIH
NFSQDDLMTEDIYILDCHSEVFVWVGQQVDSKNRMQALTIGEKFLEHDFLLEALSREAPIYIVKEGSEPPFFTRFFKWES
AKSAMLGNSFQRKLAIVKNGGMPLIVKHKRRASATFGGRSSGAPDKSQRSRSMSVSPDRVRVRGRSPAFNALAANFESSN
ARNLSTPPPMIRKLYPKSVAKDTAQLVPKSSAIAHLTSSFEPFSALENLIPQSQKANSVTPKSNPETSDKEGSMSSRIES
LTIQEDVKEGEAEDDEGLPVYPYERVNTASTDPVEDIDVTKREAYLSSAEFQEKFGTAKNEFYKLPKWKQNKLKMAVQLF
*
CDS seq >KRH02827
ATGTCCATTTCTATGAGAGATCTGGATCCAGCTTTTAAGGGAGCTGGACAAAAGGCCGGATTGGAAATATGGCGTATTGA
GAATTTTAATCCAGTTCCCATCCCTCAGTCATCTTATGGAAAGTTTTTCACTGGGGACTCCTATGTGATCTTGAAGACAA
CTGCTTCAAAAAGTGGTGCTCTTCGTCATGATATCCATTATTGGCTTGGTAAAGACACCAGTCAGGATGAAGCTGGTGCT
GCAGCCATCAAGACAGTCGAGTTGGATGCATCTTTAGGAGGACGGGCTGTTCAGTATCGTGAAGTACAAGGTCATGAAAC
TGAAAAGTTCCTCTCTTATTTCAAACCATGTATCATACCTCAAGAAGGTGGAGCTGCCTCAGGTTTTAAACATGTTGAGG
CTGAAGAACATAAGACACGGTTGTTTGTGTGCAAAGGGAAACATGTAGTGCATGTCAAAGAGATTACTTTCGCTCGATCT
TCACTGAACCATGATGATATTTTTATTCTGGATACCAAGTCCAAAATCTTCCAATTTAATGGTTCCAATTCAAGTATTCA
AGAAAGGGCTAAAGCATTGGAAGTTGTACAGTATGTCAAGGATACCTACCATGATGGGAAATGTGAGATAGCTTCTATTG
AGGATGGAAAGTTGATGGCTGATTCTGAAAGTGGAGAATTCTGGGGTTGCTTTGGGGGCTTTGCTCCTCTTCCACGGAGA
ACAGTCAGTGATGATGACAAGCCTGCTGATTCTCATCCTCCAAAGCTACTTTGTGTTGACAAGGGGAAGGCAGAACCTAT
TGAAACCGATTCTTTGACAAAGGAATTTCTGGACACAAACAAATGTTATATTCTAGATTGTGGGTTGGAAGTTTTTGCAT
GGATGGGAAGAAACACATCTCTTGATGAAAGAAAAAGTGCAAGTGTAGCAGCAGATGAGTTAATCAGGGGCACTGGTCGA
CCAAAGTCCCATATAATTCGTGTAATTGAAGGATTTGAAACAGTAATGTTCAAGTCCAAGTTTGATTCTTGGCCTCAGGC
AAGTGATGCACCAATGTCTGAAGAAGGTCGTGGCAAGGTAGCAGCACTTCTAAAACGTCAAGGATTGGATGTCAAGGGTC
TCGTGAAATCTGAGCCCAAACAAGAAGAACCTCAACCCCACATAGATTGCACAGGACATTTGCAGGTTTGGCGTGTGAAT
GGTCAGGAAAAGATTCTTCTTCCAGCCACTGATCAGTCAAAATTTTATAATGGAGATTGCTACATCTTTCAATATTCATA
TCCTGGAGAAGATAAGGAAGAGCATCTTATAGGAACATGGATTGGAAAGACTAGTGTTGAGGAAGAGAGAGCTTCAGCTC
TTTCACTAGCAAGCAAGATGGTTGAGTCAATGAAGTTTCTTCCTTCCCAAGCTCGTATCTATGAAGGCAGTGAACCAATT
CAATTTCATGCCATCCTGCAAAGCTGTATTGTTTTTAAGGGTGGACTTAGTGATGGATACAAGAATTACATTGCGGAGAA
GGAAATTCCAGATGAGACATACAATGAGGATGGTGTTGCATTATTCCGCATCCAGGGCACTGGACCAGACAATATGCAAG
CTATACAAGTTGAACCAGTTGCTTCCTCCTTGAATTCCACTTATTGCTACATACTTCATAGCGGACCCACTGTTTTTATT
TGGTCTGGAGGTTTAGCAACTTCAGATGACCAGGAGCTTGTTGAGAGAATGCTGGATTTGATTAAGCCGGATGTACAATG
CAAACCACTAAAGGAAGGCGTAGAATCGGAACAGTTTTGGGATTTGTTGGGGGGAAAAACAGAATATCCCAGTCAAAAGA
TCACGAGGGATGCTGAAAATGATCCTCACTTATTTTCTTGCAACTTCTCAGAAGGAAATTTAAAGGTGAAAGAGATTCAC
AACTTTTCCCAGGATGATTTGATGACAGAAGATATTTACATCTTGGATTGTCACTCGGAAGTCTTTGTCTGGGTTGGCCA
GCAGGTTGACTCAAAGAATAGAATGCAGGCTCTAACAATTGGCGAGAAATTTCTTGAGCATGATTTTCTCCTAGAAGCAT
TATCTCGTGAAGCTCCAATATATATTGTCAAAGAAGGTAGTGAGCCACCTTTCTTCACTCGCTTCTTTAAATGGGAGTCT
GCAAAATCTGCAATGCTAGGAAACTCATTTCAAAGGAAGCTTGCAATCGTGAAAAATGGGGGTATGCCACTTATCGTTAA
ACATAAACGAAGAGCATCAGCAACTTTTGGGGGAAGGTCTAGTGGTGCACCAGATAAATCCCAGCGTTCCCGCAGCATGT
CTGTCAGTCCTGATCGTGTTCGTGTGAGGGGCAGATCTCCAGCCTTTAATGCACTAGCAGCTAATTTTGAGAGCTCAAAT
GCTAGGAACCTTTCAACTCCACCTCCAATGATTAGAAAACTGTACCCAAAATCTGTGGCAAAGGATACAGCACAACTGGT
ACCTAAATCTTCAGCCATAGCTCATCTTACTTCTAGTTTTGAACCATTCTCAGCGCTAGAAAATTTGATTCCTCAGTCAC
AAAAAGCGAATTCAGTTACCCCCAAATCAAACCCTGAGACAAGTGACAAGGAGGGTTCTATGAGCAGTAGGATAGAATCT
CTTACCATTCAGGAGGATGTGAAAGAGGGAGAAGCTGAAGATGATGAAGGTCTCCCAGTTTACCCATATGAACGTGTAAA
CACAGCTTCTACAGATCCTGTAGAAGATATTGACGTGACTAAACGAGAGGCTTATCTGTCATCTGCAGAGTTCCAAGAGA
AATTTGGGACGGCAAAGAATGAATTTTATAAGTTGCCAAAATGGAAACAAAACAAACTCAAAATGGCAGTTCAGTTGTTC
TGA
Microexon DNA seq GAAATTTAAAG
Microexon Amino Acid seq GNLK
Microexon-tag DNA Seq AGGGATGCTGAAAATGATCCTCACTTATTTTCTTGCAACTTCTCAGAAGGAAATTTAAAGGTGAAAGAGATTCACAACTTTTCCCAGGATGATTTGATGACAGAAGAT
Microexon-tag Amino Acid seq RDAENDPHLFSCNFSEGNLKVKEIHNFSQDDLMTED
Transcript ID KRH02830
Gene ID Gm.21991
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 5e-09
Motif start 637
Motif end 712
Protein seq >KRH02830
MSISMRDLDPAFKGAGQKAGLEIWRIENFNPVPIPQSSYGKFFTGDSYVILKTTASKSGALRHDIHYWLGKDTSQDEAGA
AAIKTVELDASLGGRAVQYREVQGHETEKFLSYFKPCIIPQEGGAASGFKHVEAEEHKTRLFVCKGKHVVHVKEITFARS
SLNHDDIFILDTKSKIFQFNGSNSSIQERAKALEVVQYVKDTYHDGKCEIASIEDGKLMADSESGEFWGCFGGFAPLPRR
TVSDDDKPADSHPPKLLCVDKGKAEPIETDSLTKEFLDTNKCYILDCGLEVFAWMGRNTSLDERKSASVAADELIRGTGR
PKSHIIRVIEGFETVMFKSKFDSWPQASDAPMSEEGRGKVAALLKRQGLDVKGLVKSEPKQEEPQPHIDCTGHLQVWRVN
GQEKILLPATDQSKFYNGDCYIFQYSYPGEDKEEHLIGTWIGKTSVEEERASALSLASKMVESMKFLPSQARIYEGSEPI
QFHAILQSCIVFKGGLSDGYKNYIAEKEIPDETYNEDGVALFRIQGTGPDNMQAIQVEPVASSLNSTYCYILHSGPTVFI
WSGGLATSDDQELVERMLDLIKPDVQCKPLKEGVESEQFWDLLGGKTEYPSQKITRDAENDPHLFSCNFSEGNLKVKEIH
NFSQDDLMTEDIYILDCHSEVFVWVGQQVDSKNRMQALTIGEKFLEHDFLLEALSREAPIYIVKEGSEPPFFTRFFKWES
AKSAMLGNSFQRKLAIVKNGGMPLIVKHKRRASATFGGRSSGAPDKSQRSRSMSVSPDRVRVRGRSPAFNALAANFESSN
ARNLSTPPPMIRKLYPKSVAKDTAQLVPKSSAIAHLTSSFEPFSALENLIPQSQKANSVTPKSNPETSDKEGSMSSRIES
LTIQEDVKEGEAEDDEGLPVYPYERVNTASTDPVEDIDVTKREAYLSSAEFQEKFGTAKNEFYKLPKWKQNKLKMAVQLF
*
CDS seq >KRH02830
ATGTCCATTTCTATGAGAGATCTGGATCCAGCTTTTAAGGGAGCTGGACAAAAGGCCGGATTGGAAATATGGCGTATTGA
GAATTTTAATCCAGTTCCCATCCCTCAGTCATCTTATGGAAAGTTTTTCACTGGGGACTCCTATGTGATCTTGAAGACAA
CTGCTTCAAAAAGTGGTGCTCTTCGTCATGATATCCATTATTGGCTTGGTAAAGACACCAGTCAGGATGAAGCTGGTGCT
GCAGCCATCAAGACAGTCGAGTTGGATGCATCTTTAGGAGGACGGGCTGTTCAGTATCGTGAAGTACAAGGTCATGAAAC
TGAAAAGTTCCTCTCTTATTTCAAACCATGTATCATACCTCAAGAAGGTGGAGCTGCCTCAGGTTTTAAACATGTTGAGG
CTGAAGAACATAAGACACGGTTGTTTGTGTGCAAAGGGAAACATGTAGTGCATGTCAAAGAGATTACTTTCGCTCGATCT
TCACTGAACCATGATGATATTTTTATTCTGGATACCAAGTCCAAAATCTTCCAATTTAATGGTTCCAATTCAAGTATTCA
AGAAAGGGCTAAAGCATTGGAAGTTGTACAGTATGTCAAGGATACCTACCATGATGGGAAATGTGAGATAGCTTCTATTG
AGGATGGAAAGTTGATGGCTGATTCTGAAAGTGGAGAATTCTGGGGTTGCTTTGGGGGCTTTGCTCCTCTTCCACGGAGA
ACAGTCAGTGATGATGACAAGCCTGCTGATTCTCATCCTCCAAAGCTACTTTGTGTTGACAAGGGGAAGGCAGAACCTAT
TGAAACCGATTCTTTGACAAAGGAATTTCTGGACACAAACAAATGTTATATTCTAGATTGTGGGTTGGAAGTTTTTGCAT
GGATGGGAAGAAACACATCTCTTGATGAAAGAAAAAGTGCAAGTGTAGCAGCAGATGAGTTAATCAGGGGCACTGGTCGA
CCAAAGTCCCATATAATTCGTGTAATTGAAGGATTTGAAACAGTAATGTTCAAGTCCAAGTTTGATTCTTGGCCTCAGGC
AAGTGATGCACCAATGTCTGAAGAAGGTCGTGGCAAGGTAGCAGCACTTCTAAAACGTCAAGGATTGGATGTCAAGGGTC
TCGTGAAATCTGAGCCCAAACAAGAAGAACCTCAACCCCACATAGATTGCACAGGACATTTGCAGGTTTGGCGTGTGAAT
GGTCAGGAAAAGATTCTTCTTCCAGCCACTGATCAGTCAAAATTTTATAATGGAGATTGCTACATCTTTCAATATTCATA
TCCTGGAGAAGATAAGGAAGAGCATCTTATAGGAACATGGATTGGAAAGACTAGTGTTGAGGAAGAGAGAGCTTCAGCTC
TTTCACTAGCAAGCAAGATGGTTGAGTCAATGAAGTTTCTTCCTTCCCAAGCTCGTATCTATGAAGGCAGTGAACCAATT
CAATTTCATGCCATCCTGCAAAGCTGTATTGTTTTTAAGGGTGGACTTAGTGATGGATACAAGAATTACATTGCGGAGAA
GGAAATTCCAGATGAGACATACAATGAGGATGGTGTTGCATTATTCCGCATCCAGGGCACTGGACCAGACAATATGCAAG
CTATACAAGTTGAACCAGTTGCTTCCTCCTTGAATTCCACTTATTGCTACATACTTCATAGCGGACCCACTGTTTTTATT
TGGTCTGGAGGTTTAGCAACTTCAGATGACCAGGAGCTTGTTGAGAGAATGCTGGATTTGATTAAGCCGGATGTACAATG
CAAACCACTAAAGGAAGGCGTAGAATCGGAACAGTTTTGGGATTTGTTGGGGGGAAAAACAGAATATCCCAGTCAAAAGA
TCACGAGGGATGCTGAAAATGATCCTCACTTATTTTCTTGCAACTTCTCAGAAGGAAATTTAAAGGTGAAAGAGATTCAC
AACTTTTCCCAGGATGATTTGATGACAGAAGATATTTACATCTTGGATTGTCACTCGGAAGTCTTTGTCTGGGTTGGCCA
GCAGGTTGACTCAAAGAATAGAATGCAGGCTCTAACAATTGGCGAGAAATTTCTTGAGCATGATTTTCTCCTAGAAGCAT
TATCTCGTGAAGCTCCAATATATATTGTCAAAGAAGGTAGTGAGCCACCTTTCTTCACTCGCTTCTTTAAATGGGAGTCT
GCAAAATCTGCAATGCTAGGAAACTCATTTCAAAGGAAGCTTGCAATCGTGAAAAATGGGGGTATGCCACTTATCGTTAA
ACATAAACGAAGAGCATCAGCAACTTTTGGGGGAAGGTCTAGTGGTGCACCAGATAAATCCCAGCGTTCCCGCAGCATGT
CTGTCAGTCCTGATCGTGTTCGTGTGAGGGGCAGATCTCCAGCCTTTAATGCACTAGCAGCTAATTTTGAGAGCTCAAAT
GCTAGGAACCTTTCAACTCCACCTCCAATGATTAGAAAACTGTACCCAAAATCTGTGGCAAAGGATACAGCACAACTGGT
ACCTAAATCTTCAGCCATAGCTCATCTTACTTCTAGTTTTGAACCATTCTCAGCGCTAGAAAATTTGATTCCTCAGTCAC
AAAAAGCGAATTCAGTTACCCCCAAATCAAACCCTGAGACAAGTGACAAGGAGGGTTCTATGAGCAGTAGGATAGAATCT
CTTACCATTCAGGAGGATGTGAAAGAGGGAGAAGCTGAAGATGATGAAGGTCTCCCAGTTTACCCATATGAACGTGTAAA
CACAGCTTCTACAGATCCTGTAGAAGATATTGACGTGACTAAACGAGAGGCTTATCTGTCATCTGCAGAGTTCCAAGAGA
AATTTGGGACGGCAAAGAATGAATTTTATAAGTTGCCAAAATGGAAACAAAACAAACTCAAAATGGCAGTTCAGTTGTTC
TGA