Microexon ID Gm_13:29431933-29431947:-
Species Glycine max
Coordinates 13:29431933..29431947
Microexon Cluster ID MEP41
Size 15
Phase 0
Pfam Domain Motif DUF974
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq CARTTYTTCAAGTTYATTGTTKCWAAYCCACTTTCWGTTAGRACAAAGGTYCGYRYTRTCAAGGAAACTACMTWTYTRGARGCTTGYATWGARAAYCATACAAAATCA
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GTCCGTGTTATCAAG
Microexon Amino Acid seq VRVIK
Microexon-tag DNA Seq CAATTTTTCAAGTTCATCGTTGCTAATCCACTTTCCGTTAGGACAAAGGTCCGTGTTATCAAGGAGACTACCTTTTTGGAAGCTTGTATTGAAAACCATACAAAATCA
Microexon-tag Amino Acid Seq QFFKFIVANPLSVRTKVRVIKETTFLEACIENHTKS
Microexon-tag spanning region29430401-29432151
Microexon-tag prediction score0.9801
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH20475x
Reference Transcript ID KRH20475
Gene ID GLYMA_13G181100
Gene Name NA
Transcript ID KRH20475
Protein ID KRH20475
Gene ID GLYMA_13G181100
Gene Name NA
Pfam domain motif DUF974
Motif E-value 3.9e-64
Motif start 90
Motif end 319
Protein seq >KRH20475
MSQVGGQGGGGSHSLAFRVMRLCRPSFNVEPPLRLDPTDLFVGEDLFDDPAAKPHSFSSAAAHDDDSDPNYRNRFLLRHF
SDAMGLSGLLVLPQSFGAIYLGETFCSYISINNSSNFEVREVLIKAEIQTERQRILLLDTSKSPVETIRAGGRYDFIVEH
DVKELGPHTLVCTALYNDGDGERKYLPQFFKFIVANPLSVRTKVRVIKETTFLEACIENHTKSNLFMDQVDFEPAQYYSA
TILKGDGHHSEKDSPTREIFKPPILIRSGGGIYNYLYQLKTLSDGSPQTKVEGSNVLGKLQITWRTNLGEPGRLQTQQIL
GTPATKKEIELQVVEVPSIINLQKPFMLKLNLTNQTDRELGPFEVGLSQNVSYGERVVMINGLQSMVLSEVQALGSTNFH
LNLIATKPGIQRITGITVFDTREMKSYEPLPDLEIFVDMD*
CDS seq >KRH20475
ATGAGTCAGGTTGGTGGTCAGGGAGGAGGAGGATCTCATTCGCTGGCGTTCCGCGTGATGCGTTTGTGCCGCCCTTCCTT
CAACGTCGAACCTCCACTCCGCCTCGACCCCACCGACCTCTTCGTCGGCGAGGACCTCTTCGACGACCCCGCTGCCAAAC
CCCACTCTTTCTCCTCCGCTGCCGCCCACGACGACGACTCCGATCCCAACTACCGTAACCGCTTCCTCCTCCGCCACTTC
TCCGACGCCATGGGCCTCTCCGGCCTCCTCGTTCTCCCTCAGTCTTTCGGAGCCATTTATTTGGGGGAGACCTTCTGCAG
CTACATCAGTATAAACAACAGCTCCAATTTTGAAGTCAGAGAAGTCCTTATCAAGGCTGAAATTCAAACAGAGAGACAGA
GAATACTTCTTTTAGACACATCAAAATCGCCAGTTGAAACGATACGTGCGGGTGGGCGTTATGATTTTATTGTAGAACAT
GATGTGAAAGAGCTTGGACCTCACACGCTGGTCTGCACTGCATTGTATAATGATGGTGACGGTGAGCGTAAATATCTTCC
TCAATTTTTCAAGTTCATCGTTGCTAATCCACTTTCCGTTAGGACAAAGGTCCGTGTTATCAAGGAGACTACCTTTTTGG
AAGCTTGTATTGAAAACCATACAAAATCAAACCTTTTCATGGACCAAGTTGATTTTGAACCTGCTCAGTATTATAGTGCA
ACAATACTTAAAGGTGATGGGCACCATTCTGAGAAAGATAGTCCTACAAGGGAGATATTTAAGCCACCAATACTAATTAG
ATCTGGTGGAGGAATTTACAACTATCTCTATCAATTGAAAACGTTATCAGATGGTTCGCCTCAAACAAAAGTTGAAGGAA
GTAATGTTCTTGGTAAACTCCAGATAACATGGCGAACAAATTTGGGTGAACCCGGTCGTTTGCAGACCCAGCAAATATTG
GGGACACCAGCAACAAAGAAGGAGATTGAGTTGCAAGTTGTAGAGGTTCCGTCTATAATTAACCTTCAAAAACCATTCAT
GCTAAAATTGAATCTTACAAACCAGACAGATAGAGAGCTGGGCCCATTTGAAGTTGGCTTATCTCAAAATGTTTCATATG
GGGAGAGAGTTGTTATGATTAATGGCCTTCAATCAATGGTTTTATCAGAGGTTCAGGCTTTAGGATCTACAAATTTCCAC
CTGAATCTCATAGCTACTAAACCTGGAATTCAGAGAATTACGGGAATTACAGTTTTTGATACTAGGGAGATGAAATCTTA
TGAACCACTTCCAGATTTAGAGATTTTTGTGGACATGGATTAA
Microexon DNA seq GTCCGTGTTATCAAG
Microexon Amino Acid seq VRVIK
Microexon-tag DNA Seq CAATTTTTCAAGTTCATCGTTGCTAATCCACTTTCCGTTAGGACAAAGGTCCGTGTTATCAAGGAGACTACCTTTTTGGAAGCTTGTATTGAAAACCATACAAAATCA
Microexon-tag Amino Acid seq QFFKFIVANPLSVRTKVRVIKETTFLEACIENHTKS
Transcript ID KRH20475
Gene ID Gm.12262
Gene Name NA
Pfam domain motif DUF974
Motif E-value 3.9e-64
Motif start 90
Motif end 319
Protein seq >KRH20475
MSQVGGQGGGGSHSLAFRVMRLCRPSFNVEPPLRLDPTDLFVGEDLFDDPAAKPHSFSSAAAHDDDSDPNYRNRFLLRHF
SDAMGLSGLLVLPQSFGAIYLGETFCSYISINNSSNFEVREVLIKAEIQTERQRILLLDTSKSPVETIRAGGRYDFIVEH
DVKELGPHTLVCTALYNDGDGERKYLPQFFKFIVANPLSVRTKVRVIKETTFLEACIENHTKSNLFMDQVDFEPAQYYSA
TILKGDGHHSEKDSPTREIFKPPILIRSGGGIYNYLYQLKTLSDGSPQTKVEGSNVLGKLQITWRTNLGEPGRLQTQQIL
GTPATKKEIELQVVEVPSIINLQKPFMLKLNLTNQTDRELGPFEVGLSQNVSYGERVVMINGLQSMVLSEVQALGSTNFH
LNLIATKPGIQRITGITVFDTREMKSYEPLPDLEIFVDMD*
CDS seq >KRH20475
ATGAGTCAGGTTGGTGGTCAGGGAGGAGGAGGATCTCATTCGCTGGCGTTCCGCGTGATGCGTTTGTGCCGCCCTTCCTT
CAACGTCGAACCTCCACTCCGCCTCGACCCCACCGACCTCTTCGTCGGCGAGGACCTCTTCGACGACCCCGCTGCCAAAC
CCCACTCTTTCTCCTCCGCTGCCGCCCACGACGACGACTCCGATCCCAACTACCGTAACCGCTTCCTCCTCCGCCACTTC
TCCGACGCCATGGGCCTCTCCGGCCTCCTCGTTCTCCCTCAGTCTTTCGGAGCCATTTATTTGGGGGAGACCTTCTGCAG
CTACATCAGTATAAACAACAGCTCCAATTTTGAAGTCAGAGAAGTCCTTATCAAGGCTGAAATTCAAACAGAGAGACAGA
GAATACTTCTTTTAGACACATCAAAATCGCCAGTTGAAACGATACGTGCGGGTGGGCGTTATGATTTTATTGTAGAACAT
GATGTGAAAGAGCTTGGACCTCACACGCTGGTCTGCACTGCATTGTATAATGATGGTGACGGTGAGCGTAAATATCTTCC
TCAATTTTTCAAGTTCATCGTTGCTAATCCACTTTCCGTTAGGACAAAGGTCCGTGTTATCAAGGAGACTACCTTTTTGG
AAGCTTGTATTGAAAACCATACAAAATCAAACCTTTTCATGGACCAAGTTGATTTTGAACCTGCTCAGTATTATAGTGCA
ACAATACTTAAAGGTGATGGGCACCATTCTGAGAAAGATAGTCCTACAAGGGAGATATTTAAGCCACCAATACTAATTAG
ATCTGGTGGAGGAATTTACAACTATCTCTATCAATTGAAAACGTTATCAGATGGTTCGCCTCAAACAAAAGTTGAAGGAA
GTAATGTTCTTGGTAAACTCCAGATAACATGGCGAACAAATTTGGGTGAACCCGGTCGTTTGCAGACCCAGCAAATATTG
GGGACACCAGCAACAAAGAAGGAGATTGAGTTGCAAGTTGTAGAGGTTCCGTCTATAATTAACCTTCAAAAACCATTCAT
GCTAAAATTGAATCTTACAAACCAGACAGATAGAGAGCTGGGCCCATTTGAAGTTGGCTTATCTCAAAATGTTTCATATG
GGGAGAGAGTTGTTATGATTAATGGCCTTCAATCAATGGTTTTATCAGAGGTTCAGGCTTTAGGATCTACAAATTTCCAC
CTGAATCTCATAGCTACTAAACCTGGAATTCAGAGAATTACGGGAATTACAGTTTTTGATACTAGGGAGATGAAATCTTA
TGAACCACTTCCAGATTTAGAGATTTTTGTGGACATGGATTAA