Microexon ID Gm_20:43609905-43609910:-
Species Glycine max
Coordinates 20:43609905..43609910
Microexon Cluster ID MEP10
Size 6
Phase 0
Pfam Domain Motif DUF4788
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 51,6,51
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TTTGARGAYTAYATYGADCCMCTYAAGRTKTACCTGRMTAGRTACAGAGAGWTGGAGGGTGAYACYAAGGGATCTGCWARRGSTGGWGATGSATCTGCTAARARRGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Gm_20:43609905-43609910:- does not have available information here.
Transcript ID KRG92229
Protein ID KRG92229
Gene ID GLYMA_20G198500
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >KRG92229
MSDAPPSPTHESGGEQSPRGSSSGAREQDRYLPIANISRIMKKALPPNGKIAKDAKDTMQECVSEFISFITSEASEKCQK
EKRKTINGDDLLWAMATLGFEDYIEPLKVYLARYREAEGDTKGSARSGDGSATPDQVGLAGQNSQLVHQGSLNYIGLQVQ
PQHLVMPSMQSHE*
CDS seq >KRG92229
ATGTCGGATGCGCCACCGAGCCCGACTCATGAGAGTGGGGGCGAGCAGAGCCCGCGCGGTTCGTCGTCCGGCGCGAGGGA
GCAGGACCGGTACCTCCCGATTGCCAACATCAGCCGCATTATGAAGAAGGCTCTGCCTCCCAACGGCAAGATTGCAAAGG
ATGCCAAAGACACCATGCAGGAATGCGTTTCTGAGTTCATCAGCTTCATTACCAGCGAGGCGAGTGAGAAATGCCAGAAG
GAGAAGAGAAAGACAATCAATGGAGACGATTTGCTATGGGCCATGGCCACTTTAGGATTTGAAGACTACATAGAGCCGCT
TAAGGTGTACCTGGCTAGGTACAGAGAGGCGGAGGGTGACACTAAAGGATCTGCTAGAAGTGGTGATGGATCTGCTACAC
CAGATCAAGTTGGCCTTGCAGGTCAAAATTCTCAGCTTGTTCATCAGGGTTCGCTGAACTATATTGGTTTGCAGGTGCAA
CCACAACATCTGGTTATGCCTTCAATGCAAAGCCATGAATAG
Microexon DNA seq GCGGAG
Microexon Amino Acid seq AE
Microexon-tag DNA Seq TTTGAAGACTACATAGAGCCGCTTAAGGTGTACCTGGCTAGGTACAGAGAGGCGGAGGGTGACACTAAAGGATCTGCTAGAAGTGGTGATGGATCTGCTACACCAGAT
Microexon-tag Amino Acid seq FEDYIEPLKVYLARYREAEGDTKGSARSGDGSATPD
Transcript ID Gm.34318.1
Gene ID Gm.34318
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Gm.34318.1
MSDAPPSPTHESGGEQSPRGSSSGAREQDRYLPIANISRIMKKALPPNGKIAKDAKDTMQECVSEFISFITSEASEKCQK
EKRKTINGDDLLWAMATLGFEDYIEPLKVYLARYREAEGDTKGSARSGDGSATPDQVGLAGQNSQLVHQGSLNYIGLQVQ
PQHLVMPSMQSHE*
CDS seq >Gm.34318.1
ATGTCGGATGCGCCACCGAGCCCGACTCATGAGAGTGGGGGCGAGCAGAGCCCGCGCGGTTCGTCGTCCGGCGCGAGGGA
GCAGGACCGGTACCTCCCGATTGCCAACATCAGCCGCATTATGAAGAAGGCTCTGCCTCCCAACGGCAAGATTGCAAAGG
ATGCCAAAGACACCATGCAGGAATGCGTTTCTGAGTTCATCAGCTTCATTACCAGCGAGGCGAGTGAGAAATGCCAGAAG
GAGAAGAGAAAGACAATCAATGGAGACGATTTGCTATGGGCCATGGCCACTTTAGGATTTGAAGACTACATAGAGCCGCT
TAAGGTGTACCTGGCTAGGTACAGAGAGGCGGAGGGTGACACTAAAGGATCTGCTAGAAGTGGTGATGGATCTGCTACAC
CAGATCAAGTTGGCCTTGCAGGTCAAAATTCTCAGCTTGTTCATCAGGGTTCGCTGAACTATATTGGTTTGCAGGTGCAA
CCACAACATCTGGTTATGCCTTCAATGCAAAGCCATGAATAG