
Microexon ID | Gm_3:468492-468495:- |
Species | Glycine max | Coordinates | 3:468492..468495 |
Microexon Cluster ID | Unclassified |
Size | 4 |
Gm_3:468492-468495:- does not have available information here.
Transcript ID | KRH64942 |
Protein ID | KRH64942 |
Gene ID | GLYMA_03G005300 |
Gene Name | NA |
Pfam domain motif | MatE |
Motif E-value | 9e-31 |
Motif start | 40 |
Motif end | 200 |
Protein seq | >KRH64942 MEGNLEKKLLSREQKSEEENLSLVKRVWEESKVMWIVAAPAIFTRFTTFGISVISQAFIGHIGSRELAAYALVFTVIIRF ANGILLGMASALSTLCGQAYGAKEYDMMGVYLQRSWIVLFLSAICLLPLFIFTSPILTLLGQDESIAQVARTISIWSIPV LFAYIVSNSCQTFLQSQSKNVIISYLAALSIIIHVSLSWLFTMQFKYGIPGAMISTILAYWIPNIGQLIFITCGWCPETW KGFSFLAFKDLWPVAKLSISSGAMLCLELWYSTILILLTGNMKDAEVQIDALSICINISGWEMMIAFGFMAAVSVRVANE LGRENSKAAKFSIVVTVLTSFAIGFILFVLFLILREKVAYLFTSNEDVATAVGDLSPLLALSLLLNSIQPVLSGVAVGAG WQSTVAYVNIGCYYLIGIPVGIVLGNIIHLQVKGIWIGMLFGTLIQTIILIIITYKTNWDEQVIIARDRINKWSKMVLDH ETITSDN* |
CDS seq | >KRH64942 ATGGAGGGGAATCTTGAGAAGAAGCTGTTGAGCAGAGAGCAAAAATCAGAAGAAGAGAATTTATCATTGGTGAAGAGGGT GTGGGAAGAGAGCAAGGTGATGTGGATAGTGGCAGCACCAGCCATATTCACAAGGTTCACAACCTTTGGCATCAGTGTTA TAAGCCAAGCATTTATTGGTCATATTGGTTCAAGGGAACTCGCTGCTTATGCCCTTGTGTTCACTGTCATCATACGCTTC GCCAATGGAATTCTGTTAGGAATGGCAAGTGCATTGTCAACACTTTGTGGACAAGCATACGGCGCAAAAGAATATGACAT GATGGGAGTGTATCTTCAAAGATCATGGATAGTTTTATTCTTAAGTGCAATCTGTCTTCTTCCGTTGTTCATCTTCACAA GCCCAATTTTGACTCTCTTAGGCCAAGATGAGAGCATAGCACAAGTGGCAAGAACCATTTCTATTTGGTCAATTCCTGTC TTATTTGCTTATATTGTCTCAAACAGCTGCCAGACATTCCTTCAATCTCAAAGCAAGAATGTCATTATTTCATATTTGGC AGCTTTATCAATAATCATTCATGTGTCCCTCTCCTGGCTATTCACAATGCAATTCAAGTATGGGATTCCTGGTGCAATGA TTTCAACAATTTTGGCATACTGGATTCCGAACATTGGTCAACTGATATTTATTACATGTGGTTGGTGCCCTGAAACATGG AAAGGTTTCTCTTTTTTAGCATTCAAAGATCTTTGGCCGGTTGCCAAGCTTTCCATTTCATCTGGTGCCATGTTATGTCT TGAGCTCTGGTATAGCACAATATTGATTCTTTTGACTGGTAACATGAAAGATGCTGAGGTTCAAATTGATGCTCTATCTA TATGTATTAACATCAGTGGATGGGAAATGATGATAGCATTTGGTTTCATGGCTGCTGTTAGTGTTCGAGTGGCAAATGAA CTTGGAAGGGAAAACTCCAAAGCTGCAAAGTTCTCTATAGTTGTGACAGTGCTTACATCATTTGCAATTGGGTTTATCTT ATTTGTTCTTTTTTTAATTTTAAGAGAAAAAGTAGCTTATCTCTTTACTTCAAACGAAGATGTGGCTACTGCTGTGGGGG ATTTGTCACCTTTGTTAGCGCTTTCTTTGTTACTAAATAGTATTCAACCTGTACTCTCAGGGGTGGCTGTTGGAGCAGGG TGGCAGAGCACTGTAGCGTATGTGAACATAGGGTGTTATTACCTCATAGGTATTCCGGTTGGAATAGTACTTGGTAACAT TATTCACTTGCAAGTCAAGGGTATTTGGATTGGAATGTTGTTTGGGACACTAATTCAAACTATAATCCTAATTATAATCA CCTACAAAACTAATTGGGATGAGCAGGTGATTATAGCCCGTGATCGTATTAATAAGTGGTCTAAAATGGTCCTTGATCAT GAAACAATTACATCAGATAATTAG |
Microexon DNA seq | TTAG |
Microexon Amino Acid seq | LG |
Microexon-tag DNA Seq | GCTTATGCCCTTGTGTTCACTGTCATCATACGCTTCGCCAATGGAATTCTGTTAGGAATGGCAAGTGCATTGTCAACACTTTGTGGACAAGCATACGGCGCAAAAGAA |
Microexon-tag Amino Acid seq | AYALVFTVIIRFANGILLGMASALSTLCGQAYGAKE |
Transcript ID | KRH64942 |
Gene ID | Gm.34894 |
Gene Name | NA |
Pfam domain motif | MatE |
Motif E-value | 9e-31 |
Motif start | 40 |
Motif end | 200 |
Protein seq | >KRH64942 MEGNLEKKLLSREQKSEEENLSLVKRVWEESKVMWIVAAPAIFTRFTTFGISVISQAFIGHIGSRELAAYALVFTVIIRF ANGILLGMASALSTLCGQAYGAKEYDMMGVYLQRSWIVLFLSAICLLPLFIFTSPILTLLGQDESIAQVARTISIWSIPV LFAYIVSNSCQTFLQSQSKNVIISYLAALSIIIHVSLSWLFTMQFKYGIPGAMISTILAYWIPNIGQLIFITCGWCPETW KGFSFLAFKDLWPVAKLSISSGAMLCLELWYSTILILLTGNMKDAEVQIDALSICINISGWEMMIAFGFMAAVSVRVANE LGRENSKAAKFSIVVTVLTSFAIGFILFVLFLILREKVAYLFTSNEDVATAVGDLSPLLALSLLLNSIQPVLSGVAVGAG WQSTVAYVNIGCYYLIGIPVGIVLGNIIHLQVKGIWIGMLFGTLIQTIILIIITYKTNWDEQVIIARDRINKWSKMVLDH ETITSDN* |
CDS seq | >KRH64942 ATGGAGGGGAATCTTGAGAAGAAGCTGTTGAGCAGAGAGCAAAAATCAGAAGAAGAGAATTTATCATTGGTGAAGAGGGT GTGGGAAGAGAGCAAGGTGATGTGGATAGTGGCAGCACCAGCCATATTCACAAGGTTCACAACCTTTGGCATCAGTGTTA TAAGCCAAGCATTTATTGGTCATATTGGTTCAAGGGAACTCGCTGCTTATGCCCTTGTGTTCACTGTCATCATACGCTTC GCCAATGGAATTCTGTTAGGAATGGCAAGTGCATTGTCAACACTTTGTGGACAAGCATACGGCGCAAAAGAATATGACAT GATGGGAGTGTATCTTCAAAGATCATGGATAGTTTTATTCTTAAGTGCAATCTGTCTTCTTCCGTTGTTCATCTTCACAA GCCCAATTTTGACTCTCTTAGGCCAAGATGAGAGCATAGCACAAGTGGCAAGAACCATTTCTATTTGGTCAATTCCTGTC TTATTTGCTTATATTGTCTCAAACAGCTGCCAGACATTCCTTCAATCTCAAAGCAAGAATGTCATTATTTCATATTTGGC AGCTTTATCAATAATCATTCATGTGTCCCTCTCCTGGCTATTCACAATGCAATTCAAGTATGGGATTCCTGGTGCAATGA TTTCAACAATTTTGGCATACTGGATTCCGAACATTGGTCAACTGATATTTATTACATGTGGTTGGTGCCCTGAAACATGG AAAGGTTTCTCTTTTTTAGCATTCAAAGATCTTTGGCCGGTTGCCAAGCTTTCCATTTCATCTGGTGCCATGTTATGTCT TGAGCTCTGGTATAGCACAATATTGATTCTTTTGACTGGTAACATGAAAGATGCTGAGGTTCAAATTGATGCTCTATCTA TATGTATTAACATCAGTGGATGGGAAATGATGATAGCATTTGGTTTCATGGCTGCTGTTAGTGTTCGAGTGGCAAATGAA CTTGGAAGGGAAAACTCCAAAGCTGCAAAGTTCTCTATAGTTGTGACAGTGCTTACATCATTTGCAATTGGGTTTATCTT ATTTGTTCTTTTTTTAATTTTAAGAGAAAAAGTAGCTTATCTCTTTACTTCAAACGAAGATGTGGCTACTGCTGTGGGGG ATTTGTCACCTTTGTTAGCGCTTTCTTTGTTACTAAATAGTATTCAACCTGTACTCTCAGGGGTGGCTGTTGGAGCAGGG TGGCAGAGCACTGTAGCGTATGTGAACATAGGGTGTTATTACCTCATAGGTATTCCGGTTGGAATAGTACTTGGTAACAT TATTCACTTGCAAGTCAAGGGTATTTGGATTGGAATGTTGTTTGGGACACTAATTCAAACTATAATCCTAATTATAATCA CCTACAAAACTAATTGGGATGAGCAGGTGATTATAGCCCGTGATCGTATTAATAAGTGGTCTAAAATGGTCCTTGATCAT GAAACAATTACATCAGATAATTAG |