Microexon ID Gm_3:468492-468495:-
Species Glycine max
Coordinates 3:468492..468495
Microexon Cluster ID Unclassified
Size 4
Gm_3:468492-468495:- does not have available information here.
Transcript ID KRH64942
Protein ID KRH64942
Gene ID GLYMA_03G005300
Gene Name NA
Pfam domain motif MatE
Motif E-value 9e-31
Motif start 40
Motif end 200
Protein seq >KRH64942
MEGNLEKKLLSREQKSEEENLSLVKRVWEESKVMWIVAAPAIFTRFTTFGISVISQAFIGHIGSRELAAYALVFTVIIRF
ANGILLGMASALSTLCGQAYGAKEYDMMGVYLQRSWIVLFLSAICLLPLFIFTSPILTLLGQDESIAQVARTISIWSIPV
LFAYIVSNSCQTFLQSQSKNVIISYLAALSIIIHVSLSWLFTMQFKYGIPGAMISTILAYWIPNIGQLIFITCGWCPETW
KGFSFLAFKDLWPVAKLSISSGAMLCLELWYSTILILLTGNMKDAEVQIDALSICINISGWEMMIAFGFMAAVSVRVANE
LGRENSKAAKFSIVVTVLTSFAIGFILFVLFLILREKVAYLFTSNEDVATAVGDLSPLLALSLLLNSIQPVLSGVAVGAG
WQSTVAYVNIGCYYLIGIPVGIVLGNIIHLQVKGIWIGMLFGTLIQTIILIIITYKTNWDEQVIIARDRINKWSKMVLDH
ETITSDN*
CDS seq >KRH64942
ATGGAGGGGAATCTTGAGAAGAAGCTGTTGAGCAGAGAGCAAAAATCAGAAGAAGAGAATTTATCATTGGTGAAGAGGGT
GTGGGAAGAGAGCAAGGTGATGTGGATAGTGGCAGCACCAGCCATATTCACAAGGTTCACAACCTTTGGCATCAGTGTTA
TAAGCCAAGCATTTATTGGTCATATTGGTTCAAGGGAACTCGCTGCTTATGCCCTTGTGTTCACTGTCATCATACGCTTC
GCCAATGGAATTCTGTTAGGAATGGCAAGTGCATTGTCAACACTTTGTGGACAAGCATACGGCGCAAAAGAATATGACAT
GATGGGAGTGTATCTTCAAAGATCATGGATAGTTTTATTCTTAAGTGCAATCTGTCTTCTTCCGTTGTTCATCTTCACAA
GCCCAATTTTGACTCTCTTAGGCCAAGATGAGAGCATAGCACAAGTGGCAAGAACCATTTCTATTTGGTCAATTCCTGTC
TTATTTGCTTATATTGTCTCAAACAGCTGCCAGACATTCCTTCAATCTCAAAGCAAGAATGTCATTATTTCATATTTGGC
AGCTTTATCAATAATCATTCATGTGTCCCTCTCCTGGCTATTCACAATGCAATTCAAGTATGGGATTCCTGGTGCAATGA
TTTCAACAATTTTGGCATACTGGATTCCGAACATTGGTCAACTGATATTTATTACATGTGGTTGGTGCCCTGAAACATGG
AAAGGTTTCTCTTTTTTAGCATTCAAAGATCTTTGGCCGGTTGCCAAGCTTTCCATTTCATCTGGTGCCATGTTATGTCT
TGAGCTCTGGTATAGCACAATATTGATTCTTTTGACTGGTAACATGAAAGATGCTGAGGTTCAAATTGATGCTCTATCTA
TATGTATTAACATCAGTGGATGGGAAATGATGATAGCATTTGGTTTCATGGCTGCTGTTAGTGTTCGAGTGGCAAATGAA
CTTGGAAGGGAAAACTCCAAAGCTGCAAAGTTCTCTATAGTTGTGACAGTGCTTACATCATTTGCAATTGGGTTTATCTT
ATTTGTTCTTTTTTTAATTTTAAGAGAAAAAGTAGCTTATCTCTTTACTTCAAACGAAGATGTGGCTACTGCTGTGGGGG
ATTTGTCACCTTTGTTAGCGCTTTCTTTGTTACTAAATAGTATTCAACCTGTACTCTCAGGGGTGGCTGTTGGAGCAGGG
TGGCAGAGCACTGTAGCGTATGTGAACATAGGGTGTTATTACCTCATAGGTATTCCGGTTGGAATAGTACTTGGTAACAT
TATTCACTTGCAAGTCAAGGGTATTTGGATTGGAATGTTGTTTGGGACACTAATTCAAACTATAATCCTAATTATAATCA
CCTACAAAACTAATTGGGATGAGCAGGTGATTATAGCCCGTGATCGTATTAATAAGTGGTCTAAAATGGTCCTTGATCAT
GAAACAATTACATCAGATAATTAG
Microexon DNA seq TTAG
Microexon Amino Acid seq LG
Microexon-tag DNA Seq GCTTATGCCCTTGTGTTCACTGTCATCATACGCTTCGCCAATGGAATTCTGTTAGGAATGGCAAGTGCATTGTCAACACTTTGTGGACAAGCATACGGCGCAAAAGAA
Microexon-tag Amino Acid seq AYALVFTVIIRFANGILLGMASALSTLCGQAYGAKE
Transcript ID KRH64942
Gene ID Gm.34894
Gene Name NA
Pfam domain motif MatE
Motif E-value 9e-31
Motif start 40
Motif end 200
Protein seq >KRH64942
MEGNLEKKLLSREQKSEEENLSLVKRVWEESKVMWIVAAPAIFTRFTTFGISVISQAFIGHIGSRELAAYALVFTVIIRF
ANGILLGMASALSTLCGQAYGAKEYDMMGVYLQRSWIVLFLSAICLLPLFIFTSPILTLLGQDESIAQVARTISIWSIPV
LFAYIVSNSCQTFLQSQSKNVIISYLAALSIIIHVSLSWLFTMQFKYGIPGAMISTILAYWIPNIGQLIFITCGWCPETW
KGFSFLAFKDLWPVAKLSISSGAMLCLELWYSTILILLTGNMKDAEVQIDALSICINISGWEMMIAFGFMAAVSVRVANE
LGRENSKAAKFSIVVTVLTSFAIGFILFVLFLILREKVAYLFTSNEDVATAVGDLSPLLALSLLLNSIQPVLSGVAVGAG
WQSTVAYVNIGCYYLIGIPVGIVLGNIIHLQVKGIWIGMLFGTLIQTIILIIITYKTNWDEQVIIARDRINKWSKMVLDH
ETITSDN*
CDS seq >KRH64942
ATGGAGGGGAATCTTGAGAAGAAGCTGTTGAGCAGAGAGCAAAAATCAGAAGAAGAGAATTTATCATTGGTGAAGAGGGT
GTGGGAAGAGAGCAAGGTGATGTGGATAGTGGCAGCACCAGCCATATTCACAAGGTTCACAACCTTTGGCATCAGTGTTA
TAAGCCAAGCATTTATTGGTCATATTGGTTCAAGGGAACTCGCTGCTTATGCCCTTGTGTTCACTGTCATCATACGCTTC
GCCAATGGAATTCTGTTAGGAATGGCAAGTGCATTGTCAACACTTTGTGGACAAGCATACGGCGCAAAAGAATATGACAT
GATGGGAGTGTATCTTCAAAGATCATGGATAGTTTTATTCTTAAGTGCAATCTGTCTTCTTCCGTTGTTCATCTTCACAA
GCCCAATTTTGACTCTCTTAGGCCAAGATGAGAGCATAGCACAAGTGGCAAGAACCATTTCTATTTGGTCAATTCCTGTC
TTATTTGCTTATATTGTCTCAAACAGCTGCCAGACATTCCTTCAATCTCAAAGCAAGAATGTCATTATTTCATATTTGGC
AGCTTTATCAATAATCATTCATGTGTCCCTCTCCTGGCTATTCACAATGCAATTCAAGTATGGGATTCCTGGTGCAATGA
TTTCAACAATTTTGGCATACTGGATTCCGAACATTGGTCAACTGATATTTATTACATGTGGTTGGTGCCCTGAAACATGG
AAAGGTTTCTCTTTTTTAGCATTCAAAGATCTTTGGCCGGTTGCCAAGCTTTCCATTTCATCTGGTGCCATGTTATGTCT
TGAGCTCTGGTATAGCACAATATTGATTCTTTTGACTGGTAACATGAAAGATGCTGAGGTTCAAATTGATGCTCTATCTA
TATGTATTAACATCAGTGGATGGGAAATGATGATAGCATTTGGTTTCATGGCTGCTGTTAGTGTTCGAGTGGCAAATGAA
CTTGGAAGGGAAAACTCCAAAGCTGCAAAGTTCTCTATAGTTGTGACAGTGCTTACATCATTTGCAATTGGGTTTATCTT
ATTTGTTCTTTTTTTAATTTTAAGAGAAAAAGTAGCTTATCTCTTTACTTCAAACGAAGATGTGGCTACTGCTGTGGGGG
ATTTGTCACCTTTGTTAGCGCTTTCTTTGTTACTAAATAGTATTCAACCTGTACTCTCAGGGGTGGCTGTTGGAGCAGGG
TGGCAGAGCACTGTAGCGTATGTGAACATAGGGTGTTATTACCTCATAGGTATTCCGGTTGGAATAGTACTTGGTAACAT
TATTCACTTGCAAGTCAAGGGTATTTGGATTGGAATGTTGTTTGGGACACTAATTCAAACTATAATCCTAATTATAATCA
CCTACAAAACTAATTGGGATGAGCAGGTGATTATAGCCCGTGATCGTATTAATAAGTGGTCTAAAATGGTCCTTGATCAT
GAAACAATTACATCAGATAATTAG