
Microexon ID | Gm_16:32424002-32424015:- |
Species | Glycine max | Coordinates | 16:32424002..32424015 |
Microexon Cluster ID | MEP39 |
Size | 14 |
Phase | 1 |
Pfam Domain Motif | Unknown |
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 24,22,14,48 |
Microexon location in the Microexon-tag | 3 |
Microexon-tag DNA Seq | GRWGMWGGAGRYATGTAYKSYBTYCAASSTTCTGGAGCYMGKGCAGKTGGATTTCCWCAGATGGSMAATGCTGCAGCMATTGCAGCTGCCTTTGSKGGWGGTTTGCCT |
Logo of Microexon-tag DNA Seq | ![]() |
Alignment of exons | ![]() |
Gm_16:32424002-32424015:- does not have available information here.
Transcript ID | KRH08666 |
Protein ID | KRH08666 |
Gene ID | GLYMA_16G165000 |
Gene Name | NA |
Pfam domain motif | Unknown |
Motif E-value | NA |
Motif start | NA |
Motif end | NA |
Protein seq | >KRH08666 MAEPSKVIHVRNVGHEISENDLLQLFQPFGVITKLVMLRAKNQALLQMQDIPSAVNALQFYANVQPSIRGRNVYVQFSSH QELTTMDQNQAREDEPNRILLVTVHHMLYPITADVLHQVFSPHGFVEKIVTFQKSAGFQALIQYQSRQSAVTARSTLQGR NIYDGCCQLDIQFSNLDELQVNYNNDRSRDFTNPNLPTEQKGRSSQPGYGDAAGMYSGARAGGFSQMANAAAIAAAFGGG LPPGITGTNERCTVLVANLNPDRIDEDKLFNLFSIYGNIVRIKLLRNKPDHALIQMGDGFQAELAVHFLKGAMLFGKRLE VNYSKHANITQGADTHEYVNSNLNRFNRNAAKNYRYCCSPTKMVHLSTLPQDITEEEVVSLLEEHGTIVNSKVFEMNGKK QALVQFETEEQATEALVCKHASPLSGSVVRISFSQLQNI* |
CDS seq | >KRH08666 ATGGCTGAACCTTCCAAGGTCATTCACGTTCGAAATGTGGGGCATGAGATATCTGAAAATGATTTGCTTCAACTATTTCA GCCTTTTGGAGTAATAACAAAGCTTGTGATGTTGCGTGCCAAAAATCAGGCTCTTCTTCAAATGCAAGATATTCCTTCTG CAGTTAATGCTTTACAATTTTATGCAAATGTCCAGCCAAGCATAAGGGGGAGAAATGTTTATGTCCAGTTTTCCTCACAT CAGGAACTAACTACAATGGATCAAAATCAAGCACGAGAAGACGAGCCAAATCGAATTCTCTTAGTTACAGTTCATCACAT GCTGTATCCTATAACAGCGGATGTGCTACATCAAGTGTTTTCTCCCCATGGATTTGTGGAAAAGATTGTAACATTTCAGA AGTCAGCTGGCTTTCAAGCTCTAATCCAGTATCAATCCCGTCAAAGTGCTGTTACTGCCAGAAGTACTCTTCAGGGACGC AATATTTATGATGGTTGTTGTCAGCTGGACATTCAGTTCTCAAACCTTGATGAACTACAAGTGAACTACAATAATGATCG TTCAAGGGACTTCACAAACCCTAATCTGCCTACAGAGCAGAAAGGTCGATCTTCACAACCTGGATATGGTGATGCAGCAG GCATGTATTCAGGAGCCAGGGCAGGTGGGTTCTCTCAGATGGCCAATGCTGCGGCAATTGCAGCTGCCTTTGGGGGAGGT TTGCCTCCTGGCATAACTGGAACAAATGAAAGGTGTACAGTTCTTGTTGCAAATCTCAATCCTGATAGAATAGATGAGGA TAAACTGTTCAACTTGTTCTCCATTTATGGGAACATTGTCAGAATTAAACTTCTCCGAAATAAGCCAGATCATGCACTTA TCCAAATGGGAGATGGTTTCCAAGCTGAATTGGCAGTACATTTTCTGAAGGGAGCCATGTTGTTTGGAAAGCGATTGGAG GTCAACTATTCGAAGCATGCGAACATAACCCAAGGTGCTGATACACATGAGTATGTCAATTCAAATCTCAATCGATTCAA TCGTAATGCTGCCAAGAACTATCGGTACTGCTGCTCACCGACAAAAATGGTCCACTTGTCCACCCTCCCGCAAGACATAA CTGAAGAGGAGGTTGTAAGCCTTTTGGAGGAGCATGGAACCATTGTCAACAGCAAGGTCTTTGAGATGAATGGAAAAAAA CAGGCACTTGTTCAGTTTGAGACTGAGGAGCAGGCTACTGAAGCCCTTGTGTGCAAGCATGCAAGTCCACTTTCTGGATC AGTTGTTCGCATCTCCTTTTCCCAGTTGCAGAATATATGA |
Microexon DNA seq | GTGGGTTCTCTCAG |
Microexon Amino Acid seq | GGFSQ |
Microexon-tag DNA Seq | CCTGGATATGGTGATGCAGCAGGCATGTATTCAGGAGCCAGGGCAGGTGGGTTCTCTCAGATGGCCAATGCTGCGGCAATTGCAGCTGCCTTTGGGGGAGGTTTGCCT |
Microexon-tag Amino Acid seq | PGYGDAAGMYSGARAGGFSQMANAAAIAAAFGGGLP |
Transcript ID | Gm.20853.1 |
Gene ID | Gm.20853 |
Gene Name | NA |
Pfam domain motif | Unknown |
Motif E-value | NA |
Motif start | NA |
Motif end | NA |
Protein seq | >Gm.20853.1 MAEPSKVIHVRNVGHEISENDLLQLFQPFGVITKLVMLRAKNQALLQMQDIPSAVNALQFYANVQPSIRGRNVYVQFSSH QELTTMDQNQAREDEPNRILLVTVHHMLYPITADVLHQVFSPHGFVEKIVTFQKSAGFQALIQYQSRQSAVTARSTLQGR NIYDGCCQLDIQFSNLDELQVNYNNDRSRDFTNPNLPTEQKGRSSQPGYGDAAGMYSGARAGGFSQMANAAAIAAAFGGG LPPGITGTNERCTVLVANLNPDRIDEDKLFNLFSIYGNIVRIKLLRNKPDHALIQMGDGFQAELAVHFLKGAMLFGKRLE VNYSKHANITQGADTHEYVNSNLNRFNRNAAKNYRYCCSPTKMVHLSTLPQDITEEEVVSLLEEHGTIVNSKVFEMNGKK QALVQFETEEQATEALVCKHASPLSGSVVRISFSQLQNI* |
CDS seq | >Gm.20853.1 ATGGCTGAACCTTCCAAGGTCATTCACGTTCGAAATGTGGGGCATGAGATATCTGAAAATGATTTGCTTCAACTATTTCA GCCTTTTGGAGTAATAACAAAGCTTGTGATGTTGCGTGCCAAAAATCAGGCTCTTCTTCAAATGCAAGATATTCCTTCTG CAGTTAATGCTTTACAATTTTATGCAAATGTCCAGCCAAGCATAAGGGGGAGAAATGTTTATGTCCAGTTTTCCTCACAT CAGGAACTAACTACAATGGATCAAAATCAAGCACGAGAAGACGAGCCAAATCGAATTCTCTTAGTTACAGTTCATCACAT GCTGTATCCTATAACAGCGGATGTGCTACATCAAGTGTTTTCTCCCCATGGATTTGTGGAAAAGATTGTAACATTTCAGA AGTCAGCTGGCTTTCAAGCTCTAATCCAGTATCAATCCCGTCAAAGTGCTGTTACTGCCAGAAGTACTCTTCAGGGACGC AATATTTATGATGGTTGTTGTCAGCTGGACATTCAGTTCTCAAACCTTGATGAACTACAAGTGAACTACAATAATGATCG TTCAAGGGACTTCACAAACCCTAATCTGCCTACAGAGCAGAAAGGTCGATCTTCACAACCTGGATATGGTGATGCAGCAG GCATGTATTCAGGAGCCAGGGCAGGTGGGTTCTCTCAGATGGCCAATGCTGCGGCAATTGCAGCTGCCTTTGGGGGAGGT TTGCCTCCTGGCATAACTGGAACAAATGAAAGGTGTACAGTTCTTGTTGCAAATCTCAATCCTGATAGAATAGATGAGGA TAAACTGTTCAACTTGTTCTCCATTTATGGGAACATTGTCAGAATTAAACTTCTCCGAAATAAGCCAGATCATGCACTTA TCCAAATGGGAGATGGTTTCCAAGCTGAATTGGCAGTACATTTTCTGAAGGGAGCCATGTTGTTTGGAAAGCGATTGGAG GTCAACTATTCGAAGCATGCGAACATAACCCAAGGTGCTGATACACATGAGTATGTCAATTCAAATCTCAATCGATTCAA TCGTAATGCTGCCAAGAACTATCGGTACTGCTGCTCACCGACAAAAATGGTCCACTTGTCCACCCTCCCGCAAGACATAA CTGAAGAGGAGGTTGTAAGCCTTTTGGAGGAGCATGGAACCATTGTCAACAGCAAGGTCTTTGAGATGAATGGAAAAAAA CAGGCACTTGTTCAGTTTGAGACTGAGGAGCAGGCTACTGAAGCCCTTGTGTGCAAGCATGCAAGTCCACTTTCTGGATC AGTTGTTCGCATCTCCTTTTCCCAGTTGCAGAATATATGA |