Microexon ID Gm_2:6941544-6941557:-
Species Glycine max
Coordinates 2:6941544..6941557
Microexon Cluster ID MEP39
Size 14
Phase 1
Pfam Domain Motif Unknown
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 24,22,14,48
Microexon location in the Microexon-tag 3
Microexon-tag DNA Seq GRWGMWGGAGRYATGTAYKSYBTYCAASSTTCTGGAGCYMGKGCAGKTGGATTTCCWCAGATGGSMAATGCTGCAGCMATTGCAGCTGCCTTTGSKGGWGGTTTGCCT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Gm_2:6941544-6941557:- does not have available information here.
Transcript ID KRH70272
Protein ID KRH70272
Gene ID GLYMA_02G080500
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >KRH70272
MAEPSKVIHVRNVGHEISENDLLQLFQPFGVITKLVMLRAKNQALLQMQDIPSAVNALQFYANVQPSIRGRNVYVQFSSH
QELTTMDQNQAREDEPNRILLVTVHHMLYPITADVLHQVFSPHGFVEKIVTFQKSAGFQALIQYQSRQSAVTARSTLQGR
NIYDGCCQLDIQFSNLDELQVNYNNDRSRDFTNPNLPTEQKGRSSQPGYGDAGGMHSGARAGGFSQMANAAAIAAAFGGG
LPPGITGTNERCTVLVANLNPDRIDEDKMFNLFSIYGNIVRIKLLRNKPDHALIQMGDGFQAELAVHFLKGAMLFGKRLE
VNYSKHANITQGADTHEYANSNLNRFNRNAAKNYRYCCSPTKMIHLSTLPQDITEEEIVSLLEEHGTIVNSKVFEMNGKK
QALVQFETEEQATEALVCKHASPLSGSVVRISFSQLQNI*
CDS seq >KRH70272
ATGGCTGAACCTTCCAAGGTCATTCACGTTCGAAATGTGGGGCATGAGATATCTGAAAATGATTTGCTTCAACTATTTCA
GCCTTTTGGAGTAATAACAAAGCTTGTGATGTTGCGTGCCAAAAATCAGGCTCTTCTTCAAATGCAAGATATTCCTTCTG
CAGTTAATGCTTTACAATTTTATGCAAATGTCCAGCCAAGCATAAGGGGGAGAAATGTTTATGTCCAGTTTTCCTCACAT
CAAGAACTAACTACAATGGATCAAAATCAAGCACGAGAAGATGAGCCAAATCGAATTCTCTTAGTTACAGTTCATCACAT
GCTGTATCCTATAACAGCGGATGTGCTACATCAAGTGTTTTCTCCCCATGGATTTGTGGAAAAGATTGTAACATTCCAGA
AGTCAGCTGGCTTTCAGGCTCTCATCCAGTATCAATCCCGTCAAAGTGCTGTTACTGCCAGAAGTACTCTTCAGGGACGC
AATATTTATGATGGTTGTTGTCAGCTGGACATTCAGTTCTCAAACCTTGATGAACTACAAGTGAACTACAATAATGACCG
TTCAAGGGACTTCACAAACCCTAATCTGCCTACAGAGCAGAAAGGTCGATCTTCACAACCTGGATATGGTGATGCAGGAG
GCATGCATTCTGGAGCCAGGGCAGGTGGATTCTCTCAGATGGCCAATGCTGCAGCAATTGCAGCTGCCTTTGGGGGAGGT
TTGCCTCCTGGCATAACTGGAACAAATGAAAGGTGTACAGTTCTTGTCGCAAATCTCAATCCCGATAGAATAGATGAGGA
TAAAATGTTCAACTTGTTCTCCATTTATGGGAACATTGTCAGAATTAAACTTCTCCGAAATAAGCCAGATCATGCACTTA
TCCAGATGGGAGATGGTTTTCAAGCTGAATTGGCAGTACATTTTCTGAAGGGAGCCATGTTGTTTGGAAAGCGATTGGAG
GTCAACTATTCAAAGCATGCAAACATAACCCAAGGTGCTGATACACACGAGTACGCCAATTCAAATCTCAATCGATTCAA
TCGTAATGCTGCTAAGAACTACCGGTATTGCTGCTCTCCAACAAAGATGATCCACTTGTCCACCCTCCCGCAAGACATAA
CCGAAGAGGAGATTGTGAGCCTTTTGGAGGAGCATGGAACCATTGTCAACAGCAAGGTCTTTGAGATGAATGGAAAAAAG
CAGGCACTTGTTCAGTTTGAGACTGAGGAGCAGGCTACTGAAGCCCTTGTGTGCAAGCATGCAAGCCCACTTTCTGGATC
GGTTGTTCGCATCTCCTTTTCCCAGTTGCAGAATATATGA
Microexon DNA seq GTGGATTCTCTCAG
Microexon Amino Acid seq GGFSQ
Microexon-tag DNA Seq CCTGGATATGGTGATGCAGGAGGCATGCATTCTGGAGCCAGGGCAGGTGGATTCTCTCAGATGGCCAATGCTGCAGCAATTGCAGCTGCCTTTGGGGGAGGTTTGCCT
Microexon-tag Amino Acid seq PGYGDAGGMHSGARAGGFSQMANAAAIAAAFGGGLP
Transcript ID Gm.30202.1
Gene ID Gm.30202
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Gm.30202.1
MAEPSKVIHVRNVGHEISENDLLQLFQPFGVITKLVMLRAKNQALLQMQDIPSAVNALQFYANVQPSIRGRNVYVQFSSH
QELTTMDQNQAREDEPNRILLVTVHHMLYPITADVLHQVFSPHGFVEKIVTFQKSAGFQALIQYQSRQSAVTARSTLQGR
NIYDGCCQLDIQFSNLDELQVNYNNDRSRDFTNPNLPTEQKGRSSQPGYGDAGGMHSGARAGGFSQMANAAAIAAAFGGG
LPPGITGTNERCTVLVANLNPDRIDEDKMFNLFSIYGNIVRIKLLRNKPDHALIQMGDGFQAELAVHFLKGAMLFGKRLE
VNYSKHANITQGADTHEYANSNLNRFNRNAAKNYRYCCSPTKMIHLSTLPQDITEEEIVSLLEEHGTIVNSKVFEMNGKK
QALVQFETEEQATEALVCKHASPLSGSVVRISFSQLQNI*
CDS seq >Gm.30202.1
ATGGCTGAACCTTCCAAGGTCATTCACGTTCGAAATGTGGGGCATGAGATATCTGAAAATGATTTGCTTCAACTATTTCA
GCCTTTTGGAGTAATAACAAAGCTTGTGATGTTGCGTGCCAAAAATCAGGCTCTTCTTCAAATGCAAGATATTCCTTCTG
CAGTTAATGCTTTACAATTTTATGCAAATGTCCAGCCAAGCATAAGGGGGAGAAATGTTTATGTCCAGTTTTCCTCACAT
CAAGAACTAACTACAATGGATCAAAATCAAGCACGAGAAGATGAGCCAAATCGAATTCTCTTAGTTACAGTTCATCACAT
GCTGTATCCTATAACAGCGGATGTGCTACATCAAGTGTTTTCTCCCCATGGATTTGTGGAAAAGATTGTAACATTCCAGA
AGTCAGCTGGCTTTCAGGCTCTCATCCAGTATCAATCCCGTCAAAGTGCTGTTACTGCCAGAAGTACTCTTCAGGGACGC
AATATTTATGATGGTTGTTGTCAGCTGGACATTCAGTTCTCAAACCTTGATGAACTACAAGTGAACTACAATAATGACCG
TTCAAGGGACTTCACAAACCCTAATCTGCCTACAGAGCAGAAAGGTCGATCTTCACAACCTGGATATGGTGATGCAGGAG
GCATGCATTCTGGAGCCAGGGCAGGTGGATTCTCTCAGATGGCCAATGCTGCAGCAATTGCAGCTGCCTTTGGGGGAGGT
TTGCCTCCTGGCATAACTGGAACAAATGAAAGGTGTACAGTTCTTGTCGCAAATCTCAATCCCGATAGAATAGATGAGGA
TAAAATGTTCAACTTGTTCTCCATTTATGGGAACATTGTCAGAATTAAACTTCTCCGAAATAAGCCAGATCATGCACTTA
TCCAGATGGGAGATGGTTTTCAAGCTGAATTGGCAGTACATTTTCTGAAGGGAGCCATGTTGTTTGGAAAGCGATTGGAG
GTCAACTATTCAAAGCATGCAAACATAACCCAAGGTGCTGATACACACGAGTACGCCAATTCAAATCTCAATCGATTCAA
TCGTAATGCTGCTAAGAACTACCGGTATTGCTGCTCTCCAACAAAGATGATCCACTTGTCCACCCTCCCGCAAGACATAA
CCGAAGAGGAGATTGTGAGCCTTTTGGAGGAGCATGGAACCATTGTCAACAGCAAGGTCTTTGAGATGAATGGAAAAAAG
CAGGCACTTGTTCAGTTTGAGACTGAGGAGCAGGCTACTGAAGCCCTTGTGTGCAAGCATGCAAGCCCACTTTCTGGATC
GGTTGTTCGCATCTCCTTTTCCCAGTTGCAGAATATATGA