Microexon ID Gm_19:47564164-47564168:-
Species Glycine max
Coordinates 19:47564164..47564168
Microexon Cluster ID MEP08
Size 5
Phase 1
Pfam Domain Motif Peptidase_C1
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 52,5,51
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TTYGAYGCWMGAACWGMTTGGYCTCADTGYASCACHATTGGRARMATWCTWGATCARGGWCAYTGTGGTTCTTGYTGGGCWTTTGGTGCTGTKGARKCACTRYCWGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCAG
Microexon Amino Acid seq DQ
Microexon-tag DNA Seq TTTGATGCAAGGACAGCTTGGTCTCAGTGTAGCACTATTGGAAGAATTCTAGATCAGGGTCACTGTGGTTCTTGTTGGGCATTTGGTGCTGTTGAATCATTGTCAGAT
Microexon-tag Amino Acid Seq FDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSD
Microexon-tag spanning region47564021-47564342
Microexon-tag prediction score0.9882
Overlapped with the annotated transcript (%) 100
New Transcript ID KRG96636x
Reference Transcript ID KRG96636
Gene ID GLYMA_19G223300
Gene Name NA
Transcript ID KRG96636
Protein ID KRG96636
Gene ID GLYMA_19G223300
Gene Name NA
Pfam domain motif Peptidase_C1
Motif E-value 3.3e-66
Motif start 100
Motif end 333
Protein seq >KRG96636
MASTLLPLATFFLVLSASYLQIAGAKAQPLTSLKLNSPILQESIAKEINENPEAGWEAAINPHFSNYTVEQFKRLLGVKP
TPKKELRSTPAISHPKSLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACC
GFLCGSGCDGGYPLYAWQYLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQVWKKSKHYSVNAYRVSSD
PHDIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGYELGGHAVKLIGWGTTEDGEDYWLLANQWNREWGDDGYFKIRR
GTNECGIEEDVTAGLPSTKNLVREVTDMDADAAVSF*
CDS seq >KRG96636
ATGGCTTCAACTCTGCTTCCCCTGGCAACCTTCTTCTTAGTTCTCTCCGCTTCTTATCTCCAGATTGCTGGGGCGAAAGC
ACAACCGTTGACTAGTCTCAAGCTTAACTCTCCGATCCTTCAGGAGTCTATTGCTAAAGAGATCAATGAAAATCCTGAGG
CAGGATGGGAGGCTGCTATAAATCCTCATTTCTCCAATTATACAGTTGAACAATTTAAGCGCCTTCTTGGAGTCAAACCA
ACGCCTAAGAAGGAACTGAGAAGTACACCTGCTATATCTCACCCAAAGTCATTGAAATTGCCAAAGAATTTTGATGCAAG
GACAGCTTGGTCTCAGTGTAGCACTATTGGAAGAATTCTAGATCAGGGTCACTGTGGTTCTTGTTGGGCATTTGGTGCTG
TTGAATCATTGTCAGATCGCTTTTGCATTCATTTTGATGTAAATATCTCTCTCTCTGTTAATGACCTTCTTGCATGCTGT
GGCTTTCTGTGTGGATCTGGTTGTGATGGGGGATATCCCTTGTATGCGTGGCAATACTTAGCCCACCACGGTGTTGTCAC
TGAAGAGTGCGACCCATATTTTGATCAAATTGGCTGTTCTCATCCTGGTTGTGAGCCAGCTTACCGGACTCCCAAGTGTG
TTAAAAAGTGTGTAAGTGGGAACCAAGTTTGGAAGAAGTCAAAACACTATAGTGTCAATGCATACAGAGTGAGCTCTGAT
CCCCATGATATCATGACGGAAGTTTACAAGAATGGGCCAGTTGAAGTTGCATTCACTGTTTATGAGGATTTTGCTCACTA
CAAATCAGGAGTTTACAAACACATCACAGGTTATGAACTAGGTGGTCATGCAGTAAAGCTAATTGGATGGGGAACAACTG
AAGATGGGGAGGATTATTGGCTTCTTGCAAATCAGTGGAACAGAGAATGGGGAGATGATGGTTACTTCAAGATCCGAAGA
GGGACAAACGAATGTGGGATTGAAGAGGATGTAACTGCTGGTTTGCCTTCCACCAAAAACCTCGTCAGAGAGGTGACTGA
TATGGATGCTGACGCTGCTGTTTCATTCTGA
Microexon DNA seq ATCAG
Microexon Amino Acid seq DQ
Microexon-tag DNA Seq TTTGATGCAAGGACAGCTTGGTCTCAGTGTAGCACTATTGGAAGAATTCTAGATCAGGGTCACTGTGGTTCTTGTTGGGCATTTGGTGCTGTTGAATCATTGTCAGAT
Microexon-tag Amino Acid seq FDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSD
Transcript ID KRG96636
Gene ID Gm.28973
Gene Name NA
Pfam domain motif Peptidase_C1
Motif E-value 3.3e-66
Motif start 100
Motif end 333
Protein seq >KRG96636
MASTLLPLATFFLVLSASYLQIAGAKAQPLTSLKLNSPILQESIAKEINENPEAGWEAAINPHFSNYTVEQFKRLLGVKP
TPKKELRSTPAISHPKSLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACC
GFLCGSGCDGGYPLYAWQYLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQVWKKSKHYSVNAYRVSSD
PHDIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGYELGGHAVKLIGWGTTEDGEDYWLLANQWNREWGDDGYFKIRR
GTNECGIEEDVTAGLPSTKNLVREVTDMDADAAVSF*
CDS seq >KRG96636
ATGGCTTCAACTCTGCTTCCCCTGGCAACCTTCTTCTTAGTTCTCTCCGCTTCTTATCTCCAGATTGCTGGGGCGAAAGC
ACAACCGTTGACTAGTCTCAAGCTTAACTCTCCGATCCTTCAGGAGTCTATTGCTAAAGAGATCAATGAAAATCCTGAGG
CAGGATGGGAGGCTGCTATAAATCCTCATTTCTCCAATTATACAGTTGAACAATTTAAGCGCCTTCTTGGAGTCAAACCA
ACGCCTAAGAAGGAACTGAGAAGTACACCTGCTATATCTCACCCAAAGTCATTGAAATTGCCAAAGAATTTTGATGCAAG
GACAGCTTGGTCTCAGTGTAGCACTATTGGAAGAATTCTAGATCAGGGTCACTGTGGTTCTTGTTGGGCATTTGGTGCTG
TTGAATCATTGTCAGATCGCTTTTGCATTCATTTTGATGTAAATATCTCTCTCTCTGTTAATGACCTTCTTGCATGCTGT
GGCTTTCTGTGTGGATCTGGTTGTGATGGGGGATATCCCTTGTATGCGTGGCAATACTTAGCCCACCACGGTGTTGTCAC
TGAAGAGTGCGACCCATATTTTGATCAAATTGGCTGTTCTCATCCTGGTTGTGAGCCAGCTTACCGGACTCCCAAGTGTG
TTAAAAAGTGTGTAAGTGGGAACCAAGTTTGGAAGAAGTCAAAACACTATAGTGTCAATGCATACAGAGTGAGCTCTGAT
CCCCATGATATCATGACGGAAGTTTACAAGAATGGGCCAGTTGAAGTTGCATTCACTGTTTATGAGGATTTTGCTCACTA
CAAATCAGGAGTTTACAAACACATCACAGGTTATGAACTAGGTGGTCATGCAGTAAAGCTAATTGGATGGGGAACAACTG
AAGATGGGGAGGATTATTGGCTTCTTGCAAATCAGTGGAACAGAGAATGGGGAGATGATGGTTACTTCAAGATCCGAAGA
GGGACAAACGAATGTGGGATTGAAGAGGATGTAACTGCTGGTTTGCCTTCCACCAAAAACCTCGTCAGAGAGGTGACTGA
TATGGATGCTGACGCTGCTGTTTCATTCTGA