
Microexon ID | Zm_7:158113714-158113718:- |
Species | Zea mays | Coordinates | 7:158113714..158113718 |
Microexon Cluster ID | MEP08 |
Size | 5 |
Phase | 1 |
Pfam Domain Motif | Peptidase_C1 |
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 52,5,51 |
Microexon location in the Microexon-tag | 2 |
Microexon-tag DNA Seq | TTYGAYGCWMGAACWGMTTGGYCTCADTGYASCACHATTGGRARMATWCTWGATCARGGWCAYTGTGGTTCTTGYTGGGCWTTTGGTGCTGTKGARKCACTRYCWGAT |
Logo of Microexon-tag DNA Seq | ![]() |
Alignment of exons | ![]() |
Microexon DNA seq | ATCAA |
Microexon Amino Acid seq | DQ |
Microexon-tag DNA Seq | TTTGACGCTAGATCTGCATGGTCCCGTTGCAGCACAATTGGGAACATACTTGATCAAGGTCACTGTGGCTCTTGTTGGGCTTTTGGTGCTGTGGAGTGCCTCCAGGAC |
Microexon-tag Amino Acid Seq | FDARSAWSRCSTIGNILDQGHCGSCWAFGAVECLQD |
Microexon-tag spanning region | 158113575-158113888 |
Microexon-tag prediction score | 0.969 |
Overlapped with the annotated transcript (%) | 91.15 |
New Transcript ID | Zm00001d021615_T001x |
Reference Transcript ID | Zm00001d021615_T001 |
Gene ID | Zm00001d021615 |
Gene Name | cysteine protease4 |
Transcript ID | Zm00001d021615_T002 |
Protein ID | Zm00001d021615_P002 |
Gene ID | Zm00001d021615 |
Gene Name | cysteine protease4 |
Pfam domain motif | Unknown |
Motif E-value | NA |
Motif start | NA |
Motif end | NA |
Protein seq | >Zm00001d021615_P002 MGGELLLALLLVSAAAAPQVLGVGNGDNHMRIIQEDIIETVNNHPSAGWTASRNPYFSNYTIAQFKHILGVKPAPQNALS NVPVKTYSRSLELPKEFDARSAWSRCSTIGNILDQYNVLSDDNRVTVALVGLLVLWSASRTVFAFTSTWRVDTEDIAVLI SILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKCEKKCKEQNQVWQEK KHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLANQWN RGWGDDGYFKIIRGKNECGIEEGVVAGMPSTKNMVPNFGGAVGRAIV* |
CDS seq | >Zm00001d021615_T002 ATGGGCGGCGAACTGCTGCTTGCGCTCCTCCTCGTCTCTGCTGCTGCCGCCCCTCAGGTACTTGGAGTTGGCAATGGAGA CAATCACATGAGAATCATCCAGGAGGACATCATTGAGACAGTCAACAACCATCCCAGTGCTGGGTGGACAGCCTCACGTA ATCCTTACTTTTCAAACTATACCATTGCGCAATTTAAGCACATACTTGGAGTGAAACCAGCACCACAGAATGCACTAAGC AATGTTCCTGTCAAAACTTATTCAAGATCACTGGAACTTCCAAAAGAGTTTGACGCTAGATCTGCATGGTCCCGTTGCAG CACAATTGGGAACATACTTGATCAATATAATGTCTTGTCTGATGACAACAGGGTCACTGTGGCTCTTGTTGGGCTTTTGG TGCTGTGGAGTGCCTCCAGGACCGTTTTTGCATTCACCTCAACATGGCGAGTGGATACAGAAGACATTGCTGTTCTCATA AGCATTTTACTTTCAGTCAATGACCTACTGGCATGCTGCGGTTTTATGTGCGGCGATGGGTGTGATGGAGGCTATCCTAT AGAGGCATGGCGCTACTTTGTTCAAAATGGTGTTGTTACGGATGAGTGTGATCCATACTTCGACCCGGTCGGTTGCAAGC ATCCTGGATGCGAACCTGCTTATCCTACACCAAAGTGTGAAAAGAAATGCAAGGAGCAGAACCAAGTTTGGCAGGAAAAG AAGCATTTCAGCATTGATGCGTACAGAATAAATTCAGATCCACATGACATAATGGCAGAGGTCTACAAAAATGGTCCTGT AGAAGTTGCTTTCACAGTTTACGAGGATTTCGCACACTACAAATCTGGAGTGTACAAGCACATCACCGGTGGCATTATGG GTGGCCATGCCGTCAAGTTGATTGGATGGGGAACCAGTGATGCTGGAGAGGATTACTGGCTTCTGGCAAATCAGTGGAAT AGAGGCTGGGGCGATGATGGATACTTCAAGATCATAAGGGGCAAAAATGAATGTGGCATCGAGGAGGGCGTTGTTGCTGG AATGCCATCGACAAAGAATATGGTTCCAAACTTCGGCGGTGCCGTTGGAAGAGCTATAGTTTAA |
Microexon DNA seq | ATCAA |
Microexon Amino Acid seq | DQ |
Microexon-tag DNA Seq | TTTGACGCTAGATCTGCATGGTCCCGTTGCAGCACAATTGGGAACATACTTGATCAAGGTCACTGTGGCTCTTGTTGGGCTTTTGGTGCTGTGGAGTGCCTCCAGGAC |
Microexon-tag Amino Acid seq | FDARSAWSRCSTIGNILDQGHCGSCWAFGAVECLQD |
Transcript ID | Zm.31690.1 |
Gene ID | Zm.31690 |
Gene Name | cysteine protease4 |
Pfam domain motif | Peptidase_C1 |
Motif E-value | 3.4e-66 |
Motif start | 93 |
Motif end | 327 |
Protein seq | >Zm.31690.1 MGGELLLALLLVSAAAAPQVLGVGNGDNHMRIIQEDIIETVNNHPSAGWTASRNPYFSNYTIAQFKHILGVKPAPQNALS NVPVKTYSRSLELPKEFDARSAWSRCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHLNMSILLSVNDLLACCGFMCGDG CDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKCEKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAE VYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGI EEGVVAGMPSTKNMVPNFGGAVGRAIV* |
CDS seq | >Zm.31690.1 ATGGGCGGCGAACTGCTGCTTGCGCTCCTCCTCGTCTCTGCTGCTGCCGCCCCTCAGGTACTTGGAGTTGGCAATGGAGA CAATCACATGAGAATCATCCAGGAGGACATCATTGAGACAGTCAACAACCATCCCAGTGCTGGGTGGACAGCCTCACGTA ATCCTTACTTTTCAAACTATACCATTGCGCAATTTAAGCACATACTTGGAGTGAAACCAGCACCACAGAATGCACTAAGC AATGTTCCTGTCAAAACTTATTCAAGATCACTGGAACTTCCAAAAGAGTTTGACGCTAGATCTGCATGGTCCCGTTGCAG CACAATTGGGAACATACTTGATCAAGGTCACTGTGGCTCTTGTTGGGCTTTTGGTGCTGTGGAGTGCCTCCAGGACCGTT TTTGCATTCACCTCAACATGAGCATTTTACTTTCAGTCAATGACCTACTGGCATGCTGCGGTTTTATGTGCGGCGATGGG TGTGATGGAGGCTATCCTATAGAGGCATGGCGCTACTTTGTTCAAAATGGTGTTGTTACGGATGAGTGTGATCCATACTT CGACCCGGTCGGTTGCAAGCATCCTGGATGCGAACCTGCTTATCCTACACCAAAGTGTGAAAAGAAATGCAAGGAGCAGA ACCAAGTTTGGCAGGAAAAGAAGCATTTCAGCATTGATGCGTACAGAATAAATTCAGATCCACATGACATAATGGCAGAG GTCTACAAAAATGGTCCTGTAGAAGTTGCTTTCACAGTTTACGAGGATTTCGCACACTACAAATCTGGAGTGTACAAGCA CATCACCGGTGGCATTATGGGTGGCCATGCCGTCAAGTTGATTGGATGGGGAACCAGTGATGCTGGAGAGGATTACTGGC TTCTGGCAAATCAGTGGAATAGAGGCTGGGGCGATGATGGATACTTCAAGATCATAAGGGGCAAAAATGAATGTGGCATC GAGGAGGGCGTTGTTGCTGGAATGCCATCGACAAAGAATATGGTTCCAAACTTCGGCGGTGCCGTTGGAAGAGCTATAGT TTAA |