Microexon ID Zm_7:158113714-158113718:-
Species Zea mays
Coordinates 7:158113714..158113718
Microexon Cluster ID MEP08
Size 5
Phase 1
Pfam Domain Motif Peptidase_C1
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 52,5,51
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TTYGAYGCWMGAACWGMTTGGYCTCADTGYASCACHATTGGRARMATWCTWGATCARGGWCAYTGTGGTTCTTGYTGGGCWTTTGGTGCTGTKGARKCACTRYCWGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCAA
Microexon Amino Acid seq DQ
Microexon-tag DNA Seq TTTGACGCTAGATCTGCATGGTCCCGTTGCAGCACAATTGGGAACATACTTGATCAAGGTCACTGTGGCTCTTGTTGGGCTTTTGGTGCTGTGGAGTGCCTCCAGGAC
Microexon-tag Amino Acid Seq FDARSAWSRCSTIGNILDQGHCGSCWAFGAVECLQD
Microexon-tag spanning region158113575-158113888
Microexon-tag prediction score0.969
Overlapped with the annotated transcript (%) 91.15
New Transcript ID Zm00001d021615_T001x
Reference Transcript ID Zm00001d021615_T001
Gene ID Zm00001d021615
Gene Name cysteine protease4
Transcript ID Zm00001d021615_T002
Protein ID Zm00001d021615_P002
Gene ID Zm00001d021615
Gene Name cysteine protease4
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Zm00001d021615_P002
MGGELLLALLLVSAAAAPQVLGVGNGDNHMRIIQEDIIETVNNHPSAGWTASRNPYFSNYTIAQFKHILGVKPAPQNALS
NVPVKTYSRSLELPKEFDARSAWSRCSTIGNILDQYNVLSDDNRVTVALVGLLVLWSASRTVFAFTSTWRVDTEDIAVLI
SILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKCEKKCKEQNQVWQEK
KHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLANQWN
RGWGDDGYFKIIRGKNECGIEEGVVAGMPSTKNMVPNFGGAVGRAIV*
CDS seq >Zm00001d021615_T002
ATGGGCGGCGAACTGCTGCTTGCGCTCCTCCTCGTCTCTGCTGCTGCCGCCCCTCAGGTACTTGGAGTTGGCAATGGAGA
CAATCACATGAGAATCATCCAGGAGGACATCATTGAGACAGTCAACAACCATCCCAGTGCTGGGTGGACAGCCTCACGTA
ATCCTTACTTTTCAAACTATACCATTGCGCAATTTAAGCACATACTTGGAGTGAAACCAGCACCACAGAATGCACTAAGC
AATGTTCCTGTCAAAACTTATTCAAGATCACTGGAACTTCCAAAAGAGTTTGACGCTAGATCTGCATGGTCCCGTTGCAG
CACAATTGGGAACATACTTGATCAATATAATGTCTTGTCTGATGACAACAGGGTCACTGTGGCTCTTGTTGGGCTTTTGG
TGCTGTGGAGTGCCTCCAGGACCGTTTTTGCATTCACCTCAACATGGCGAGTGGATACAGAAGACATTGCTGTTCTCATA
AGCATTTTACTTTCAGTCAATGACCTACTGGCATGCTGCGGTTTTATGTGCGGCGATGGGTGTGATGGAGGCTATCCTAT
AGAGGCATGGCGCTACTTTGTTCAAAATGGTGTTGTTACGGATGAGTGTGATCCATACTTCGACCCGGTCGGTTGCAAGC
ATCCTGGATGCGAACCTGCTTATCCTACACCAAAGTGTGAAAAGAAATGCAAGGAGCAGAACCAAGTTTGGCAGGAAAAG
AAGCATTTCAGCATTGATGCGTACAGAATAAATTCAGATCCACATGACATAATGGCAGAGGTCTACAAAAATGGTCCTGT
AGAAGTTGCTTTCACAGTTTACGAGGATTTCGCACACTACAAATCTGGAGTGTACAAGCACATCACCGGTGGCATTATGG
GTGGCCATGCCGTCAAGTTGATTGGATGGGGAACCAGTGATGCTGGAGAGGATTACTGGCTTCTGGCAAATCAGTGGAAT
AGAGGCTGGGGCGATGATGGATACTTCAAGATCATAAGGGGCAAAAATGAATGTGGCATCGAGGAGGGCGTTGTTGCTGG
AATGCCATCGACAAAGAATATGGTTCCAAACTTCGGCGGTGCCGTTGGAAGAGCTATAGTTTAA
Microexon DNA seq ATCAA
Microexon Amino Acid seq DQ
Microexon-tag DNA Seq TTTGACGCTAGATCTGCATGGTCCCGTTGCAGCACAATTGGGAACATACTTGATCAAGGTCACTGTGGCTCTTGTTGGGCTTTTGGTGCTGTGGAGTGCCTCCAGGAC
Microexon-tag Amino Acid seq FDARSAWSRCSTIGNILDQGHCGSCWAFGAVECLQD
Transcript ID Zm.31690.1
Gene ID Zm.31690
Gene Name cysteine protease4
Pfam domain motif Peptidase_C1
Motif E-value 3.4e-66
Motif start 93
Motif end 327
Protein seq >Zm.31690.1
MGGELLLALLLVSAAAAPQVLGVGNGDNHMRIIQEDIIETVNNHPSAGWTASRNPYFSNYTIAQFKHILGVKPAPQNALS
NVPVKTYSRSLELPKEFDARSAWSRCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHLNMSILLSVNDLLACCGFMCGDG
CDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKCEKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAE
VYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGI
EEGVVAGMPSTKNMVPNFGGAVGRAIV*
CDS seq >Zm.31690.1
ATGGGCGGCGAACTGCTGCTTGCGCTCCTCCTCGTCTCTGCTGCTGCCGCCCCTCAGGTACTTGGAGTTGGCAATGGAGA
CAATCACATGAGAATCATCCAGGAGGACATCATTGAGACAGTCAACAACCATCCCAGTGCTGGGTGGACAGCCTCACGTA
ATCCTTACTTTTCAAACTATACCATTGCGCAATTTAAGCACATACTTGGAGTGAAACCAGCACCACAGAATGCACTAAGC
AATGTTCCTGTCAAAACTTATTCAAGATCACTGGAACTTCCAAAAGAGTTTGACGCTAGATCTGCATGGTCCCGTTGCAG
CACAATTGGGAACATACTTGATCAAGGTCACTGTGGCTCTTGTTGGGCTTTTGGTGCTGTGGAGTGCCTCCAGGACCGTT
TTTGCATTCACCTCAACATGAGCATTTTACTTTCAGTCAATGACCTACTGGCATGCTGCGGTTTTATGTGCGGCGATGGG
TGTGATGGAGGCTATCCTATAGAGGCATGGCGCTACTTTGTTCAAAATGGTGTTGTTACGGATGAGTGTGATCCATACTT
CGACCCGGTCGGTTGCAAGCATCCTGGATGCGAACCTGCTTATCCTACACCAAAGTGTGAAAAGAAATGCAAGGAGCAGA
ACCAAGTTTGGCAGGAAAAGAAGCATTTCAGCATTGATGCGTACAGAATAAATTCAGATCCACATGACATAATGGCAGAG
GTCTACAAAAATGGTCCTGTAGAAGTTGCTTTCACAGTTTACGAGGATTTCGCACACTACAAATCTGGAGTGTACAAGCA
CATCACCGGTGGCATTATGGGTGGCCATGCCGTCAAGTTGATTGGATGGGGAACCAGTGATGCTGGAGAGGATTACTGGC
TTCTGGCAAATCAGTGGAATAGAGGCTGGGGCGATGATGGATACTTCAAGATCATAAGGGGCAAAAATGAATGTGGCATC
GAGGAGGGCGTTGTTGCTGGAATGCCATCGACAAAGAATATGGTTCCAAACTTCGGCGGTGCCGTTGGAAGAGCTATAGT
TTAA