Microexon ID Zm_5:173404705-173404713:+
Species Zea mays
Coordinates 5:173404705..173404713
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ACCCCAACG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq CTGCTCAGGACGGGGTACCATTTCCAGCCGCCAATGAACTGGATCAACGACCCCAACGCTCCGCTGTACTACAAAGGATGGTATCACCTGTTCTACCAGTACAACCCC
Microexon-tag Amino Acid Seq MLRTGYHFQPPMNWINDPNAPLYYKGWYHLFYQYNP
Microexon-tag spanning region173404022-173405330
Microexon-tag prediction score0.9614
Overlapped with the annotated transcript (%) 91.67
New Transcript ID Zm00001d016708_T002x
Reference Transcript ID Zm00001d016708_T002
Gene ID Zm00001d016708
Gene Name cell wall invertase1
Zm_5:173404705-173404713:+ does not have available information here.
Microexon DNA seq ACCCCAACG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq CTGCTCAGGACGGGGTACCATTTCCAGCCGCCAATGAACTGGATCAACGACCCCAACGCTCCGCTGTACTACAAAGGATGGTATCACCTGTTCTACCAGTACAACCCC
Microexon-tag Amino Acid seq MLRTGYHFQPPMNWINDPNAPLYYKGWYHLFYQYNP
Transcript ID Zm.24477.3
Gene ID Zm.24477
Gene Name cell wall invertase1
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.4e-103
Motif start 58
Motif end 377
Protein seq >Zm.24477.3
MGTRPRGVVLAPWAVVLVLVLALRLAGASHVIHRSLEAEAAPSVPASIVSPLLRTGYHFQPPMNWINDPNAPLYYKGWYH
LFYQYNPKGAVWGNIVWAHSVSRDLINWVALEPAIYPSIPSDKYGCWSGSATILEDGTPAILYTGIDRADINYQVQVLAL
PKDASDPLLREWEKPEEYNPVATPAAGGINATQFRDPTTAWRHAGHWRMLVGSVRGARGMALVYRSRDFRKWTKAKHPLH
SAALTGMWECPDFFPVSGPGLQAGLDTSAPGRKYVLKSSLDLTRYDYYTIGSYDGGKDRYYPDDPAGDYHHRLRYDYGNY
YASKTFYDPVERRRVLLGWANESDSVTDDKAKGWAGIHAIPRKIWLDPTGKQLLQWPIHEVEKLRGKAVSVDAKLVKPGD
HFEVTGIATYQADVEVSFELELEAGTSLLEKAEAFDPAYDDDAQKLCGVKGADARGGVGPFGLWVLASADLQERTAVFFR
VFRDGHGKPKVLMCTDPTKSSLSPDLYKPTFAGFVDADISSGKITLRSLIDRSVVESFGAGGKTCILSRVYPSIAVGKDA
HLYVFNNGEVDVTVSGLTAWEMKKPLMNGA*
CDS seq >Zm.24477.3
ATGGGGACTCGGCCGCGGGGGGTCGTCCTCGCGCCATGGGCGGTGGTGCTGGTGCTGGTGCTCGCGCTTCGTCTGGCAGG
CGCGTCGCATGTGATCCACCGCAGCCTGGAGGCCGAGGCGGCGCCGTCGGTCCCGGCCTCCATTGTCAGCCCCCTGCTCA
GGACGGGGTACCATTTCCAGCCGCCAATGAACTGGATCAACGACCCCAACGCTCCGCTGTACTACAAAGGATGGTATCAC
CTGTTCTACCAGTACAACCCCAAGGGCGCGGTATGGGGCAACATCGTGTGGGCGCACTCGGTGTCGCGCGACCTGATCAA
CTGGGTGGCGCTGGAGCCCGCCATCTACCCCAGCATCCCGTCCGACAAGTACGGCTGCTGGTCGGGCTCGGCGACGATCC
TGGAGGACGGCACGCCGGCGATCCTGTACACGGGGATCGACCGGGCGGACATCAACTACCAGGTGCAGGTGCTGGCGCTC
CCCAAGGACGCGTCCGACCCGCTGCTCCGCGAGTGGGAGAAGCCGGAGGAGTACAACCCGGTGGCGACGCCGGCGGCCGG
CGGCATCAACGCGACGCAGTTCCGCGACCCGACGACGGCGTGGCGGCACGCGGGGCACTGGCGGATGCTGGTGGGCAGCG
TGCGCGGCGCGCGCGGGATGGCGCTGGTGTACCGGAGCCGGGACTTCAGGAAGTGGACCAAGGCCAAGCACCCGCTGCAC
TCGGCGGCGCTGACGGGGATGTGGGAGTGCCCCGACTTCTTCCCGGTGTCCGGGCCGGGGCTGCAGGCCGGCCTCGACAC
CTCCGCGCCCGGAAGGAAGTACGTGCTCAAGAGCAGCCTGGACCTCACCCGCTACGACTACTACACCATCGGGTCGTACG
ACGGCGGCAAGGACCGGTACTACCCCGACGACCCCGCCGGCGACTACCACCACCGCCTGCGCTACGACTACGGCAACTAC
TACGCGTCCAAGACGTTCTACGACCCCGTGGAGCGCCGCCGCGTGCTGCTCGGGTGGGCCAACGAGTCCGACAGCGTCAC
CGACGACAAGGCCAAGGGCTGGGCCGGCATCCATGCGATCCCAAGGAAGATCTGGCTGGACCCCACCGGGAAGCAGCTGC
TGCAGTGGCCCATCCACGAGGTCGAGAAGCTCAGGGGGAAGGCCGTCAGTGTGGACGCCAAGCTGGTCAAGCCCGGCGAC
CATTTTGAGGTCACAGGCATCGCAACTTATCAGGCTGACGTGGAGGTGAGCTTCGAGCTGGAGCTGGAGGCGGGGACGAG
CCTGCTGGAGAAGGCGGAGGCGTTCGACCCGGCGTACGACGACGACGCGCAGAAGCTGTGCGGAGTCAAGGGCGCGGACG
CCAGGGGCGGCGTGGGGCCATTCGGCCTCTGGGTGCTGGCCTCCGCCGACCTGCAGGAGCGGACGGCCGTCTTCTTCAGG
GTGTTCAGGGACGGACACGGCAAGCCCAAGGTGCTCATGTGCACCGACCCCACCAAGTCGTCTCTTAGTCCGGATCTGTA
CAAGCCGACCTTCGCGGGCTTCGTCGACGCCGACATCTCCAGCGGCAAGATCACCCTTAGAAGCTTGATCGATCGGTCCG
TGGTCGAGAGCTTCGGCGCGGGAGGCAAGACGTGCATCCTGTCACGGGTGTACCCGTCCATCGCCGTCGGGAAGGACGCT
CACCTCTACGTTTTCAACAACGGCGAGGTGGACGTCACGGTGTCCGGCCTGACCGCCTGGGAGATGAAGAAACCGCTGAT
GAACGGCGCCTGA