Microexon ID Os_4:33943054-33943062:-
Species Oryza sativa
Coordinates 4:33943054..33943062
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCGAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CGTGACCGAACGGCGTACCACTTCCAACCCGCCAAGAACTGGCAGAACGATCCGAATGGGCCTGTGTACTACAATGGCATGTACCACCTCTTCTACCAGTACAACCCG
Microexon-tag Amino Acid Seq RDRTAYHFQPAKNWQNDPNGPVYYNGMYHLFYQYNP
Microexon-tag spanning region33942892-33943256
Microexon-tag prediction score0.9698
Overlapped with the annotated transcript (%) 100
New Transcript ID Os04t0664800-00x
Reference Transcript ID Os04t0664800-00
Gene ID Os04g0664800
Gene Name OsCIN6
Transcript ID Os04t0664800-00
Protein ID Os04t0664800-00
Gene ID Os04g0664800
Gene Name OsCIN6
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.1e-101
Motif start 42
Motif end 371
Protein seq >Os04t0664800-00
MALAGLPLSVFAIAVHFCLVFSSSSSPPVCPANGHRDRTAYHFQPAKNWQNDPNGPVYYNGMYHLFYQYNPHGALWDVGN
LSWGHSVSGDLVNWAALDNALDPTAPFDANGCASGSVTILPDGVPVVMYSGIDARRRQVQNVAFPKNPRDPLLREWTKPG
YNPVIPVPADVSPDNFRDPTTAWLGSDGLWRFAISAVADGVGATLVYRSADFLRWERNAAPLHASRDAVMAECPDLFPVA
EHGEDGLDLDASAIGGAGAGVRHVLKVSMPDTLEDYYMVGRYDDADDTFTVPPEDLEAHGDDYRRWRRIDHGHLYASKTF
YDAGKKRRVLWAWVNESDSEADDVTKGWSGLQSFPRAVWLDEGGRQLVQWPVEEIETLRRKRGVLLGGNEVEAGGLREIG
GIAGSQADVEVAFEIASLAGADRLEPDHLRDPDALCGENGAAVHGGIGPFGLLVMASGDLRERTAVFFRVFRLSHGYTVL
MCTDLTRSTSRAGVYKPSHGGFVDIDIEKDRAISLRTLIDHSIVESFGGGGRTCMTARVYPEHVATGSSHLYVFNNASDA
VKVSKLEAWELATASVNAGDDGLISYGGPVCAAQVQ*
CDS seq >Os04t0664800-00
ATGGCGTTGGCTGGATTGCCCCTCTCCGTCTTTGCCATCGCTGTCCATTTTTGCCTTGTTTTCTCCTCCTCCTCTTCTCC
TCCGGTTTGCCCTGCAAACGGCCACCGTGACCGAACGGCGTACCACTTCCAACCCGCCAAGAACTGGCAGAACGATCCGA
ATGGGCCTGTGTACTACAATGGCATGTACCACCTCTTCTACCAGTACAACCCGCACGGCGCGCTCTGGGACGTCGGCAAC
CTCTCCTGGGGGCACTCCGTCTCCGGCGACCTCGTGAACTGGGCCGCCCTCGACAACGCGCTCGATCCCACAGCGCCATT
CGACGCCAATGGTTGCGCGTCGGGGTCAGTCACCATCCTCCCCGACGGCGTGCCCGTCGTCATGTACTCCGGCATCGACG
CCCGCCGCCGGCAGGTCCAGAACGTCGCGTTCCCCAAGAACCCTCGCGACCCGCTCCTCCGCGAGTGGACCAAGCCCGGG
TACAACCCGGTCATCCCTGTCCCCGCCGACGTCTCGCCGGACAATTTCCGGGACCCCACCACCGCCTGGCTCGGCAGCGA
CGGCCTGTGGCGGTTCGCCATCTCCGCCGTGGCCGACGGCGTGGGCGCGACGCTCGTGTACCGGAGCGCCGACTTCCTGC
GGTGGGAGCGCAACGCGGCGCCGCTGCACGCCTCGCGGGACGCGGTCATGGCCGAGTGCCCCGACCTGTTCCCCGTCGCC
GAGCACGGCGAGGACGGGCTCGACCTCGACGCGTCGGCGATCGGCGGCGCCGGCGCCGGCGTGAGGCACGTCCTCAAGGT
CAGCATGCCGGACACCCTCGAGGACTACTACATGGTCGGACGGTACGACGACGCGGACGACACGTTCACCGTGCCGCCGG
AGGACCTGGAAGCCCACGGCGATGACTACCGGCGGTGGCGGCGGATCGACCACGGCCACCTGTACGCGTCCAAGACGTTC
TACGACGCGGGCAAGAAACGGCGCGTGCTGTGGGCGTGGGTGAACGAGTCCGACAGCGAGGCCGACGACGTCACCAAGGG
CTGGTCCGGCCTTCAGTCGTTTCCGCGGGCGGTGTGGCTGGACGAGGGCGGGAGGCAGCTGGTGCAGTGGCCGGTGGAGG
AGATCGAGACGCTGAGGCGGAAACGCGGCGTTCTGCTCGGCGGAAACGAGGTGGAGGCGGGCGGGCTGCGCGAGATCGGC
GGCATCGCGGGCTCGCAGGCGGACGTGGAGGTAGCGTTCGAGATCGCGAGCCTCGCGGGCGCCGACCGCCTCGAGCCCGA
CCATTTGCGTGACCCCGACGCGCTGTGCGGGGAGAATGGCGCGGCGGTGCACGGCGGAATCGGCCCGTTCGGGCTGCTCG
TCATGGCATCCGGCGACCTGCGCGAGCGCACCGCCGTTTTCTTCAGGGTGTTCAGGCTCTCGCACGGGTACACGGTCCTC
ATGTGCACGGACCTGACACGGTCAACTTCAAGAGCAGGGGTGTACAAGCCATCCCACGGAGGATTCGTGGACATAGACAT
AGAGAAGGACAGGGCTATATCGCTTCGAACCCTGATCGATCATTCGATCGTGGAGAGTTTTGGCGGTGGGGGGCGGACGT
GCATGACGGCTCGAGTGTACCCTGAACATGTAGCAACGGGGAGCAGCCACCTGTACGTGTTCAACAATGCGTCAGATGCG
GTGAAGGTGTCCAAGCTGGAGGCATGGGAGCTCGCGACGGCTAGTGTCAATGCTGGAGACGACGGGCTGATTTCGTATGG
TGGTCCTGTATGTGCTGCTCAAGTGCAGTAA
Microexon DNA seq ATCCGAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CGTGACCGAACGGCGTACCACTTCCAACCCGCCAAGAACTGGCAGAACGATCCGAATGGGCCTGTGTACTACAATGGCATGTACCACCTCTTCTACCAGTACAACCCG
Microexon-tag Amino Acid seq RDRTAYHFQPAKNWQNDPNGPVYYNGMYHLFYQYNP
Transcript ID Os04t0664800-00
Gene ID Os.24503
Gene Name OsCIN6
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.1e-101
Motif start 42
Motif end 371
Protein seq >Os04t0664800-00
MALAGLPLSVFAIAVHFCLVFSSSSSPPVCPANGHRDRTAYHFQPAKNWQNDPNGPVYYNGMYHLFYQYNPHGALWDVGN
LSWGHSVSGDLVNWAALDNALDPTAPFDANGCASGSVTILPDGVPVVMYSGIDARRRQVQNVAFPKNPRDPLLREWTKPG
YNPVIPVPADVSPDNFRDPTTAWLGSDGLWRFAISAVADGVGATLVYRSADFLRWERNAAPLHASRDAVMAECPDLFPVA
EHGEDGLDLDASAIGGAGAGVRHVLKVSMPDTLEDYYMVGRYDDADDTFTVPPEDLEAHGDDYRRWRRIDHGHLYASKTF
YDAGKKRRVLWAWVNESDSEADDVTKGWSGLQSFPRAVWLDEGGRQLVQWPVEEIETLRRKRGVLLGGNEVEAGGLREIG
GIAGSQADVEVAFEIASLAGADRLEPDHLRDPDALCGENGAAVHGGIGPFGLLVMASGDLRERTAVFFRVFRLSHGYTVL
MCTDLTRSTSRAGVYKPSHGGFVDIDIEKDRAISLRTLIDHSIVESFGGGGRTCMTARVYPEHVATGSSHLYVFNNASDA
VKVSKLEAWELATASVNAGDDGLISYGGPVCAAQVQ*
CDS seq >Os04t0664800-00
ATGGCGTTGGCTGGATTGCCCCTCTCCGTCTTTGCCATCGCTGTCCATTTTTGCCTTGTTTTCTCCTCCTCCTCTTCTCC
TCCGGTTTGCCCTGCAAACGGCCACCGTGACCGAACGGCGTACCACTTCCAACCCGCCAAGAACTGGCAGAACGATCCGA
ATGGGCCTGTGTACTACAATGGCATGTACCACCTCTTCTACCAGTACAACCCGCACGGCGCGCTCTGGGACGTCGGCAAC
CTCTCCTGGGGGCACTCCGTCTCCGGCGACCTCGTGAACTGGGCCGCCCTCGACAACGCGCTCGATCCCACAGCGCCATT
CGACGCCAATGGTTGCGCGTCGGGGTCAGTCACCATCCTCCCCGACGGCGTGCCCGTCGTCATGTACTCCGGCATCGACG
CCCGCCGCCGGCAGGTCCAGAACGTCGCGTTCCCCAAGAACCCTCGCGACCCGCTCCTCCGCGAGTGGACCAAGCCCGGG
TACAACCCGGTCATCCCTGTCCCCGCCGACGTCTCGCCGGACAATTTCCGGGACCCCACCACCGCCTGGCTCGGCAGCGA
CGGCCTGTGGCGGTTCGCCATCTCCGCCGTGGCCGACGGCGTGGGCGCGACGCTCGTGTACCGGAGCGCCGACTTCCTGC
GGTGGGAGCGCAACGCGGCGCCGCTGCACGCCTCGCGGGACGCGGTCATGGCCGAGTGCCCCGACCTGTTCCCCGTCGCC
GAGCACGGCGAGGACGGGCTCGACCTCGACGCGTCGGCGATCGGCGGCGCCGGCGCCGGCGTGAGGCACGTCCTCAAGGT
CAGCATGCCGGACACCCTCGAGGACTACTACATGGTCGGACGGTACGACGACGCGGACGACACGTTCACCGTGCCGCCGG
AGGACCTGGAAGCCCACGGCGATGACTACCGGCGGTGGCGGCGGATCGACCACGGCCACCTGTACGCGTCCAAGACGTTC
TACGACGCGGGCAAGAAACGGCGCGTGCTGTGGGCGTGGGTGAACGAGTCCGACAGCGAGGCCGACGACGTCACCAAGGG
CTGGTCCGGCCTTCAGTCGTTTCCGCGGGCGGTGTGGCTGGACGAGGGCGGGAGGCAGCTGGTGCAGTGGCCGGTGGAGG
AGATCGAGACGCTGAGGCGGAAACGCGGCGTTCTGCTCGGCGGAAACGAGGTGGAGGCGGGCGGGCTGCGCGAGATCGGC
GGCATCGCGGGCTCGCAGGCGGACGTGGAGGTAGCGTTCGAGATCGCGAGCCTCGCGGGCGCCGACCGCCTCGAGCCCGA
CCATTTGCGTGACCCCGACGCGCTGTGCGGGGAGAATGGCGCGGCGGTGCACGGCGGAATCGGCCCGTTCGGGCTGCTCG
TCATGGCATCCGGCGACCTGCGCGAGCGCACCGCCGTTTTCTTCAGGGTGTTCAGGCTCTCGCACGGGTACACGGTCCTC
ATGTGCACGGACCTGACACGGTCAACTTCAAGAGCAGGGGTGTACAAGCCATCCCACGGAGGATTCGTGGACATAGACAT
AGAGAAGGACAGGGCTATATCGCTTCGAACCCTGATCGATCATTCGATCGTGGAGAGTTTTGGCGGTGGGGGGCGGACGT
GCATGACGGCTCGAGTGTACCCTGAACATGTAGCAACGGGGAGCAGCCACCTGTACGTGTTCAACAATGCGTCAGATGCG
GTGAAGGTGTCCAAGCTGGAGGCATGGGAGCTCGCGACGGCTAGTGTCAATGCTGGAGACGACGGGCTGATTTCGTATGG
TGGTCCTGTATGTGCTGCTCAAGTGCAGTAA