Microexon ID Os_9:4165840-4165848:+
Species Oryza sativa
Coordinates 9:4165840..4165848
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCGAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq GCCAGAAGAACTGCCTATCACTTCCAGCCTGCAAAGAACTGGCAGAATGATCCGAATGGGCCTATGTACCATAACGGTATGTACCATTTGTTCTACCAGTACAACCCG
Microexon-tag Amino Acid Seq ARRTAYHFQPAKNWQNDPNGPMYHNGMYHLFYQYNP
Microexon-tag spanning region4165632-4169003
Microexon-tag prediction score0.9508
Overlapped with the annotated transcript (%) 100
New Transcript ID Os09t0255000-01x
Reference Transcript ID Os09t0255000-01
Gene ID Os09g0255000
Gene Name OSINV1
Transcript ID Os09t0255000-01
Protein ID Os09t0255000-01
Gene ID Os09g0255000
Gene Name OSINV1
Pfam domain motif Glyco_hydro_32N
Motif E-value 6.8e-103
Motif start 44
Motif end 366
Protein seq >Os09t0255000-01
MARLGLAVCAASFHLFLLLASTSSLRRAPTEADTANHARRTAYHFQPAKNWQNDPNGPMYHNGMYHLFYQYNPHSALWDI
GNLSWGHSVSGDLLNWAALDTALDPTSPFDANGCWSGSATILPGALPAILYTGIDASKEQVQNVAFAKNPSDPLLREWEK
PAYNPVIALPADVPGDKFRDPSTAWLGRDGLWRIAVSAEVDGVASTLVYRSKDFVRWERNAAPLHASRAAGMVECPDLFP
VAERGEDGLDTSANGAGGVRHVLKLSVMDTLQDYYMVGTYDDAADAFSPAEPERGDDCRSWRRLDYGHVYASKSFFDVRK
NRRVLWAWANESDSQADDVARGWSGVQTFPRKMWLAKDGKQLLQWPIEEIKTLRRKRAGLWQGTRLGAGAVQEIVGVASS
QADVEVVFKIPSLEEAERVDDPNRLLDPQKLCGEKGAAVRGGVGPFGLLVMASGDLHEHTAVFFRVFRHHDKYKLLMCTD
LTKSSTRAGVYKPAYGGFVDMDIDDHKTISLRTLIDHSVVESFGGGGRACITARVYPEHVATSSSHLYVFNNGSDAVKVA
KLEAWDLATATVNVVVGDHHGLVAPALELEPTRTTQ*
CDS seq >Os09t0255000-01
ATGGCGAGATTAGGCCTTGCCGTCTGCGCCGCCTCCTTCCACCTCTTCCTTCTTCTTGCGTCGACCTCCTCGCTCCGGCG
AGCTCCCACGGAGGCGGACACGGCCAACCATGCCAGAAGAACTGCCTATCACTTCCAGCCTGCAAAGAACTGGCAGAATG
ATCCGAATGGGCCTATGTACCATAACGGTATGTACCATTTGTTCTACCAGTACAACCCGCATAGCGCGCTCTGGGACATC
GGCAACCTCTCCTGGGGCCACTCCGTCTCCGGCGACCTCCTCAACTGGGCCGCTCTCGACACCGCGCTCGACCCCACCTC
GCCTTTCGACGCCAATGGCTGCTGGTCGGGCTCCGCCACCATCCTCCCCGGCGCCCTCCCGGCCATCCTCTACACCGGCA
TCGACGCCAGCAAGGAGCAGGTCCAGAACGTCGCCTTCGCCAAGAACCCCTCCGATCCGCTCCTCCGCGAGTGGGAGAAG
CCCGCGTACAATCCGGTCATCGCGCTCCCGGCCGATGTCCCCGGCGACAAGTTCCGTGACCCCTCGACGGCATGGCTCGG
CCGTGACGGCCTGTGGCGGATCGCCGTCTCCGCCGAGGTGGACGGCGTCGCGTCCACGCTCGTCTACAGGAGCAAGGACT
TCGTCCGGTGGGAGCGGAACGCCGCGCCGCTGCACGCGTCGCGCGCCGCGGGCATGGTGGAGTGCCCGGACCTGTTCCCC
GTAGCGGAGCGCGGCGAGGATGGCCTCGACACGTCGGCGAACGGCGCCGGCGGCGTGCGCCACGTCCTGAAGCTGAGCGT
GATGGACACGCTCCAGGACTACTACATGGTCGGCACGTACGACGACGCGGCAGACGCCTTCTCGCCGGCGGAGCCTGAGC
GCGGCGACGACTGCCGGAGCTGGCGGCGGCTGGACTACGGGCACGTGTACGCGTCCAAGTCGTTCTTCGACGTGCGCAAG
AACCGGCGCGTTCTGTGGGCGTGGGCGAACGAGTCCGACAGCCAGGCCGACGACGTCGCCCGCGGCTGGTCCGGCGTGCA
GACATTCCCGAGGAAGATGTGGCTAGCCAAGGACGGCAAGCAGCTGCTGCAGTGGCCGATCGAGGAGATCAAGACGCTGA
GGAGGAAGCGCGCCGGTCTCTGGCAAGGCACCAGGCTAGGCGCCGGCGCGGTGCAGGAGATCGTCGGCGTGGCGAGCTCG
CAGGCGGACGTGGAGGTGGTGTTCAAGATTCCGAGCCTGGAGGAGGCCGAGAGGGTGGACGACCCCAACCGGCTGCTGGA
TCCCCAGAAGCTGTGCGGTGAGAAGGGCGCCGCCGTGCGGGGCGGCGTCGGCCCGTTCGGGCTCCTCGTAATGGCCTCCG
GCGACCTGCACGAGCACACCGCCGTCTTCTTCAGGGTGTTCAGGCACCATGACAAGTACAAGCTTCTTATGTGCACCGAC
TTGACAAAGTCGTCGACAAGAGCAGGGGTGTACAAGCCGGCTTACGGCGGATTCGTGGACATGGACATCGACGATCACAA
AACCATATCGCTGAGAACGTTGATTGACCATTCGGTGGTGGAGAGCTTCGGCGGCGGCGGGCGGGCGTGCATCACGGCGC
GGGTTTACCCGGAGCACGTGGCGACGAGCAGCAGCCATTTGTACGTGTTCAACAACGGGAGCGACGCCGTTAAGGTGGCC
AAGCTGGAGGCGTGGGACCTGGCGACGGCGACCGTCAACGTTGTTGTTGGAGACCACCACGGCCTTGTTGCGCCGGCGTT
GGAGTTGGAGCCGACACGCACGACGCAGTGA
Microexon DNA seq ATCCGAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq GCCAGAAGAACTGCCTATCACTTCCAGCCTGCAAAGAACTGGCAGAATGATCCGAATGGGCCTATGTACCATAACGGTATGTACCATTTGTTCTACCAGTACAACCCG
Microexon-tag Amino Acid seq ARRTAYHFQPAKNWQNDPNGPMYHNGMYHLFYQYNP
Transcript ID Os09t0255000-01
Gene ID Os.36827
Gene Name OSINV1
Pfam domain motif Glyco_hydro_32N
Motif E-value 6.9e-103
Motif start 44
Motif end 366
Protein seq >Os09t0255000-01
MARLGLAVCAASFHLFLLLASTSSLRRAPTEADTANHARRTAYHFQPAKNWQNDPNGPMYHNGMYHLFYQYNPHSALWDI
GNLSWGHSVSGDLLNWAALDTALDPTSPFDANGCWSGSATILPGALPAILYTGIDASKEQVQNVAFAKNPSDPLLREWEK
PAYNPVIALPADVPGDKFRDPSTAWLGRDGLWRIAVSAEVDGVASTLVYRSKDFVRWERNAAPLHASRAAGMVECPDLFP
VAERGEDGLDTSANGAGGVRHVLKLSVMDTLQDYYMVGTYDDAADAFSPAEPERGDDCRSWRRLDYGHVYASKSFFDVRK
NRRVLWAWANESDSQADDVARGWSGVQTFPRKMWLAKDGKQLLQWPIEEIKTLRRKRAGLWQGTRLGAGAVQEIVGVASS
QADVEVVFKIPSLEEAERVDDPNRLLDPQKLCGEKGAAVRGGVGPFGLLVMASGDLHEHTAVFFRVFRHHDKYKLLMCTD
LTKSSTRAGVYKPAYGGFVDMDIDDHKTISLRTLIDHSVVESFGGGGRACITARVYPEHVATSSSHLYVFNNGSDAVKVA
KLEAWDLATATVNVVVGDHHGLVAPALELEPTRTTQ*
CDS seq >Os09t0255000-01
ATGGCGAGATTAGGCCTTGCCGTCTGCGCCGCCTCCTTCCACCTCTTCCTTCTTCTTGCGTCGACCTCCTCGCTCCGGCG
AGCTCCCACGGAGGCGGACACGGCCAACCATGCCAGAAGAACTGCCTATCACTTCCAGCCTGCAAAGAACTGGCAGAATG
ATCCGAATGGGCCTATGTACCATAACGGTATGTACCATTTGTTCTACCAGTACAACCCGCATAGCGCGCTCTGGGACATC
GGCAACCTCTCCTGGGGCCACTCCGTCTCCGGCGACCTCCTCAACTGGGCCGCTCTCGACACCGCGCTCGACCCCACCTC
GCCTTTCGACGCCAATGGCTGCTGGTCGGGCTCCGCCACCATCCTCCCCGGCGCCCTCCCGGCCATCCTCTACACCGGCA
TCGACGCCAGCAAGGAGCAGGTCCAGAACGTCGCCTTCGCCAAGAACCCCTCCGATCCGCTCCTCCGCGAGTGGGAGAAG
CCCGCGTACAATCCGGTCATCGCGCTCCCGGCCGATGTCCCCGGCGACAAGTTCCGTGACCCCTCGACGGCATGGCTCGG
CCGTGACGGCCTGTGGCGGATCGCCGTCTCCGCCGAGGTGGACGGCGTCGCGTCCACGCTCGTCTACAGGAGCAAGGACT
TCGTCCGGTGGGAGCGGAACGCCGCGCCGCTGCACGCGTCGCGCGCCGCGGGCATGGTGGAGTGCCCGGACCTGTTCCCC
GTAGCGGAGCGCGGCGAGGATGGCCTCGACACGTCGGCGAACGGCGCCGGCGGCGTGCGCCACGTCCTGAAGCTGAGCGT
GATGGACACGCTCCAGGACTACTACATGGTCGGCACGTACGACGACGCGGCAGACGCCTTCTCGCCGGCGGAGCCTGAGC
GCGGCGACGACTGCCGGAGCTGGCGGCGGCTGGACTACGGGCACGTGTACGCGTCCAAGTCGTTCTTCGACGTGCGCAAG
AACCGGCGCGTTCTGTGGGCGTGGGCGAACGAGTCCGACAGCCAGGCCGACGACGTCGCCCGCGGCTGGTCCGGCGTGCA
GACATTCCCGAGGAAGATGTGGCTAGCCAAGGACGGCAAGCAGCTGCTGCAGTGGCCGATCGAGGAGATCAAGACGCTGA
GGAGGAAGCGCGCCGGTCTCTGGCAAGGCACCAGGCTAGGCGCCGGCGCGGTGCAGGAGATCGTCGGCGTGGCGAGCTCG
CAGGCGGACGTGGAGGTGGTGTTCAAGATTCCGAGCCTGGAGGAGGCCGAGAGGGTGGACGACCCCAACCGGCTGCTGGA
TCCCCAGAAGCTGTGCGGTGAGAAGGGCGCCGCCGTGCGGGGCGGCGTCGGCCCGTTCGGGCTCCTCGTAATGGCCTCCG
GCGACCTGCACGAGCACACCGCCGTCTTCTTCAGGGTGTTCAGGCACCATGACAAGTACAAGCTTCTTATGTGCACCGAC
TTGACAAAGTCGTCGACAAGAGCAGGGGTGTACAAGCCGGCTTACGGCGGATTCGTGGACATGGACATCGACGATCACAA
AACCATATCGCTGAGAACGTTGATTGACCATTCGGTGGTGGAGAGCTTCGGCGGCGGCGGGCGGGCGTGCATCACGGCGC
GGGTTTACCCGGAGCACGTGGCGACGAGCAGCAGCCATTTGTACGTGTTCAACAACGGGAGCGACGCCGTTAAGGTGGCC
AAGCTGGAGGCGTGGGACCTGGCGACGGCGACCGTCAACGTTGTTGTTGGAGACCACCACGGCCTTGTTGCGCCGGCGTT
GGAGTTGGAGCCGACACGCACGACGCAGTGA