Microexon ID Os_4:26766100-26766108:+
Species Oryza sativa
Coordinates 4:26766100..26766108
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCGAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAGCGCACGGCGTTCCACTTCCAGCCCCCCAACAATTGGATGAACGATCCGAACGGTCCACTGTACTACAAGGGATGGTACCATCTGTTCTACCAATGGAACCCG
Microexon-tag Amino Acid Seq WQRTAFHFQPPNNWMNDPNGPLYYKGWYHLFYQWNP
Microexon-tag spanning region26765867-26766288
Microexon-tag prediction score0.9719
Overlapped with the annotated transcript (%) 100
New Transcript ID Os04t0535600-01x
Reference Transcript ID Os04t0535600-01
Gene ID Os04g0535600
Gene Name OSINV2
Transcript ID Os04t0535600-01
Protein ID Os04t0535600-01
Gene ID Os04g0535600
Gene Name OSINV2
Pfam domain motif Glyco_hydro_32N
Motif E-value 5.5e-109
Motif start 125
Motif end 448
Protein seq >Os04t0535600-01
MIPAISPMMDGAAPLLPETSPESRQQRDPERGKRRTPVLPAVVASAVVLLGLAALFLVYGFHDGGDGRAAVLAPGTVEVA
ASSSRGVVEGVSEKSTTPALRLGGGAVRDYAWTNSMLSWQRTAFHFQPPNNWMNDPNGPLYYKGWYHLFYQWNPDSAVWG
NITWGHAVSRDLIHWLHLPLAMVPDHWYDINGVWTGSATQLPDGRIVMLYTGATEESVQVQNLAEPADPNDPLLREWSKA
EANPVLVPPPGIGLTDFRDPTTAWRNPADSAWRITIGSKDRDHAGLALVYKTEDFLHYDLLPTLLHVVKGTGMWECVDLY
PVSTSPAVEDGLETSTPPGPGVKHVLKASLDDDRNDYYAIGTYDGETDTWTPDNADIDVGIGLRYDYGKFYASKTFYDPV
GRRRVLWGWIGETDSERADILKGWASLQSIPRTVMLDTKTGSNLLQWPVVEVENLRMRGKSFDGLDVSPGSVVPLDVGKA
TQLDIEAVFEVDTSAADGVVTEAGAAAYSCGTGGGAVGRGLMGPFGLLVLADDQLSERTAVFFYLVKGVDGNLTTFFCQD
ELRSSKANDLVKRVYGSLVPVLDGENLSIRILVDHSIVEGFAQGGRTCITSRVYPTKAIYESAKIFLFNNATNVRVTAKS
LKIWELNSAYIRPYVD*
CDS seq >Os04t0535600-01
ATGATCCCGGCCATCTCCCCGATGATGGACGGCGCCGCGCCTCTGCTTCCCGAGACGAGCCCCGAGAGCCGCCAGCAACG
TGATCCGGAGAGGGGGAAGAGGCGGACGCCGGTCCTCCCGGCCGTCGTCGCGTCCGCCGTGGTCCTTCTCGGCCTCGCGG
CGCTCTTCCTGGTGTATGGATTTCACGACGGTGGAGACGGGAGGGCGGCCGTACTCGCGCCCGGCACCGTGGAGGTCGCG
GCCTCGTCGTCCCGCGGCGTCGTGGAGGGCGTCTCGGAGAAGTCCACCACCCCGGCGCTGCGCCTCGGCGGCGGCGCGGT
CCGGGACTACGCCTGGACCAACTCGATGCTGTCCTGGCAGCGCACGGCGTTCCACTTCCAGCCCCCCAACAATTGGATGA
ACGATCCGAACGGTCCACTGTACTACAAGGGATGGTACCATCTGTTCTACCAATGGAACCCGGACTCGGCCGTGTGGGGG
AACATCACCTGGGGCCATGCCGTGTCGCGCGATCTCATCCACTGGCTGCACCTGCCGCTCGCCATGGTGCCCGACCACTG
GTACGACATCAACGGCGTCTGGACCGGCTCGGCGACGCAGCTGCCCGACGGCCGGATCGTCATGCTCTACACCGGCGCCA
CGGAGGAGTCGGTGCAGGTGCAGAACCTGGCCGAGCCGGCCGACCCGAACGACCCGCTGCTGCGGGAATGGAGCAAGGCG
GAGGCGAACCCGGTGCTGGTGCCGCCGCCGGGCATCGGGCTGACGGACTTCCGCGACCCGACGACGGCGTGGCGCAACCC
GGCCGACTCGGCGTGGCGGATCACCATCGGGTCCAAGGACCGCGACCACGCGGGGCTGGCGCTCGTGTACAAGACGGAGG
ACTTCCTGCACTACGACCTGCTGCCCACGCTGCTGCACGTCGTCAAGGGCACCGGCATGTGGGAGTGCGTGGACTTGTAC
CCGGTGTCCACCTCGCCGGCCGTCGAGGACGGGCTCGAGACGTCCACCCCGCCGGGGCCCGGCGTGAAGCACGTACTCAA
GGCCAGCCTCGACGACGACAGAAACGACTACTACGCCATCGGCACCTACGACGGCGAGACCGATACCTGGACGCCGGACA
ACGCCGACATCGACGTCGGGATCGGGCTCCGGTACGACTACGGCAAGTTCTACGCGTCCAAGACGTTCTACGACCCCGTC
GGGCGGCGTCGCGTGCTATGGGGGTGGATCGGCGAGACCGACAGCGAGCGGGCTGACATACTCAAGGGCTGGGCCTCCCT
TCAGTCAATTCCGAGGACAGTTATGCTGGACACGAAGACTGGCAGCAACCTGCTCCAGTGGCCGGTGGTGGAGGTGGAGA
ACCTCCGTATGCGCGGCAAGAGCTTCGACGGCCTCGACGTCTCTCCCGGCTCCGTCGTGCCGTTGGACGTCGGCAAAGCA
ACACAGCTGGACATCGAGGCCGTGTTCGAGGTGGACACTTCGGCAGCCGACGGTGTCGTCACGGAGGCTGGAGCAGCAGC
ATACAGCTGCGGCACGGGCGGCGGTGCGGTTGGCCGTGGCTTGATGGGCCCGTTCGGGCTTCTCGTGCTGGCCGATGATC
AGTTGTCGGAGCGGACGGCTGTCTTCTTCTACCTGGTCAAGGGAGTCGACGGCAACCTTACAACCTTCTTCTGCCAAGAC
GAGCTCAGGTCATCAAAAGCAAACGATCTAGTTAAGAGAGTCTATGGCAGCTTGGTACCTGTGCTAGATGGCGAAAACCT
GTCAATAAGGATTTTGGTTGATCACTCCATAGTTGAGGGCTTCGCTCAAGGAGGGAGGACATGCATTACCTCGCGTGTGT
ATCCCACCAAAGCCATCTACGAGTCGGCCAAAATCTTTCTTTTCAACAATGCCACAAATGTTAGAGTCACTGCCAAATCG
CTAAAGATTTGGGAGCTGAACTCTGCTTACATCCGTCCATATGTAGACTAG
Microexon DNA seq ATCCGAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAGCGCACGGCGTTCCACTTCCAGCCCCCCAACAATTGGATGAACGATCCGAACGGTCCACTGTACTACAAGGGATGGTACCATCTGTTCTACCAATGGAACCCG
Microexon-tag Amino Acid seq WQRTAFHFQPPNNWMNDPNGPLYYKGWYHLFYQWNP
Transcript ID Os04t0535600-01
Gene ID Os.23469
Gene Name OSINV2
Pfam domain motif Glyco_hydro_32N
Motif E-value 5.5e-109
Motif start 125
Motif end 448
Protein seq >Os04t0535600-01
MIPAISPMMDGAAPLLPETSPESRQQRDPERGKRRTPVLPAVVASAVVLLGLAALFLVYGFHDGGDGRAAVLAPGTVEVA
ASSSRGVVEGVSEKSTTPALRLGGGAVRDYAWTNSMLSWQRTAFHFQPPNNWMNDPNGPLYYKGWYHLFYQWNPDSAVWG
NITWGHAVSRDLIHWLHLPLAMVPDHWYDINGVWTGSATQLPDGRIVMLYTGATEESVQVQNLAEPADPNDPLLREWSKA
EANPVLVPPPGIGLTDFRDPTTAWRNPADSAWRITIGSKDRDHAGLALVYKTEDFLHYDLLPTLLHVVKGTGMWECVDLY
PVSTSPAVEDGLETSTPPGPGVKHVLKASLDDDRNDYYAIGTYDGETDTWTPDNADIDVGIGLRYDYGKFYASKTFYDPV
GRRRVLWGWIGETDSERADILKGWASLQSIPRTVMLDTKTGSNLLQWPVVEVENLRMRGKSFDGLDVSPGSVVPLDVGKA
TQLDIEAVFEVDTSAADGVVTEAGAAAYSCGTGGGAVGRGLMGPFGLLVLADDQLSERTAVFFYLVKGVDGNLTTFFCQD
ELRSSKANDLVKRVYGSLVPVLDGENLSIRILVDHSIVEGFAQGGRTCITSRVYPTKAIYESAKIFLFNNATNVRVTAKS
LKIWELNSAYIRPYVD*
CDS seq >Os04t0535600-01
ATGATCCCGGCCATCTCCCCGATGATGGACGGCGCCGCGCCTCTGCTTCCCGAGACGAGCCCCGAGAGCCGCCAGCAACG
TGATCCGGAGAGGGGGAAGAGGCGGACGCCGGTCCTCCCGGCCGTCGTCGCGTCCGCCGTGGTCCTTCTCGGCCTCGCGG
CGCTCTTCCTGGTGTATGGATTTCACGACGGTGGAGACGGGAGGGCGGCCGTACTCGCGCCCGGCACCGTGGAGGTCGCG
GCCTCGTCGTCCCGCGGCGTCGTGGAGGGCGTCTCGGAGAAGTCCACCACCCCGGCGCTGCGCCTCGGCGGCGGCGCGGT
CCGGGACTACGCCTGGACCAACTCGATGCTGTCCTGGCAGCGCACGGCGTTCCACTTCCAGCCCCCCAACAATTGGATGA
ACGATCCGAACGGTCCACTGTACTACAAGGGATGGTACCATCTGTTCTACCAATGGAACCCGGACTCGGCCGTGTGGGGG
AACATCACCTGGGGCCATGCCGTGTCGCGCGATCTCATCCACTGGCTGCACCTGCCGCTCGCCATGGTGCCCGACCACTG
GTACGACATCAACGGCGTCTGGACCGGCTCGGCGACGCAGCTGCCCGACGGCCGGATCGTCATGCTCTACACCGGCGCCA
CGGAGGAGTCGGTGCAGGTGCAGAACCTGGCCGAGCCGGCCGACCCGAACGACCCGCTGCTGCGGGAATGGAGCAAGGCG
GAGGCGAACCCGGTGCTGGTGCCGCCGCCGGGCATCGGGCTGACGGACTTCCGCGACCCGACGACGGCGTGGCGCAACCC
GGCCGACTCGGCGTGGCGGATCACCATCGGGTCCAAGGACCGCGACCACGCGGGGCTGGCGCTCGTGTACAAGACGGAGG
ACTTCCTGCACTACGACCTGCTGCCCACGCTGCTGCACGTCGTCAAGGGCACCGGCATGTGGGAGTGCGTGGACTTGTAC
CCGGTGTCCACCTCGCCGGCCGTCGAGGACGGGCTCGAGACGTCCACCCCGCCGGGGCCCGGCGTGAAGCACGTACTCAA
GGCCAGCCTCGACGACGACAGAAACGACTACTACGCCATCGGCACCTACGACGGCGAGACCGATACCTGGACGCCGGACA
ACGCCGACATCGACGTCGGGATCGGGCTCCGGTACGACTACGGCAAGTTCTACGCGTCCAAGACGTTCTACGACCCCGTC
GGGCGGCGTCGCGTGCTATGGGGGTGGATCGGCGAGACCGACAGCGAGCGGGCTGACATACTCAAGGGCTGGGCCTCCCT
TCAGTCAATTCCGAGGACAGTTATGCTGGACACGAAGACTGGCAGCAACCTGCTCCAGTGGCCGGTGGTGGAGGTGGAGA
ACCTCCGTATGCGCGGCAAGAGCTTCGACGGCCTCGACGTCTCTCCCGGCTCCGTCGTGCCGTTGGACGTCGGCAAAGCA
ACACAGCTGGACATCGAGGCCGTGTTCGAGGTGGACACTTCGGCAGCCGACGGTGTCGTCACGGAGGCTGGAGCAGCAGC
ATACAGCTGCGGCACGGGCGGCGGTGCGGTTGGCCGTGGCTTGATGGGCCCGTTCGGGCTTCTCGTGCTGGCCGATGATC
AGTTGTCGGAGCGGACGGCTGTCTTCTTCTACCTGGTCAAGGGAGTCGACGGCAACCTTACAACCTTCTTCTGCCAAGAC
GAGCTCAGGTCATCAAAAGCAAACGATCTAGTTAAGAGAGTCTATGGCAGCTTGGTACCTGTGCTAGATGGCGAAAACCT
GTCAATAAGGATTTTGGTTGATCACTCCATAGTTGAGGGCTTCGCTCAAGGAGGGAGGACATGCATTACCTCGCGTGTGT
ATCCCACCAAAGCCATCTACGAGTCGGCCAAAATCTTTCTTTTCAACAATGCCACAAATGTTAGAGTCACTGCCAAATCG
CTAAAGATTTGGGAGCTGAACTCTGCTTACATCCGTCCATATGTAGACTAG