Microexon ID Os_4:20412657-20412665:+
Species Oryza sativa
Coordinates 4:20412657..20412665
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCGAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq AAGCTCCGGACCGGGTACCACTTCCAACCACCCAAGCACTGGATCAACGATCCGAACGGTCCGATGTACTATAAGGGCTTGTACCATCTGTTCTACCAGTACAACCCC
Microexon-tag Amino Acid Seq KLRTGYHFQPPKHWINDPNGPMYYKGLYHLFYQYNP
Microexon-tag spanning region20412507-20412837
Microexon-tag prediction score0.9668
Overlapped with the annotated transcript (%) 100
New Transcript ID Os04t0413200-01x
Reference Transcript ID Os04t0413200-01
Gene ID Os04g0413200
Gene Name OSINV4
Transcript ID Os04t0413200-01
Protein ID Os04t0413200-01
Gene ID Os04g0413200
Gene Name OSINV4
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.3e-107
Motif start 51
Motif end 377
Protein seq >Os04t0413200-01
MATARARAALVFVALLQMAAVVVVRASHVVYPELQSLEAKHVDGKLRTGYHFQPPKHWINDPNGPMYYKGLYHLFYQYNP
KGAVWGNIEWAHSVSTDLIDWTALEPGIYPSKTFDEKGCWSGSATVLPSGVPVIMYTGIDPDERQVQNVAYPVNLSDPYL
REWYKPDYNPIINPDGGINASAFRDPTTAWYGPDGHWRLLVGSKVNMKGLAVLYRSRDFKKWVKAHHPLHSAHTGMWECP
DFFPVAVAGGSRHYRRGVDTAELHDAAVAEEVKYVLKVSLDLTRYEYYTVGWYDHATDRYVPDAAFPDNDYGLRYDYGDF
YASKSFYDPAKRRRIVWGWANESDTVPDDRRKGWAGIQAIPRKLWLSADGKQLVQWPVEELKALRAKHVNVTDKVIKKGN
YFEVTGFKSVQSDVDMAFAIKDLSKAEEFDPAWRTDAEALCKKLGSDVDGGVGPFGLWALASGDLKERTAVFFRVFKAND
SSHVVLMCNDPTRSSYESKIYRPTFAGFVDVDIAKNKQIALRTLIDHSVVESFGARGKTCILTRVYPRKAVGDDAHLFVF
NNGESDVKVTNLDAWEMKTPKMNAEE*
CDS seq >Os04t0413200-01
ATGGCGACGGCGAGGGCGAGGGCCGCGCTGGTGTTTGTGGCTCTGCTGCAGATGGCGGCAGTGGTGGTGGTGCGCGCGTC
GCACGTCGTCTACCCGGAGCTCCAGTCGCTGGAGGCGAAGCACGTCGACGGGAAGCTCCGGACCGGGTACCACTTCCAAC
CACCCAAGCACTGGATCAACGATCCGAACGGTCCGATGTACTATAAGGGCTTGTACCATCTGTTCTACCAGTACAACCCC
AAGGGCGCCGTGTGGGGGAACATCGAGTGGGCGCACTCGGTTTCGACGGACCTGATCGACTGGACGGCGCTGGAGCCGGG
GATCTACCCGTCCAAGACGTTCGACGAGAAGGGCTGCTGGTCGGGCTCCGCCACCGTGCTCCCCAGCGGCGTGCCGGTGA
TCATGTACACCGGCATCGACCCCGACGAGCGGCAGGTCCAGAACGTCGCCTACCCGGTGAACCTCTCCGACCCGTACCTC
CGCGAGTGGTACAAGCCCGACTACAACCCCATCATCAACCCGGACGGCGGCATCAACGCCAGCGCGTTCCGCGACCCGAC
CACCGCCTGGTACGGGCCCGACGGCCACTGGCGGCTCCTCGTCGGCAGCAAGGTGAACATGAAGGGGCTCGCCGTGCTGT
ACCGGAGCCGGGACTTCAAGAAGTGGGTCAAGGCGCACCACCCGCTGCACTCGGCGCACACCGGGATGTGGGAGTGCCCG
GACTTCTTCCCCGTCGCTGTGGCGGGCGGGAGCCGCCACTACCGCCGCGGCGTCGACACCGCCGAGCTGCACGACGCCGC
CGTGGCGGAGGAGGTCAAGTACGTGCTCAAGGTGAGCCTCGACCTGACGCGGTACGAGTACTACACCGTCGGCTGGTACG
ACCACGCCACCGACCGGTACGTCCCCGACGCCGCCTTCCCCGACAACGACTACGGCCTCCGCTACGACTACGGCGACTTC
TACGCATCCAAGTCGTTCTACGACCCGGCCAAGCGCCGCCGCATCGTCTGGGGCTGGGCCAACGAGTCCGACACCGTACC
CGACGACCGCCGAAAGGGCTGGGCCGGCATCCAGGCGATACCGAGGAAGCTCTGGCTGTCGGCGGACGGGAAGCAGCTGG
TGCAGTGGCCGGTGGAGGAGCTCAAGGCGCTGCGAGCCAAGCACGTCAATGTCACTGACAAGGTCATCAAGAAGGGCAAC
TACTTCGAGGTCACCGGCTTCAAGTCCGTGCAGTCGGATGTGGATATGGCGTTCGCGATCAAGGACCTGAGCAAGGCGGA
GGAGTTCGACCCGGCGTGGCGGACGGACGCGGAGGCGCTGTGCAAGAAGCTCGGCTCGGACGTCGACGGCGGCGTGGGGC
CGTTCGGGCTGTGGGCGCTGGCCTCCGGCGACCTCAAGGAGAGGACGGCCGTCTTCTTCAGGGTGTTCAAGGCCAACGAC
TCCTCGCACGTCGTCCTCATGTGCAACGACCCTACCAGGTCATCGTACGAGTCGAAGATCTACAGGCCGACCTTCGCCGG
CTTCGTCGACGTCGACATCGCCAAGAACAAACAAATCGCCCTCCGGACATTGATCGATCACTCCGTGGTGGAGAGCTTCG
GGGCGCGCGGCAAGACGTGCATCCTGACGAGGGTGTACCCGAGGAAAGCCGTCGGCGACGACGCGCACCTCTTCGTCTTC
AACAACGGCGAGTCGGACGTCAAGGTCACCAACCTGGACGCCTGGGAGATGAAGACCCCGAAGATGAACGCGGAGGAGTA
G
Microexon DNA seq ATCCGAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq AAGCTCCGGACCGGGTACCACTTCCAACCACCCAAGCACTGGATCAACGATCCGAACGGTCCGATGTACTATAAGGGCTTGTACCATCTGTTCTACCAGTACAACCCC
Microexon-tag Amino Acid seq KLRTGYHFQPPKHWINDPNGPMYYKGLYHLFYQYNP
Transcript ID Os04t0413200-01
Gene ID Os.22515
Gene Name OSINV4
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.3e-107
Motif start 51
Motif end 377
Protein seq >Os04t0413200-01
MATARARAALVFVALLQMAAVVVVRASHVVYPELQSLEAKHVDGKLRTGYHFQPPKHWINDPNGPMYYKGLYHLFYQYNP
KGAVWGNIEWAHSVSTDLIDWTALEPGIYPSKTFDEKGCWSGSATVLPSGVPVIMYTGIDPDERQVQNVAYPVNLSDPYL
REWYKPDYNPIINPDGGINASAFRDPTTAWYGPDGHWRLLVGSKVNMKGLAVLYRSRDFKKWVKAHHPLHSAHTGMWECP
DFFPVAVAGGSRHYRRGVDTAELHDAAVAEEVKYVLKVSLDLTRYEYYTVGWYDHATDRYVPDAAFPDNDYGLRYDYGDF
YASKSFYDPAKRRRIVWGWANESDTVPDDRRKGWAGIQAIPRKLWLSADGKQLVQWPVEELKALRAKHVNVTDKVIKKGN
YFEVTGFKSVQSDVDMAFAIKDLSKAEEFDPAWRTDAEALCKKLGSDVDGGVGPFGLWALASGDLKERTAVFFRVFKAND
SSHVVLMCNDPTRSSYESKIYRPTFAGFVDVDIAKNKQIALRTLIDHSVVESFGARGKTCILTRVYPRKAVGDDAHLFVF
NNGESDVKVTNLDAWEMKTPKMNAEE*
CDS seq >Os04t0413200-01
ATGGCGACGGCGAGGGCGAGGGCCGCGCTGGTGTTTGTGGCTCTGCTGCAGATGGCGGCAGTGGTGGTGGTGCGCGCGTC
GCACGTCGTCTACCCGGAGCTCCAGTCGCTGGAGGCGAAGCACGTCGACGGGAAGCTCCGGACCGGGTACCACTTCCAAC
CACCCAAGCACTGGATCAACGATCCGAACGGTCCGATGTACTATAAGGGCTTGTACCATCTGTTCTACCAGTACAACCCC
AAGGGCGCCGTGTGGGGGAACATCGAGTGGGCGCACTCGGTTTCGACGGACCTGATCGACTGGACGGCGCTGGAGCCGGG
GATCTACCCGTCCAAGACGTTCGACGAGAAGGGCTGCTGGTCGGGCTCCGCCACCGTGCTCCCCAGCGGCGTGCCGGTGA
TCATGTACACCGGCATCGACCCCGACGAGCGGCAGGTCCAGAACGTCGCCTACCCGGTGAACCTCTCCGACCCGTACCTC
CGCGAGTGGTACAAGCCCGACTACAACCCCATCATCAACCCGGACGGCGGCATCAACGCCAGCGCGTTCCGCGACCCGAC
CACCGCCTGGTACGGGCCCGACGGCCACTGGCGGCTCCTCGTCGGCAGCAAGGTGAACATGAAGGGGCTCGCCGTGCTGT
ACCGGAGCCGGGACTTCAAGAAGTGGGTCAAGGCGCACCACCCGCTGCACTCGGCGCACACCGGGATGTGGGAGTGCCCG
GACTTCTTCCCCGTCGCTGTGGCGGGCGGGAGCCGCCACTACCGCCGCGGCGTCGACACCGCCGAGCTGCACGACGCCGC
CGTGGCGGAGGAGGTCAAGTACGTGCTCAAGGTGAGCCTCGACCTGACGCGGTACGAGTACTACACCGTCGGCTGGTACG
ACCACGCCACCGACCGGTACGTCCCCGACGCCGCCTTCCCCGACAACGACTACGGCCTCCGCTACGACTACGGCGACTTC
TACGCATCCAAGTCGTTCTACGACCCGGCCAAGCGCCGCCGCATCGTCTGGGGCTGGGCCAACGAGTCCGACACCGTACC
CGACGACCGCCGAAAGGGCTGGGCCGGCATCCAGGCGATACCGAGGAAGCTCTGGCTGTCGGCGGACGGGAAGCAGCTGG
TGCAGTGGCCGGTGGAGGAGCTCAAGGCGCTGCGAGCCAAGCACGTCAATGTCACTGACAAGGTCATCAAGAAGGGCAAC
TACTTCGAGGTCACCGGCTTCAAGTCCGTGCAGTCGGATGTGGATATGGCGTTCGCGATCAAGGACCTGAGCAAGGCGGA
GGAGTTCGACCCGGCGTGGCGGACGGACGCGGAGGCGCTGTGCAAGAAGCTCGGCTCGGACGTCGACGGCGGCGTGGGGC
CGTTCGGGCTGTGGGCGCTGGCCTCCGGCGACCTCAAGGAGAGGACGGCCGTCTTCTTCAGGGTGTTCAAGGCCAACGAC
TCCTCGCACGTCGTCCTCATGTGCAACGACCCTACCAGGTCATCGTACGAGTCGAAGATCTACAGGCCGACCTTCGCCGG
CTTCGTCGACGTCGACATCGCCAAGAACAAACAAATCGCCCTCCGGACATTGATCGATCACTCCGTGGTGGAGAGCTTCG
GGGCGCGCGGCAAGACGTGCATCCTGACGAGGGTGTACCCGAGGAAAGCCGTCGGCGACGACGCGCACCTCTTCGTCTTC
AACAACGGCGAGTCGGACGTCAAGGTCACCAACCTGGACGCCTGGGAGATGAAGACCCCGAAGATGAACGCGGAGGAGTA
G