Microexon ID Os_4:20423070-20423078:+
Species Oryza sativa
Coordinates 4:20423070..20423078
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCGAACG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq GAGCTCCGGACTGGGTATCACTTCCAGCCACCCAAGAACTGGATCAATGATCCGAACGCGCCGATGTACTACAAGGGGTGGTACCATCTGTTCTACCAGTACAACCCC
Microexon-tag Amino Acid Seq ELRTGYHFQPPKNWINDPNAPMYYKGWYHLFYQYNP
Microexon-tag spanning region20422378-20423615
Microexon-tag prediction score0.97
Overlapped with the annotated transcript (%) 100
New Transcript ID Os04t0413500-01x
Reference Transcript ID Os04t0413500-01
Gene ID Os04g0413500
Gene Name GIF1
Transcript ID Os04t0413500-01
Protein ID Os04t0413500-01
Gene ID Os04g0413500
Gene Name GIF1
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.5e-104
Motif start 59
Motif end 384
Protein seq >Os04t0413500-01
MGVLGSRVAWAWLVQLLLLQQLAGASHVVYDDLELQAAATTADGVPPSIVDSELRTGYHFQPPKNWINDPNAPMYYKGWY
HLFYQYNPKGAVWGNIVWAHSVSRDLINWVALKPAIEPSIRADKYGCWSGSATMMADGTPVIMYTGVNRPDVNYQVQNVA
LPRNGSDPLLREWVKPGHNPVIVPEGGINATQFRDPTTAWRGADGHWRLLVGSLAGQSRGVAYVYRSRDFRRWTRAAQPL
HSAPTGMWECPDFYPVTADGRREGVDTSSAVVDAAASARVKYVLKNSLDLRRYDYYTVGTYDRKAERYVPDDPAGDEHHI
RYDYGNFYASKTFYDPAKRRRILWGWANESDTAADDVAKGWAGIQAIPRKVWLDPSGKQLLQWPIEEVERLRGKWPVILK
DRVVKPGEHVEVTGLQTAQADVEVSFEVGSLEAAERLDPAMAYDAQRLCSARGADARGGVGPFGLWVLASAGLEEKTAVF
FRVFRPAARGGGAGKPVVLMCTDPTKSSRNPNMYQPTFAGFVDTDITNGKISLRSLIDRSVVESFGAGGKACILSRVYPS
LAIGKNARLYVFNNGKAEIKVSQLTAWEMKKPVMMNGA*
CDS seq >Os04t0413500-01
ATGGGAGTTCTTGGTAGTAGGGTCGCTTGGGCATGGCTGGTCCAGCTGCTGCTGCTCCAGCAGCTCGCCGGAGCGTCGCA
CGTCGTCTACGACGACCTCGAGCTGCAGGCGGCTGCTACCACAGCGGACGGCGTGCCGCCGTCCATCGTCGACTCTGAGC
TCCGGACTGGGTATCACTTCCAGCCACCCAAGAACTGGATCAATGATCCGAACGCGCCGATGTACTACAAGGGGTGGTAC
CATCTGTTCTACCAGTACAACCCCAAGGGCGCCGTGTGGGGGAACATCGTGTGGGCGCACTCAGTGTCACGTGACCTCAT
CAACTGGGTGGCGCTCAAGCCGGCCATCGAGCCCAGCATCAGGGCCGACAAGTACGGCTGCTGGTCGGGGTCGGCGACGA
TGATGGCCGACGGGACGCCGGTGATCATGTACACCGGCGTCAACCGCCCCGACGTCAACTACCAGGTGCAGAACGTGGCG
CTGCCGAGGAACGGGTCGGACCCGCTGCTGCGCGAGTGGGTGAAGCCCGGCCACAACCCGGTGATCGTGCCCGAGGGCGG
CATCAACGCGACGCAGTTCCGCGACCCGACCACCGCGTGGCGCGGGGCCGACGGCCACTGGCGGCTGCTCGTCGGCAGCC
TCGCGGGGCAGTCCCGCGGCGTGGCGTACGTGTACCGGAGCAGGGACTTCCGGCGGTGGACGCGCGCGGCGCAGCCGCTG
CACTCGGCGCCCACGGGGATGTGGGAGTGCCCGGACTTCTACCCGGTCACCGCGGACGGCCGCCGCGAGGGCGTCGACAC
CTCGTCCGCCGTCGTCGACGCCGCCGCCTCGGCGCGCGTCAAGTACGTGCTCAAGAACAGCCTCGACCTGCGCCGGTACG
ACTACTACACCGTCGGAACGTACGACCGGAAGGCCGAGCGGTACGTGCCGGACGACCCCGCCGGCGACGAGCACCACATC
CGCTACGACTACGGCAACTTCTACGCCTCCAAGACGTTCTACGACCCGGCGAAGCGCCGCCGCATCCTCTGGGGATGGGC
CAACGAGTCCGACACCGCCGCCGACGACGTGGCCAAGGGCTGGGCCGGAATCCAGGCGATTCCGAGGAAAGTGTGGCTGG
ACCCAAGTGGGAAGCAACTGTTGCAGTGGCCAATCGAGGAGGTCGAGAGGCTGAGAGGGAAGTGGCCGGTCATTCTCAAG
GACAGGGTGGTCAAGCCAGGGGAACACGTCGAGGTGACCGGGCTACAAACTGCACAGGCTGACGTGGAGGTGAGCTTCGA
GGTGGGGAGCCTGGAGGCGGCGGAGCGGCTGGACCCGGCGATGGCGTACGACGCGCAGCGGCTGTGCAGCGCGCGGGGCG
CCGACGCGAGGGGCGGCGTGGGGCCGTTCGGCCTGTGGGTGCTCGCGTCCGCGGGGCTGGAGGAGAAGACCGCCGTGTTC
TTCAGGGTGTTCAGGCCGGCGGCGCGCGGCGGCGGCGCCGGCAAGCCCGTCGTGCTCATGTGCACCGACCCCACCAAGTC
ATCGCGCAACCCGAACATGTACCAGCCGACGTTTGCAGGGTTCGTTGACACGGACATCACCAACGGGAAGATATCTCTGA
GGAGCCTGATCGACAGGTCGGTTGTTGAGAGCTTCGGGGCTGGAGGAAAGGCGTGCATCCTGTCGAGGGTGTACCCGTCG
CTGGCCATCGGCAAGAACGCGCGCCTTTACGTTTTCAATAACGGGAAGGCGGAGATCAAGGTGTCGCAGCTCACCGCGTG
GGAGATGAAGAAGCCGGTCATGATGAATGGAGCCTAA
Microexon DNA seq ATCCGAACG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq GAGCTCCGGACTGGGTATCACTTCCAGCCACCCAAGAACTGGATCAATGATCCGAACGCGCCGATGTACTACAAGGGGTGGTACCATCTGTTCTACCAGTACAACCCC
Microexon-tag Amino Acid seq ELRTGYHFQPPKNWINDPNAPMYYKGWYHLFYQYNP
Transcript ID Os04t0413500-01
Gene ID Os.22517
Gene Name GIF1
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.5e-104
Motif start 59
Motif end 384
Protein seq >Os04t0413500-01
MGVLGSRVAWAWLVQLLLLQQLAGASHVVYDDLELQAAATTADGVPPSIVDSELRTGYHFQPPKNWINDPNAPMYYKGWY
HLFYQYNPKGAVWGNIVWAHSVSRDLINWVALKPAIEPSIRADKYGCWSGSATMMADGTPVIMYTGVNRPDVNYQVQNVA
LPRNGSDPLLREWVKPGHNPVIVPEGGINATQFRDPTTAWRGADGHWRLLVGSLAGQSRGVAYVYRSRDFRRWTRAAQPL
HSAPTGMWECPDFYPVTADGRREGVDTSSAVVDAAASARVKYVLKNSLDLRRYDYYTVGTYDRKAERYVPDDPAGDEHHI
RYDYGNFYASKTFYDPAKRRRILWGWANESDTAADDVAKGWAGIQAIPRKVWLDPSGKQLLQWPIEEVERLRGKWPVILK
DRVVKPGEHVEVTGLQTAQADVEVSFEVGSLEAAERLDPAMAYDAQRLCSARGADARGGVGPFGLWVLASAGLEEKTAVF
FRVFRPAARGGGAGKPVVLMCTDPTKSSRNPNMYQPTFAGFVDTDITNGKISLRSLIDRSVVESFGAGGKACILSRVYPS
LAIGKNARLYVFNNGKAEIKVSQLTAWEMKKPVMMNGA*
CDS seq >Os04t0413500-01
ATGGGAGTTCTTGGTAGTAGGGTCGCTTGGGCATGGCTGGTCCAGCTGCTGCTGCTCCAGCAGCTCGCCGGAGCGTCGCA
CGTCGTCTACGACGACCTCGAGCTGCAGGCGGCTGCTACCACAGCGGACGGCGTGCCGCCGTCCATCGTCGACTCTGAGC
TCCGGACTGGGTATCACTTCCAGCCACCCAAGAACTGGATCAATGATCCGAACGCGCCGATGTACTACAAGGGGTGGTAC
CATCTGTTCTACCAGTACAACCCCAAGGGCGCCGTGTGGGGGAACATCGTGTGGGCGCACTCAGTGTCACGTGACCTCAT
CAACTGGGTGGCGCTCAAGCCGGCCATCGAGCCCAGCATCAGGGCCGACAAGTACGGCTGCTGGTCGGGGTCGGCGACGA
TGATGGCCGACGGGACGCCGGTGATCATGTACACCGGCGTCAACCGCCCCGACGTCAACTACCAGGTGCAGAACGTGGCG
CTGCCGAGGAACGGGTCGGACCCGCTGCTGCGCGAGTGGGTGAAGCCCGGCCACAACCCGGTGATCGTGCCCGAGGGCGG
CATCAACGCGACGCAGTTCCGCGACCCGACCACCGCGTGGCGCGGGGCCGACGGCCACTGGCGGCTGCTCGTCGGCAGCC
TCGCGGGGCAGTCCCGCGGCGTGGCGTACGTGTACCGGAGCAGGGACTTCCGGCGGTGGACGCGCGCGGCGCAGCCGCTG
CACTCGGCGCCCACGGGGATGTGGGAGTGCCCGGACTTCTACCCGGTCACCGCGGACGGCCGCCGCGAGGGCGTCGACAC
CTCGTCCGCCGTCGTCGACGCCGCCGCCTCGGCGCGCGTCAAGTACGTGCTCAAGAACAGCCTCGACCTGCGCCGGTACG
ACTACTACACCGTCGGAACGTACGACCGGAAGGCCGAGCGGTACGTGCCGGACGACCCCGCCGGCGACGAGCACCACATC
CGCTACGACTACGGCAACTTCTACGCCTCCAAGACGTTCTACGACCCGGCGAAGCGCCGCCGCATCCTCTGGGGATGGGC
CAACGAGTCCGACACCGCCGCCGACGACGTGGCCAAGGGCTGGGCCGGAATCCAGGCGATTCCGAGGAAAGTGTGGCTGG
ACCCAAGTGGGAAGCAACTGTTGCAGTGGCCAATCGAGGAGGTCGAGAGGCTGAGAGGGAAGTGGCCGGTCATTCTCAAG
GACAGGGTGGTCAAGCCAGGGGAACACGTCGAGGTGACCGGGCTACAAACTGCACAGGCTGACGTGGAGGTGAGCTTCGA
GGTGGGGAGCCTGGAGGCGGCGGAGCGGCTGGACCCGGCGATGGCGTACGACGCGCAGCGGCTGTGCAGCGCGCGGGGCG
CCGACGCGAGGGGCGGCGTGGGGCCGTTCGGCCTGTGGGTGCTCGCGTCCGCGGGGCTGGAGGAGAAGACCGCCGTGTTC
TTCAGGGTGTTCAGGCCGGCGGCGCGCGGCGGCGGCGCCGGCAAGCCCGTCGTGCTCATGTGCACCGACCCCACCAAGTC
ATCGCGCAACCCGAACATGTACCAGCCGACGTTTGCAGGGTTCGTTGACACGGACATCACCAACGGGAAGATATCTCTGA
GGAGCCTGATCGACAGGTCGGTTGTTGAGAGCTTCGGGGCTGGAGGAAAGGCGTGCATCCTGTCGAGGGTGTACCCGTCG
CTGGCCATCGGCAAGAACGCGCGCCTTTACGTTTTCAATAACGGGAAGGCGGAGATCAAGGTGTCGCAGCTCACCGCGTG
GGAGATGAAGAAGCCGGTCATGATGAATGGAGCCTAA