Microexon ID Os_4:30466049-30466059:-
Species Oryza sativa
Coordinates 4:30466049..30466059
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq AAAATCTAAAG
Microexon Amino Acid seq ENLK
Microexon-tag DNA Seq AAAGAAAATGAAAGCGACCCTCATCTTTTCTCATGTATCTTATCCAAGGAAAATCTAAAGGTCAAGGAAATACACCACTTTACTCAGGATGACCTGATGGCAGAAGAT
Microexon-tag Amino Acid Seq KENESDPHLFSCILSKENLKVKEIHHFTQDDLMAED
Microexon-tag spanning region30465906-30466233
Microexon-tag prediction score0.9278
Overlapped with the annotated transcript (%) 100
New Transcript ID Os04t0604000-01x
Reference Transcript ID Os04t0604000-01
Gene ID Os04g0604000
Gene Name VLN4
Transcript ID Os04t0604000-01
Protein ID Os04t0604000-01
Gene ID Os04g0604000
Gene Name VLN4
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Os04t0604000-01
MSISMKDVDPAFRGVGQKDGLEVWRIENFKPVPVPTSSHGKFYMGDSYIILKTTALKNGSFRHDLHYWLGKDTSQDEAGT
AAILTVELDAALGGRAVQYREVQGGETEKLLSYFRPCIMPQPGGVASGFNHVEVNQQDHVTRLYVCQGKHVVHVKEVPFV
RSSLNHEDIFILDTANKIFQFNGSNSCIQERAKALEVVQYIKDTFHEGKCEVAAVEDGKLMADTEAGEFWGLFGGFAPLP
KKTSSEDNGDDKETVTKLLCFNQGTLEHISFESLEHELLETNKCYLLDCGAEMYVWMGRGTSLQVRKGASEAAEKLLIDE
NRKGSNVIKVIEGFETIMFKSKFNKWPPTPDLKLSSEDGRGKVAALLRSQGLDVKGLMKAAPEEEEPQPYIDCTGHLQVW
RVNGDGKTLLSSSDQSKLYTGDCYIFQYTYTGDDKEECLIGTWFGKKSVEEDRTSAISLASKMFQAAKFQAAQARLYEGK
EPIQFFVIFQSLQVFKGGLSSGYKNFIAVNGTDDDTYVEGGLALFRIQGSGSENMQAIQVDAVSSSLNSSYCYILHNGNT
VFTWTGNLTTSLDNDLVERQLDVIKPDLPSRSQKEGRETDQFWELLGGKCKYSNKKIGKENESDPHLFSCILSKENLKVK
EIHHFTQDDLMAEDIFVLDCRTDLFVWVGQEVDAKLRSQAMDIGEKFLLHDFLMENLSQDTPIFIVTEGSEPQFFTRFFT
WDSAKSLMHGSSYQRKLAIVKGGATPSLDKPKRRTPAFSGRNAGQDKSQQRTRSMSHSPERHRIRGRSPAFTAIASAFEN
PSTRYLSTPPPAVKKLFPRSGGSELPKTSSKQSAINALTSAFEGPTKSTIPKSVKASPEAEKAIQEEGSTIGESENEPED
DENSTIYPYERLTTTSDDPAPDIDVTKREVYLSSVEFTEKFGMTRASFKNLPKWKQNRLKSDLQLF*
CDS seq >Os04t0604000-01
ATGTCAATTTCTATGAAAGATGTAGATCCGGCTTTCCGTGGAGTTGGACAAAAGGATGGTTTGGAGGTATGGCGTATTGA
AAACTTCAAGCCTGTTCCCGTGCCAACATCTTCACACGGAAAATTTTACATGGGTGATTCATATATCATCTTGAAGACGA
CAGCCTTGAAAAATGGTTCATTTCGCCATGATCTCCACTACTGGCTTGGAAAGGATACTAGTCAGGACGAAGCTGGAACT
GCTGCAATTCTAACTGTGGAGCTTGATGCTGCCCTTGGAGGGCGTGCTGTCCAGTACAGGGAAGTACAAGGCGGTGAAAC
TGAAAAGCTTCTCTCCTATTTTAGACCATGCATCATGCCACAGCCAGGAGGGGTAGCTTCTGGGTTCAATCATGTAGAGG
TCAATCAGCAAGATCATGTTACCCGCTTATATGTGTGCCAAGGAAAGCATGTTGTTCATGTTAAAGAGGTTCCTTTTGTT
CGTTCATCCCTTAATCACGAGGACATATTCATTTTGGATACCGCGAACAAAATTTTCCAGTTCAATGGCTCTAACTCATG
CATTCAAGAGAGAGCAAAAGCTCTTGAAGTTGTGCAATATATCAAAGATACTTTCCATGAGGGCAAGTGCGAAGTTGCAG
CTGTTGAGGATGGAAAGTTGATGGCCGATACTGAAGCTGGTGAATTTTGGGGTTTGTTTGGTGGTTTTGCTCCTCTCCCA
AAGAAGACATCTTCAGAGGACAATGGGGATGATAAGGAAACTGTGACCAAATTGCTATGTTTTAACCAAGGAACGCTGGA
GCATATTAGTTTTGAATCTTTGGAGCATGAGTTACTTGAGACAAACAAATGCTACTTGTTGGACTGTGGAGCTGAAATGT
ATGTTTGGATGGGCAGAGGTACTTCTTTGCAAGTGAGAAAGGGTGCAAGTGAAGCTGCTGAGAAATTGCTCATTGATGAG
AACCGAAAAGGATCAAATGTTATCAAAGTGATTGAGGGATTCGAAACAATCATGTTCAAGTCAAAATTTAACAAGTGGCC
GCCTACTCCTGATTTGAAGCTGTCATCTGAGGATGGCCGAGGCAAAGTGGCAGCTCTACTCAGAAGTCAAGGATTGGATG
TTAAAGGATTGATGAAGGCTGCTCCTGAAGAAGAAGAACCCCAACCTTATATTGATTGCACTGGTCATTTACAGGTCTGG
CGAGTAAATGGCGATGGCAAGACTCTTCTTTCATCTTCTGATCAATCAAAACTTTACACCGGAGATTGCTACATTTTTCA
ATACACATACACTGGAGATGATAAGGAGGAATGTCTTATTGGAACTTGGTTTGGGAAGAAGAGTGTTGAGGAGGACAGAA
CATCGGCAATTTCACTAGCTAGCAAGATGTTTCAGGCTGCAAAATTCCAGGCTGCCCAGGCTCGCCTCTATGAAGGGAAA
GAACCGATTCAATTTTTCGTCATATTTCAGAGTCTTCAAGTTTTTAAGGGTGGACTTAGCTCTGGATACAAGAACTTTAT
TGCTGTAAATGGTACCGATGACGACACTTACGTTGAAGGTGGGCTTGCTCTCTTCCGGATTCAAGGCTCAGGATCAGAAA
ACATGCAAGCAATTCAAGTTGATGCAGTGTCTTCGTCATTAAATTCATCCTACTGCTACATTCTACACAATGGAAACACT
GTGTTCACATGGACTGGGAACCTTACAACCTCACTGGATAATGACTTGGTTGAGAGACAGCTAGATGTAATTAAGCCAGA
TCTGCCATCCAGGTCACAAAAGGAGGGGAGAGAAACCGACCAATTCTGGGAACTCTTAGGTGGAAAATGCAAGTATTCAA
ACAAAAAAATAGGAAAAGAAAATGAAAGCGACCCTCATCTTTTCTCATGTATCTTATCCAAGGAAAATCTAAAGGTCAAG
GAAATACACCACTTTACTCAGGATGACCTGATGGCAGAAGATATTTTCGTTCTAGACTGCCGCACCGACTTGTTTGTTTG
GGTTGGACAGGAGGTGGATGCCAAATTGAGATCACAAGCGATGGACATCGGTGAGAAATTTCTTCTACATGATTTCCTTA
TGGAAAATCTCTCGCAAGACACACCAATTTTTATTGTTACAGAAGGAAGTGAGCCACAGTTTTTTACTAGGTTCTTCACT
TGGGACTCAGCAAAATCGCTGATGCATGGCAGTTCATACCAGAGGAAGCTTGCAATAGTAAAGGGTGGAGCAACTCCATC
GCTTGATAAACCTAAAAGGCGAACACCAGCGTTTTCAGGAAGGAACGCAGGACAAGATAAATCTCAGCAGCGCACAAGAA
GTATGTCCCACAGCCCAGAACGTCACCGTATTCGAGGAAGATCTCCAGCTTTCACCGCAATAGCTTCTGCCTTTGAGAAC
CCAAGTACCCGGTATCTTTCCACCCCTCCCCCTGCTGTCAAGAAGCTTTTCCCAAGATCCGGAGGGTCTGAATTGCCAAA
GACATCATCCAAACAATCAGCTATCAATGCTCTCACCAGTGCTTTCGAGGGTCCTACGAAAAGTACAATACCTAAGTCTG
TAAAAGCGAGCCCCGAGGCAGAGAAGGCAATACAGGAGGAAGGCTCAACGATCGGTGAAAGTGAAAACGAGCCAGAAGAT
GATGAGAACAGCACAATCTACCCATATGAACGTTTGACCACCACATCTGATGATCCTGCTCCTGACATTGATGTTACCAA
GCGAGAGGTCTACTTATCATCAGTTGAGTTCACAGAGAAGTTTGGCATGACAAGGGCATCATTCAAAAACCTTCCAAAAT
GGAAGCAAAACAGGCTAAAGTCTGATCTCCAGCTCTTTTAG
Microexon DNA seq AAAATCTAAAG
Microexon Amino Acid seq ENLK
Microexon-tag DNA Seq AAAGAAAATGAAAGCGACCCTCATCTTTTCTCATGTATCTTATCCAAGGAAAATCTAAAGGTCAAGGAAATACACCACTTTACTCAGGATGACCTGATGGCAGAAGAT
Microexon-tag Amino Acid seq KENESDPHLFSCILSKENLKVKEIHHFTQDDLMAED
Transcript ID Os04t0604000-01
Gene ID Os.23969
Gene Name VLN4
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Os04t0604000-01
MSISMKDVDPAFRGVGQKDGLEVWRIENFKPVPVPTSSHGKFYMGDSYIILKTTALKNGSFRHDLHYWLGKDTSQDEAGT
AAILTVELDAALGGRAVQYREVQGGETEKLLSYFRPCIMPQPGGVASGFNHVEVNQQDHVTRLYVCQGKHVVHVKEVPFV
RSSLNHEDIFILDTANKIFQFNGSNSCIQERAKALEVVQYIKDTFHEGKCEVAAVEDGKLMADTEAGEFWGLFGGFAPLP
KKTSSEDNGDDKETVTKLLCFNQGTLEHISFESLEHELLETNKCYLLDCGAEMYVWMGRGTSLQVRKGASEAAEKLLIDE
NRKGSNVIKVIEGFETIMFKSKFNKWPPTPDLKLSSEDGRGKVAALLRSQGLDVKGLMKAAPEEEEPQPYIDCTGHLQVW
RVNGDGKTLLSSSDQSKLYTGDCYIFQYTYTGDDKEECLIGTWFGKKSVEEDRTSAISLASKMFQAAKFQAAQARLYEGK
EPIQFFVIFQSLQVFKGGLSSGYKNFIAVNGTDDDTYVEGGLALFRIQGSGSENMQAIQVDAVSSSLNSSYCYILHNGNT
VFTWTGNLTTSLDNDLVERQLDVIKPDLPSRSQKEGRETDQFWELLGGKCKYSNKKIGKENESDPHLFSCILSKENLKVK
EIHHFTQDDLMAEDIFVLDCRTDLFVWVGQEVDAKLRSQAMDIGEKFLLHDFLMENLSQDTPIFIVTEGSEPQFFTRFFT
WDSAKSLMHGSSYQRKLAIVKGGATPSLDKPKRRTPAFSGRNAGQDKSQQRTRSMSHSPERHRIRGRSPAFTAIASAFEN
PSTRYLSTPPPAVKKLFPRSGGSELPKTSSKQSAINALTSAFEGPTKSTIPKSVKASPEAEKAIQEEGSTIGESENEPED
DENSTIYPYERLTTTSDDPAPDIDVTKREVYLSSVEFTEKFGMTRASFKNLPKWKQNRLKSDLQLF*
CDS seq >Os04t0604000-01
ATGTCAATTTCTATGAAAGATGTAGATCCGGCTTTCCGTGGAGTTGGACAAAAGGATGGTTTGGAGGTATGGCGTATTGA
AAACTTCAAGCCTGTTCCCGTGCCAACATCTTCACACGGAAAATTTTACATGGGTGATTCATATATCATCTTGAAGACGA
CAGCCTTGAAAAATGGTTCATTTCGCCATGATCTCCACTACTGGCTTGGAAAGGATACTAGTCAGGACGAAGCTGGAACT
GCTGCAATTCTAACTGTGGAGCTTGATGCTGCCCTTGGAGGGCGTGCTGTCCAGTACAGGGAAGTACAAGGCGGTGAAAC
TGAAAAGCTTCTCTCCTATTTTAGACCATGCATCATGCCACAGCCAGGAGGGGTAGCTTCTGGGTTCAATCATGTAGAGG
TCAATCAGCAAGATCATGTTACCCGCTTATATGTGTGCCAAGGAAAGCATGTTGTTCATGTTAAAGAGGTTCCTTTTGTT
CGTTCATCCCTTAATCACGAGGACATATTCATTTTGGATACCGCGAACAAAATTTTCCAGTTCAATGGCTCTAACTCATG
CATTCAAGAGAGAGCAAAAGCTCTTGAAGTTGTGCAATATATCAAAGATACTTTCCATGAGGGCAAGTGCGAAGTTGCAG
CTGTTGAGGATGGAAAGTTGATGGCCGATACTGAAGCTGGTGAATTTTGGGGTTTGTTTGGTGGTTTTGCTCCTCTCCCA
AAGAAGACATCTTCAGAGGACAATGGGGATGATAAGGAAACTGTGACCAAATTGCTATGTTTTAACCAAGGAACGCTGGA
GCATATTAGTTTTGAATCTTTGGAGCATGAGTTACTTGAGACAAACAAATGCTACTTGTTGGACTGTGGAGCTGAAATGT
ATGTTTGGATGGGCAGAGGTACTTCTTTGCAAGTGAGAAAGGGTGCAAGTGAAGCTGCTGAGAAATTGCTCATTGATGAG
AACCGAAAAGGATCAAATGTTATCAAAGTGATTGAGGGATTCGAAACAATCATGTTCAAGTCAAAATTTAACAAGTGGCC
GCCTACTCCTGATTTGAAGCTGTCATCTGAGGATGGCCGAGGCAAAGTGGCAGCTCTACTCAGAAGTCAAGGATTGGATG
TTAAAGGATTGATGAAGGCTGCTCCTGAAGAAGAAGAACCCCAACCTTATATTGATTGCACTGGTCATTTACAGGTCTGG
CGAGTAAATGGCGATGGCAAGACTCTTCTTTCATCTTCTGATCAATCAAAACTTTACACCGGAGATTGCTACATTTTTCA
ATACACATACACTGGAGATGATAAGGAGGAATGTCTTATTGGAACTTGGTTTGGGAAGAAGAGTGTTGAGGAGGACAGAA
CATCGGCAATTTCACTAGCTAGCAAGATGTTTCAGGCTGCAAAATTCCAGGCTGCCCAGGCTCGCCTCTATGAAGGGAAA
GAACCGATTCAATTTTTCGTCATATTTCAGAGTCTTCAAGTTTTTAAGGGTGGACTTAGCTCTGGATACAAGAACTTTAT
TGCTGTAAATGGTACCGATGACGACACTTACGTTGAAGGTGGGCTTGCTCTCTTCCGGATTCAAGGCTCAGGATCAGAAA
ACATGCAAGCAATTCAAGTTGATGCAGTGTCTTCGTCATTAAATTCATCCTACTGCTACATTCTACACAATGGAAACACT
GTGTTCACATGGACTGGGAACCTTACAACCTCACTGGATAATGACTTGGTTGAGAGACAGCTAGATGTAATTAAGCCAGA
TCTGCCATCCAGGTCACAAAAGGAGGGGAGAGAAACCGACCAATTCTGGGAACTCTTAGGTGGAAAATGCAAGTATTCAA
ACAAAAAAATAGGAAAAGAAAATGAAAGCGACCCTCATCTTTTCTCATGTATCTTATCCAAGGAAAATCTAAAGGTCAAG
GAAATACACCACTTTACTCAGGATGACCTGATGGCAGAAGATATTTTCGTTCTAGACTGCCGCACCGACTTGTTTGTTTG
GGTTGGACAGGAGGTGGATGCCAAATTGAGATCACAAGCGATGGACATCGGTGAGAAATTTCTTCTACATGATTTCCTTA
TGGAAAATCTCTCGCAAGACACACCAATTTTTATTGTTACAGAAGGAAGTGAGCCACAGTTTTTTACTAGGTTCTTCACT
TGGGACTCAGCAAAATCGCTGATGCATGGCAGTTCATACCAGAGGAAGCTTGCAATAGTAAAGGGTGGAGCAACTCCATC
GCTTGATAAACCTAAAAGGCGAACACCAGCGTTTTCAGGAAGGAACGCAGGACAAGATAAATCTCAGCAGCGCACAAGAA
GTATGTCCCACAGCCCAGAACGTCACCGTATTCGAGGAAGATCTCCAGCTTTCACCGCAATAGCTTCTGCCTTTGAGAAC
CCAAGTACCCGGTATCTTTCCACCCCTCCCCCTGCTGTCAAGAAGCTTTTCCCAAGATCCGGAGGGTCTGAATTGCCAAA
GACATCATCCAAACAATCAGCTATCAATGCTCTCACCAGTGCTTTCGAGGGTCCTACGAAAAGTACAATACCTAAGTCTG
TAAAAGCGAGCCCCGAGGCAGAGAAGGCAATACAGGAGGAAGGCTCAACGATCGGTGAAAGTGAAAACGAGCCAGAAGAT
GATGAGAACAGCACAATCTACCCATATGAACGTTTGACCACCACATCTGATGATCCTGCTCCTGACATTGATGTTACCAA
GCGAGAGGTCTACTTATCATCAGTTGAGTTCACAGAGAAGTTTGGCATGACAAGGGCATCATTCAAAAACCTTCCAAAAT
GGAAGCAAAACAGGCTAAAGTCTGATCTCCAGCTCTTTTAG