Microexon ID At_1:20566930-20566938:+
Species Arabidopsis thaliana
Coordinates 1:20566930..20566938
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCGTACCGGACCGGTTATCACTTTCAACCTCTCAAAAATTGGATGAACGATCCTAATGGACCAATGATATACAAAGGAATATACCATCTTTTTTACCAATATAACCCA
Microexon-tag Amino Acid Seq PYRTGYHFQPLKNWMNDPNGPMIYKGIYHLFYQYNP
Microexon-tag spanning region20566725-20567081
Microexon-tag prediction score0.9487
Overlapped with the annotated transcript (%) 100
New Transcript ID AT1G55120.1x
Reference Transcript ID AT1G55120.1
Gene ID AT1G55120
Gene Name CWINV3
Transcript ID AT1G55120.1
Protein ID AT1G55120.1
Gene ID AT1G55120
Gene Name CWINV3
Pfam domain motif Glyco_hydro_32N
Motif E-value 8.2e-107
Motif start 43
Motif end 365
Protein seq >AT1G55120.1
MAKLNRSNIGLSLLLSMFLANFITDLEASSHQDLNQPYRTGYHFQPLKNWMNDPNGPMIYKGIYHLFYQYNPYGAVWDVR
IVWGHSTSVDLVNWISQPPAFNPSQPSDINGCWSGSVTILPNGKPVILYTGIDQNKGQVQNVAVPVNISDPYLREWSKPP
QNPLMTTNAVNGINPDRFRDPTTAWLGRDGEWRVIVGSSTDDRRGLAILYKSRDFFNWTQSMKPLHYEDLTGMWECPDFF
PVSITGSDGVETSSVGENGIKHVLKVSLIETLHDYYTIGSYDREKDVYVPDLGFVQNESAPRLDYGKYYASKTFYDDVKK
RRILWGWVNESSPAKDDIEKGWSGLQSFPRKIWLDESGKELLQWPIEEIETLRGQQVNWQKKVLKAGSTLQVHGVTAAQA
DVEVSFKVKELEKADVIEPSWTDPQKICSQGDLSVMSGLGPFGLMVLASNDMEEYTSVYFRIFKSNDDTNKKTKYVVLMC
SDQSRSSLNDENDKSTFGAFVAIDPSHQTISLRTLIDHSIVESYGGGGRTCITSRVYPKLAIGENANLFVFNKGTQSVDI
LTLSAWSLKSAQINGDLMSPFIEREESRSPNHQF*
CDS seq >AT1G55120.1
ATGGCGAAATTAAATCGTTCCAACATTGGTCTCTCACTGTTACTATCTATGTTCCTCGCAAACTTTATCACCGATCTTGA
AGCTTCTTCTCATCAAGATCTCAACCAACCGTACCGGACCGGTTATCACTTTCAACCTCTCAAAAATTGGATGAACGATC
CTAATGGACCAATGATATACAAAGGAATATACCATCTTTTTTACCAATATAACCCATACGGTGCCGTTTGGGATGTGAGA
ATCGTGTGGGGACACTCCACGTCAGTTGACTTAGTCAACTGGATTTCACAACCTCCAGCTTTTAATCCATCTCAGCCGTC
AGATATCAACGGTTGTTGGTCAGGCTCCGTCACGATTCTACCAAACGGCAAACCTGTGATCCTCTACACCGGCATTGACC
AAAACAAAGGTCAAGTCCAAAACGTCGCCGTTCCGGTTAACATCTCTGATCCTTATCTCCGAGAATGGTCAAAACCGCCG
CAAAATCCTCTAATGACAACTAACGCGGTTAACGGAATTAACCCGGACCGGTTTCGTGATCCGACCACCGCGTGGCTTGG
ACGTGACGGAGAATGGCGAGTAATCGTCGGAAGCTCGACGGACGATCGACGAGGATTAGCGATTCTTTACAAAAGCAGAG
ATTTCTTCAACTGGACGCAATCAATGAAGCCTTTACATTACGAAGACTTAACCGGAATGTGGGAATGTCCTGATTTTTTC
CCGGTTTCGATAACCGGATCGGACGGTGTAGAAACGTCGTCGGTTGGTGAGAATGGGATTAAGCATGTGCTTAAAGTGAG
TTTGATTGAGACATTGCATGATTATTACACGATTGGGAGTTATGATCGTGAGAAAGATGTTTACGTACCGGATCTTGGGT
TTGTGCAAAACGAATCAGCTCCGAGGTTAGATTACGGGAAATATTACGCGTCGAAAACGTTTTACGATGATGTTAAAAAA
CGAAGGATCTTGTGGGGTTGGGTTAATGAATCGTCTCCAGCTAAAGATGATATTGAGAAGGGTTGGTCTGGTCTTCAGTC
ATTTCCGAGGAAAATATGGCTCGATGAATCAGGAAAGGAATTGCTACAGTGGCCGATTGAAGAGATTGAGACATTGCGTG
GGCAACAAGTCAACTGGCAAAAGAAAGTTCTCAAAGCAGGATCTACTCTCCAAGTTCATGGTGTCACTGCTGCGCAGGCA
GATGTTGAGGTATCGTTCAAGGTGAAGGAATTGGAAAAGGCGGATGTGATTGAACCGAGTTGGACCGATCCACAAAAGAT
ATGTAGTCAGGGAGATTTATCGGTTATGTCGGGTTTAGGACCATTTGGTTTGATGGTTTTGGCATCCAATGACATGGAAG
AGTACACGTCCGTTTACTTCAGAATTTTCAAGTCAAATGACGATACTAACAAGAAGACTAAATACGTGGTGTTGATGTGC
AGTGACCAAAGCAGATCATCGTTAAATGATGAAAATGATAAATCAACCTTTGGTGCTTTTGTTGCGATAGACCCTTCTCA
CCAAACAATTTCTCTTAGGACTTTGATTGATCACTCGATAGTGGAGAGTTATGGTGGAGGAGGCAGAACATGTATAACCT
CAAGAGTGTATCCAAAATTGGCAATCGGAGAAAATGCAAATCTGTTTGTCTTCAACAAAGGCACTCAAAGTGTTGATATC
TTGACCCTAAGTGCTTGGAGTTTAAAGTCTGCTCAAATCAATGGCGACTTAATGTCACCATTTATTGAGCGTGAAGAGTC
ACGCTCACCTAATCATCAGTTTTGA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCGTACCGGACCGGTTATCACTTTCAACCTCTCAAAAATTGGATGAACGATCCTAATGGACCAATGATATACAAAGGAATATACCATCTTTTTTACCAATATAACCCA
Microexon-tag Amino Acid seq PYRTGYHFQPLKNWMNDPNGPMIYKGIYHLFYQYNP
Transcript ID AT1G55120.1
Gene ID At.4749
Gene Name CWINV3
Pfam domain motif Glyco_hydro_32N
Motif E-value 8.2e-107
Motif start 43
Motif end 365
Protein seq >AT1G55120.1
MAKLNRSNIGLSLLLSMFLANFITDLEASSHQDLNQPYRTGYHFQPLKNWMNDPNGPMIYKGIYHLFYQYNPYGAVWDVR
IVWGHSTSVDLVNWISQPPAFNPSQPSDINGCWSGSVTILPNGKPVILYTGIDQNKGQVQNVAVPVNISDPYLREWSKPP
QNPLMTTNAVNGINPDRFRDPTTAWLGRDGEWRVIVGSSTDDRRGLAILYKSRDFFNWTQSMKPLHYEDLTGMWECPDFF
PVSITGSDGVETSSVGENGIKHVLKVSLIETLHDYYTIGSYDREKDVYVPDLGFVQNESAPRLDYGKYYASKTFYDDVKK
RRILWGWVNESSPAKDDIEKGWSGLQSFPRKIWLDESGKELLQWPIEEIETLRGQQVNWQKKVLKAGSTLQVHGVTAAQA
DVEVSFKVKELEKADVIEPSWTDPQKICSQGDLSVMSGLGPFGLMVLASNDMEEYTSVYFRIFKSNDDTNKKTKYVVLMC
SDQSRSSLNDENDKSTFGAFVAIDPSHQTISLRTLIDHSIVESYGGGGRTCITSRVYPKLAIGENANLFVFNKGTQSVDI
LTLSAWSLKSAQINGDLMSPFIEREESRSPNHQF*
CDS seq >AT1G55120.1
ATGGCGAAATTAAATCGTTCCAACATTGGTCTCTCACTGTTACTATCTATGTTCCTCGCAAACTTTATCACCGATCTTGA
AGCTTCTTCTCATCAAGATCTCAACCAACCGTACCGGACCGGTTATCACTTTCAACCTCTCAAAAATTGGATGAACGATC
CTAATGGACCAATGATATACAAAGGAATATACCATCTTTTTTACCAATATAACCCATACGGTGCCGTTTGGGATGTGAGA
ATCGTGTGGGGACACTCCACGTCAGTTGACTTAGTCAACTGGATTTCACAACCTCCAGCTTTTAATCCATCTCAGCCGTC
AGATATCAACGGTTGTTGGTCAGGCTCCGTCACGATTCTACCAAACGGCAAACCTGTGATCCTCTACACCGGCATTGACC
AAAACAAAGGTCAAGTCCAAAACGTCGCCGTTCCGGTTAACATCTCTGATCCTTATCTCCGAGAATGGTCAAAACCGCCG
CAAAATCCTCTAATGACAACTAACGCGGTTAACGGAATTAACCCGGACCGGTTTCGTGATCCGACCACCGCGTGGCTTGG
ACGTGACGGAGAATGGCGAGTAATCGTCGGAAGCTCGACGGACGATCGACGAGGATTAGCGATTCTTTACAAAAGCAGAG
ATTTCTTCAACTGGACGCAATCAATGAAGCCTTTACATTACGAAGACTTAACCGGAATGTGGGAATGTCCTGATTTTTTC
CCGGTTTCGATAACCGGATCGGACGGTGTAGAAACGTCGTCGGTTGGTGAGAATGGGATTAAGCATGTGCTTAAAGTGAG
TTTGATTGAGACATTGCATGATTATTACACGATTGGGAGTTATGATCGTGAGAAAGATGTTTACGTACCGGATCTTGGGT
TTGTGCAAAACGAATCAGCTCCGAGGTTAGATTACGGGAAATATTACGCGTCGAAAACGTTTTACGATGATGTTAAAAAA
CGAAGGATCTTGTGGGGTTGGGTTAATGAATCGTCTCCAGCTAAAGATGATATTGAGAAGGGTTGGTCTGGTCTTCAGTC
ATTTCCGAGGAAAATATGGCTCGATGAATCAGGAAAGGAATTGCTACAGTGGCCGATTGAAGAGATTGAGACATTGCGTG
GGCAACAAGTCAACTGGCAAAAGAAAGTTCTCAAAGCAGGATCTACTCTCCAAGTTCATGGTGTCACTGCTGCGCAGGCA
GATGTTGAGGTATCGTTCAAGGTGAAGGAATTGGAAAAGGCGGATGTGATTGAACCGAGTTGGACCGATCCACAAAAGAT
ATGTAGTCAGGGAGATTTATCGGTTATGTCGGGTTTAGGACCATTTGGTTTGATGGTTTTGGCATCCAATGACATGGAAG
AGTACACGTCCGTTTACTTCAGAATTTTCAAGTCAAATGACGATACTAACAAGAAGACTAAATACGTGGTGTTGATGTGC
AGTGACCAAAGCAGATCATCGTTAAATGATGAAAATGATAAATCAACCTTTGGTGCTTTTGTTGCGATAGACCCTTCTCA
CCAAACAATTTCTCTTAGGACTTTGATTGATCACTCGATAGTGGAGAGTTATGGTGGAGGAGGCAGAACATGTATAACCT
CAAGAGTGTATCCAAAATTGGCAATCGGAGAAAATGCAAATCTGTTTGTCTTCAACAAAGGCACTCAAAGTGTTGATATC
TTGACCCTAAGTGCTTGGAGTTTAAAGTCTGCTCAAATCAATGGCGACTTAATGTCACCATTTATTGAGCGTGAAGAGTC
ACGCTCACCTAATCATCAGTTTTGA