Microexon ID At_3:4535501-4535509:-
Species Arabidopsis thaliana
Coordinates 3:4535501..4535509
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCCTACCGGACCGGTTTCCATTTCCAACCCCCCAAAAATTGGATGAACGATCCTAATGGGCCTATGATATACAAAGGAATATATCATCTTTTCTACCAATGGAACCCG
Microexon-tag Amino Acid Seq PYRTGFHFQPPKNWMNDPNGPMIYKGIYHLFYQWNP
Microexon-tag spanning region4535186-4535684
Microexon-tag prediction score0.9462
Overlapped with the annotated transcript (%) 100
New Transcript ID AT3G13790.1x
Reference Transcript ID AT3G13790.1
Gene ID AT3G13790
Gene Name CWINV1
Transcript ID AT3G13790.1
Protein ID AT3G13790.1
Gene ID AT3G13790
Gene Name CWINV1
Pfam domain motif Glyco_hydro_32N
Motif E-value 6e-111
Motif start 56
Motif end 378
Protein seq >AT3G13790.1
MTKEVCSNIGLWLLLTLLIGNYVVNLEASHHVYKRLTQSTNTKSPSVNQPYRTGFHFQPPKNWMNDPNGPMIYKGIYHLF
YQWNPKGAVWGNIVWAHSTSTDLINWDPHPPAIFPSAPFDINGCWSGSATILPNGKPVILYTGIDPKNQQVQNIAEPKNL
SDPYLREWKKSPLNPLMAPDAVNGINASSFRDPTTAWLGQDKKWRVIIGSKIHRRGLAITYTSKDFLKWEKSPEPLHYDD
GSGMWECPDFFPVTRFGSNGVETSSFGEPNEILKHVLKISLDDTKHDYYTIGTYDRVKDKFVPDNGFKMDGTAPRYDYGK
YYASKTFFDSAKNRRILWGWTNESSSVEDDVEKGWSGIQTIPRKIWLDRSGKQLIQWPVREVERLRTKQVKNLRNKVLKS
GSRLEVYGVTAAQADVEVLFKVRDLEKADVIEPSWTDPQLICSKMNVSVKSGLGPFGLMVLASKNLEEYTSVYFRIFKAR
QNSNKYVVLMCSDQSRSSLKEDNDKTTYGAFVDINPHQPLSLRALIDHSVVESFGGKGRACITSRVYPKLAIGKSSHLFA
FNYGYQSVDVLNLNAWSMNSAQIS*
CDS seq >AT3G13790.1
ATGACCAAAGAAGTTTGCTCCAACATTGGACTTTGGTTATTGCTCACGTTACTTATTGGTAACTATGTCGTCAATCTTGA
AGCCTCGCACCATGTCTACAAGAGACTTACCCAAAGCACTAACACCAAATCTCCTTCCGTAAACCAGCCCTACCGGACCG
GTTTCCATTTCCAACCCCCCAAAAATTGGATGAACGATCCTAATGGGCCTATGATATACAAAGGAATATATCATCTTTTC
TACCAATGGAACCCGAAAGGAGCCGTGTGGGGTAACATCGTGTGGGCTCATTCCACGTCAACAGACTTAATCAATTGGGA
TCCACATCCTCCAGCTATCTTCCCATCTGCACCCTTCGATATCAACGGATGCTGGTCCGGTTCAGCTACTATTCTCCCTA
ATGGAAAACCGGTTATCCTCTATACCGGAATCGACCCTAAGAACCAACAGGTCCAAAACATAGCCGAGCCTAAGAATCTC
TCCGATCCTTATCTCCGAGAATGGAAAAAGTCGCCGTTAAATCCTCTCATGGCTCCTGACGCCGTTAACGGAATCAACGC
CAGCTCGTTCCGTGACCCAACCACCGCGTGGCTAGGCCAAGACAAGAAATGGAGAGTGATCATCGGAAGCAAGATTCACC
GTCGTGGACTAGCCATTACTTACACGAGTAAAGACTTTCTAAAATGGGAAAAATCTCCAGAGCCGTTGCATTACGACGAC
GGAAGTGGAATGTGGGAATGTCCTGATTTTTTCCCGGTCACGAGGTTTGGTTCTAACGGCGTGGAAACGTCTTCGTTTGG
TGAACCTAATGAGATTTTGAAGCACGTGTTGAAAATAAGTTTGGACGACACGAAACATGATTATTACACGATTGGTACGT
ACGATCGGGTTAAAGATAAATTCGTACCGGACAATGGTTTCAAGATGGACGGTACGGCTCCGAGATACGATTACGGAAAG
TATTACGCGTCTAAAACGTTTTTTGACTCGGCTAAGAACCGGAGAATCTTGTGGGGTTGGACTAACGAGTCATCGTCGGT
TGAGGATGATGTTGAGAAAGGCTGGTCCGGTATTCAGACGATTCCAAGGAAAATATGGCTTGATAGATCAGGGAAACAAT
TAATTCAGTGGCCGGTTAGGGAAGTTGAAAGATTACGTACAAAACAAGTCAAAAACTTACGCAACAAAGTTCTAAAGTCA
GGATCTAGGCTTGAAGTCTATGGTGTGACAGCTGCACAGGCGGATGTAGAAGTATTGTTCAAAGTGAGAGACTTGGAGAA
AGCGGATGTGATAGAACCAAGTTGGACTGATCCGCAGTTGATTTGTAGCAAGATGAATGTATCGGTTAAGTCTGGTTTAG
GTCCATTCGGTTTAATGGTTTTGGCATCTAAGAATTTGGAAGAGTACACATCTGTTTATTTTAGAATCTTCAAAGCCCGT
CAAAACAGCAATAAGTACGTTGTGCTCATGTGCAGTGACCAAAGCAGATCTTCGCTGAAGGAAGATAATGACAAAACGAC
ATACGGAGCTTTTGTGGATATTAATCCTCACCAACCACTATCCCTCAGAGCCTTGATTGATCATTCAGTAGTGGAGAGTT
TCGGTGGAAAGGGAAGAGCATGCATTACCTCAAGAGTGTATCCAAAATTGGCAATAGGAAAAAGTTCACATCTCTTTGCT
TTTAATTATGGATATCAAAGTGTTGATGTCTTAAACTTAAATGCTTGGAGCATGAACTCTGCCCAAATCAGTTGA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCCTACCGGACCGGTTTCCATTTCCAACCCCCCAAAAATTGGATGAACGATCCTAATGGGCCTATGATATACAAAGGAATATATCATCTTTTCTACCAATGGAACCCG
Microexon-tag Amino Acid seq PYRTGFHFQPPKNWMNDPNGPMIYKGIYHLFYQWNP
Transcript ID AT3G13790.1
Gene ID At.13325
Gene Name CWINV1
Pfam domain motif Glyco_hydro_32N
Motif E-value 6e-111
Motif start 56
Motif end 378
Protein seq >AT3G13790.1
MTKEVCSNIGLWLLLTLLIGNYVVNLEASHHVYKRLTQSTNTKSPSVNQPYRTGFHFQPPKNWMNDPNGPMIYKGIYHLF
YQWNPKGAVWGNIVWAHSTSTDLINWDPHPPAIFPSAPFDINGCWSGSATILPNGKPVILYTGIDPKNQQVQNIAEPKNL
SDPYLREWKKSPLNPLMAPDAVNGINASSFRDPTTAWLGQDKKWRVIIGSKIHRRGLAITYTSKDFLKWEKSPEPLHYDD
GSGMWECPDFFPVTRFGSNGVETSSFGEPNEILKHVLKISLDDTKHDYYTIGTYDRVKDKFVPDNGFKMDGTAPRYDYGK
YYASKTFFDSAKNRRILWGWTNESSSVEDDVEKGWSGIQTIPRKIWLDRSGKQLIQWPVREVERLRTKQVKNLRNKVLKS
GSRLEVYGVTAAQADVEVLFKVRDLEKADVIEPSWTDPQLICSKMNVSVKSGLGPFGLMVLASKNLEEYTSVYFRIFKAR
QNSNKYVVLMCSDQSRSSLKEDNDKTTYGAFVDINPHQPLSLRALIDHSVVESFGGKGRACITSRVYPKLAIGKSSHLFA
FNYGYQSVDVLNLNAWSMNSAQIS*
CDS seq >AT3G13790.1
ATGACCAAAGAAGTTTGCTCCAACATTGGACTTTGGTTATTGCTCACGTTACTTATTGGTAACTATGTCGTCAATCTTGA
AGCCTCGCACCATGTCTACAAGAGACTTACCCAAAGCACTAACACCAAATCTCCTTCCGTAAACCAGCCCTACCGGACCG
GTTTCCATTTCCAACCCCCCAAAAATTGGATGAACGATCCTAATGGGCCTATGATATACAAAGGAATATATCATCTTTTC
TACCAATGGAACCCGAAAGGAGCCGTGTGGGGTAACATCGTGTGGGCTCATTCCACGTCAACAGACTTAATCAATTGGGA
TCCACATCCTCCAGCTATCTTCCCATCTGCACCCTTCGATATCAACGGATGCTGGTCCGGTTCAGCTACTATTCTCCCTA
ATGGAAAACCGGTTATCCTCTATACCGGAATCGACCCTAAGAACCAACAGGTCCAAAACATAGCCGAGCCTAAGAATCTC
TCCGATCCTTATCTCCGAGAATGGAAAAAGTCGCCGTTAAATCCTCTCATGGCTCCTGACGCCGTTAACGGAATCAACGC
CAGCTCGTTCCGTGACCCAACCACCGCGTGGCTAGGCCAAGACAAGAAATGGAGAGTGATCATCGGAAGCAAGATTCACC
GTCGTGGACTAGCCATTACTTACACGAGTAAAGACTTTCTAAAATGGGAAAAATCTCCAGAGCCGTTGCATTACGACGAC
GGAAGTGGAATGTGGGAATGTCCTGATTTTTTCCCGGTCACGAGGTTTGGTTCTAACGGCGTGGAAACGTCTTCGTTTGG
TGAACCTAATGAGATTTTGAAGCACGTGTTGAAAATAAGTTTGGACGACACGAAACATGATTATTACACGATTGGTACGT
ACGATCGGGTTAAAGATAAATTCGTACCGGACAATGGTTTCAAGATGGACGGTACGGCTCCGAGATACGATTACGGAAAG
TATTACGCGTCTAAAACGTTTTTTGACTCGGCTAAGAACCGGAGAATCTTGTGGGGTTGGACTAACGAGTCATCGTCGGT
TGAGGATGATGTTGAGAAAGGCTGGTCCGGTATTCAGACGATTCCAAGGAAAATATGGCTTGATAGATCAGGGAAACAAT
TAATTCAGTGGCCGGTTAGGGAAGTTGAAAGATTACGTACAAAACAAGTCAAAAACTTACGCAACAAAGTTCTAAAGTCA
GGATCTAGGCTTGAAGTCTATGGTGTGACAGCTGCACAGGCGGATGTAGAAGTATTGTTCAAAGTGAGAGACTTGGAGAA
AGCGGATGTGATAGAACCAAGTTGGACTGATCCGCAGTTGATTTGTAGCAAGATGAATGTATCGGTTAAGTCTGGTTTAG
GTCCATTCGGTTTAATGGTTTTGGCATCTAAGAATTTGGAAGAGTACACATCTGTTTATTTTAGAATCTTCAAAGCCCGT
CAAAACAGCAATAAGTACGTTGTGCTCATGTGCAGTGACCAAAGCAGATCTTCGCTGAAGGAAGATAATGACAAAACGAC
ATACGGAGCTTTTGTGGATATTAATCCTCACCAACCACTATCCCTCAGAGCCTTGATTGATCATTCAGTAGTGGAGAGTT
TCGGTGGAAAGGGAAGAGCATGCATTACCTCAAGAGTGTATCCAAAATTGGCAATAGGAAAAAGTTCACATCTCTTTGCT
TTTAATTATGGATATCAAAGTGTTGATGTCTTAAACTTAAATGCTTGGAGCATGAACTCTGCCCAAATCAGTTGA