Microexon ID At_2:15176905-15176913:-
Species Arabidopsis thaliana
Coordinates 2:15176905..15176913
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CTTCACAGACCTAGTTTTCACTTTCAACCTCCAAAGCATTGGATTAACGATCCAAATGGTCCAGTATACTACAAAGGTCTCTACCATCTCTTCTACCAATATAACACC
Microexon-tag Amino Acid Seq LHRPSFHFQPPKHWINDPNGPVYYKGLYHLFYQYNT
Microexon-tag spanning region15176769-15177644
Microexon-tag prediction score0.9346
Overlapped with the annotated transcript (%) 100
New Transcript ID AT2G36190.1x
Reference Transcript ID AT2G36190.1
Gene ID AT2G36190
Gene Name CWINV4
Transcript ID AT2G36190.1
Protein ID AT2G36190.1
Gene ID AT2G36190
Gene Name CWINV4
Pfam domain motif Glyco_hydro_32N
Motif E-value 3e-104
Motif start 54
Motif end 372
Protein seq >AT2G36190.1
MAISNVISVLLLLLVLINLSNQNIKGIDAFHQIYEELQSESVESVNHLHRPSFHFQPPKHWINDPNGPVYYKGLYHLFYQ
YNTKGAVWGNIIWAHSVSKDLVNWEALEPALSPSKWFDIGGTWSGSITIVPGKGPIILYTGVNQNETQLQNYAIPEDPSD
PYLRKWIKPDDNPIAIPDYTMNGSAFRDPTTAWFSKDGHWRTVVGSKRKRRGIAYIYRSRDFKHWVKAKHPVHSKQSTGM
WECPDFFPVSLTDFRNGLDLDYVGPNTKHVLKVSLDITRYEYYTLGKYDLKKDRYIPDGNTPDGWEGLRFDYGNFYASKT
FFDYKKNRRILWGWANESDTVEDDILKGWAGLQVIPRTVLLDSSKKQLVFWPVEEIESLRGNYVRMNNHDIKMGQRIEVK
GITPAQADVEVTFYVGSLEKAEIFDPSFTWKPLELCNIKGSNVRGGVGPFGLITLATPDLEEYTPVFFRVFNDTKTHKPK
VLMCSDARPSSLKQDTGLLAKDRMYKPSFAGFVDVDMADGRISLRSLIDHSVVESFGALGKTVITSRVYPVKAVKENAHL
YVFNNGTQTVTIESLNAWNMDRPLQMNDGAL*
CDS seq >AT2G36190.1
ATGGCTATTTCAAATGTTATTTCTGTCTTATTATTATTGCTTGTACTAATCAATTTAAGCAATCAAAATATCAAAGGAAT
TGATGCATTTCATCAGATTTACGAAGAATTGCAATCTGAATCAGTCGAGTCAGTGAATCATCTTCACAGACCTAGTTTTC
ACTTTCAACCTCCAAAGCATTGGATTAACGATCCAAATGGTCCAGTATACTACAAAGGTCTCTACCATCTCTTCTACCAA
TATAACACCAAAGGTGCGGTTTGGGGTAATATTATATGGGCCCATTCGGTTTCTAAAGACTTGGTTAACTGGGAGGCTCT
TGAACCGGCTCTTAGTCCTTCAAAATGGTTCGACATCGGAGGTACATGGTCCGGTTCAATAACAATCGTACCGGGAAAAG
GACCGATTATCCTCTATACCGGTGTTAACCAGAACGAAACTCAGTTGCAAAACTATGCAATCCCAGAGGACCCATCAGAC
CCATACCTAAGGAAATGGATTAAACCGGACGATAACCCGATTGCAATCCCAGACTATACAATGAACGGTTCAGCATTCCG
TGACCCGACAACCGCTTGGTTCTCCAAAGACGGGCATTGGAGAACCGTGGTAGGGTCAAAAAGAAAGCGTAGAGGAATTG
CTTACATCTACAGAAGCCGAGATTTCAAGCATTGGGTCAAAGCTAAGCACCCGGTTCACTCTAAACAGTCAACCGGTATG
TGGGAATGTCCTGATTTCTTCCCGGTTTCCTTAACCGATTTCCGAAACGGTTTGGACTTGGATTACGTCGGTCCAAACAC
CAAGCATGTGTTGAAGGTTAGCTTGGACATTACCCGGTACGAGTATTACACGCTTGGTAAATACGATCTTAAGAAGGACC
GGTACATACCGGACGGTAATACTCCCGATGGTTGGGAGGGTTTAAGATTCGATTACGGTAATTTCTACGCTTCCAAGACA
TTCTTTGACTACAAAAAGAACAGAAGAATCTTATGGGGTTGGGCTAATGAATCTGACACCGTTGAAGATGATATTTTGAA
GGGTTGGGCTGGTCTTCAGGTGATTCCAAGAACGGTGCTCCTTGATTCAAGCAAGAAGCAACTTGTGTTTTGGCCTGTTG
AAGAGATAGAGTCATTAAGAGGTAACTACGTCCGAATGAACAATCATGACATCAAGATGGGTCAACGCATAGAAGTCAAA
GGAATCACTCCTGCTCAAGCCGATGTGGAAGTAACGTTCTATGTTGGAAGTCTAGAGAAAGCTGAGATATTCGATCCGAG
TTTCACGTGGAAACCATTGGAATTGTGTAACATTAAAGGCTCGAATGTGAGAGGCGGTGTAGGACCTTTTGGTCTGATCA
CATTAGCGACTCCTGATTTGGAAGAGTACACTCCTGTCTTCTTTAGAGTCTTCAACGACACTAAAACTCATAAGCCTAAA
GTTCTCATGTGCTCCGATGCTAGACCGTCGTCGTTAAAGCAAGACACGGGTTTACTCGCAAAAGATAGGATGTATAAACC
ATCGTTTGCTGGTTTTGTGGATGTTGATATGGCTGATGGAAGGATCTCTCTAAGGAGTTTGATTGATCATTCAGTAGTAG
AGAGCTTTGGAGCATTAGGTAAAACGGTGATAACATCAAGAGTGTATCCAGTGAAAGCAGTAAAAGAGAATGCACATTTG
TATGTCTTTAACAATGGAACACAAACTGTGACCATAGAGAGTCTCAATGCTTGGAATATGGATCGTCCTCTGCAAATGAA
TGATGGAGCTCTTTAA
Microexon DNA seq ATCCAAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CTTCACAGACCTAGTTTTCACTTTCAACCTCCAAAGCATTGGATTAACGATCCAAATGGTCCAGTATACTACAAAGGTCTCTACCATCTCTTCTACCAATATAACACC
Microexon-tag Amino Acid seq LHRPSFHFQPPKHWINDPNGPVYYKGLYHLFYQYNT
Transcript ID AT2G36190.1
Gene ID At.10500
Gene Name CWINV4
Pfam domain motif Glyco_hydro_32N
Motif E-value 3e-104
Motif start 54
Motif end 372
Protein seq >AT2G36190.1
MAISNVISVLLLLLVLINLSNQNIKGIDAFHQIYEELQSESVESVNHLHRPSFHFQPPKHWINDPNGPVYYKGLYHLFYQ
YNTKGAVWGNIIWAHSVSKDLVNWEALEPALSPSKWFDIGGTWSGSITIVPGKGPIILYTGVNQNETQLQNYAIPEDPSD
PYLRKWIKPDDNPIAIPDYTMNGSAFRDPTTAWFSKDGHWRTVVGSKRKRRGIAYIYRSRDFKHWVKAKHPVHSKQSTGM
WECPDFFPVSLTDFRNGLDLDYVGPNTKHVLKVSLDITRYEYYTLGKYDLKKDRYIPDGNTPDGWEGLRFDYGNFYASKT
FFDYKKNRRILWGWANESDTVEDDILKGWAGLQVIPRTVLLDSSKKQLVFWPVEEIESLRGNYVRMNNHDIKMGQRIEVK
GITPAQADVEVTFYVGSLEKAEIFDPSFTWKPLELCNIKGSNVRGGVGPFGLITLATPDLEEYTPVFFRVFNDTKTHKPK
VLMCSDARPSSLKQDTGLLAKDRMYKPSFAGFVDVDMADGRISLRSLIDHSVVESFGALGKTVITSRVYPVKAVKENAHL
YVFNNGTQTVTIESLNAWNMDRPLQMNDGAL*
CDS seq >AT2G36190.1
ATGGCTATTTCAAATGTTATTTCTGTCTTATTATTATTGCTTGTACTAATCAATTTAAGCAATCAAAATATCAAAGGAAT
TGATGCATTTCATCAGATTTACGAAGAATTGCAATCTGAATCAGTCGAGTCAGTGAATCATCTTCACAGACCTAGTTTTC
ACTTTCAACCTCCAAAGCATTGGATTAACGATCCAAATGGTCCAGTATACTACAAAGGTCTCTACCATCTCTTCTACCAA
TATAACACCAAAGGTGCGGTTTGGGGTAATATTATATGGGCCCATTCGGTTTCTAAAGACTTGGTTAACTGGGAGGCTCT
TGAACCGGCTCTTAGTCCTTCAAAATGGTTCGACATCGGAGGTACATGGTCCGGTTCAATAACAATCGTACCGGGAAAAG
GACCGATTATCCTCTATACCGGTGTTAACCAGAACGAAACTCAGTTGCAAAACTATGCAATCCCAGAGGACCCATCAGAC
CCATACCTAAGGAAATGGATTAAACCGGACGATAACCCGATTGCAATCCCAGACTATACAATGAACGGTTCAGCATTCCG
TGACCCGACAACCGCTTGGTTCTCCAAAGACGGGCATTGGAGAACCGTGGTAGGGTCAAAAAGAAAGCGTAGAGGAATTG
CTTACATCTACAGAAGCCGAGATTTCAAGCATTGGGTCAAAGCTAAGCACCCGGTTCACTCTAAACAGTCAACCGGTATG
TGGGAATGTCCTGATTTCTTCCCGGTTTCCTTAACCGATTTCCGAAACGGTTTGGACTTGGATTACGTCGGTCCAAACAC
CAAGCATGTGTTGAAGGTTAGCTTGGACATTACCCGGTACGAGTATTACACGCTTGGTAAATACGATCTTAAGAAGGACC
GGTACATACCGGACGGTAATACTCCCGATGGTTGGGAGGGTTTAAGATTCGATTACGGTAATTTCTACGCTTCCAAGACA
TTCTTTGACTACAAAAAGAACAGAAGAATCTTATGGGGTTGGGCTAATGAATCTGACACCGTTGAAGATGATATTTTGAA
GGGTTGGGCTGGTCTTCAGGTGATTCCAAGAACGGTGCTCCTTGATTCAAGCAAGAAGCAACTTGTGTTTTGGCCTGTTG
AAGAGATAGAGTCATTAAGAGGTAACTACGTCCGAATGAACAATCATGACATCAAGATGGGTCAACGCATAGAAGTCAAA
GGAATCACTCCTGCTCAAGCCGATGTGGAAGTAACGTTCTATGTTGGAAGTCTAGAGAAAGCTGAGATATTCGATCCGAG
TTTCACGTGGAAACCATTGGAATTGTGTAACATTAAAGGCTCGAATGTGAGAGGCGGTGTAGGACCTTTTGGTCTGATCA
CATTAGCGACTCCTGATTTGGAAGAGTACACTCCTGTCTTCTTTAGAGTCTTCAACGACACTAAAACTCATAAGCCTAAA
GTTCTCATGTGCTCCGATGCTAGACCGTCGTCGTTAAAGCAAGACACGGGTTTACTCGCAAAAGATAGGATGTATAAACC
ATCGTTTGCTGGTTTTGTGGATGTTGATATGGCTGATGGAAGGATCTCTCTAAGGAGTTTGATTGATCATTCAGTAGTAG
AGAGCTTTGGAGCATTAGGTAAAACGGTGATAACATCAAGAGTGTATCCAGTGAAAGCAGTAAAAGAGAATGCACATTTG
TATGTCTTTAACAATGGAACACAAACTGTGACCATAGAGAGTCTCAATGCTTGGAATATGGATCGTCCTCTGCAAATGAA
TGATGGAGCTCTTTAA