Microexon ID At_3:19508934-19508942:-
Species Arabidopsis thaliana
Coordinates 3:19508934..19508942
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCGAATG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq CTTCACAGAACCGCTTATCACTTTCAACCTCCCCGCCATTGGATCAACGATCCGAATGCTCCAATGCTCTACAAGGGAGTTTACCATCTCTTCTACCAATACAATCCC
Microexon-tag Amino Acid Seq LHRTAYHFQPPRHWINDPNAPMLYKGVYHLFYQYNP
Microexon-tag spanning region19508789-19509138
Microexon-tag prediction score0.9499
Overlapped with the annotated transcript (%) 100
New Transcript ID AT3G52600.1x
Reference Transcript ID AT3G52600.1
Gene ID AT3G52600
Gene Name CWINV2
Transcript ID AT3G52600.1
Protein ID AT3G52600.1
Gene ID AT3G52600
Gene Name CWINV2
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.2e-104
Motif start 52
Motif end 371
Protein seq >AT3G52600.1
MSAPKFGYVLLLIVLINISNNGVDAFHKVFKKLQSKSTSLESVSPLHRTAYHFQPPRHWINDPNAPMLYKGVYHLFYQYN
PKGAVWGNIVWAHSVSKDLINWEALEPAIYPSKWFDINGTWSGSATHVPGKGPVILYTGITENQTQIQNYAIPQDLSDPY
LKTWIKPDDNPIVKPDNGENGSAFRDPTTAWFNKKDGYWRMLVGSKRKNRGIAYMYKSRDFKKWVKSKRPIHSRKKTGMW
ECPDFFPVSVTDKKNGLDFSYDGPNAKHVLKVSLDLTRYEYYTLGTYDTKKDRYRPDGYTPDGWDGLRFDYGNYYASKTF
FDDKTNRRILWGWANESDTVQDDTVKGWAGIQLIPRTILLDSSGKQLVFWPIEEIESLRGKNVQMTNQKMEMGQRFEVQG
ITPAQVDVDVTFNVGNLEKAEKFDESFATKPLELCNLKGSNVNGGVGPFGLITLATSDLEEYTPVFFRVFKDAASNKPKV
LMCSDAKPSSLKKDTGTDAKERMYKPSFAGFVDVGLLDGKISLRSLIDHSVVESFGAKGKTVITSRVYPTKAVGEKAHLF
VFNNGSQPVTVESLNAWNMQKPLKMNQGAK*
CDS seq >AT3G52600.1
ATGAGTGCTCCAAAGTTTGGTTATGTATTACTATTGATTGTATTAATCAATATTAGCAATAATGGCGTCGATGCATTTCA
CAAAGTTTTCAAGAAATTGCAATCAAAATCGACATCATTGGAGTCAGTAAGTCCTCTTCACAGAACCGCTTATCACTTTC
AACCTCCCCGCCATTGGATCAACGATCCGAATGCTCCAATGCTCTACAAGGGAGTTTACCATCTCTTCTACCAATACAAT
CCCAAAGGTGCGGTTTGGGGTAACATTGTGTGGGCTCACTCGGTTTCTAAGGACTTGATCAATTGGGAAGCCCTTGAACC
AGCCATTTACCCATCCAAATGGTTCGACATCAACGGTACATGGTCCGGTTCAGCTACCCACGTACCGGGAAAAGGACCGG
TTATCCTCTACACCGGTATCACCGAGAACCAGACTCAGATTCAAAACTACGCCATTCCACAAGATCTTTCCGACCCATAC
CTCAAGACATGGATAAAGCCAGACGATAACCCCATCGTAAAACCCGATAATGGCGAGAACGGATCCGCTTTCCGTGACCC
GACCACGGCTTGGTTCAACAAAAAAGATGGGTATTGGAGAATGCTTGTTGGCTCAAAGAGAAAGAACAGAGGAATTGCTT
ATATGTACAAGAGCCGTGACTTCAAAAAATGGGTCAAAAGCAAACGTCCTATCCACTCAAGAAAGAAAACCGGTATGTGG
GAATGTCCCGATTTCTTCCCGGTATCCGTAACCGACAAGAAAAACGGTTTGGACTTCAGCTACGACGGTCCAAACGCCAA
GCATGTGTTGAAGGTTAGTTTGGACTTGACCAGATACGAGTACTACACTCTTGGAACGTATGACACCAAGAAGGATCGTT
ACAGGCCAGACGGTTACACTCCTGACGGTTGGGATGGTTTGAGATTTGATTATGGTAACTACTATGCGTCAAAGACATTC
TTTGATGACAAGACGAACAGAAGAATTCTTTGGGGGTGGGCCAATGAATCCGACACCGTTCAAGATGATACCGTGAAGGG
TTGGGCCGGAATTCAGCTTATCCCAAGAACAATCTTGCTTGACTCTAGCGGTAAGCAACTCGTGTTTTGGCCTATTGAAG
AGATTGAGTCATTGAGAGGAAAGAATGTCCAAATGACCAACCAGAAAATGGAGATGGGCCAACGCTTTGAGGTCCAAGGA
ATCACTCCTGCTCAGGTGGATGTGGATGTGACATTCAACGTTGGAAATCTAGAGAAGGCCGAAAAGTTTGACGAAAGCTT
CGCAACTAAACCACTAGAGTTATGTAACTTGAAAGGTTCGAATGTGAATGGTGGAGTTGGTCCTTTTGGTTTGATCACGT
TGGCTACATCTGACTTAGAAGAATACACTCCTGTTTTCTTTAGAGTCTTCAAAGATGCCGCATCCAACAAGCCTAAGGTT
CTCATGTGCTCTGACGCCAAGCCTTCGAGTCTCAAAAAAGACACCGGGACTGACGCCAAGGAGAGGATGTACAAACCATC
GTTTGCTGGTTTTGTGGATGTCGGTCTCCTTGACGGAAAGATCTCTCTAAGGAGTTTGATTGATCACTCGGTTGTGGAGA
GCTTTGGAGCAAAAGGAAAGACAGTGATAACATCTAGAGTGTATCCGACAAAAGCAGTAGGAGAAAAAGCTCATTTGTTT
GTCTTTAACAACGGCTCACAACCTGTGACCGTAGAGAGCCTCAATGCATGGAACATGCAGAAGCCTCTGAAGATGAACCA
AGGTGCAAAGTGA
Microexon DNA seq ATCCGAATG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq CTTCACAGAACCGCTTATCACTTTCAACCTCCCCGCCATTGGATCAACGATCCGAATGCTCCAATGCTCTACAAGGGAGTTTACCATCTCTTCTACCAATACAATCCC
Microexon-tag Amino Acid seq LHRTAYHFQPPRHWINDPNAPMLYKGVYHLFYQYNP
Transcript ID AT3G52600.1
Gene ID At.16230
Gene Name CWINV2
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.2e-104
Motif start 52
Motif end 371
Protein seq >AT3G52600.1
MSAPKFGYVLLLIVLINISNNGVDAFHKVFKKLQSKSTSLESVSPLHRTAYHFQPPRHWINDPNAPMLYKGVYHLFYQYN
PKGAVWGNIVWAHSVSKDLINWEALEPAIYPSKWFDINGTWSGSATHVPGKGPVILYTGITENQTQIQNYAIPQDLSDPY
LKTWIKPDDNPIVKPDNGENGSAFRDPTTAWFNKKDGYWRMLVGSKRKNRGIAYMYKSRDFKKWVKSKRPIHSRKKTGMW
ECPDFFPVSVTDKKNGLDFSYDGPNAKHVLKVSLDLTRYEYYTLGTYDTKKDRYRPDGYTPDGWDGLRFDYGNYYASKTF
FDDKTNRRILWGWANESDTVQDDTVKGWAGIQLIPRTILLDSSGKQLVFWPIEEIESLRGKNVQMTNQKMEMGQRFEVQG
ITPAQVDVDVTFNVGNLEKAEKFDESFATKPLELCNLKGSNVNGGVGPFGLITLATSDLEEYTPVFFRVFKDAASNKPKV
LMCSDAKPSSLKKDTGTDAKERMYKPSFAGFVDVGLLDGKISLRSLIDHSVVESFGAKGKTVITSRVYPTKAVGEKAHLF
VFNNGSQPVTVESLNAWNMQKPLKMNQGAK*
CDS seq >AT3G52600.1
ATGAGTGCTCCAAAGTTTGGTTATGTATTACTATTGATTGTATTAATCAATATTAGCAATAATGGCGTCGATGCATTTCA
CAAAGTTTTCAAGAAATTGCAATCAAAATCGACATCATTGGAGTCAGTAAGTCCTCTTCACAGAACCGCTTATCACTTTC
AACCTCCCCGCCATTGGATCAACGATCCGAATGCTCCAATGCTCTACAAGGGAGTTTACCATCTCTTCTACCAATACAAT
CCCAAAGGTGCGGTTTGGGGTAACATTGTGTGGGCTCACTCGGTTTCTAAGGACTTGATCAATTGGGAAGCCCTTGAACC
AGCCATTTACCCATCCAAATGGTTCGACATCAACGGTACATGGTCCGGTTCAGCTACCCACGTACCGGGAAAAGGACCGG
TTATCCTCTACACCGGTATCACCGAGAACCAGACTCAGATTCAAAACTACGCCATTCCACAAGATCTTTCCGACCCATAC
CTCAAGACATGGATAAAGCCAGACGATAACCCCATCGTAAAACCCGATAATGGCGAGAACGGATCCGCTTTCCGTGACCC
GACCACGGCTTGGTTCAACAAAAAAGATGGGTATTGGAGAATGCTTGTTGGCTCAAAGAGAAAGAACAGAGGAATTGCTT
ATATGTACAAGAGCCGTGACTTCAAAAAATGGGTCAAAAGCAAACGTCCTATCCACTCAAGAAAGAAAACCGGTATGTGG
GAATGTCCCGATTTCTTCCCGGTATCCGTAACCGACAAGAAAAACGGTTTGGACTTCAGCTACGACGGTCCAAACGCCAA
GCATGTGTTGAAGGTTAGTTTGGACTTGACCAGATACGAGTACTACACTCTTGGAACGTATGACACCAAGAAGGATCGTT
ACAGGCCAGACGGTTACACTCCTGACGGTTGGGATGGTTTGAGATTTGATTATGGTAACTACTATGCGTCAAAGACATTC
TTTGATGACAAGACGAACAGAAGAATTCTTTGGGGGTGGGCCAATGAATCCGACACCGTTCAAGATGATACCGTGAAGGG
TTGGGCCGGAATTCAGCTTATCCCAAGAACAATCTTGCTTGACTCTAGCGGTAAGCAACTCGTGTTTTGGCCTATTGAAG
AGATTGAGTCATTGAGAGGAAAGAATGTCCAAATGACCAACCAGAAAATGGAGATGGGCCAACGCTTTGAGGTCCAAGGA
ATCACTCCTGCTCAGGTGGATGTGGATGTGACATTCAACGTTGGAAATCTAGAGAAGGCCGAAAAGTTTGACGAAAGCTT
CGCAACTAAACCACTAGAGTTATGTAACTTGAAAGGTTCGAATGTGAATGGTGGAGTTGGTCCTTTTGGTTTGATCACGT
TGGCTACATCTGACTTAGAAGAATACACTCCTGTTTTCTTTAGAGTCTTCAAAGATGCCGCATCCAACAAGCCTAAGGTT
CTCATGTGCTCTGACGCCAAGCCTTCGAGTCTCAAAAAAGACACCGGGACTGACGCCAAGGAGAGGATGTACAAACCATC
GTTTGCTGGTTTTGTGGATGTCGGTCTCCTTGACGGAAAGATCTCTCTAAGGAGTTTGATTGATCACTCGGTTGTGGAGA
GCTTTGGAGCAAAAGGAAAGACAGTGATAACATCTAGAGTGTATCCGACAAAAGCAGTAGGAGAAAAAGCTCATTTGTTT
GTCTTTAACAACGGCTCACAACCTGTGACCGTAGAGAGCCTCAATGCATGGAACATGCAGAAGCCTCTGAAGATGAACCA
AGGTGCAAAGTGA