Microexon ID At_5:3839851-3839859:+
Species Arabidopsis thaliana
Coordinates 5:3839851..3839859
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCAAACG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq CTTAACCGGACGTCCTTCCATTTCCAACCTCAGAGAAATTGGCTCAACGATCCAAACGCGCCAATGTATTACAAAGGATTCTACCATTTGTTCTACCAGAATAACCCT
Microexon-tag Amino Acid Seq LNRTSFHFQPQRNWLNDPNAPMYYKGFYHLFYQNNP
Microexon-tag spanning region3839532-3840004
Microexon-tag prediction score0.938
Overlapped with the annotated transcript (%) 100
New Transcript ID AT5G11920.1x
Reference Transcript ID AT5G11920.1
Gene ID AT5G11920
Gene Name CWINV6
Transcript ID AT5G11920.1
Protein ID AT5G11920.1
Gene ID AT5G11920
Gene Name CWINV6
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.4e-98
Motif start 21
Motif end 341
Protein seq >AT5G11920.1
MADVMEQNLLQTAVLNRTSFHFQPQRNWLNDPNAPMYYKGFYHLFYQNNPLAPEFSRTRIIWGHSVSQDMVNWIQLEPAL
VPSESFDINSCWSGSATILPDGRPVILYTGLDVNNKQQVTVVAEPKDVSDPLLREWVKPKYNPVMVPPSNVPFNCFRDPT
EAWKGQDGKWRVLIGAKEKDTEKGMAILYRSDDFVQWTKYPVPLLESEGTGMWECPDFFPVSITGKEGVDTSVNNASVRH
VLKASFGGNDCYVIGKYSSETEDFSADYEFTNTSADLRYDHGTFYASKAFFDSVKNRRINWGWVIETDSKEDDFKKGWAG
LMTLPREIWMDTSGKKLMQWPIEEINNLRTKSVSLDDCYEFKTGSTFEISGITAAQADVEVTFNLPFLENNPEILDADQV
DDATLFDRDSSVGCVYGPFGLLALASSDLSEQTAIFFKVIRRGNGYAVVMCSSEKRSSLRDNIKKSSHGAFLDIDPRHEK
ISLRCLIDHSIIESYGVGGKTVITSRVYPKLAIGEAAKLYVFNDGENGVIMTSLEAWSMRNAQINSNPTY*
CDS seq >AT5G11920.1
ATGGCTGATGTAATGGAACAGAATCTTCTTCAAACCGCTGTGCTTAACCGGACGTCCTTCCATTTCCAACCTCAGAGAAA
TTGGCTCAACGATCCAAACGCGCCAATGTATTACAAAGGATTCTACCATTTGTTCTACCAGAATAACCCTTTGGCTCCCG
AATTCAGTAGGACAAGAATCATATGGGGACACTCTGTTTCACAAGACATGGTCAATTGGATCCAGCTCGAACCAGCCCTT
GTCCCTTCTGAGTCCTTTGACATCAACAGCTGCTGGTCAGGATCCGCCACGATCCTCCCTGATGGCAGACCTGTAATTTT
GTACACTGGACTCGACGTCAACAACAAACAGCAAGTCACAGTTGTTGCCGAACCAAAGGACGTTTCTGACCCTTTGCTTC
GTGAGTGGGTTAAGCCAAAGTACAATCCTGTGATGGTTCCGCCAAGTAATGTCCCTTTTAATTGTTTCCGTGATCCCACG
GAGGCGTGGAAAGGGCAAGATGGGAAATGGAGAGTGCTCATAGGAGCTAAGGAGAAAGATACTGAGAAAGGAATGGCGAT
TTTGTACCGAAGCGATGATTTTGTCCAGTGGACAAAGTATCCGGTGCCTTTACTTGAGTCAGAAGGAACCGGAATGTGGG
AATGCCCTGATTTTTTCCCGGTTTCTATCACTGGTAAAGAAGGTGTTGACACTTCGGTAAACAATGCTAGTGTGAGGCAT
GTCTTGAAGGCGAGTTTTGGAGGCAATGATTGCTATGTCATTGGTAAATACTCTTCTGAGACTGAAGACTTTTCAGCGGA
TTATGAGTTCACTAACACTAGTGCAGATTTGAGATATGATCATGGAACGTTTTATGCCTCAAAGGCGTTCTTCGATAGTG
TTAAAAATAGGAGGATCAACTGGGGATGGGTCATAGAGACTGATAGCAAAGAAGATGATTTTAAGAAAGGATGGGCTGGC
CTTATGACTCTTCCCAGGGAAATTTGGATGGACACAAGTGGAAAGAAGCTGATGCAATGGCCAATTGAAGAAATCAACAA
TCTCCGGACCAAAAGTGTTAGCCTTGATGATTGCTATGAATTCAAAACCGGCTCTACCTTTGAAATCTCAGGCATCACTG
CTGCCCAAGCAGATGTAGAAGTGACTTTTAATCTGCCTTTCCTGGAAAATAATCCCGAGATACTTGATGCTGACCAAGTT
GATGATGCGACTCTGTTTGATCGTGATAGCTCGGTTGGATGTGTTTACGGGCCTTTTGGATTGCTAGCATTGGCTTCCAG
TGATTTATCAGAACAAACCGCAATCTTCTTTAAAGTTATTCGTCGCGGTAACGGATATGCAGTTGTAATGTGCAGCAGCG
AGAAGAGGTCTTCGTTGAGAGACAACATAAAAAAATCTTCGCATGGAGCATTCCTAGATATTGATCCAAGGCATGAGAAG
ATCTCATTAAGATGTTTGATCGATCACTCGATTATAGAGAGCTACGGAGTAGGAGGAAAAACTGTGATAACATCTAGAGT
TTATCCAAAATTGGCAATTGGTGAAGCCGCTAAGCTTTATGTCTTCAATGATGGAGAAAATGGTGTGATCATGACGTCCC
TGGAAGCTTGGAGCATGAGAAATGCCCAAATCAATTCAAACCCAACTTATTAG
Microexon DNA seq ATCCAAACG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq CTTAACCGGACGTCCTTCCATTTCCAACCTCAGAGAAATTGGCTCAACGATCCAAACGCGCCAATGTATTACAAAGGATTCTACCATTTGTTCTACCAGAATAACCCT
Microexon-tag Amino Acid seq LNRTSFHFQPQRNWLNDPNAPMYYKGFYHLFYQNNP
Transcript ID AT5G11920.1
Gene ID At.22805
Gene Name CWINV6
Pfam domain motif Glyco_hydro_32N
Motif E-value 4.4e-98
Motif start 21
Motif end 341
Protein seq >AT5G11920.1
MADVMEQNLLQTAVLNRTSFHFQPQRNWLNDPNAPMYYKGFYHLFYQNNPLAPEFSRTRIIWGHSVSQDMVNWIQLEPAL
VPSESFDINSCWSGSATILPDGRPVILYTGLDVNNKQQVTVVAEPKDVSDPLLREWVKPKYNPVMVPPSNVPFNCFRDPT
EAWKGQDGKWRVLIGAKEKDTEKGMAILYRSDDFVQWTKYPVPLLESEGTGMWECPDFFPVSITGKEGVDTSVNNASVRH
VLKASFGGNDCYVIGKYSSETEDFSADYEFTNTSADLRYDHGTFYASKAFFDSVKNRRINWGWVIETDSKEDDFKKGWAG
LMTLPREIWMDTSGKKLMQWPIEEINNLRTKSVSLDDCYEFKTGSTFEISGITAAQADVEVTFNLPFLENNPEILDADQV
DDATLFDRDSSVGCVYGPFGLLALASSDLSEQTAIFFKVIRRGNGYAVVMCSSEKRSSLRDNIKKSSHGAFLDIDPRHEK
ISLRCLIDHSIIESYGVGGKTVITSRVYPKLAIGEAAKLYVFNDGENGVIMTSLEAWSMRNAQINSNPTY*
CDS seq >AT5G11920.1
ATGGCTGATGTAATGGAACAGAATCTTCTTCAAACCGCTGTGCTTAACCGGACGTCCTTCCATTTCCAACCTCAGAGAAA
TTGGCTCAACGATCCAAACGCGCCAATGTATTACAAAGGATTCTACCATTTGTTCTACCAGAATAACCCTTTGGCTCCCG
AATTCAGTAGGACAAGAATCATATGGGGACACTCTGTTTCACAAGACATGGTCAATTGGATCCAGCTCGAACCAGCCCTT
GTCCCTTCTGAGTCCTTTGACATCAACAGCTGCTGGTCAGGATCCGCCACGATCCTCCCTGATGGCAGACCTGTAATTTT
GTACACTGGACTCGACGTCAACAACAAACAGCAAGTCACAGTTGTTGCCGAACCAAAGGACGTTTCTGACCCTTTGCTTC
GTGAGTGGGTTAAGCCAAAGTACAATCCTGTGATGGTTCCGCCAAGTAATGTCCCTTTTAATTGTTTCCGTGATCCCACG
GAGGCGTGGAAAGGGCAAGATGGGAAATGGAGAGTGCTCATAGGAGCTAAGGAGAAAGATACTGAGAAAGGAATGGCGAT
TTTGTACCGAAGCGATGATTTTGTCCAGTGGACAAAGTATCCGGTGCCTTTACTTGAGTCAGAAGGAACCGGAATGTGGG
AATGCCCTGATTTTTTCCCGGTTTCTATCACTGGTAAAGAAGGTGTTGACACTTCGGTAAACAATGCTAGTGTGAGGCAT
GTCTTGAAGGCGAGTTTTGGAGGCAATGATTGCTATGTCATTGGTAAATACTCTTCTGAGACTGAAGACTTTTCAGCGGA
TTATGAGTTCACTAACACTAGTGCAGATTTGAGATATGATCATGGAACGTTTTATGCCTCAAAGGCGTTCTTCGATAGTG
TTAAAAATAGGAGGATCAACTGGGGATGGGTCATAGAGACTGATAGCAAAGAAGATGATTTTAAGAAAGGATGGGCTGGC
CTTATGACTCTTCCCAGGGAAATTTGGATGGACACAAGTGGAAAGAAGCTGATGCAATGGCCAATTGAAGAAATCAACAA
TCTCCGGACCAAAAGTGTTAGCCTTGATGATTGCTATGAATTCAAAACCGGCTCTACCTTTGAAATCTCAGGCATCACTG
CTGCCCAAGCAGATGTAGAAGTGACTTTTAATCTGCCTTTCCTGGAAAATAATCCCGAGATACTTGATGCTGACCAAGTT
GATGATGCGACTCTGTTTGATCGTGATAGCTCGGTTGGATGTGTTTACGGGCCTTTTGGATTGCTAGCATTGGCTTCCAG
TGATTTATCAGAACAAACCGCAATCTTCTTTAAAGTTATTCGTCGCGGTAACGGATATGCAGTTGTAATGTGCAGCAGCG
AGAAGAGGTCTTCGTTGAGAGACAACATAAAAAAATCTTCGCATGGAGCATTCCTAGATATTGATCCAAGGCATGAGAAG
ATCTCATTAAGATGTTTGATCGATCACTCGATTATAGAGAGCTACGGAGTAGGAGGAAAAACTGTGATAACATCTAGAGT
TTATCCAAAATTGGCAATTGGTGAAGCCGCTAAGCTTTATGTCTTCAATGATGGAGAAAATGGTGTGATCATGACGTCCC
TGGAAGCTTGGAGCATGAGAAATGCCCAAATCAATTCAAACCCAACTTATTAG