Microexon ID At_1:456757-456761:+
Species Arabidopsis thaliana
Coordinates 1:456757..456761
Microexon Cluster ID MEP08
Size 5
Phase 1
Pfam Domain Motif Peptidase_C1
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 52,5,51
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TTYGAYGCWMGAACWGMTTGGYCTCADTGYASCACHATTGGRARMATWCTWGATCARGGWCAYTGTGGTTCTTGYTGGGCWTTTGGTGCTGTKGARKCACTRYCWGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCAG
Microexon Amino Acid seq DQ
Microexon-tag DNA Seq TTTGATGCTAGAACCGCTTGGTCACAGTGCACCAGTATTGGAAGGATCTTAGATCAGGGTCACTGTGGTTCTTGCTGGGCCTTTGGTGCTGTTGAATCACTGTCTGAC
Microexon-tag Amino Acid Seq FDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSD
Microexon-tag spanning region456595-456893
Microexon-tag prediction score0.9797
Overlapped with the annotated transcript (%) 100
New Transcript ID AT1G02305.1x
Reference Transcript ID AT1G02305.1
Gene ID AT1G02305
Gene Name CATHB2
Transcript ID AT1G02305.1
Protein ID AT1G02305.1
Gene ID AT1G02305
Gene Name CATHB2
Pfam domain motif Peptidase_C1
Motif E-value 4.5e-68
Motif start 106
Motif end 340
Protein seq >AT1G02305.1
MADNCIRLLHSASVFFCLGLLISSFNLLQGIAAENLSKQKLTSWILQNEIVKEVNENPNAGWKASFNDRFANATVAEFKR
LLGVKPTPKTEFLGVPIVSHDISLKLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVN
DLLACCGFLCGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSA
YKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDG
YFKIRRGTNECGIEHGVVAGLPSDRNVVKGITTSDDLLVSSF*
CDS seq >AT1G02305.1
ATGGCTGATAATTGTATCAGACTTCTTCACTCAGCCTCTGTTTTCTTCTGTTTAGGGCTTCTAATTTCATCCTTCAACTT
GTTGCAGGGTATTGCAGCTGAAAATCTTTCCAAGCAGAAACTGACCTCATGGATTCTTCAGAATGAGATTGTAAAGGAAG
TCAATGAGAATCCAAACGCTGGATGGAAAGCTTCTTTCAATGATCGGTTTGCAAACGCCACTGTTGCAGAGTTTAAGCGC
CTTCTTGGTGTTAAACCAACACCAAAGACGGAATTTTTGGGTGTGCCTATTGTAAGCCATGATATATCTTTGAAGCTTCC
AAAAGAATTTGATGCTAGAACCGCTTGGTCACAGTGCACCAGTATTGGAAGGATCTTAGATCAGGGTCACTGTGGTTCTT
GCTGGGCCTTTGGTGCTGTTGAATCACTGTCTGACAGATTCTGCATCAAATATAACATGAATGTTTCTTTATCTGTCAAT
GATCTTTTAGCATGTTGTGGATTCCTTTGCGGTCAAGGTTGTAATGGTGGATACCCAATTGCTGCGTGGCGGTACTTTAA
GCACCACGGTGTAGTCACTGAAGAGTGTGATCCATACTTCGACAATACTGGTTGCTCGCACCCGGGATGTGAACCCGCTT
ACCCTACACCAAAATGTGCAAGGAAATGTGTTAGCGGAAACCAGCTTTGGCGTGAATCAAAACATTATGGTGTTAGTGCG
TACAAGGTCAGATCTCACCCTGATGACATTATGGCAGAAGTTTACAAAAATGGACCTGTTGAGGTTGCCTTCACTGTTTA
CGAGGACTTTGCGCATTACAAATCTGGAGTGTACAAGCACATAACAGGTACTAACATTGGAGGTCATGCTGTTAAACTTA
TTGGCTGGGGAACTTCTGATGACGGTGAAGATTATTGGTTGCTTGCAAATCAATGGAACCGAAGCTGGGGTGATGATGGG
TACTTCAAGATCAGGAGAGGAACAAACGAATGTGGCATTGAACATGGTGTTGTAGCTGGTTTACCTTCAGACAGGAACGT
AGTTAAAGGTATTACTACTTCAGATGATCTTCTTGTTTCCTCATTTTAA
Microexon DNA seq ATCAG
Microexon Amino Acid seq DQ
Microexon-tag DNA Seq TTTGATGCTAGAACCGCTTGGTCACAGTGCACCAGTATTGGAAGGATCTTAGATCAGGGTCACTGTGGTTCTTGCTGGGCCTTTGGTGCTGTTGAATCACTGTCTGAC
Microexon-tag Amino Acid seq FDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSD
Transcript ID At.139.2
Gene ID At.139
Gene Name CATHB2
Pfam domain motif Peptidase_C1
Motif E-value 4.5e-68
Motif start 103
Motif end 337
Protein seq >At.139.2
MADSCCIRLHLLASVFLLLFSSFNLQGIAAENLSKQKLTSLILQNEIVKEVNENPNAGWKASFNDRFANATVAEFKRLLG
VKPTPKTEFLGVPIVSHDISLKLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLL
ACCGFLCGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKV
RSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFK
IRRGTNECGIEHGVVAGLPSDRNVVKGITTSDDLLVSSF*
CDS seq >At.139.2
ATGGCTGATAGTTGTTGTATCAGACTTCACTTATTAGCCTCTGTTTTCTTGCTCTTATTTTCATCCTTCAACTTGCAGGG
TATTGCAGCGGAAAATCTTTCCAAGCAGAAACTGACCTCACTGATTCTTCAGAATGAGATTGTAAAGGAAGTCAATGAGA
ATCCAAACGCTGGATGGAAAGCTTCTTTCAATGATCGGTTTGCAAACGCCACTGTTGCAGAGTTTAAGCGCCTTCTTGGT
GTTAAACCAACACCAAAGACGGAATTTTTGGGTGTGCCTATTGTAAGCCATGATATATCTTTGAAGCTTCCAAAAGAATT
TGATGCTAGAACCGCTTGGTCACAGTGCACCAGTATTGGAAGGATCTTAGATCAGGGTCACTGTGGTTCTTGCTGGGCCT
TTGGTGCTGTTGAATCACTGTCTGACAGATTCTGCATCAAATATAACATGAATGTTTCTTTATCTGTCAATGATCTTTTA
GCATGTTGTGGATTCCTTTGCGGTCAAGGTTGTAATGGTGGATACCCAATTGCTGCGTGGCGGTACTTTAAGCACCACGG
TGTAGTCACTGAAGAGTGTGATCCATACTTCGACAATACTGGTTGCTCGCACCCGGGATGTGAACCCGCTTACCCTACAC
CAAAATGTGCAAGGAAATGTGTTAGCGGAAACCAGCTTTGGCGTGAATCAAAACATTATGGTGTTAGTGCGTACAAGGTC
AGATCTCACCCTGATGACATTATGGCAGAAGTTTACAAAAATGGACCTGTTGAGGTTGCCTTCACTGTTTACGAGGACTT
TGCGCATTACAAATCTGGAGTGTACAAGCACATAACAGGTACTAACATTGGAGGTCATGCTGTTAAACTTATTGGCTGGG
GAACTTCTGATGACGGTGAAGATTATTGGTTGCTTGCAAATCAATGGAACCGAAGCTGGGGTGATGATGGGTACTTCAAG
ATCAGGAGAGGAACAAACGAATGTGGCATTGAACATGGTGTTGTAGCTGGTTTACCTTCAGACAGGAACGTAGTTAAAGG
TATTACTACTTCAGATGATCTTCTTGTTTCCTCATTTTAA