Microexon ID Vv_4:943987-943995:-
Species Vistis vinifera
Coordinates 4:943987..943995
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTACAGGACTGCCTACCACTTCCAGCCTCCCAAAAACTGGATGAATGATCCTAATGGGCCAATGTACTACAATGGAGTTTACCATCTGTTCTATCAGTACAATCCC
Microexon-tag Amino Acid Seq PYRTAYHFQPPKNWMNDPNGPMYYNGVYHLFYQYNP
Microexon-tag spanning region943220-944176
Microexon-tag prediction score0.9753
Overlapped with the annotated transcript (%) 91.67
New Transcript ID VIT_04s0008g01140.t01x
Reference Transcript ID VIT_04s0008g01140.t01
Gene ID VIT_04s0008g01140
Gene Name NA
Vv_4:943987-943995:- does not have available information here.
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTACAGGACTGCCTACCACTTCCAGCCTCCCAAAAACTGGATGAATGATCCTAATGGGCCAATGTACTACAATGGAGTTTACCATCTGTTCTATCAGTACAATCCC
Microexon-tag Amino Acid seq PYRTAYHFQPPKNWMNDPNGPMYYNGVYHLFYQYNP
Transcript ID Vv.22512.1
Gene ID Vv.22512
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 6.1e-102
Motif start 67
Motif end 379
Protein seq >Vv.22512.1
MLQSINCHRDGGKSAVKLRKEMGRFGIWVVGLCLMVGGHGIEGETSHHSYRNLQSDPADQPYRTAYHFQPPKNWMNDPNG
PMYYNGVYHLFYQYNPYAAVWGNITWAHSTSYDLVNWVHLELAIKPTDPFDINGCWSGSATILTGEEPVIIYTGKDSQNR
QVQNLSVPKNISDPLLREWIKSPHNPLMTPIDGIDASNFRDPTTAWQGSDKVWRILVGSLINGHGTALLYRSRDFVNWNK
SQTPLHSSNKTGMWECPDFYPVSISSRNGVETSVQNAETRHVLKASFNGNDYYIMGKYVPETDTYLVETGFLDAGSDLRY
DYGKFYASKTFFDAAKKRRILWAWIQEADKDTEKGWSGLQSFPRSVLLDQNGQRLVQWPVKEIAILHKNQVTFHNKELRG
GSVIEVSGITASQADVEVSFDFPHLEEAELMDPSWTDPQALCSRKNVSVKGGIGPFGLLVLASNNLTEQTAIFFRIFKST
QEKHIVLMCSDQSRSSLRQDVDKTIYGAFVDIDLNHEQISLRSLIDHSIVESFGGKGKTCITARVYPELAINTEAHLYAF
NSGNQTLNISTLSAWSMKNAEMVPTN*
CDS seq >Vv.22512.1
ATGCTTCAAAGTATTAATTGCCATAGGGACGGAGGGAAATCTGCTGTAAAGCTGAGGAAAGAGATGGGGAGATTTGGGAT
TTGGGTTGTGGGGTTGTGCCTTATGGTGGGTGGACATGGGATTGAGGGAGAGACCTCTCATCATAGTTACAGAAATCTTC
AATCTGATCCAGCGGACCAGCCTTACAGGACTGCCTACCACTTCCAGCCTCCCAAAAACTGGATGAATGATCCTAATGGG
CCAATGTACTACAATGGAGTTTACCATCTGTTCTATCAGTACAATCCCTATGCTGCAGTATGGGGTAACATTACATGGGC
ACATTCCACATCTTATGATCTTGTCAATTGGGTTCATCTTGAACTTGCTATTAAACCAACTGACCCTTTTGACATCAATG
GTTGCTGGTCTGGTTCTGCCACAATCCTTACCGGAGAAGAACCTGTCATTATATACACTGGAAAAGATTCTCAAAACCGC
CAAGTCCAAAACTTGTCTGTGCCCAAGAACATATCTGACCCACTCCTTAGAGAATGGATAAAATCACCCCATAATCCTCT
AATGACTCCTATTGATGGCATTGATGCAAGCAATTTCAGAGACCCTACCACTGCTTGGCAAGGCTCAGACAAAGTATGGA
GAATTCTTGTTGGAAGCCTGATAAATGGTCATGGCACAGCACTTCTTTATCGAAGTAGAGACTTTGTGAACTGGAATAAG
AGCCAAACCCCTCTTCATTCATCCAATAAAACTGGAATGTGGGAGTGTCCAGACTTCTACCCTGTCAGTATTAGCAGCAG
AAATGGGGTTGAAACCTCTGTTCAAAATGCAGAGACACGCCATGTGCTTAAGGCAAGCTTCAATGGGAATGACTACTACA
TAATGGGAAAATATGTGCCTGAAACCGATACCTATCTGGTTGAGACTGGTTTCCTGGATGCGGGTTCAGATTTGAGATAT
GATTATGGGAAATTTTATGCATCCAAAACATTTTTTGATGCTGCGAAAAAGAGGCGGATACTGTGGGCCTGGATACAAGA
AGCTGACAAAGATACTGAAAAAGGATGGTCTGGGCTTCAGTCCTTCCCTAGAAGTGTTTTGCTCGATCAAAATGGCCAAC
GATTAGTACAGTGGCCGGTCAAAGAAATAGCGATACTACACAAGAACCAAGTGACCTTCCACAATAAGGAGCTGAGGGGT
GGTTCTGTAATTGAAGTTTCTGGTATCACAGCTTCTCAGGCTGATGTAGAAGTTTCATTTGATTTCCCACATTTGGAAGA
AGCTGAGCTGATGGATCCAAGTTGGACTGATCCCCAAGCCCTCTGTAGTCGAAAGAATGTATCAGTTAAAGGCGGGATCG
GGCCATTTGGCTTGTTGGTTTTGGCTTCAAACAATTTGACAGAACAAACTGCAATCTTCTTTCGCATCTTCAAAAGTACT
CAAGAGAAACACATCGTGCTTATGTGCAGTGATCAGAGCAGGTCCTCTTTGAGACAAGACGTGGACAAAACCATCTATGG
AGCTTTTGTTGATATAGACCTTAATCACGAGCAGATTTCGCTCAGAAGCTTGATTGACCATTCTATTGTTGAGAGTTTTG
GGGGGAAGGGGAAGACTTGCATTACTGCCAGGGTTTATCCAGAGTTGGCTATTAATACAGAAGCACACCTTTATGCATTT
AACAGTGGAAATCAGACTCTGAATATCTCAACACTTAGTGCTTGGAGCATGAAGAATGCTGAGATGGTTCCTACCAATTG
A