Microexon ID Ha_13:114502896-114502904:-
Species Helianthus annuus
Coordinates 13:114502896..114502904
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ACCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTACAGGACAGCATATCACTTCCAGCCTTCCAAAAACTGGATGAATGACCCTAATGGGCCAATGTATTACAAGGGGGTCTACCATTTTTTCTATCAATTCAATCCA
Microexon-tag Amino Acid Seq PYRTAYHFQPSKNWMNDPNGPMYYKGVYHFFYQFNP
Microexon-tag spanning region114502225-114503097
Microexon-tag prediction score0.9525
Overlapped with the annotated transcript (%) 91.67
New Transcript ID OTG01909x
Reference Transcript ID OTG01909
Gene ID HannXRQ_Chr13g0407171
Gene Name NA
Ha_13:114502896-114502904:- does not have available information here.
Microexon DNA seq ACCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTACAGGACAGCATATCACTTCCAGCCTTCCAAAAACTGGATGAATGACCCTAATGGGCCAATGTATTACAAGGGGGTCTACCATTTTTTCTATCAATTCAATCCA
Microexon-tag Amino Acid seq PYRTAYHFQPSKNWMNDPNGPMYYKGVYHFFYQFNP
Transcript ID Ha.16357.1
Gene ID Ha.16357
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 6.2e-97
Motif start 57
Motif end 375
Protein seq >Ha.16357.1
MMLFKTKLSSLNILMKIYGFCVFSLCCFLIFNGVQVEAFKNHTLLLNVSQPYRTAYHFQPSKNWMNDPNGPMYYKGVYHF
FYQFNPYGPLWGNISWAHAISHDLINWVHLDIALVPNEPYDINGCFSGSATILPNGEPAILYTGVDGNLHQLQNLAFPKN
ISDPLLNEWVKWSLNPILSPPDESNPFLYRDPSTGWMGPDGEWRVVIGTQIDHQGAAILYKSKDFQSWTMSPSPLQLSNI
TTFWECPDFYPVICNGKSGLDTSVISVQGKDIKHVLKASFNNQDYYIMGKYDPESDHYDVDADFMKSKGWLRYDYGRFYA
SKSFYDGDKKRRILWSWVCEGDTAPDAIEKGWSGLQTIPRSIQLSKNEDQLVQWPVKELEKLRTRKVHFKNKKLRGRSMI
EISGITASQADVEITFRLSKHKHAEVLVPEVIDPQLLCTQKNASMGGKFGPFGLLVLASKDLTEHTAVFFRVFRTANSFR
VIMCVDPSRSSSKPLIDKPIYGAFLALDPRHAKISLRTLIDHSIVESFGGEGLACITSRIYPVLAVGEQAHLFAFNNGTQ
SLSISSLSAWSMKKAQIV*
CDS seq >Ha.16357.1
ATGATGCTTTTCAAAACTAAACTTTCATCTCTGAATATTTTGATGAAGATATATGGTTTTTGTGTATTTTCTTTGTGTTG
TTTCCTGATTTTTAATGGAGTCCAGGTTGAAGCTTTCAAGAACCATACATTGCTACTGAATGTTTCACAGCCTTACAGGA
CAGCATATCACTTCCAGCCTTCCAAAAACTGGATGAATGACCCTAATGGGCCAATGTATTACAAGGGGGTCTACCATTTT
TTCTATCAATTCAATCCATATGGTCCACTATGGGGTAACATATCATGGGCTCATGCCATATCGCATGATCTTATCAACTG
GGTACATCTTGATATTGCTCTGGTACCAAACGAGCCTTACGATATCAATGGGTGCTTTTCTGGTTCCGCAACGATTCTAC
CAAATGGTGAACCGGCGATCCTATACACAGGTGTTGATGGCAATTTACACCAACTACAAAATTTGGCATTTCCGAAAAAC
ATATCGGACCCGCTACTAAATGAATGGGTAAAATGGTCACTTAACCCAATATTGAGTCCTCCAGATGAATCTAACCCATT
CTTATATAGAGATCCATCAACGGGTTGGATGGGTCCGGATGGCGAATGGAGGGTTGTTATTGGAACTCAGATTGATCATC
AAGGAGCAGCTATTCTTTACAAAAGTAAGGATTTTCAAAGTTGGACTATGTCTCCGAGCCCGTTACAGTTATCTAACATA
ACAACATTTTGGGAATGTCCTGACTTTTATCCTGTTATTTGTAACGGGAAAAGCGGGCTTGATACATCAGTTATATCTGT
TCAAGGGAAGGACATTAAGCATGTTCTTAAAGCTAGTTTTAATAACCAGGATTATTATATAATGGGAAAATATGATCCAG
AAAGTGACCACTACGACGTTGATGCTGACTTTATGAAGAGTAAAGGGTGGTTACGGTATGATTATGGGAGGTTTTATGCT
TCAAAGTCGTTTTATGATGGTGATAAGAAGAGAAGGATATTATGGTCATGGGTTTGTGAAGGTGATACTGCACCAGATGC
TATCGAAAAAGGTTGGTCTGGCCTTCAGACGATTCCTAGGAGCATCCAGCTTAGTAAAAATGAAGACCAGTTGGTGCAGT
GGCCTGTCAAGGAACTAGAGAAACTACGAACACGGAAAGTGCATTTCAAGAATAAAAAACTCAGGGGCCGGTCCATGATT
GAAATTTCTGGCATCACAGCTTCGCAGGCTGATGTAGAAATCACGTTTCGCTTGTCAAAACACAAACATGCTGAGGTGTT
GGTTCCTGAAGTAATTGATCCCCAACTTCTTTGTACACAAAAGAATGCTTCAATGGGGGGAAAATTTGGGCCTTTCGGAT
TGCTGGTCTTGGCTTCGAAGGACCTTACTGAACACACTGCGGTCTTCTTTCGTGTCTTTAGAACCGCTAACAGTTTTAGA
GTGATCATGTGTGTTGACCCAAGCAGGTCGTCTTCAAAACCACTGATTGACAAGCCCATCTATGGAGCTTTTCTTGCACT
TGATCCTCGACACGCAAAGATTTCCTTGAGAACCTTGATAGATCACTCCATTGTAGAAAGTTTTGGTGGAGAAGGATTGG
CTTGCATTACATCAAGAATTTATCCAGTATTGGCAGTTGGAGAACAAGCACATCTTTTTGCATTCAACAATGGCACTCAG
AGTTTGAGTATTTCAAGTTTAAGTGCTTGGAGTATGAAAAAAGCTCAAATTGTCTAA