Microexon ID Ha_4:168279614-168279628:-
Species Helianthus annuus
Coordinates 4:168279614..168279628
Microexon Cluster ID MEP41
Size 15
Phase 0
Pfam Domain Motif DUF974
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq CARTTYTTCAAGTTYATTGTTKCWAAYCCACTTTCWGTTAGRACAAAGGTYCGYRYTRTCAAGGAAACTACMTWTYTRGARGCTTGYATWGARAAYCATACAAAATCA
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GTCCGCGTTGTGAAG
Microexon Amino Acid seq VRVVK
Microexon-tag DNA Seq CAGTATTTCAAGTTCATGGTTTCAAATCCCCTTTCTGTTAGGACAAAGGTCCGCGTTGTGAAGGACACTACTTATCTGGAGACTTGCCTAGAAAATAATACAAAATCA
Microexon-tag Amino Acid Seq QYFKFMVSNPLSVRTKVRVVKDTTYLETCLENNTKS
Microexon-tag spanning region168279372-168280125
Microexon-tag prediction score0.9675
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG29608x
Reference Transcript ID OTG29608
Gene ID HannXRQ_Chr04g0124141
Gene Name NA
Transcript ID OTG29608
Protein ID OTG29608
Gene ID HannXRQ_Chr04g0124141
Gene Name NA
Pfam domain motif DUF974
Motif E-value 1.5e-60
Motif start 88
Motif end 314
Protein seq >OTG29608
MSSTQSLHSLAFRVMRLCRPTFHVETPLQFDPCDLVVGEDLFDSPSAAPRHLLHSHSASNDSSSADLTYRNRFVLGDDSD
AMGLPGLLVLPQSFGAIYLGETFCSYISINNSSNFDARDIIIKSEIQTERQRIMLLDTSKSPVESIRAGGRYDFIVEHDV
KELGAHTLVCTALYYDGDAERKYLPQYFKFMVSNPLSVRTKVRVVKDTTYLETCLENNTKSNLFMDQVEFEPAPRWSATI
LKADVHHSEKGGSTRETFKPPVLIRAGGGIYNYLYALKMSSAPTKGEGSNILGKLQITWRTNLGEPGRLQTQHITGNPIT
RKEIDLKVVKVPSVIILEKPFLVHLSLTNLTGKKLGPFEAWLSLNDSKEEKAVMISGLQRMDLPAVEAFASLEFQLNLIS
AKLGVQKISGITVFDTMEKKTYDPLPDLEIFVDTDL*
CDS seq >OTG29608
ATGAGCAGCACACAATCGTTGCACTCGCTGGCCTTCAGGGTAATGCGGCTATGCCGTCCGACATTCCACGTCGAAACTCC
TCTCCAGTTCGATCCTTGCGATCTTGTCGTTGGCGAGGATCTGTTCGACAGTCCTTCAGCCGCCCCTCGCCACCTTCTTC
ACTCTCATTCCGCCTCTAATGATTCGTCATCTGCCGATCTCACCTACCGCAACCGCTTTGTTCTCGGTGATGATTCCGAT
GCCATGGGGCTCCCTGGTCTCCTTGTTCTCCCTCAGTCCTTCGGGGCAATATATCTTGGGGAGACATTTTGTAGTTATAT
AAGCATTAACAACAGCTCAAATTTTGATGCCAGGGATATAATAATCAAGTCTGAAATACAAACAGAAAGGCAGAGAATAA
TGCTTTTAGATACATCAAAATCACCTGTTGAAAGCATAAGAGCAGGAGGACGCTACGACTTCATCGTTGAACATGATGTG
AAGGAACTTGGTGCACACACATTGGTCTGTACTGCTCTGTATTATGACGGTGATGCCGAACGCAAGTATCTTCCACAGTA
TTTCAAGTTCATGGTTTCAAATCCCCTTTCTGTTAGGACAAAGGTCCGCGTTGTGAAGGACACTACTTATCTGGAGACTT
GCCTAGAAAATAATACAAAATCAAACCTATTCATGGACCAAGTTGAATTTGAGCCAGCTCCACGGTGGAGTGCGACAATA
CTTAAAGCCGATGTCCACCATTCAGAAAAGGGTGGTTCTACTAGAGAAACATTCAAACCACCTGTTCTTATTAGAGCAGG
TGGAGGAATTTATAATTATCTTTATGCATTGAAAATGTCATCTGCACCTACAAAAGGCGAGGGGAGTAATATTCTTGGTA
AACTTCAGATAACATGGCGTACAAATTTGGGTGAACCCGGGCGCCTGCAAACACAACATATAACTGGCAATCCCATCACG
CGAAAAGAGATTGATTTGAAGGTAGTAAAAGTGCCATCTGTTATCATCTTAGAAAAACCCTTTCTGGTGCATTTGAGTCT
CACAAACTTAACTGGAAAGAAGCTGGGGCCCTTTGAAGCTTGGTTATCCCTCAATGATTCAAAAGAGGAAAAGGCTGTTA
TGATTAGTGGACTTCAAAGGATGGATTTACCAGCGGTGGAGGCATTTGCATCGTTGGAGTTTCAACTGAATTTGATCTCT
GCCAAACTTGGAGTGCAGAAAATCAGTGGCATTACGGTGTTTGATACGATGGAGAAAAAAACCTATGACCCGTTACCCGA
TCTAGAAATATTCGTCGATACAGATCTATAA
Microexon DNA seq GTCCGCGTTGTGAAG
Microexon Amino Acid seq VRVVK
Microexon-tag DNA Seq CAGTATTTCAAGTTCATGGTTTCAAATCCCCTTTCTGTTAGGACAAAGGTCCGCGTTGTGAAGGACACTACTTATCTGGAGACTTGCCTAGAAAATAATACAAAATCA
Microexon-tag Amino Acid seq QYFKFMVSNPLSVRTKVRVVKDTTYLETCLENNTKS
Transcript ID OTG29608
Gene ID Ha.42616
Gene Name NA
Pfam domain motif DUF974
Motif E-value 1.5e-60
Motif start 88
Motif end 314
Protein seq >OTG29608
MSSTQSLHSLAFRVMRLCRPTFHVETPLQFDPCDLVVGEDLFDSPSAAPRHLLHSHSASNDSSSADLTYRNRFVLGDDSD
AMGLPGLLVLPQSFGAIYLGETFCSYISINNSSNFDARDIIIKSEIQTERQRIMLLDTSKSPVESIRAGGRYDFIVEHDV
KELGAHTLVCTALYYDGDAERKYLPQYFKFMVSNPLSVRTKVRVVKDTTYLETCLENNTKSNLFMDQVEFEPAPRWSATI
LKADVHHSEKGGSTRETFKPPVLIRAGGGIYNYLYALKMSSAPTKGEGSNILGKLQITWRTNLGEPGRLQTQHITGNPIT
RKEIDLKVVKVPSVIILEKPFLVHLSLTNLTGKKLGPFEAWLSLNDSKEEKAVMISGLQRMDLPAVEAFASLEFQLNLIS
AKLGVQKISGITVFDTMEKKTYDPLPDLEIFVDTDL*
CDS seq >OTG29608
ATGAGCAGCACACAATCGTTGCACTCGCTGGCCTTCAGGGTAATGCGGCTATGCCGTCCGACATTCCACGTCGAAACTCC
TCTCCAGTTCGATCCTTGCGATCTTGTCGTTGGCGAGGATCTGTTCGACAGTCCTTCAGCCGCCCCTCGCCACCTTCTTC
ACTCTCATTCCGCCTCTAATGATTCGTCATCTGCCGATCTCACCTACCGCAACCGCTTTGTTCTCGGTGATGATTCCGAT
GCCATGGGGCTCCCTGGTCTCCTTGTTCTCCCTCAGTCCTTCGGGGCAATATATCTTGGGGAGACATTTTGTAGTTATAT
AAGCATTAACAACAGCTCAAATTTTGATGCCAGGGATATAATAATCAAGTCTGAAATACAAACAGAAAGGCAGAGAATAA
TGCTTTTAGATACATCAAAATCACCTGTTGAAAGCATAAGAGCAGGAGGACGCTACGACTTCATCGTTGAACATGATGTG
AAGGAACTTGGTGCACACACATTGGTCTGTACTGCTCTGTATTATGACGGTGATGCCGAACGCAAGTATCTTCCACAGTA
TTTCAAGTTCATGGTTTCAAATCCCCTTTCTGTTAGGACAAAGGTCCGCGTTGTGAAGGACACTACTTATCTGGAGACTT
GCCTAGAAAATAATACAAAATCAAACCTATTCATGGACCAAGTTGAATTTGAGCCAGCTCCACGGTGGAGTGCGACAATA
CTTAAAGCCGATGTCCACCATTCAGAAAAGGGTGGTTCTACTAGAGAAACATTCAAACCACCTGTTCTTATTAGAGCAGG
TGGAGGAATTTATAATTATCTTTATGCATTGAAAATGTCATCTGCACCTACAAAAGGCGAGGGGAGTAATATTCTTGGTA
AACTTCAGATAACATGGCGTACAAATTTGGGTGAACCCGGGCGCCTGCAAACACAACATATAACTGGCAATCCCATCACG
CGAAAAGAGATTGATTTGAAGGTAGTAAAAGTGCCATCTGTTATCATCTTAGAAAAACCCTTTCTGGTGCATTTGAGTCT
CACAAACTTAACTGGAAAGAAGCTGGGGCCCTTTGAAGCTTGGTTATCCCTCAATGATTCAAAAGAGGAAAAGGCTGTTA
TGATTAGTGGACTTCAAAGGATGGATTTACCAGCGGTGGAGGCATTTGCATCGTTGGAGTTTCAACTGAATTTGATCTCT
GCCAAACTTGGAGTGCAGAAAATCAGTGGCATTACGGTGTTTGATACGATGGAGAAAAAAACCTATGACCCGTTACCCGA
TCTAGAAATATTCGTCGATACAGATCTATAA