Microexon ID Ha_15:146518341-146518349:+
Species Helianthus annuus
Coordinates 15:146518341..146518349
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAACGATCTGTTTACCACTTTCAACCCGATAAGAATTTCATCAGTGATCCTAATGGTCCGTTATATCACATGGGTTGGTACCATTTATTCTATCAATACAACCCG
Microexon-tag Amino Acid Seq WQRSVYHFQPDKNFISDPNGPLYHMGWYHLFYQYNP
Microexon-tag spanning region146517643-146522006
Microexon-tag prediction score0.9406
Overlapped with the annotated transcript (%) 100
New Transcript ID OTF96403x
Reference Transcript ID OTF96403
Gene ID HannXRQ_Chr15g0493771
Gene Name NA
Transcript ID OTF96403
Protein ID OTF96403
Gene ID HannXRQ_Chr15g0493771
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.5e-97
Motif start 111
Motif end 430
Protein seq >OTF96403
MAFPVLTPLLAYTDLKKRPVSTGTPKSGRSSSPRKILASLFVYILVLSSLTAVIYNQLLRPQGRLQFTAITIYSGPKQFE
IKPASSIGVESPEKVVGDELYSIGWQRSVYHFQPDKNFISDPNGPLYHMGWYHLFYQYNPHSAIWGNITWGHAVSKDLIN
WFHLPLAMVPDHWYDLKGVMTGSATVLPDGRIMMYYTGNAQDLSQLQCVAYPANSSDPLLVEWVKYEHNPILAPPLGVGL
KDFRDPSTLWIGPDGKYRMVMGSKHDNNIGCALIYHTTNFTHFELSDEVLHSVPGTGMWECVDLYPISTRDTYGLEMSSY
ESDAKYVLKQSGDDDRHDWYAIGTYDPVKDKWYPDDPEMDVGIGLRYDYGKFYASKTFYDPSKRRRVLWGYVGETDLQQN
DLKKGWANMLNVPRTVVLDPKTQSNLLQWPVEEIETLRSKKYHEFKDVELRPGSLVPLDIGSTAELDISASFEVEEASMS
ATLEADEFNCTTSEGSAMRGVLGPFGVVVLADATLSEQTPVYFYIAKNIDGTSRTYFCADESRSSKLLDVGKVVYGSTVP
ILHGENYNMRLLVDHSIVESFAQGGRTVITSRVYPTKAIYEAAKVFLFNNGTSITVKASLKIWKMGGAKLNKYPF*
CDS seq >OTF96403
ATGGCTTTTCCTGTCCTCACCCCTCTTCTTGCTTACACTGACCTTAAAAAACGGCCGGTGTCCACCGGAACTCCGAAATC
CGGTCGATCATCGTCGCCAAGAAAGATTCTCGCTAGTTTATTTGTATACATTCTAGTCCTTTCATCCTTGACTGCCGTAA
TTTACAACCAACTGCTAAGACCTCAAGGTCGCCTACAGTTCACTGCCATCACAATTTATTCGGGGCCGAAGCAGTTCGAG
ATCAAACCGGCATCTTCGATAGGAGTGGAGTCGCCGGAGAAGGTCGTCGGAGATGAGTTATATAGCATAGGATGGCAACG
ATCTGTTTACCACTTTCAACCCGATAAGAATTTCATCAGTGATCCTAATGGTCCGTTATATCACATGGGTTGGTACCATT
TATTCTATCAATACAACCCGCATTCAGCAATATGGGGTAATATTACTTGGGGCCATGCAGTCTCTAAAGATCTCATCAAC
TGGTTCCACCTCCCTCTAGCCATGGTTCCGGACCACTGGTACGACCTCAAAGGCGTCATGACCGGCTCCGCCACCGTCCT
CCCAGACGGCCGGATCATGATGTATTACACCGGCAATGCACAAGATCTCTCTCAGCTTCAATGTGTTGCGTACCCCGCTA
ATTCGTCCGACCCCCTTCTTGTTGAATGGGTCAAGTACGAACATAACCCTATTCTCGCTCCTCCTCTTGGAGTCGGTCTC
AAAGACTTCCGAGACCCGTCAACTCTCTGGATCGGGCCAGACGGGAAGTATCGAATGGTTATGGGGTCAAAGCATGACAA
TAACATAGGGTGTGCCCTAATTTACCACACCACCAATTTCACTCATTTTGAGTTATCCGATGAAGTGCTCCATTCAGTTC
CGGGAACGGGTATGTGGGAATGTGTGGATCTTTACCCGATATCCACGAGAGATACATACGGGCTGGAAATGTCGAGTTAT
GAGTCGGATGCTAAGTATGTGTTGAAGCAAAGTGGGGATGACGATAGGCATGATTGGTATGCAATCGGGACATATGATCC
GGTGAAAGATAAATGGTATCCAGATGATCCTGAAATGGACGTGGGTATCGGGTTGAGATACGACTATGGAAAGTTTTATG
CATCAAAGACGTTTTACGACCCGAGTAAGAGGAGACGGGTCTTATGGGGCTATGTTGGAGAAACAGATCTTCAACAAAAT
GACCTAAAAAAAGGATGGGCAAATATGTTGAATGTTCCAAGAACCGTGGTGTTGGACCCAAAGACACAATCTAACTTGTT
ACAATGGCCTGTTGAGGAAATTGAGACTTTGAGATCTAAAAAGTACCATGAATTTAAAGATGTCGAGTTACGGCCCGGAT
CACTTGTTCCACTTGACATAGGCTCAACCGCAGAGTTGGACATTAGTGCCTCATTTGAGGTAGAGGAAGCTTCAATGAGT
GCAACCTTAGAAGCTGATGAGTTCAACTGCACGACGAGCGAGGGTTCCGCCATGAGGGGTGTTTTGGGACCGTTTGGAGT
TGTGGTTCTAGCCGATGCAACACTTTCAGAACAAACACCTGTTTATTTCTACATTGCAAAGAACATAGATGGCACCTCTA
GAACTTATTTCTGTGCTGATGAATCAAGGTCATCAAAGCTTTTAGATGTGGGTAAAGTTGTATATGGAAGCACGGTTCCT
ATACTCCATGGTGAAAACTACAACATGAGGTTATTGGTGGACCATTCAATCGTAGAAAGCTTTGCACAAGGAGGAAGAAC
AGTAATTACATCAAGAGTGTATCCTACAAAAGCAATATATGAAGCAGCAAAAGTGTTTTTATTCAACAATGGCACTAGCA
TCACTGTTAAGGCATCTCTCAAGATATGGAAGATGGGTGGAGCAAAACTTAACAAATATCCCTTTTAA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAACGATCTGTTTACCACTTTCAACCCGATAAGAATTTCATCAGTGATCCTAATGGTCCGTTATATCACATGGGTTGGTACCATTTATTCTATCAATACAACCCG
Microexon-tag Amino Acid seq WQRSVYHFQPDKNFISDPNGPLYHMGWYHLFYQYNP
Transcript ID OTF96403
Gene ID Ha.25102
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.5e-97
Motif start 111
Motif end 430
Protein seq >OTF96403
MAFPVLTPLLAYTDLKKRPVSTGTPKSGRSSSPRKILASLFVYILVLSSLTAVIYNQLLRPQGRLQFTAITIYSGPKQFE
IKPASSIGVESPEKVVGDELYSIGWQRSVYHFQPDKNFISDPNGPLYHMGWYHLFYQYNPHSAIWGNITWGHAVSKDLIN
WFHLPLAMVPDHWYDLKGVMTGSATVLPDGRIMMYYTGNAQDLSQLQCVAYPANSSDPLLVEWVKYEHNPILAPPLGVGL
KDFRDPSTLWIGPDGKYRMVMGSKHDNNIGCALIYHTTNFTHFELSDEVLHSVPGTGMWECVDLYPISTRDTYGLEMSSY
ESDAKYVLKQSGDDDRHDWYAIGTYDPVKDKWYPDDPEMDVGIGLRYDYGKFYASKTFYDPSKRRRVLWGYVGETDLQQN
DLKKGWANMLNVPRTVVLDPKTQSNLLQWPVEEIETLRSKKYHEFKDVELRPGSLVPLDIGSTAELDISASFEVEEASMS
ATLEADEFNCTTSEGSAMRGVLGPFGVVVLADATLSEQTPVYFYIAKNIDGTSRTYFCADESRSSKLLDVGKVVYGSTVP
ILHGENYNMRLLVDHSIVESFAQGGRTVITSRVYPTKAIYEAAKVFLFNNGTSITVKASLKIWKMGGAKLNKYPF*
CDS seq >OTF96403
ATGGCTTTTCCTGTCCTCACCCCTCTTCTTGCTTACACTGACCTTAAAAAACGGCCGGTGTCCACCGGAACTCCGAAATC
CGGTCGATCATCGTCGCCAAGAAAGATTCTCGCTAGTTTATTTGTATACATTCTAGTCCTTTCATCCTTGACTGCCGTAA
TTTACAACCAACTGCTAAGACCTCAAGGTCGCCTACAGTTCACTGCCATCACAATTTATTCGGGGCCGAAGCAGTTCGAG
ATCAAACCGGCATCTTCGATAGGAGTGGAGTCGCCGGAGAAGGTCGTCGGAGATGAGTTATATAGCATAGGATGGCAACG
ATCTGTTTACCACTTTCAACCCGATAAGAATTTCATCAGTGATCCTAATGGTCCGTTATATCACATGGGTTGGTACCATT
TATTCTATCAATACAACCCGCATTCAGCAATATGGGGTAATATTACTTGGGGCCATGCAGTCTCTAAAGATCTCATCAAC
TGGTTCCACCTCCCTCTAGCCATGGTTCCGGACCACTGGTACGACCTCAAAGGCGTCATGACCGGCTCCGCCACCGTCCT
CCCAGACGGCCGGATCATGATGTATTACACCGGCAATGCACAAGATCTCTCTCAGCTTCAATGTGTTGCGTACCCCGCTA
ATTCGTCCGACCCCCTTCTTGTTGAATGGGTCAAGTACGAACATAACCCTATTCTCGCTCCTCCTCTTGGAGTCGGTCTC
AAAGACTTCCGAGACCCGTCAACTCTCTGGATCGGGCCAGACGGGAAGTATCGAATGGTTATGGGGTCAAAGCATGACAA
TAACATAGGGTGTGCCCTAATTTACCACACCACCAATTTCACTCATTTTGAGTTATCCGATGAAGTGCTCCATTCAGTTC
CGGGAACGGGTATGTGGGAATGTGTGGATCTTTACCCGATATCCACGAGAGATACATACGGGCTGGAAATGTCGAGTTAT
GAGTCGGATGCTAAGTATGTGTTGAAGCAAAGTGGGGATGACGATAGGCATGATTGGTATGCAATCGGGACATATGATCC
GGTGAAAGATAAATGGTATCCAGATGATCCTGAAATGGACGTGGGTATCGGGTTGAGATACGACTATGGAAAGTTTTATG
CATCAAAGACGTTTTACGACCCGAGTAAGAGGAGACGGGTCTTATGGGGCTATGTTGGAGAAACAGATCTTCAACAAAAT
GACCTAAAAAAAGGATGGGCAAATATGTTGAATGTTCCAAGAACCGTGGTGTTGGACCCAAAGACACAATCTAACTTGTT
ACAATGGCCTGTTGAGGAAATTGAGACTTTGAGATCTAAAAAGTACCATGAATTTAAAGATGTCGAGTTACGGCCCGGAT
CACTTGTTCCACTTGACATAGGCTCAACCGCAGAGTTGGACATTAGTGCCTCATTTGAGGTAGAGGAAGCTTCAATGAGT
GCAACCTTAGAAGCTGATGAGTTCAACTGCACGACGAGCGAGGGTTCCGCCATGAGGGGTGTTTTGGGACCGTTTGGAGT
TGTGGTTCTAGCCGATGCAACACTTTCAGAACAAACACCTGTTTATTTCTACATTGCAAAGAACATAGATGGCACCTCTA
GAACTTATTTCTGTGCTGATGAATCAAGGTCATCAAAGCTTTTAGATGTGGGTAAAGTTGTATATGGAAGCACGGTTCCT
ATACTCCATGGTGAAAACTACAACATGAGGTTATTGGTGGACCATTCAATCGTAGAAAGCTTTGCACAAGGAGGAAGAAC
AGTAATTACATCAAGAGTGTATCCTACAAAAGCAATATATGAAGCAGCAAAAGTGTTTTTATTCAACAATGGCACTAGCA
TCACTGTTAAGGCATCTCTCAAGATATGGAAGATGGGTGGAGCAAAACTTAACAAATATCCCTTTTAA