Microexon ID Ha_14:114560011-114560024:-
Species Helianthus annuus
Coordinates 14:114560011..114560024
Microexon Cluster ID MEP40
Size 14
Phase 1
Pfam Domain Motif SBP_bac_10
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 46,14,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GTYATCCCMYTRKCMAACTWYTCHRTBGAYRCCRMTTATTTTCCAGTKTCMTTCTTYGAGCTTYTAGGWYTRCTRGVRARCWTGAARGGCATMACATCAGAMWMRGTR
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTCATTCTTTGAG
Microexon Amino Acid seq VSFFE
Microexon-tag DNA Seq GTCATTCCATTAGCAAACTTTTCCATGGATGTCAGTTATTTTCCAGTTTCATTCTTTGAGCTTCTAGGACTCATGCCATCATTGAAAGGCATCACATCAGAAAGGATA
Microexon-tag Amino Acid Seq VIPLANFSMDVSYFPVSFFELLGLMPSLKGITSERI
Microexon-tag spanning region114559182-114560207
Microexon-tag prediction score0.9494
Overlapped with the annotated transcript (%) 100
New Transcript ID OTF98235x
Reference Transcript ID OTF98235
Gene ID HannXRQ_Chr14g0443391
Gene Name NA
Transcript ID OTF98235
Protein ID OTF98235
Gene ID HannXRQ_Chr14g0443391
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >OTF98235
MLLLYMAVLSSAFSGLLLLNGGVMVEGVAVKVTGNISKIEDASYFRIYYGNTFKVIKNDLDGKSYLLIQSNSKMAAKTKY
CTPRIKSFVIPLANFSMDVSYFPVSFFELLGLMPSLKGITSERIASPCLLKLYNEGQLQMLNKSEPQQFSQYTAHFISYN
SNQQTQSCNYVAFVPIGEQTPLQRAEWIKYLGVFANVETRANEIYDAVKSNYMCLANSAASKKLKPIVAWMEFNDGVWTF
TKEAYKLKYIEDAGGENLDESINKITYNITTIEDMEQLHAILCTVDVLIDGTLTPDPFGYNATLFLQNLNIEDQSCLAFL
SHESVWRHDKRLSTDLALDWFDGAVSQPQLVLADLMEMLFPTGNYTTTYFRNLLKGEEATSLGLENCDRDTFTAMEPTII
ACT*
CDS seq >OTF98235
ATGTTACTATTGTATATGGCGGTTTTATCGTCGGCTTTTAGCGGTCTTTTGCTTCTCAATGGCGGAGTAATGGTGGAGGG
CGTCGCTGTAAAAGTGACCGGTAACATTTCGAAGATAGAAGACGCTAGTTATTTTCGTATATATTATGGTAATACCTTCA
AAGTCATCAAAAACGACCTTGATGGCAAGAGCTATCTTCTCATCCAGAGTAACTCAAAGATGGCAGCAAAGACCAAATAT
TGTACGCCAAGGATCAAATCATTTGTCATTCCATTAGCAAACTTTTCCATGGATGTCAGTTATTTTCCAGTTTCATTCTT
TGAGCTTCTAGGACTCATGCCATCATTGAAAGGCATCACATCAGAAAGGATAGCTTCTCCATGTCTCCTTAAACTCTATA
ATGAGGGACAACTCCAAATGCTTAACAAGAGTGAGCCACAACAATTTTCTCAGTATACCGCGCATTTCATCAGCTATAAC
TCCAATCAACAAACTCAGTCATGCAACTATGTTGCTTTTGTCCCCATCGGCGAGCAAACTCCCCTCCAAAGGGCCGAGTG
GATCAAGTACTTGGGAGTTTTTGCAAATGTGGAAACGAGAGCAAATGAAATCTATGATGCTGTGAAAAGCAATTACATGT
GCTTGGCTAATTCTGCCGCAAGCAAAAAGCTCAAACCAATAGTGGCTTGGATGGAGTTTAATGATGGTGTTTGGACTTTC
ACAAAAGAAGCATACAAGCTAAAGTATATAGAAGATGCAGGTGGAGAAAACCTGGATGAGTCCATAAACAAAATAACATA
CAACATAACAACTATTGAAGATATGGAACAACTTCATGCTATCTTATGTACCGTAGATGTATTGATTGATGGAACGTTAA
CCCCAGACCCGTTTGGCTACAATGCAACGTTGTTTCTTCAAAACCTCAATATAGAGGATCAATCTTGTTTAGCATTTCTT
TCACATGAAAGCGTGTGGAGACACGATAAACGCCTTTCAACTGACCTGGCTCTTGATTGGTTCGATGGAGCTGTGTCGCA
ACCGCAACTAGTCCTTGCGGACCTAATGGAAATGTTGTTTCCAACGGGGAATTATACGACTACTTACTTTAGAAACCTCC
TAAAGGGCGAAGAGGCTACAAGCCTTGGTCTTGAAAACTGCGATCGCGATACTTTTACGGCCATGGAGCCAACTATAATA
GCATGCACATAG
Microexon DNA seq TTTCATTCTTTGAG
Microexon Amino Acid seq VSFFE
Microexon-tag DNA Seq GTCATTCCATTAGCAAACTTTTCCATGGATGTCAGTTATTTTCCAGTTTCATTCTTTGAGCTTCTAGGACTCATGCCATCATTGAAAGGCATCACATCAGAAAGGATA
Microexon-tag Amino Acid seq VIPLANFSMDVSYFPVSFFELLGLMPSLKGITSERI
Transcript ID OTF98235
Gene ID Ha.19979
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >OTF98235
MLLLYMAVLSSAFSGLLLLNGGVMVEGVAVKVTGNISKIEDASYFRIYYGNTFKVIKNDLDGKSYLLIQSNSKMAAKTKY
CTPRIKSFVIPLANFSMDVSYFPVSFFELLGLMPSLKGITSERIASPCLLKLYNEGQLQMLNKSEPQQFSQYTAHFISYN
SNQQTQSCNYVAFVPIGEQTPLQRAEWIKYLGVFANVETRANEIYDAVKSNYMCLANSAASKKLKPIVAWMEFNDGVWTF
TKEAYKLKYIEDAGGENLDESINKITYNITTIEDMEQLHAILCTVDVLIDGTLTPDPFGYNATLFLQNLNIEDQSCLAFL
SHESVWRHDKRLSTDLALDWFDGAVSQPQLVLADLMEMLFPTGNYTTTYFRNLLKGEEATSLGLENCDRDTFTAMEPTII
ACT*
CDS seq >OTF98235
ATGTTACTATTGTATATGGCGGTTTTATCGTCGGCTTTTAGCGGTCTTTTGCTTCTCAATGGCGGAGTAATGGTGGAGGG
CGTCGCTGTAAAAGTGACCGGTAACATTTCGAAGATAGAAGACGCTAGTTATTTTCGTATATATTATGGTAATACCTTCA
AAGTCATCAAAAACGACCTTGATGGCAAGAGCTATCTTCTCATCCAGAGTAACTCAAAGATGGCAGCAAAGACCAAATAT
TGTACGCCAAGGATCAAATCATTTGTCATTCCATTAGCAAACTTTTCCATGGATGTCAGTTATTTTCCAGTTTCATTCTT
TGAGCTTCTAGGACTCATGCCATCATTGAAAGGCATCACATCAGAAAGGATAGCTTCTCCATGTCTCCTTAAACTCTATA
ATGAGGGACAACTCCAAATGCTTAACAAGAGTGAGCCACAACAATTTTCTCAGTATACCGCGCATTTCATCAGCTATAAC
TCCAATCAACAAACTCAGTCATGCAACTATGTTGCTTTTGTCCCCATCGGCGAGCAAACTCCCCTCCAAAGGGCCGAGTG
GATCAAGTACTTGGGAGTTTTTGCAAATGTGGAAACGAGAGCAAATGAAATCTATGATGCTGTGAAAAGCAATTACATGT
GCTTGGCTAATTCTGCCGCAAGCAAAAAGCTCAAACCAATAGTGGCTTGGATGGAGTTTAATGATGGTGTTTGGACTTTC
ACAAAAGAAGCATACAAGCTAAAGTATATAGAAGATGCAGGTGGAGAAAACCTGGATGAGTCCATAAACAAAATAACATA
CAACATAACAACTATTGAAGATATGGAACAACTTCATGCTATCTTATGTACCGTAGATGTATTGATTGATGGAACGTTAA
CCCCAGACCCGTTTGGCTACAATGCAACGTTGTTTCTTCAAAACCTCAATATAGAGGATCAATCTTGTTTAGCATTTCTT
TCACATGAAAGCGTGTGGAGACACGATAAACGCCTTTCAACTGACCTGGCTCTTGATTGGTTCGATGGAGCTGTGTCGCA
ACCGCAACTAGTCCTTGCGGACCTAATGGAAATGTTGTTTCCAACGGGGAATTATACGACTACTTACTTTAGAAACCTCC
TAAAGGGCGAAGAGGCTACAAGCCTTGGTCTTGAAAACTGCGATCGCGATACTTTTACGGCCATGGAGCCAACTATAATA
GCATGCACATAG