Microexon ID Ha_12:39551665-39551678:-
Species Helianthus annuus
Coordinates 12:39551665..39551678
Microexon Cluster ID MEP40
Size 14
Phase 1
Pfam Domain Motif SBP_bac_10
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 46,14,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GTYATCCCMYTRKCMAACTWYTCHRTBGAYRCCRMTTATTTTCCAGTKTCMTTCTTYGAGCTTYTAGGWYTRCTRGVRARCWTGAARGGCATMACATCAGAMWMRGTR
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTCATTCTTCGAG
Microexon Amino Acid seq VSFFE
Microexon-tag DNA Seq GTCGTCCCAATATCAAACTTCTCCATGGATGCTCGTTATTTTCCAGTTTCATTCTTCGAGCTTCTAGGCCTCATGCCGACGATGAAGGGCATCACATCAGAGAAGGTA
Microexon-tag Amino Acid Seq VVPISNFSMDARYFPVSFFELLGLMPTMKGITSEKV
Microexon-tag spanning region39551297-39551953
Microexon-tag prediction score0.9607
Overlapped with the annotated transcript (%) 80
New Transcript ID OTG04738x
Reference Transcript ID OTG04738
Gene ID HannXRQ_Chr12g0365711
Gene Name NA
Ha_12:39551665-39551678:- does not have available information here.
Microexon DNA seq TTTCATTCTTCGAG
Microexon Amino Acid seq VSFFE
Microexon-tag DNA Seq GTCGTCCCAATATCAAACTTCTCCATGGATGCTCGTTATTTTCCAGTTTCATTCTTCGAGCTTCTAGGCCTCATGCCGACGATGAAGGGCATCACATCAGAGAAGGTA
Microexon-tag Amino Acid seq VVPISNFSMDARYFPVSFFELLGLMPTMKGITSEKV
Transcript ID Ha.12112.3
Gene ID Ha.12112
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Ha.12112.3
MLVCMEVLLLALGGLLLLVGGGMVEGAAIKVNVGNMSKLDDAAYFHLYYGNTFKVIKNDLDGKSYLLIQNNSRMAPRTKY
CTPRIKSFVVPISNFSMDARYFPVSFFELLGLMPTMKGITSEKVASPCLLKLYNEGELQMLNNSEPQQFSQYAAHFIGHD
DDQETQSCNYVTFFPRDEQAPLQRAEWIKYLGIFANDEVRANEIYDAVKSNYMCLVNSVASKNLKPTVAWMEFNDGAWSF
TDEAYKLKYVEDAGGENLDDSINKITYNTSSIEDLELLHAILCTVDVVIDGTLTTDPFGYNATLFLQNLNIDDKSCFGFL
SSQSVWRHDKRLSTDLALDWFDGAVSQPQLVLADLIEVLFPSGNYTTTYFRNLLKEEEPITLGPENCTRDTFTPMELTLI
TCP*
CDS seq >Ha.12112.3
ATGTTAGTGTGTATGGAGGTTCTTTTATTGGCGTTGGGCGGTCTTTTGCTTCTCGTTGGCGGAGGTATGGTAGAGGGTGC
AGCCATCAAAGTCAACGTTGGTAACATGTCGAAGCTAGACGATGCTGCTTACTTTCATTTATATTATGGTAATACCTTCA
AAGTCATCAAAAACGACCTCGATGGCAAGAGCTATCTTCTCATCCAGAATAACTCAAGGATGGCGCCAAGGACTAAATAT
TGCACACCAAGGATCAAATCATTTGTCGTCCCAATATCAAACTTCTCCATGGATGCTCGTTATTTTCCAGTTTCATTCTT
CGAGCTTCTAGGCCTCATGCCGACGATGAAGGGCATCACATCAGAGAAGGTAGCGTCTCCATGTCTGCTTAAACTGTACA
ATGAGGGTGAACTTCAAATGCTCAACAACAGTGAGCCACAACAATTTTCTCAATATGCCGCACATTTCATCGGCCATGAC
GATGACCAAGAAACTCAATCGTGCAATTACGTTACTTTTTTTCCCAGGGACGAGCAAGCTCCTCTCCAAAGGGCGGAATG
GATCAAGTACTTGGGAATTTTTGCAAACGATGAAGTGAGAGCAAATGAAATCTATGATGCCGTGAAAAGCAATTACATGT
GCTTGGTTAATTCAGTAGCAAGCAAAAACCTCAAACCAACAGTGGCCTGGATGGAGTTTAATGATGGTGCTTGGTCTTTC
ACAGATGAAGCATACAAGCTAAAGTATGTAGAAGATGCAGGTGGAGAGAATTTGGATGACTCCATAAACAAAATAACATA
CAACACATCATCTATTGAAGATTTGGAACTACTACATGCCATCTTATGTACCGTAGATGTAGTGATTGACGGAACATTAA
CGACAGACCCGTTTGGCTACAACGCAACATTGTTTCTCCAAAACCTCAATATAGATGATAAATCTTGTTTTGGTTTTCTT
TCATCTCAAAGCGTGTGGAGACACGATAAACGTCTTTCAACTGATCTAGCTCTTGATTGGTTTGATGGAGCAGTGTCACA
ACCACAACTAGTCCTAGCGGACCTAATAGAAGTTCTCTTTCCATCAGGGAATTATACAACAACTTACTTTAGAAACCTCC
TAAAGGAAGAAGAGCCTATAACCCTTGGTCCTGAAAACTGCACTCGAGATACTTTCACTCCAATGGAGCTAACTCTAATA
ACATGCCCATAG