Microexon ID Ha_9:62788162-62788167:-
Species Helianthus annuus
Coordinates 9:62788162..62788167
Microexon Cluster ID MEP10
Size 6
Phase 0
Pfam Domain Motif DUF4788
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 51,6,51
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TTTGARGAYTAYATYGADCCMCTYAAGRTKTACCTGRMTAGRTACAGAGAGWTGGAGGGTGAYACYAAGGGATCTGCWARRGSTGGWGATGSATCTGCTAARARRGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTAGAG
Microexon Amino Acid seq LE
Microexon-tag DNA Seq TTCGAAGATTACATTGATCCACTCAAAGCTTACCTATCTAGGTACAGAGAGTTAGAGGGTGATAGTAAGGGGTCGGGTAGGGGTGGAGATGGATCTGGTAAGAAAGAT
Microexon-tag Amino Acid Seq FEDYIDPLKAYLSRYRELEGDSKGSGRGGDGSGKKD
Microexon-tag spanning region62788029-62788612
Microexon-tag prediction score0.965
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG14183x
Reference Transcript ID OTG14183
Gene ID HannXRQ_Chr09g0246541
Gene Name NA
Transcript ID OTG14183
Protein ID OTG14183
Gene ID HannXRQ_Chr09g0246541
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >OTG14183
MKKALPTNGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATLGFEDYIDPLKAYLSRYRELEGD
SKGSGRGGDGSGKKDTIESQLASNAQVIYSSRFFYTRHELCEFSRLNFQARGSWEHHLAVHLCKPNTRRTFKLCVY*
CDS seq >OTG14183
ATGAAAAAGGCGTTGCCCACCAACGGGAAGATTGCCAAGGATGCCAAGGATACGGTGCAGGAATGCGTTTCTGAGTTTAT
CAGTTTCATCACTAGCGAGGCTAGTGACAAATGCCAGAAAGAGAAAAGGAAGACGATCAACGGTGATGACTTGTTGTGGG
CGATGGCTACTTTAGGCTTCGAAGATTACATTGATCCACTCAAAGCTTACCTATCTAGGTACAGAGAGTTAGAGGGTGAT
AGTAAGGGGTCGGGTAGGGGTGGAGATGGATCTGGTAAGAAAGATACTATTGAATCTCAACTTGCTTCTAATGCACAGGT
GATTTACTCATCAAGGTTCTTTTACACAAGGCATGAACTTTGTGAATTCTCACGTCTGAACTTTCAGGCAAGGGGGAGCT
GGGAGCATCATCTTGCTGTTCACTTATGCAAGCCAAACACTCGTCGAACTTTCAAGCTTTGTGTTTACTGA
Microexon DNA seq TTAGAG
Microexon Amino Acid seq LE
Microexon-tag DNA Seq TTCGAAGATTACATTGATCCACTCAAAGCTTACCTATCTAGGTACAGAGAGTTAGAGGGTGATAGTAAGGGGTCGGGTAGGGGTGGAGATGGATCTGGTAAGAAAGAT
Microexon-tag Amino Acid seq FEDYIDPLKAYLSRYRELEGDSKGSGRGGDGSGKKD
Transcript ID Ha.55414.2
Gene ID Ha.55414
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Ha.55414.2
MADGPGSPAGGSQSGGDRSPQSNYNIREQDRFLPIANISRIMKKALPTNGKIAKDAKDTVQECVSEFISFITSEASDKCQ
KEKRKTINGDDLLWAMATLGFEDYIDPLKAYLSRYRELEGDSKGSGRGGDGSGKKDTIESQLASNAQFTHQGSFTQGMNF
VNSHV*
CDS seq >Ha.55414.2
ATGGCGGACGGTCCTGGTAGTCCGGCCGGTGGTAGTCAGAGCGGCGGAGACAGGAGTCCTCAGTCGAATTATAATATTAG
AGAGCAGGATAGGTTTTTGCCCATTGCGAATATCAGTAGGATTATGAAAAAGGCGTTGCCCACCAACGGGAAGATTGCCA
AGGATGCCAAGGATACGGTGCAGGAATGCGTTTCTGAGTTTATCAGTTTCATCACTAGCGAGGCTAGTGACAAATGCCAG
AAAGAGAAAAGGAAGACGATCAACGGTGATGACTTGTTGTGGGCGATGGCTACTTTAGGCTTCGAAGATTACATTGATCC
ACTCAAAGCTTACCTATCTAGGTACAGAGAGTTAGAGGGTGATAGTAAGGGGTCGGGTAGGGGTGGAGATGGATCTGGTA
AGAAAGATACTATTGAATCTCAACTTGCTTCTAATGCACAGTTTACTCATCAAGGTTCTTTTACACAAGGCATGAACTTT
GTGAATTCTCACGTCTGA