Microexon ID Ha_9:141124590-141124604:-
Species Helianthus annuus
Coordinates 9:141124590..141124604
Microexon Cluster ID MEP44
Size 15
Phase 1
Pfam Domain Motif Unknown
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 46,15,47
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq RTTTATGRGCTTTGTGATRTTGACTTGAAAGATTTYAGYMTBCAAGCWTWTGGRCARCAAGGBTGTYTACTTCGRAGCTTRCCTRCAGATRTKGTRTTTGACAAYWCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq CTTATGGAGATGAAG
Microexon Amino Acid seq AYGDEG
Microexon-tag DNA Seq GTTTATGAACTTTGTGATGTTGACCTGAAGGATTTTAGCCTCCAAGCTTATGGAGATGAAGGGTGTCTGCTTAGAAGTTTGCCTGCAGATGTGGGGTATGATAATTCA
Microexon-tag Amino Acid Seq VYELCDVDLKDFSLQAYGDEGCLLRSLPADVGYDNS
Microexon-tag spanning region141124294-141124800
Microexon-tag prediction score0.9534
Overlapped with the annotated transcript (%) 97.3
New Transcript ID OTG15208x
Reference Transcript ID OTG15208
Gene ID HannXRQ_Chr09g0257931
Gene Name NA
Ha_9:141124590-141124604:- does not have available information here.
Microexon DNA seq CTTATGGAGATGAAG
Microexon Amino Acid seq AYGDEG
Microexon-tag DNA Seq GTTTATGAACTTTGTGATGTTGACCTGAAGGATTTTAGCCTCCAAGCTTATGGAGATGAAGGGTGTCTGCTTAGAAGTTTGCCTGCAGATGTGGGGTATGATAATTCA
Microexon-tag Amino Acid seq VYELCDVDLKDFSLQAYGDEGCLLRSLPADVGYDNS
Transcript ID Ha.56589.1
Gene ID Ha.56589
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Ha.56589.1
MDSVKATTLLEGSLCFHIFLFTILLSISLNVIAEKLPPTSQSVSLPSSGIFQPIEISPSVLPNYPYLGKVLPPMYPYFPK
TYKPVLTGRCPFNFSSISNIMEKTASDCSVLLASFVGNVICCPQFASSLHIFQGHFSKNSDSLVLQNATTADDCFSDIVE
VLASKGANSSIPNMCSYKSLNLTGGSCPVKDISTFEKTVNTSKLLDACSVIDPLKECCRSVCQPAIMEAAFQISSAQTIV
DKSMNTGLSNHVDVLNDCKGVVYSWLSRKLPSDAADTSFRILSACKVNKVCPLEMKEPVEVIKACKNVSAQSASCCSSLN
SYIAGVQRQMLITNKQAIMCASLLGSKLRKGGVLTDVYELCDVDLKDFSLQAYGDEGCLLRSLPADVGYDNSSGFSFTCD
LSDNIAAPWPSSSSMASFSLCAPEMSLPALPTSDTVYPGCNGLFLDLFIIVLVLFIVPLNDLL*
CDS seq >Ha.56589.1
ATGGATTCCGTAAAGGCCACAACACTTCTCGAAGGTTCATTGTGTTTTCACATCTTTCTATTCACAATCTTACTATCTAT
TTCCTTAAATGTAATTGCCGAGAAGCTTCCCCCCACGTCTCAATCAGTTAGTCTTCCTAGTTCCGGAATCTTCCAGCCTA
TAGAAATATCGCCTTCAGTTTTACCAAATTACCCCTATCTCGGGAAGGTATTACCACCAATGTACCCTTACTTTCCGAAA
ACATACAAACCGGTATTAACGGGAAGATGCCCGTTTAATTTCTCTTCTATTTCAAACATCATGGAAAAAACAGCATCAGA
CTGTTCTGTACTTTTAGCATCATTCGTAGGAAACGTAATATGTTGCCCGCAATTCGCGAGTTCACTTCACATATTTCAGG
GACATTTCAGTAAAAATTCCGATAGTTTAGTTCTTCAAAACGCTACCACTGCTGACGATTGTTTCTCGGACATTGTCGAA
GTTTTAGCCAGCAAAGGTGCAAACAGCTCGATCCCGAACATGTGCTCTTATAAATCATTGAATTTAACTGGTGGTTCGTG
TCCTGTGAAGGATATAAGCACGTTTGAGAAAACGGTGAACACGAGTAAGTTATTGGACGCGTGTAGCGTAATCGACCCGC
TTAAAGAGTGCTGCAGATCGGTTTGCCAACCGGCAATTATGGAAGCTGCGTTTCAAATTTCTTCAGCGCAAACAATAGTT
GATAAAAGTATGAACACGGGTTTATCTAATCACGTTGATGTGTTGAATGACTGCAAAGGGGTTGTGTATTCATGGCTTTC
GAGAAAACTTCCATCAGATGCTGCAGATACTTCCTTCAGAATATTATCTGCTTGTAAAGTTAACAAAGTTTGCCCATTGG
AAATGAAGGAGCCGGTTGAAGTGATAAAAGCTTGTAAGAATGTGTCTGCTCAAAGTGCTTCGTGTTGTAGTTCATTGAAT
TCATACATTGCGGGAGTACAAAGGCAAATGTTGATCACAAATAAACAAGCAATAATGTGTGCATCATTGTTGGGGTCAAA
GTTACGGAAAGGCGGAGTACTAACCGACGTTTATGAACTTTGTGATGTTGACCTGAAGGATTTTAGCCTCCAAGCTTATG
GAGATGAAGGGTGTCTGCTTAGAAGTTTGCCTGCAGATGTGGGGTATGATAATTCAAGTGGTTTCAGTTTCACATGTGAT
TTAAGTGACAATATTGCTGCTCCATGGCCTTCATCGTCATCTATGGCATCCTTTTCTCTTTGTGCTCCCGAGATGTCATT
GCCTGCACTACCCACATCAGATACGGTATACCCAGGCTGCAATGGGTTGTTTTTGGATCTCTTTATAATCGTACTTGTGC
TTTTTATAGTTCCTTTGAACGATTTGTTGTAA