Microexon ID Ha_17:17084014-17084022:+
Species Helianthus annuus
Coordinates 17:17084014..17084022
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTCACCGTACGGCGTTCCACTTTCAACCTCCCCAAAACTGGATGAATGATCCTAACGGACCAATGTACTACAATGGACTGTACCATCTGTTTTATCAGTACAATCCA
Microexon-tag Amino Acid Seq PHRTAFHFQPPQNWMNDPNGPMYYNGLYHLFYQYNP
Microexon-tag spanning region17083822-17084918
Microexon-tag prediction score0.9638
Overlapped with the annotated transcript (%) 100
New Transcript ID OTF85292x
Reference Transcript ID OTF85292
Gene ID HannXRQ_Chr17g0538391
Gene Name 6FEH
Transcript ID OTF85292
Protein ID OTF85292
Gene ID HannXRQ_Chr17g0538391
Gene Name 6FEH
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.5e-103
Motif start 45
Motif end 361
Protein seq >OTF85292
MMPTATLTLFLIFLTLGFHHVRSNTLTPESLPSSPPEQPHRTAFHFQPPQNWMNDPNGPMYYNGLYHLFYQYNPSGPLFA
EHMYWAHSVSHDLINWTPLDIAIAPAEPFDLISCWSGSATILPGNKPVMLYTGIDSENRQVQNLIEPKNLSDPYLREWVK
YTGNPVINLPDGIQHDDFRDPTTAWLADDGKWRIIVGSQKDKKGIAFLYQSEDFVNWSMHESPLHEVAGTGIWECPDFFP
VWVDSTNGVDTSVMNSRVKHVLKVGLFDYQKDYYMIGDYNFVNENYVPQNELTLGTLRYDYGKYYASKSFFDPVKNRRIL
LAWVNESDSQADDVAKGWSGVHSFPRSIWLDKKQKQLVQWPIKEIETLYENEATVQNKNLEDGSSHEILGITAWQVDVKL
SFKLNNLEEAEKLDPSGVDPQLVCSEMDASKKGKFGPFGLLALASHDLTEQTAIFFRVFQNNGRYIVLMCSDQSRSSTRN
GLDKTTYGAFVDIDPQQDEISLRTLIDHSIVESFGGGGKTCITARVYPTLAIKDEAHLFAFNNGTESVLITELSAWSVKK
ARINTEENVGRASQ*
CDS seq >OTF85292
ATGATGCCAACTGCCACTCTCACTCTCTTTCTAATATTCCTGACGCTTGGCTTCCACCATGTCCGATCAAACACTCTCAC
GCCGGAATCACTTCCTTCTTCTCCGCCGGAGCAGCCTCACCGTACGGCGTTCCACTTTCAACCTCCCCAAAACTGGATGA
ATGATCCTAACGGACCAATGTACTACAATGGACTGTACCATCTGTTTTATCAGTACAATCCATCTGGCCCGCTCTTTGCT
GAGCATATGTATTGGGCACATTCGGTGTCACATGACTTGATCAACTGGACCCCACTCGACATCGCCATTGCCCCAGCCGA
ACCCTTTGACCTCATCAGTTGCTGGTCTGGCTCAGCCACAATCCTCCCTGGAAACAAACCGGTCATGCTATACACCGGAA
TTGACTCCGAAAACCGCCAAGTCCAAAACCTTATTGAACCGAAGAACTTATCGGATCCATATCTTCGAGAATGGGTCAAG
TACACTGGCAATCCGGTCATAAACCTCCCGGATGGGATTCAACATGATGACTTTAGAGACCCAACCACTGCGTGGCTTGC
AGACGACGGAAAATGGAGGATAATTGTTGGAAGTCAGAAAGACAAGAAGGGAATCGCGTTTCTGTACCAATCTGAGGATT
TTGTTAACTGGAGTATGCATGAGTCACCACTGCATGAAGTTGCAGGTACTGGTATATGGGAATGCCCTGACTTTTTTCCG
GTGTGGGTTGATAGCACCAATGGGGTTGATACATCCGTAATGAACTCTAGAGTGAAGCACGTGTTGAAGGTGGGATTGTT
TGATTATCAAAAAGACTACTACATGATCGGGGATTACAATTTTGTGAACGAAAACTATGTTCCTCAAAATGAACTAACGC
TTGGTACATTGAGATATGATTACGGGAAGTATTATGCTTCGAAGTCGTTCTTTGACCCTGTGAAAAATAGAAGGATCTTG
TTGGCTTGGGTGAATGAATCTGATTCTCAAGCAGATGATGTTGCTAAAGGATGGTCCGGAGTTCATTCATTTCCAAGGAG
TATTTGGCTCGATAAAAAGCAGAAGCAGCTCGTACAATGGCCTATCAAGGAGATTGAAACGTTATATGAAAACGAAGCTA
CTGTTCAAAATAAGAATCTTGAAGATGGATCATCACATGAAATTTTGGGCATAACTGCATGGCAGGTGGACGTGAAGCTT
TCGTTCAAACTTAATAATTTAGAAGAGGCCGAGAAACTGGACCCGAGTGGGGTTGACCCGCAACTTGTTTGCAGTGAAAT
GGATGCATCGAAGAAAGGCAAATTCGGCCCGTTTGGTCTGTTAGCTTTGGCTTCCCATGACTTGACTGAACAAACTGCAA
TCTTCTTTCGGGTTTTCCAAAATAATGGACGATACATTGTGCTAATGTGCAGCGATCAAAGCCGGTCTTCTACAAGGAAC
GGGCTTGATAAAACGACATATGGAGCATTTGTCGACATAGATCCTCAACAAGATGAAATTTCACTTAGAACCTTGATAGA
TCACTCGATCGTCGAGAGCTTTGGAGGAGGAGGAAAGACGTGCATCACAGCTAGGGTTTACCCAACATTAGCCATTAAAG
ACGAAGCCCATTTGTTTGCATTTAACAATGGAACAGAAAGCGTGTTGATCACCGAGCTGAGTGCTTGGAGTGTGAAGAAA
GCTCGGATTAACACCGAAGAAAATGTTGGGCGTGCAAGTCAATAA
Microexon DNA seq ATCCTAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTCACCGTACGGCGTTCCACTTTCAACCTCCCCAAAACTGGATGAATGATCCTAACGGACCAATGTACTACAATGGACTGTACCATCTGTTTTATCAGTACAATCCA
Microexon-tag Amino Acid seq PHRTAFHFQPPQNWMNDPNGPMYYNGLYHLFYQYNP
Transcript ID Ha.29712.2
Gene ID Ha.29712
Gene Name 6FEH
Pfam domain motif Glyco_hydro_32N
Motif E-value 1.5e-103
Motif start 45
Motif end 361
Protein seq >Ha.29712.2
MMPTATLTLFLIFLTLGFHHVRSNTLTPESLPSSPPEQPHRTAFHFQPPQNWMNDPNGPMYYNGLYHLFYQYNPSGPLFA
EHMYWAHSVSHDLINWTPLDIAIAPAEPFDLISCWSGSATILPGNKPVMLYTGIDSENRQVQNLIEPKNLSDPYLREWVK
YTGNPVINLPDGIQHDDFRDPTTAWLADDGKWRIIVGSQKDKKGIAFLYQSEDFVNWSMHESPLHEVAGTGIWECPDFFP
VWVDSTNGVDTSVMNSRVKHVLKVGLFDYQKDYYMIGDYNFVNENYVPQNELTLGTLRYDYGKYYASKSFFDPVKNRRIL
LAWVNESDSQADDVAKGWSGVHSFPRSIWLDKKQKQLVQWPIKEIETLYENEATVQNKNLEDGSSHEILGITAWQVDVKL
SFKLNNLEEAEKLDPSGVDPQLVCSEMDASKKGKFGPFGLLALASHDLTEQTAIFFRVFQNNGRYIVLMCSDQSRSSTRN
GLDKTTYGAFVDIDPQQDEISLRTLIDHSIVESFGGGGKTCITARVYPTLAIKDEAHLFAFNNGTESVLITELSAWSVKK
ARINTEENVGRASQ*
CDS seq >Ha.29712.2
ATGATGCCAACTGCCACTCTCACTCTCTTTCTAATATTCCTGACGCTTGGCTTCCACCATGTCCGATCAAACACTCTCAC
GCCGGAATCACTTCCTTCTTCTCCGCCGGAGCAGCCTCACCGTACGGCGTTCCACTTTCAACCTCCCCAAAACTGGATGA
ATGATCCTAACGGACCAATGTACTACAATGGACTGTACCATCTGTTTTATCAGTACAATCCATCTGGCCCGCTCTTTGCT
GAGCATATGTATTGGGCACATTCGGTGTCACATGACTTGATCAACTGGACCCCACTCGACATCGCCATTGCCCCAGCCGA
ACCCTTTGACCTCATCAGTTGCTGGTCTGGCTCAGCCACAATCCTCCCTGGAAACAAACCGGTCATGCTATACACCGGAA
TTGACTCCGAAAACCGCCAAGTCCAAAACCTTATTGAACCGAAGAACTTATCGGATCCATATCTTCGAGAATGGGTCAAG
TACACTGGCAATCCGGTCATAAACCTCCCGGATGGGATTCAACATGATGACTTTAGAGACCCAACCACTGCGTGGCTTGC
AGACGACGGAAAATGGAGGATAATTGTTGGAAGTCAGAAAGACAAGAAGGGAATCGCGTTTCTGTACCAATCTGAGGATT
TTGTTAACTGGAGTATGCATGAGTCACCACTGCATGAAGTTGCAGGTACTGGTATATGGGAATGCCCTGACTTTTTTCCG
GTGTGGGTTGATAGCACCAATGGGGTTGATACATCCGTAATGAACTCTAGAGTGAAGCACGTGTTGAAGGTGGGATTGTT
TGATTATCAAAAAGACTACTACATGATCGGGGATTACAATTTTGTGAACGAAAACTATGTTCCTCAAAATGAACTAACGC
TTGGTACATTGAGATATGATTACGGGAAGTATTATGCTTCGAAGTCGTTCTTTGACCCTGTGAAAAATAGAAGGATCTTG
TTGGCTTGGGTGAATGAATCTGATTCTCAAGCAGATGATGTTGCTAAAGGATGGTCCGGAGTTCATTCATTTCCAAGGAG
TATTTGGCTCGATAAAAAGCAGAAGCAGCTCGTACAATGGCCTATCAAGGAGATTGAAACGTTATATGAAAACGAAGCTA
CTGTTCAAAATAAGAATCTTGAAGATGGATCATCACATGAAATTTTGGGCATAACTGCATGGCAGGTGGACGTGAAGCTT
TCGTTCAAACTTAATAATTTAGAAGAGGCCGAGAAACTGGACCCGAGTGGGGTTGACCCGCAACTTGTTTGCAGTGAAAT
GGATGCATCGAAGAAAGGCAAATTCGGCCCGTTTGGTCTGTTAGCTTTGGCTTCCCATGACTTGACTGAACAAACTGCAA
TCTTCTTTCGGGTTTTCCAAAATAATGGACGATACATTGTGCTAATGTGCAGCGATCAAAGCCGGTCTTCTACAAGGAAC
GGGCTTGATAAAACGACATATGGAGCATTTGTCGACATAGATCCTCAACAAGATGAAATTTCACTTAGAACCTTGATAGA
TCACTCGATCGTCGAGAGCTTTGGAGGAGGAGGAAAGACGTGCATCACAGCTAGGGTTTACCCAACATTAGCCATTAAAG
ACGAAGCCCATTTGTTTGCATTTAACAATGGAACAGAAAGCGTGTTGATCACCGAGCTGAGTGCTTGGAGTGTGAAGAAA
GCTCGGATTAACACCGAAGAAAATGTTGGGCGTGCAAGTCAATAA