Microexon ID Ha_8:78366943-78366951:-
Species Helianthus annuus
Coordinates 8:78366943..78366951
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTATAGGACGGCATTTCACTTTCAGCCTCCCCAAAACTGGATGAACGATCCTAACGGACTTATGTACTACAATGGAGTTTACCATCTTTTTTACCAACACAACCCG
Microexon-tag Amino Acid Seq PYRTAFHFQPPQNWMNDPNGLMYYNGVYHLFYQHNP
Microexon-tag spanning region78363871-78367187
Microexon-tag prediction score0.9532
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG18719x
Reference Transcript ID OTG18719
Gene ID HannXRQ_Chr08g0226131
Gene Name NA
Transcript ID OTG18719
Protein ID OTG18719
Gene ID HannXRQ_Chr08g0226131
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 6.2e-102
Motif start 36
Motif end 352
Protein seq >OTG18719
MTTTSAIWFLIFLTVGFHHVRYNAITLEQPYRTAFHFQPPQNWMNDPNGLMYYNGVYHLFYQHNPFGPLFAVQMYWGHSV
SHDLINWTPLEHAFAPTQPFDINGCISGSTTILPGNKPVILYTGIDSQNRQVQNVAVPKDPSDPYLREWVRYTGNPVINV
PEGIQPDEFRDPTTAWLADDGKWRVTIGGQKDKAGIAILFRSEDFVNWTRHEEPLYEVTGSGMWECLDFFPVHVDGTNGV
DTSVTNPAVKHVLKMGVYDYARDYYLIGDYNPVKEKYVSQDELTLDSLRYDYGKYYASKSFYDPVRRRRILMAWVNESDS
DADAIAKGWSGLQSFPRSVWLDQNQKQLVQWPIEEIEMLHENEVSLRNENLEDGSLHEILGITPLQVDVQISFKLTNLEE
AEILDPSRLDPQLICSEMDASKKGIFGPFGLLAFASHDLTEQTAIFFRVFQHNGRYIVLMCSDQSRSSTRKGLDKTTYGA
FVDIDPQRDEISLRALIDHSIVESFGGGGKTCIIARVYPTLATRDEAHLFAFNNGTKSVLITELSAWSVKKARINIDETI
GCADA*
CDS seq >OTG18719
ATGACGACAACAAGTGCCATTTGGTTTCTAATATTTTTGACGGTTGGCTTTCATCATGTCCGATACAACGCTATTACCTT
GGAGCAGCCTTATAGGACGGCATTTCACTTTCAGCCTCCCCAAAACTGGATGAACGATCCTAACGGACTTATGTACTACA
ATGGAGTTTACCATCTTTTTTACCAACACAACCCGTTTGGTCCGCTATTCGCTGTTCAAATGTATTGGGGTCATTCAGTA
TCACATGACTTGATAAACTGGACCCCACTCGAACATGCATTTGCCCCAACCCAACCCTTCGACATTAATGGTTGCATCTC
TGGCTCCACAACAATCCTCCCCGGAAACAAACCTGTTATATTATACACTGGAATCGATTCTCAAAATCGCCAAGTTCAAA
ATGTAGCCGTTCCAAAAGACCCTTCCGATCCATATCTTCGAGAATGGGTTAGGTACACTGGCAATCCCGTCATAAACGTA
CCCGAAGGGATTCAACCTGATGAATTTCGAGATCCCACCACCGCGTGGCTTGCTGACGATGGAAAATGGCGGGTGACCAT
TGGAGGTCAGAAGGATAAGGCAGGAATCGCGATTCTTTTCCGGTCTGAGGACTTTGTAAATTGGACTAGACACGAAGAAC
CACTTTATGAGGTTACAGGCAGTGGTATGTGGGAATGCCTTGACTTTTTTCCGGTGCATGTTGATGGCACCAATGGAGTC
GATACATCTGTAACGAACCCCGCTGTAAAACATGTGTTGAAGATGGGAGTCTATGATTATGCAAGAGACTACTACTTAAT
TGGGGATTACAATCCTGTGAAAGAAAAATATGTTTCGCAAGATGAATTAACACTTGACTCGTTGAGATATGATTACGGAA
AGTATTATGCTTCAAAGTCATTCTATGACCCTGTGAGAAGGAGAAGGATCTTGATGGCTTGGGTAAATGAATCTGATTCC
GACGCTGATGCTATTGCTAAAGGATGGTCTGGACTTCAGTCGTTTCCAAGGAGTGTTTGGCTCGATCAAAACCAGAAGCA
GCTCGTACAATGGCCTATCGAGGAAATTGAAATGTTACATGAAAACGAAGTCAGTCTCCGAAATGAGAATCTTGAAGATG
GATCACTACATGAAATTCTAGGCATAACTCCTTTGCAAGTGGATGTGCAGATATCATTCAAACTAACTAATTTAGAAGAG
GCCGAAATACTAGACCCGAGTAGGCTTGATCCACAACTTATTTGCAGCGAAATGGATGCATCAAAGAAAGGCATATTTGG
CCCATTTGGACTCTTAGCTTTTGCTTCCCATGACTTGACTGAACAAACTGCAATCTTTTTTCGTGTTTTCCAACATAATG
GACGCTACATTGTGCTAATGTGCAGTGATCAAAGTCGGTCTTCTACAAGGAAAGGGCTTGATAAAACTACATATGGAGCA
TTTGTCGACATCGATCCTCAACGAGATGAAATTTCACTTCGAGCATTGATAGATCACTCTATCGTCGAGAGCTTTGGAGG
AGGAGGAAAGACGTGCATCATAGCTAGGGTTTACCCAACATTAGCCACTAGAGACGAAGCCCATTTGTTTGCATTTAACA
ATGGAACAAAAAGTGTGTTGATCACTGAGTTGAGTGCTTGGAGCGTAAAGAAAGCTCGAATTAACATCGATGAAACTATT
GGGTGTGCAGATGCATAA
Microexon DNA seq ATCCTAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq CCTTATAGGACGGCATTTCACTTTCAGCCTCCCCAAAACTGGATGAACGATCCTAACGGACTTATGTACTACAATGGAGTTTACCATCTTTTTTACCAACACAACCCG
Microexon-tag Amino Acid seq PYRTAFHFQPPQNWMNDPNGLMYYNGVYHLFYQHNP
Transcript ID OTG18719
Gene ID Ha.53262
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 6.2e-102
Motif start 36
Motif end 352
Protein seq >OTG18719
MTTTSAIWFLIFLTVGFHHVRYNAITLEQPYRTAFHFQPPQNWMNDPNGLMYYNGVYHLFYQHNPFGPLFAVQMYWGHSV
SHDLINWTPLEHAFAPTQPFDINGCISGSTTILPGNKPVILYTGIDSQNRQVQNVAVPKDPSDPYLREWVRYTGNPVINV
PEGIQPDEFRDPTTAWLADDGKWRVTIGGQKDKAGIAILFRSEDFVNWTRHEEPLYEVTGSGMWECLDFFPVHVDGTNGV
DTSVTNPAVKHVLKMGVYDYARDYYLIGDYNPVKEKYVSQDELTLDSLRYDYGKYYASKSFYDPVRRRRILMAWVNESDS
DADAIAKGWSGLQSFPRSVWLDQNQKQLVQWPIEEIEMLHENEVSLRNENLEDGSLHEILGITPLQVDVQISFKLTNLEE
AEILDPSRLDPQLICSEMDASKKGIFGPFGLLAFASHDLTEQTAIFFRVFQHNGRYIVLMCSDQSRSSTRKGLDKTTYGA
FVDIDPQRDEISLRALIDHSIVESFGGGGKTCIIARVYPTLATRDEAHLFAFNNGTKSVLITELSAWSVKKARINIDETI
GCADA*
CDS seq >OTG18719
ATGACGACAACAAGTGCCATTTGGTTTCTAATATTTTTGACGGTTGGCTTTCATCATGTCCGATACAACGCTATTACCTT
GGAGCAGCCTTATAGGACGGCATTTCACTTTCAGCCTCCCCAAAACTGGATGAACGATCCTAACGGACTTATGTACTACA
ATGGAGTTTACCATCTTTTTTACCAACACAACCCGTTTGGTCCGCTATTCGCTGTTCAAATGTATTGGGGTCATTCAGTA
TCACATGACTTGATAAACTGGACCCCACTCGAACATGCATTTGCCCCAACCCAACCCTTCGACATTAATGGTTGCATCTC
TGGCTCCACAACAATCCTCCCCGGAAACAAACCTGTTATATTATACACTGGAATCGATTCTCAAAATCGCCAAGTTCAAA
ATGTAGCCGTTCCAAAAGACCCTTCCGATCCATATCTTCGAGAATGGGTTAGGTACACTGGCAATCCCGTCATAAACGTA
CCCGAAGGGATTCAACCTGATGAATTTCGAGATCCCACCACCGCGTGGCTTGCTGACGATGGAAAATGGCGGGTGACCAT
TGGAGGTCAGAAGGATAAGGCAGGAATCGCGATTCTTTTCCGGTCTGAGGACTTTGTAAATTGGACTAGACACGAAGAAC
CACTTTATGAGGTTACAGGCAGTGGTATGTGGGAATGCCTTGACTTTTTTCCGGTGCATGTTGATGGCACCAATGGAGTC
GATACATCTGTAACGAACCCCGCTGTAAAACATGTGTTGAAGATGGGAGTCTATGATTATGCAAGAGACTACTACTTAAT
TGGGGATTACAATCCTGTGAAAGAAAAATATGTTTCGCAAGATGAATTAACACTTGACTCGTTGAGATATGATTACGGAA
AGTATTATGCTTCAAAGTCATTCTATGACCCTGTGAGAAGGAGAAGGATCTTGATGGCTTGGGTAAATGAATCTGATTCC
GACGCTGATGCTATTGCTAAAGGATGGTCTGGACTTCAGTCGTTTCCAAGGAGTGTTTGGCTCGATCAAAACCAGAAGCA
GCTCGTACAATGGCCTATCGAGGAAATTGAAATGTTACATGAAAACGAAGTCAGTCTCCGAAATGAGAATCTTGAAGATG
GATCACTACATGAAATTCTAGGCATAACTCCTTTGCAAGTGGATGTGCAGATATCATTCAAACTAACTAATTTAGAAGAG
GCCGAAATACTAGACCCGAGTAGGCTTGATCCACAACTTATTTGCAGCGAAATGGATGCATCAAAGAAAGGCATATTTGG
CCCATTTGGACTCTTAGCTTTTGCTTCCCATGACTTGACTGAACAAACTGCAATCTTTTTTCGTGTTTTCCAACATAATG
GACGCTACATTGTGCTAATGTGCAGTGATCAAAGTCGGTCTTCTACAAGGAAAGGGCTTGATAAAACTACATATGGAGCA
TTTGTCGACATCGATCCTCAACGAGATGAAATTTCACTTCGAGCATTGATAGATCACTCTATCGTCGAGAGCTTTGGAGG
AGGAGGAAAGACGTGCATCATAGCTAGGGTTTACCCAACATTAGCCACTAGAGACGAAGCCCATTTGTTTGCATTTAACA
ATGGAACAAAAAGTGTGTTGATCACTGAGTTGAGTGCTTGGAGCGTAAAGAAAGCTCGAATTAACATCGATGAAACTATT
GGGTGTGCAGATGCATAA