Microexon ID Ha_13:114551943-114551951:-
Species Helianthus annuus
Coordinates 13:114551943..114551951
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TCTTACCGGACAGCATATCACTTCCAGCCTACCAAAAATTGGATGAATGATCCTAATGGACCAATGTATTACAAGGGGGTCTACCATTTTTTCTATCAACACAATCCA
Microexon-tag Amino Acid Seq SYRTAYHFQPTKNWMNDPNGPMYYKGVYHFFYQHNP
Microexon-tag spanning region114551387-114552110
Microexon-tag prediction score0.9595
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG01911x
Reference Transcript ID OTG01911
Gene ID HannXRQ_Chr13g0407191
Gene Name NA
Transcript ID OTG01911
Protein ID OTG01911
Gene ID HannXRQ_Chr13g0407191
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 9.5e-97
Motif start 77
Motif end 395
Protein seq >OTG01911
MLTYLYKAYNTCPTAHSLHFTMLFKTKLSSLNIPMKIYGFYVFSLCCFLIFNGVRVEAFRNHTRLLNVSQSYRTAYHFQP
TKNWMNDPNGPMYYKGVYHFFYQHNPYGPLWGNMSWAHAISHDLINWVHLDIALVPNEPYDINGCFSGSTTILPNGEPAI
LYTGVDANLHQVQNLAFPKNISDPLLNEWVKWPLNPILSPPDEIDPSFYRDPSTGWMGADGEWRVVIGTRIDHQGAAILY
KSKDFRSWNRSLSPLHLSNITTFWECPDFYPVMRNGKSGLDTSVTSVQGKDIRHVLKASFNYQDYYILGKYDPLSDRYDV
DADFMKSKGWLRYDYGRFYASKSFYDGEKKRRILWSWVCEGDTAPDAIEKGWSGLQTIPRSIWLSKNEDQLVQWPVKELE
KLRTRKVHLENKKLKGGSMIEISGITASQADVEITFSFSNLKHAEVLSAEVVDPQILCTQKNATTDGKFGPFGLLVLASK
DLTEHTAVFFRVFRIANSFRVLMCADTSRSSLKPFIDKPIYGAFLALDPRYAKISLRTLIDHSIVESFGGEGLACITSRV
YPVLAVGEQAHLYAFNNGTQSLSISSLSAWSMKKAQIV*
CDS seq >OTG01911
ATGCTCACGTACCTATATAAAGCCTACAACACTTGTCCAACTGCACACAGTTTGCATTTCACGATGCTTTTCAAAACTAA
ACTATCATCTCTGAATATTCCGATGAAGATATATGGTTTTTATGTGTTTTCTTTGTGTTGTTTCCTGATTTTTAATGGAG
TCCGTGTTGAAGCTTTCAGGAACCATACACGGCTACTGAATGTTTCACAGTCTTACCGGACAGCATATCACTTCCAGCCT
ACCAAAAATTGGATGAATGATCCTAATGGACCAATGTATTACAAGGGGGTCTACCATTTTTTCTATCAACACAATCCATA
TGGTCCACTATGGGGTAACATGTCATGGGCTCATGCCATATCACATGATCTTATCAACTGGGTACATCTTGATATTGCTC
TTGTACCAAACGAGCCTTACGATATCAATGGGTGTTTTTCTGGTTCGACAACGATACTACCAAATGGTGAACCGGCGATC
CTATACACAGGTGTTGATGCCAATTTACACCAAGTGCAAAATTTGGCATTTCCCAAAAACATATCGGACCCGCTACTAAA
TGAATGGGTAAAATGGCCACTTAACCCAATATTGAGTCCTCCGGATGAAATTGACCCATCCTTTTATAGAGATCCATCAA
CTGGTTGGATGGGTGCGGATGGAGAATGGAGGGTTGTGATTGGAACTCGGATTGATCATCAAGGAGCAGCTATTCTATAC
AAAAGTAAGGATTTTCGCAGTTGGAATAGGTCTCTGAGCCCGTTACACCTCTCTAACATAACAACATTTTGGGAATGTCC
TGACTTTTATCCTGTTATGCGTAATGGGAAAAGCGGGCTTGATACATCAGTTACATCCGTTCAAGGGAAGGACATTAGGC
ATGTTCTTAAAGCTAGTTTTAATTACCAGGATTATTATATACTGGGAAAATACGATCCACTAAGTGACCGCTACGACGTT
GATGCTGACTTTATGAAGAGTAAAGGGTGGTTACGGTATGATTATGGGAGGTTTTATGCTTCTAAGTCGTTTTATGATGG
TGAGAAGAAGAGAAGGATATTATGGTCATGGGTTTGTGAAGGTGATACTGCACCAGATGCTATCGAAAAAGGTTGGTCTG
GCCTTCAGACGATTCCTAGGAGCATCTGGCTTAGTAAAAATGAAGATCAGTTGGTGCAGTGGCCTGTCAAGGAACTAGAG
AAACTACGAACACGAAAAGTGCATTTGGAGAATAAAAAACTCAAGGGCGGGTCCATGATTGAAATTTCTGGTATCACAGC
TTCACAGGCTGATGTAGAAATCACGTTTAGCTTCTCAAATCTCAAACATGCTGAGGTGTTGAGTGCTGAAGTAGTTGATC
CCCAAATTCTTTGTACACAAAAGAATGCTACAACAGATGGCAAATTTGGGCCTTTCGGTTTGCTGGTTTTGGCTTCGAAG
GACCTTACTGAACACACTGCGGTCTTCTTTCGTGTCTTTAGAATTGCTAACAGTTTCCGAGTGCTCATGTGTGCTGACAC
AAGCAGGTCGTCTTTAAAACCATTTATCGACAAACCCATCTATGGAGCTTTTCTTGCACTTGATCCTCGATACGCAAAGA
TCTCCTTGAGAACCTTGATAGATCACTCCATTGTAGAAAGCTTTGGTGGAGAAGGGTTGGCTTGCATTACATCAAGAGTT
TATCCGGTACTGGCAGTTGGAGAACAAGCACATCTTTATGCATTCAACAATGGCACTCAGAGTTTGAGTATCTCAAGTTT
AAGTGCTTGGAGTATGAAGAAAGCTCAAATTGTCTAA
Microexon DNA seq ATCCTAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TCTTACCGGACAGCATATCACTTCCAGCCTACCAAAAATTGGATGAATGATCCTAATGGACCAATGTATTACAAGGGGGTCTACCATTTTTTCTATCAACACAATCCA
Microexon-tag Amino Acid seq SYRTAYHFQPTKNWMNDPNGPMYYKGVYHFFYQHNP
Transcript ID OTG01911
Gene ID HannXRQ_Chr13g0407191
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 9.5e-97
Motif start 77
Motif end 395
Protein seq >OTG01911
MLTYLYKAYNTCPTAHSLHFTMLFKTKLSSLNIPMKIYGFYVFSLCCFLIFNGVRVEAFRNHTRLLNVSQSYRTAYHFQP
TKNWMNDPNGPMYYKGVYHFFYQHNPYGPLWGNMSWAHAISHDLINWVHLDIALVPNEPYDINGCFSGSTTILPNGEPAI
LYTGVDANLHQVQNLAFPKNISDPLLNEWVKWPLNPILSPPDEIDPSFYRDPSTGWMGADGEWRVVIGTRIDHQGAAILY
KSKDFRSWNRSLSPLHLSNITTFWECPDFYPVMRNGKSGLDTSVTSVQGKDIRHVLKASFNYQDYYILGKYDPLSDRYDV
DADFMKSKGWLRYDYGRFYASKSFYDGEKKRRILWSWVCEGDTAPDAIEKGWSGLQTIPRSIWLSKNEDQLVQWPVKELE
KLRTRKVHLENKKLKGGSMIEISGITASQADVEITFSFSNLKHAEVLSAEVVDPQILCTQKNATTDGKFGPFGLLVLASK
DLTEHTAVFFRVFRIANSFRVLMCADTSRSSLKPFIDKPIYGAFLALDPRYAKISLRTLIDHSIVESFGGEGLACITSRV
YPVLAVGEQAHLYAFNNGTQSLSISSLSAWSMKKAQIV*
CDS seq >OTG01911
ATGCTCACGTACCTATATAAAGCCTACAACACTTGTCCAACTGCACACAGTTTGCATTTCACGATGCTTTTCAAAACTAA
ACTATCATCTCTGAATATTCCGATGAAGATATATGGTTTTTATGTGTTTTCTTTGTGTTGTTTCCTGATTTTTAATGGAG
TCCGTGTTGAAGCTTTCAGGAACCATACACGGCTACTGAATGTTTCACAGTCTTACCGGACAGCATATCACTTCCAGCCT
ACCAAAAATTGGATGAATGATCCTAATGGACCAATGTATTACAAGGGGGTCTACCATTTTTTCTATCAACACAATCCATA
TGGTCCACTATGGGGTAACATGTCATGGGCTCATGCCATATCACATGATCTTATCAACTGGGTACATCTTGATATTGCTC
TTGTACCAAACGAGCCTTACGATATCAATGGGTGTTTTTCTGGTTCGACAACGATACTACCAAATGGTGAACCGGCGATC
CTATACACAGGTGTTGATGCCAATTTACACCAAGTGCAAAATTTGGCATTTCCCAAAAACATATCGGACCCGCTACTAAA
TGAATGGGTAAAATGGCCACTTAACCCAATATTGAGTCCTCCGGATGAAATTGACCCATCCTTTTATAGAGATCCATCAA
CTGGTTGGATGGGTGCGGATGGAGAATGGAGGGTTGTGATTGGAACTCGGATTGATCATCAAGGAGCAGCTATTCTATAC
AAAAGTAAGGATTTTCGCAGTTGGAATAGGTCTCTGAGCCCGTTACACCTCTCTAACATAACAACATTTTGGGAATGTCC
TGACTTTTATCCTGTTATGCGTAATGGGAAAAGCGGGCTTGATACATCAGTTACATCCGTTCAAGGGAAGGACATTAGGC
ATGTTCTTAAAGCTAGTTTTAATTACCAGGATTATTATATACTGGGAAAATACGATCCACTAAGTGACCGCTACGACGTT
GATGCTGACTTTATGAAGAGTAAAGGGTGGTTACGGTATGATTATGGGAGGTTTTATGCTTCTAAGTCGTTTTATGATGG
TGAGAAGAAGAGAAGGATATTATGGTCATGGGTTTGTGAAGGTGATACTGCACCAGATGCTATCGAAAAAGGTTGGTCTG
GCCTTCAGACGATTCCTAGGAGCATCTGGCTTAGTAAAAATGAAGATCAGTTGGTGCAGTGGCCTGTCAAGGAACTAGAG
AAACTACGAACACGAAAAGTGCATTTGGAGAATAAAAAACTCAAGGGCGGGTCCATGATTGAAATTTCTGGTATCACAGC
TTCACAGGCTGATGTAGAAATCACGTTTAGCTTCTCAAATCTCAAACATGCTGAGGTGTTGAGTGCTGAAGTAGTTGATC
CCCAAATTCTTTGTACACAAAAGAATGCTACAACAGATGGCAAATTTGGGCCTTTCGGTTTGCTGGTTTTGGCTTCGAAG
GACCTTACTGAACACACTGCGGTCTTCTTTCGTGTCTTTAGAATTGCTAACAGTTTCCGAGTGCTCATGTGTGCTGACAC
AAGCAGGTCGTCTTTAAAACCATTTATCGACAAACCCATCTATGGAGCTTTTCTTGCACTTGATCCTCGATACGCAAAGA
TCTCCTTGAGAACCTTGATAGATCACTCCATTGTAGAAAGCTTTGGTGGAGAAGGGTTGGCTTGCATTACATCAAGAGTT
TATCCGGTACTGGCAGTTGGAGAACAAGCACATCTTTATGCATTCAACAATGGCACTCAGAGTTTGAGTATCTCAAGTTT
AAGTGCTTGGAGTATGAAGAAAGCTCAAATTGTCTAA