Microexon ID Ha_1:67424447-67424455:+
Species Helianthus annuus
Coordinates 1:67424447..67424455
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCCAATG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq GTTTATAGAACCGCCTTCCATTTCCAGCCCAAACAACATTGGATCAACGATCCCAATGCACCAATGTATTACAAGGGATTGTATCATTTGTTCTACCAACATAATCCA
Microexon-tag Amino Acid Seq VYRTAFHFQPKQHWINDPNAPMYYKGLYHLFYQHNP
Microexon-tag spanning region67414539-67424613
Microexon-tag prediction score0.9483
Overlapped with the annotated transcript (%) 15.73
New Transcript ID OTG36672x
Reference Transcript ID OTG36672
Gene ID HannXRQ_Chr01g0010151
Gene Name NA
Transcript ID OTG36672
Protein ID OTG36672
Gene ID HannXRQ_Chr01g0010151
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.1e-101
Motif start 54
Motif end 373
Protein seq >OTG36672
MRSFIVTVWGFVELMCLFLMLNDFKRIMAFHKVSKHLQHINAFQVLEDHKPAYHFQPKHHWINDPNAPMYYKGLYHLFYQ
HNPKGSVWGNIVWAHSVSKDLINWTPLDPAIEPTKPFDKFGCWSGSATILTDNKPIILYTGMTKEKPEPGYQVQNYAIPE
NYSDPYLTKWIKPDSNPIIKPTKENVSAFRDPTTAWKINDQWEMTIGSKVDMLGISYLYRSKDFINWTLVDHPLHQKENE
GMWECPDFYPVSTKGEKGLDTTVIDGDIKHVFKVSLDLTKYDYYTIGKYDTKQDKYIPDKGMVDSWAGLRYDWGNFYASK
TFFDPVKKRRIIWGWANESSTEDENVKKGWAGILLIPRTVWLDPSGKQLLQWPVVELETLRHKNVQLTDKELKKGDIVEV
AGINVAQVDVDVLFKFSNLDKAEEYNTEWDKTFPPEMLAKNICQVMGTTQQGGLGPFGLLALTSKDFKEYTPIFFRIFKT
PNTKHKVLMCSDAMPSTLNLNEYRPSYGGFVDVDTTENKISLRSLIDHSVVESFAAGGKTVMTSRVYPTLAIGDKAHLHV
FNNGTEIVIVERLDAWSMTKPKMN*
CDS seq >OTG36672
ATGCGGTCTTTCATTGTGACTGTTTGGGGTTTCGTAGAATTGATGTGTTTGTTCTTAATGTTGAATGACTTCAAAAGAAT
CATGGCTTTCCACAAGGTTTCTAAACATTTACAACACATTAACGCTTTTCAAGTGTTAGAAGATCACAAACCTGCCTATC
ATTTTCAACCCAAGCACCATTGGATCAACGATCCCAATGCACCAATGTATTACAAGGGATTGTATCATTTGTTCTACCAA
CATAATCCAAAAGGTTCTGTTTGGGGTAACATTGTGTGGGCTCACTCGGTATCAAAAGACCTAATCAACTGGACCCCACT
AGATCCAGCAATCGAACCAACAAAACCATTTGATAAGTTTGGTTGTTGGTCTGGATCTGCCACAATCCTTACTGATAACA
AGCCGATCATACTGTACACTGGGATGACAAAAGAGAAACCAGAACCAGGTTATCAAGTCCAAAACTATGCAATACCCGAA
AACTATTCAGACCCGTACCTAACAAAATGGATCAAACCTGATAGCAATCCCATCATAAAGCCAACCAAAGAGAACGTGTC
CGCATTTCGTGACCCGACAACAGCTTGGAAGATCAATGACCAGTGGGAAATGACCATCGGTAGTAAGGTGGATATGCTAG
GCATATCATACTTGTATCGAAGCAAAGATTTCATTAACTGGACTTTGGTTGACCACCCTCTACACCAAAAAGAGAACGAG
GGGATGTGGGAATGCCCTGATTTTTACCCGGTATCCACTAAGGGAGAAAAAGGGTTGGACACTACAGTAATAGATGGTGA
TATCAAACATGTTTTTAAAGTAAGTCTTGACCTTACCAAATATGATTACTATACAATTGGAAAATACGATACAAAGCAAG
ATAAATACATCCCGGATAAAGGAATGGTTGATAGTTGGGCTGGTTTAAGATACGATTGGGGAAACTTTTATGCTTCGAAG
ACGTTCTTTGACCCTGTGAAGAAGCGAAGGATAATTTGGGGTTGGGCTAATGAATCTAGCACTGAAGACGAAAATGTCAA
GAAAGGATGGGCTGGAATTCTGTTGATTCCACGAACTGTTTGGTTAGACCCATCTGGAAAGCAATTACTACAATGGCCGG
TTGTTGAACTAGAAACACTAAGACATAAAAATGTTCAACTAACTGATAAGGAGCTCAAGAAGGGAGATATAGTTGAAGTT
GCAGGAATTAATGTTGCTCAGGTGGATGTCGACGTGCTTTTTAAGTTCTCGAATTTGGATAAAGCAGAGGAGTATAATAC
TGAATGGGATAAAACATTCCCGCCAGAAATGCTTGCAAAAAACATATGTCAAGTCATGGGTACGACCCAGCAAGGCGGGT
TAGGACCATTTGGTCTTCTAGCATTGACATCCAAGGACTTTAAAGAATACACTCCCATCTTTTTTAGGATCTTCAAAACT
CCTAACACCAAGCATAAAGTTCTCATGTGCTCAGATGCTATGCCGTCAACATTGAACCTAAACGAGTATAGGCCATCATA
TGGAGGATTTGTGGATGTAGATACTACCGAAAACAAGATTTCTCTTCGGAGTTTGATTGACCATTCGGTTGTGGAAAGCT
TTGCAGCAGGGGGGAAGACTGTAATGACATCTAGGGTTTACCCGACATTAGCAATTGGCGATAAAGCCCATTTGCATGTA
TTTAACAATGGTACTGAAATTGTTATAGTTGAGAGACTTGATGCTTGGTCCATGACGAAACCTAAAATGAACTAA
Microexon DNA seq ATCCCAATG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq GATCACAAACCTGCCTATCATTTTCAACCCAAGCACCATTGGATCAACGATCCCAATGCACCAATGTATTACAAGGGATTGTATCATTTGTTCTACCAACATAATCCA
Microexon-tag Amino Acid seq DHKPAYHFQPKHHWINDPNAPMYYKGLYHLFYQHNP
Transcript ID OTG36672
Gene ID Ha.1084
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.1e-101
Motif start 54
Motif end 373
Protein seq >OTG36672
MRSFIVTVWGFVELMCLFLMLNDFKRIMAFHKVSKHLQHINAFQVLEDHKPAYHFQPKHHWINDPNAPMYYKGLYHLFYQ
HNPKGSVWGNIVWAHSVSKDLINWTPLDPAIEPTKPFDKFGCWSGSATILTDNKPIILYTGMTKEKPEPGYQVQNYAIPE
NYSDPYLTKWIKPDSNPIIKPTKENVSAFRDPTTAWKINDQWEMTIGSKVDMLGISYLYRSKDFINWTLVDHPLHQKENE
GMWECPDFYPVSTKGEKGLDTTVIDGDIKHVFKVSLDLTKYDYYTIGKYDTKQDKYIPDKGMVDSWAGLRYDWGNFYASK
TFFDPVKKRRIIWGWANESSTEDENVKKGWAGILLIPRTVWLDPSGKQLLQWPVVELETLRHKNVQLTDKELKKGDIVEV
AGINVAQVDVDVLFKFSNLDKAEEYNTEWDKTFPPEMLAKNICQVMGTTQQGGLGPFGLLALTSKDFKEYTPIFFRIFKT
PNTKHKVLMCSDAMPSTLNLNEYRPSYGGFVDVDTTENKISLRSLIDHSVVESFAAGGKTVMTSRVYPTLAIGDKAHLHV
FNNGTEIVIVERLDAWSMTKPKMN*
CDS seq >OTG36672
ATGCGGTCTTTCATTGTGACTGTTTGGGGTTTCGTAGAATTGATGTGTTTGTTCTTAATGTTGAATGACTTCAAAAGAAT
CATGGCTTTCCACAAGGTTTCTAAACATTTACAACACATTAACGCTTTTCAAGTGTTAGAAGATCACAAACCTGCCTATC
ATTTTCAACCCAAGCACCATTGGATCAACGATCCCAATGCACCAATGTATTACAAGGGATTGTATCATTTGTTCTACCAA
CATAATCCAAAAGGTTCTGTTTGGGGTAACATTGTGTGGGCTCACTCGGTATCAAAAGACCTAATCAACTGGACCCCACT
AGATCCAGCAATCGAACCAACAAAACCATTTGATAAGTTTGGTTGTTGGTCTGGATCTGCCACAATCCTTACTGATAACA
AGCCGATCATACTGTACACTGGGATGACAAAAGAGAAACCAGAACCAGGTTATCAAGTCCAAAACTATGCAATACCCGAA
AACTATTCAGACCCGTACCTAACAAAATGGATCAAACCTGATAGCAATCCCATCATAAAGCCAACCAAAGAGAACGTGTC
CGCATTTCGTGACCCGACAACAGCTTGGAAGATCAATGACCAGTGGGAAATGACCATCGGTAGTAAGGTGGATATGCTAG
GCATATCATACTTGTATCGAAGCAAAGATTTCATTAACTGGACTTTGGTTGACCACCCTCTACACCAAAAAGAGAACGAG
GGGATGTGGGAATGCCCTGATTTTTACCCGGTATCCACTAAGGGAGAAAAAGGGTTGGACACTACAGTAATAGATGGTGA
TATCAAACATGTTTTTAAAGTAAGTCTTGACCTTACCAAATATGATTACTATACAATTGGAAAATACGATACAAAGCAAG
ATAAATACATCCCGGATAAAGGAATGGTTGATAGTTGGGCTGGTTTAAGATACGATTGGGGAAACTTTTATGCTTCGAAG
ACGTTCTTTGACCCTGTGAAGAAGCGAAGGATAATTTGGGGTTGGGCTAATGAATCTAGCACTGAAGACGAAAATGTCAA
GAAAGGATGGGCTGGAATTCTGTTGATTCCACGAACTGTTTGGTTAGACCCATCTGGAAAGCAATTACTACAATGGCCGG
TTGTTGAACTAGAAACACTAAGACATAAAAATGTTCAACTAACTGATAAGGAGCTCAAGAAGGGAGATATAGTTGAAGTT
GCAGGAATTAATGTTGCTCAGGTGGATGTCGACGTGCTTTTTAAGTTCTCGAATTTGGATAAAGCAGAGGAGTATAATAC
TGAATGGGATAAAACATTCCCGCCAGAAATGCTTGCAAAAAACATATGTCAAGTCATGGGTACGACCCAGCAAGGCGGGT
TAGGACCATTTGGTCTTCTAGCATTGACATCCAAGGACTTTAAAGAATACACTCCCATCTTTTTTAGGATCTTCAAAACT
CCTAACACCAAGCATAAAGTTCTCATGTGCTCAGATGCTATGCCGTCAACATTGAACCTAAACGAGTATAGGCCATCATA
TGGAGGATTTGTGGATGTAGATACTACCGAAAACAAGATTTCTCTTCGGAGTTTGATTGACCATTCGGTTGTGGAAAGCT
TTGCAGCAGGGGGGAAGACTGTAATGACATCTAGGGTTTACCCGACATTAGCAATTGGCGATAAAGCCCATTTGCATGTA
TTTAACAATGGTACTGAAATTGTTATAGTTGAGAGACTTGATGCTTGGTCCATGACGAAACCTAAAATGAACTAA