Microexon ID Ha_1:67420364-67420372:+
Species Helianthus annuus
Coordinates 1:67420364..67420372
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Ha_1:67420364-67420372:+ does not have available information here.
Transcript ID OTG36671
Protein ID OTG36671
Gene ID HannXRQ_Chr01g0010141
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 3.2e-101
Motif start 57
Motif end 374
Protein seq >OTG36671
MDLQIQSKLIRVLLCCIPIIFNTFNGNGVVMASHKVNPNYQSVSAVKVKQVYRTAFHFQPKQHWINDPNAPMYYKGLYHF
FCQYNPKGAVWGNIVWAHSVSTDMINWISLKPALVPSKWFDKYGCWSGSATILPGEKPVILYTGVIKEKPEPGYQVQNYA
IPANYSDPYLQEWVKPDNNPILKPVQVNISSFRDPSTAWYNNGHWKMLVGSRHYSRGIAYLYRSKDFVRWTRARHPFNEK
LGTGMWECPDFYPLSSQGQRNGLDASASGTKYVFKVSLDETRNECYMIGEYDLVQDRFHPDNTSGWTAGLRYDYGNFYAS
KTFFDPIKKRRILWGWANESSTKDEDVAKGWAGIQLIPRMVWLDPSGKQLLQWPIRELETLRGKKKNLKNVKLNKGDIME
IKGITAAQADVDVTFTFSSFSKVEPYDKKWEKFSPQDLCGINGATVQGGLGPFGILALASKNLEEYTPVFFRIFKTHDKN
YKVLMCSDATPSSTNPNEYKPSFGGFVDMDLTNKTLCLRSLIDHSVVESFGEGGKTVITSRVYPELAVYGDAHLHVFNNG
SETITVERLNAWSINAPIMN*
CDS seq >OTG36671
ATGGATCTTCAAATTCAAAGTAAACTAATTAGGGTTTTGCTATGTTGCATCCCCATAATCTTCAATACTTTCAATGGCAA
TGGTGTAGTCATGGCTTCCCACAAGGTTAATCCAAACTACCAGTCTGTAAGTGCTGTCAAAGTCAAGCAGGTTTATAGAA
CCGCCTTCCATTTCCAGCCCAAACAACATTGGATCAACGATCCAAACGCACCAATGTATTACAAAGGACTTTACCATTTC
TTCTGTCAATACAACCCCAAGGGTGCGGTGTGGGGCAACATTGTGTGGGCGCACTCTGTATCAACGGACATGATTAACTG
GATATCGTTAAAACCCGCACTAGTACCATCAAAATGGTTTGATAAGTATGGTTGTTGGTCAGGATCCGCCACAATCCTCC
CAGGTGAAAAACCTGTCATTTTATACACTGGGGTAATAAAAGAGAAGCCAGAACCAGGCTACCAAGTTCAAAACTACGCG
ATACCTGCCAATTATTCAGATCCATACCTCCAAGAATGGGTCAAACCCGACAACAATCCAATCTTGAAGCCAGTTCAAGT
GAACATTTCGTCTTTCCGTGACCCTTCAACGGCTTGGTACAACAATGGTCATTGGAAAATGTTGGTTGGTAGTAGACACT
ATAGTCGAGGGATTGCTTACCTGTACCGAAGCAAAGATTTTGTTAGGTGGACCCGTGCTAGGCACCCGTTCAATGAAAAG
CTAGGTACGGGTATGTGGGAATGCCCAGACTTTTATCCGTTATCATCTCAGGGTCAAAGAAACGGGTTAGACGCTTCAGC
TAGTGGCACTAAATATGTTTTCAAGGTGAGCCTTGATGAGACGAGAAATGAATGCTACATGATTGGAGAATATGATTTGG
TGCAAGACCGGTTTCACCCGGATAATACATCCGGATGGACCGCAGGGTTAAGATATGATTATGGAAACTTCTATGCGTCC
AAGACATTCTTTGATCCTATCAAGAAGCGGAGAATTTTGTGGGGTTGGGCTAATGAGTCCAGCACAAAAGATGAAGATGT
TGCCAAGGGATGGGCAGGAATTCAGTTGATTCCACGTATGGTTTGGCTGGATCCTAGTGGAAAGCAGTTGCTACAATGGC
CAATTCGGGAACTAGAAACCCTAAGGGGTAAGAAGAAGAATCTAAAGAATGTAAAACTCAACAAAGGAGATATCATGGAA
ATAAAAGGAATTACTGCCGCTCAGGCGGATGTAGATGTTACATTTACATTTTCAAGTTTTAGTAAAGTTGAGCCATATGA
TAAAAAGTGGGAAAAATTTTCACCACAAGATCTATGTGGGATCAATGGTGCAACCGTACAAGGGGGACTTGGACCATTCG
GAATTCTAGCATTGGCCTCCAAGAATCTTGAAGAATACACTCCAGTTTTCTTTCGGATTTTCAAGACTCATGACAAGAAT
TATAAAGTTCTCATGTGCTCTGATGCTACTCCTTCCTCAACTAATCCGAATGAATACAAGCCATCATTTGGAGGGTTTGT
CGATATGGATTTAACTAATAAGACACTTTGCCTTAGGAGTTTAATTGATCATTCAGTTGTGGAAAGCTTTGGAGAGGGAG
GGAAAACAGTCATCACATCTAGGGTTTATCCAGAACTCGCAGTGTATGGTGATGCACATTTGCATGTATTTAACAATGGT
AGCGAGACCATAACTGTTGAGAGACTTAATGCTTGGTCCATCAATGCACCTATTATGAACTAA
Microexon DNA seq ATCCAAACG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq GTTTATAGAACCGCCTTCCATTTCCAGCCCAAACAACATTGGATCAACGATCCAAACGCACCAATGTATTACAAAGGACTTTACCATTTCTTCTGTCAATACAACCCC
Microexon-tag Amino Acid seq VYRTAFHFQPKQHWINDPNAPMYYKGLYHFFCQYNP
Transcript ID OTG36671
Gene ID Ha.1083
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 3.2e-101
Motif start 57
Motif end 374
Protein seq >OTG36671
MDLQIQSKLIRVLLCCIPIIFNTFNGNGVVMASHKVNPNYQSVSAVKVKQVYRTAFHFQPKQHWINDPNAPMYYKGLYHF
FCQYNPKGAVWGNIVWAHSVSTDMINWISLKPALVPSKWFDKYGCWSGSATILPGEKPVILYTGVIKEKPEPGYQVQNYA
IPANYSDPYLQEWVKPDNNPILKPVQVNISSFRDPSTAWYNNGHWKMLVGSRHYSRGIAYLYRSKDFVRWTRARHPFNEK
LGTGMWECPDFYPLSSQGQRNGLDASASGTKYVFKVSLDETRNECYMIGEYDLVQDRFHPDNTSGWTAGLRYDYGNFYAS
KTFFDPIKKRRILWGWANESSTKDEDVAKGWAGIQLIPRMVWLDPSGKQLLQWPIRELETLRGKKKNLKNVKLNKGDIME
IKGITAAQADVDVTFTFSSFSKVEPYDKKWEKFSPQDLCGINGATVQGGLGPFGILALASKNLEEYTPVFFRIFKTHDKN
YKVLMCSDATPSSTNPNEYKPSFGGFVDMDLTNKTLCLRSLIDHSVVESFGEGGKTVITSRVYPELAVYGDAHLHVFNNG
SETITVERLNAWSINAPIMN*
CDS seq >OTG36671
ATGGATCTTCAAATTCAAAGTAAACTAATTAGGGTTTTGCTATGTTGCATCCCCATAATCTTCAATACTTTCAATGGCAA
TGGTGTAGTCATGGCTTCCCACAAGGTTAATCCAAACTACCAGTCTGTAAGTGCTGTCAAAGTCAAGCAGGTTTATAGAA
CCGCCTTCCATTTCCAGCCCAAACAACATTGGATCAACGATCCAAACGCACCAATGTATTACAAAGGACTTTACCATTTC
TTCTGTCAATACAACCCCAAGGGTGCGGTGTGGGGCAACATTGTGTGGGCGCACTCTGTATCAACGGACATGATTAACTG
GATATCGTTAAAACCCGCACTAGTACCATCAAAATGGTTTGATAAGTATGGTTGTTGGTCAGGATCCGCCACAATCCTCC
CAGGTGAAAAACCTGTCATTTTATACACTGGGGTAATAAAAGAGAAGCCAGAACCAGGCTACCAAGTTCAAAACTACGCG
ATACCTGCCAATTATTCAGATCCATACCTCCAAGAATGGGTCAAACCCGACAACAATCCAATCTTGAAGCCAGTTCAAGT
GAACATTTCGTCTTTCCGTGACCCTTCAACGGCTTGGTACAACAATGGTCATTGGAAAATGTTGGTTGGTAGTAGACACT
ATAGTCGAGGGATTGCTTACCTGTACCGAAGCAAAGATTTTGTTAGGTGGACCCGTGCTAGGCACCCGTTCAATGAAAAG
CTAGGTACGGGTATGTGGGAATGCCCAGACTTTTATCCGTTATCATCTCAGGGTCAAAGAAACGGGTTAGACGCTTCAGC
TAGTGGCACTAAATATGTTTTCAAGGTGAGCCTTGATGAGACGAGAAATGAATGCTACATGATTGGAGAATATGATTTGG
TGCAAGACCGGTTTCACCCGGATAATACATCCGGATGGACCGCAGGGTTAAGATATGATTATGGAAACTTCTATGCGTCC
AAGACATTCTTTGATCCTATCAAGAAGCGGAGAATTTTGTGGGGTTGGGCTAATGAGTCCAGCACAAAAGATGAAGATGT
TGCCAAGGGATGGGCAGGAATTCAGTTGATTCCACGTATGGTTTGGCTGGATCCTAGTGGAAAGCAGTTGCTACAATGGC
CAATTCGGGAACTAGAAACCCTAAGGGGTAAGAAGAAGAATCTAAAGAATGTAAAACTCAACAAAGGAGATATCATGGAA
ATAAAAGGAATTACTGCCGCTCAGGCGGATGTAGATGTTACATTTACATTTTCAAGTTTTAGTAAAGTTGAGCCATATGA
TAAAAAGTGGGAAAAATTTTCACCACAAGATCTATGTGGGATCAATGGTGCAACCGTACAAGGGGGACTTGGACCATTCG
GAATTCTAGCATTGGCCTCCAAGAATCTTGAAGAATACACTCCAGTTTTCTTTCGGATTTTCAAGACTCATGACAAGAAT
TATAAAGTTCTCATGTGCTCTGATGCTACTCCTTCCTCAACTAATCCGAATGAATACAAGCCATCATTTGGAGGGTTTGT
CGATATGGATTTAACTAATAAGACACTTTGCCTTAGGAGTTTAATTGATCATTCAGTTGTGGAAAGCTTTGGAGAGGGAG
GGAAAACAGTCATCACATCTAGGGTTTATCCAGAACTCGCAGTGTATGGTGATGCACATTTGCATGTATTTAACAATGGT
AGCGAGACCATAACTGTTGAGAGACTTAATGCTTGGTCCATCAATGCACCTATTATGAACTAA