Microexon ID Ha_4:173380502-173380505:+
Species Helianthus annuus
Coordinates 4:173380502..173380505
Microexon Cluster ID MEP05
Size 4
Phase 2
Pfam Domain Motif Helicase_C
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 53,4,51
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GATGAYTGGMTTTCWKCSARRAYWCAAGTTGTWGTKGCHACWGTRGCWTTTGGRATGGGWATWGATARRMARGATGTYMGDATTGTKTGYCAYTTYAAYWTKCCWAAR
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GATG
Microexon Amino Acid seq GM
Microexon-tag DNA Seq GATGATTGGATTTCTGCCAAGACGCAAGTTGTAGTGGCTACAGTGGCGTTTGGGATGGGTATAGATCGGAAAGATGTCAGAATTGTTTGCCATTTCAATATTCCGAAA
Microexon-tag Amino Acid Seq DDWISAKTQVVVATVAFGMGIDRKDVRIVCHFNIPK
Microexon-tag spanning region173379661-173380654
Microexon-tag prediction score0.9695
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG29781x
Reference Transcript ID OTG29781
Gene ID HannXRQ_Chr04g0125951
Gene Name RecQl3
Transcript ID OTG29781
Protein ID OTG29781
Gene ID HannXRQ_Chr04g0125951
Gene Name RecQl3
Pfam domain motif Helicase_C
Motif E-value 1.7e-14
Motif start 277
Motif end 366
Protein seq >OTG29781
MIKALTSKQSQLRSSGNGFKKALLPVKNISKERKVSGKEALTKLLRWHFGHSEYRGKQLEAIEAVLSGRDCFCLMPTGGG
KSICYQIPALAKPGIVLVVSPLIALMENQVMALKEKGVAAEYLSSTQTAQVRNKIHEELESGNPGLRLLYVTPELIATTG
FMSKLTKIHSRGLLNLIAIDEAHCISTWGHDFRPSYRKLASLRKRLPDVPMLALTATAVPKVQVDVIESLNMENPLVLKS
SFNRPNIYYEVRFKDLLTDPYVDLTDLIKSCGDVCGIVYCLERTTCDDLASHLSKNGISCAAYHAGLNNKLRTSVLDDWI
SAKTQVVVATVAFGMGIDRKDVRIVCHFNIPKSMVSFYQESGRAGRDQQPSGSVLYYGIDDRKKMQFILNNADNKKSQSS
SSQDRSPKKSIVDFNLMVEYCETSGCRRKKILDSFGEQVSTSLCKKTCDACKNPNIVEKYLDELKAMCSLGNRIGSSRIF
ISSSSKPVDDVDFSEYWSRDDEAIVSEDDISDDDDGIDVVEDLTQSALPSKTKFSEKMDILQRAEEKYYQKNDHDKQSNK
LDKNAINETTRESCKQRLLNSLKQTQQRLNNLPIDPEVSSVFFENECFKKYGKTGKSFYLSQVASTVRWLSTVNAADLTS
RLATTNQNSTLKTATEVEPLSSGSPSVSDEVTNTMNEKDREKVSLCSSQITHNDIKLPPILSFSEFVNKKSSKDSKQSAY
GVDKKPDKRSRLQ*
CDS seq >OTG29781
ATGATCAAAGCCCTAACCAGTAAGCAGTCTCAACTCCGATCATCCGGCAACGGATTTAAGAAGGCGCTACTTCCTGTGAA
AAACATTAGCAAAGAGAGAAAAGTTTCTGGAAAAGAGGCTTTAACAAAGCTCTTGAGATGGCATTTCGGTCATTCGGAGT
ACCGAGGGAAGCAATTGGAAGCCATAGAAGCTGTATTATCAGGAAGAGACTGTTTCTGTTTGATGCCAACGGGTGGCGGA
AAATCGATTTGTTATCAAATTCCTGCACTGGCAAAACCAGGCATTGTGCTTGTCGTGTCTCCTTTGATAGCCTTAATGGA
AAATCAAGTTATGGCGTTGAAGGAGAAAGGGGTTGCGGCTGAATATCTCTCATCGACTCAGACTGCACAAGTTAGGAATA
AGATCCATGAAGAACTAGAATCTGGAAATCCCGGGTTGAGGTTGCTCTACGTGACACCGGAACTGATTGCAACAACGGGA
TTTATGTCAAAGTTGACAAAGATTCATTCACGAGGGTTGTTGAATTTGATTGCAATAGATGAGGCGCATTGCATCTCAAC
ATGGGGCCATGATTTTAGGCCTAGCTACAGAAAATTAGCTTCATTGAGGAAACGTCTGCCAGACGTTCCAATGTTAGCAT
TAACAGCTACGGCTGTTCCCAAGGTTCAGGTGGATGTGATAGAGTCTTTGAACATGGAGAACCCTCTAGTCCTGAAATCA
TCATTTAATCGCCCAAATATATATTACGAGGTTCGATTCAAGGATCTTCTAACGGACCCATATGTTGATCTGACTGATCT
AATAAAATCTTGTGGAGATGTATGCGGAATCGTTTACTGCCTTGAACGTACAACTTGTGATGATCTGGCTTCTCATTTAT
CAAAAAACGGCATTTCTTGTGCTGCATATCATGCAGGGTTAAACAATAAATTACGGACCTCTGTTTTGGATGATTGGATT
TCTGCCAAGACGCAAGTTGTAGTGGCTACAGTGGCGTTTGGGATGGGTATAGATCGGAAAGATGTCAGAATTGTTTGCCA
TTTCAATATTCCGAAATCAATGGTATCGTTTTATCAAGAATCGGGTAGAGCTGGTCGTGATCAACAACCTTCTGGAAGTG
TTTTATACTACGGAATAGATGATCGCAAGAAAATGCAATTTATATTGAATAATGCGGATAACAAGAAGTCGCAGTCCTCA
AGCTCACAGGACAGGTCCCCAAAAAAGTCAATAGTTGACTTTAATTTGATGGTCGAGTATTGTGAAACATCTGGTTGTCG
TAGGAAAAAGATTTTAGACAGTTTTGGCGAACAGGTGTCAACATCACTATGTAAGAAAACGTGTGACGCATGCAAAAATC
CAAACATAGTGGAGAAATACTTGGACGAGCTTAAAGCCATGTGTTCCCTTGGTAATCGAATCGGATCGTCACGGATATTT
ATAAGCAGCTCCTCAAAACCTGTTGATGACGTGGATTTCTCAGAGTATTGGTCTCGTGATGACGAGGCAATCGTGTCTGA
GGATGATATATCTGATGATGACGATGGTATTGATGTTGTGGAAGACCTTACACAGTCAGCATTACCTTCAAAAACTAAAT
TTAGTGAGAAGATGGATATTTTGCAACGAGCGGAAGAAAAATACTATCAGAAAAACGATCACGATAAACAGAGCAATAAA
CTCGACAAAAACGCTATAAACGAAACCACACGGGAGTCTTGCAAACAAAGGTTACTCAATTCGCTAAAGCAAACACAGCA
ACGGCTCAACAACTTGCCCATAGACCCTGAAGTGTCTTCCGTATTCTTTGAAAACGAGTGCTTCAAAAAATACGGGAAAA
CTGGAAAATCATTTTATTTATCTCAAGTGGCAAGTACTGTGAGGTGGCTTTCAACAGTCAACGCTGCAGACTTAACATCT
CGGCTTGCAACCACGAATCAGAATTCTACTTTAAAAACTGCAACGGAAGTGGAACCTTTAAGTTCAGGATCACCGTCTGT
TTCAGATGAGGTAACGAACACGATGAATGAAAAAGATCGTGAAAAAGTTTCGTTATGTTCGTCACAGATCACTCACAATG
ATATAAAGCTGCCACCGATCCTGTCTTTCTCTGAGTTTGTTAACAAAAAGAGCAGCAAAGATAGCAAACAGTCAGCATAT
GGAGTTGATAAGAAACCAGATAAAAGAAGCAGACTTCAGTAG
Microexon DNA seq GATG
Microexon Amino Acid seq GM
Microexon-tag DNA Seq GATGATTGGATTTCTGCCAAGACGCAAGTTGTAGTGGCTACAGTGGCGTTTGGGATGGGTATAGATCGGAAAGATGTCAGAATTGTTTGCCATTTCAATATTCCGAAA
Microexon-tag Amino Acid seq DDWISAKTQVVVATVAFGMGIDRKDVRIVCHFNIPK
Transcript ID Ha.42803.1
Gene ID Ha.42803
Gene Name RecQl3
Pfam domain motif Helicase_C
Motif E-value 1.6e-14
Motif start 260
Motif end 349
Protein seq >Ha.42803.1
MMKKALLPVKNISKERKVSGKEALTKLLRWHFGHSEYRGKQLEAIEAVLSGRDCFCLMPTGGGKSICYQIPALAKPGIVL
VVSPLIALMENQVMALKEKGVAAEYLSSTQTAQVRNKIHEELESGNPGLRLLYVTPELIATTGFMSKLTKIHSRGLLNLI
AIDEAHCISTWGHDFRPSYRKLASLRKRLPDVPMLALTATAVPKVQVDVIESLNMENPLVLKSSFNRPNIYYEVRFKDLL
TDPYVDLTDLIKSCGDVCGIVYCLERTTCDDLASHLSKNGISCAAYHAGLNNKLRTSVLDDWISAKTQVVVATVAFGMGI
DRKDVRIVCHFNIPKSMVSFYQESGRAGRDQQPSGSVLYYGIDDRKKMQFILNNADNKKSQSSSSQDRSPKKSIVDFNLM
VEYCETSGCRRKKILDSFGEQVSTSLCKKTCDACKNPNIVEKYLDELKAMCSLGNRIGSSRIFISSSSKPVDDVDFSEYW
SRDDEAIVSEDDISDDDDGIDVVEDLTQSALPSKTKFSEKMDILQRAEEKYYQKNDHDKQSNKLDKNAINETTRESCKQR
LLNSLKQTQQRLNNLPIDPEVSSVFFENECFKKYGKTGKSFYLSQVASTVRWLSTVNAADLTSRLATTNQNSTLKTATEV
EPLSSGSPSVSDEVTNTMNEKDREKVSLCSSQITHNDIKLPPILSFSEFVNKKSSKDSKQSAYGVDKKPDKRSRLQ*
CDS seq >Ha.42803.1
ATGATGAAGAAGGCGCTACTTCCTGTGAAAAACATTAGCAAAGAGAGAAAAGTTTCTGGAAAAGAGGCTTTAACAAAGCT
CTTGAGATGGCATTTCGGTCATTCGGAGTACCGAGGGAAGCAATTGGAAGCCATAGAAGCTGTATTATCAGGAAGAGACT
GTTTCTGTTTGATGCCAACGGGTGGCGGAAAATCGATTTGTTATCAAATTCCTGCACTGGCAAAACCAGGCATTGTGCTT
GTCGTGTCTCCTTTGATAGCCTTAATGGAAAATCAAGTTATGGCGTTGAAGGAGAAAGGGGTTGCGGCTGAATATCTCTC
ATCGACTCAGACTGCACAAGTTAGGAATAAGATCCATGAAGAACTAGAATCTGGAAATCCCGGGTTGAGGTTGCTCTACG
TGACACCGGAACTGATTGCAACAACGGGATTTATGTCAAAGTTGACAAAGATTCATTCACGAGGGTTGTTGAATTTGATT
GCAATAGATGAGGCGCATTGCATCTCAACATGGGGCCATGATTTTAGGCCTAGCTACAGAAAATTAGCTTCATTGAGGAA
ACGTCTGCCAGACGTTCCAATGTTAGCATTAACAGCTACGGCTGTTCCCAAGGTTCAGGTGGATGTGATAGAGTCTTTGA
ACATGGAGAACCCTCTAGTCCTGAAATCATCATTTAATCGCCCAAATATATATTACGAGGTTCGATTCAAGGATCTTCTA
ACGGACCCATATGTTGATCTGACTGATCTAATAAAATCTTGTGGAGATGTATGCGGAATCGTTTACTGCCTTGAACGTAC
AACTTGTGATGATCTGGCTTCTCATTTATCAAAAAACGGCATTTCTTGTGCTGCATATCATGCAGGGTTAAACAATAAAT
TACGGACCTCTGTTTTGGATGATTGGATTTCTGCCAAGACGCAAGTTGTAGTGGCTACAGTGGCGTTTGGGATGGGTATA
GATCGGAAAGATGTCAGAATTGTTTGCCATTTCAATATTCCGAAATCAATGGTATCGTTTTATCAAGAATCGGGTAGAGC
TGGTCGTGATCAACAACCTTCTGGAAGTGTTTTATACTACGGAATAGATGATCGCAAGAAAATGCAATTTATATTGAATA
ATGCGGATAACAAGAAGTCGCAGTCCTCAAGCTCACAGGACAGGTCCCCAAAAAAGTCAATAGTTGACTTTAATTTGATG
GTCGAGTATTGTGAAACATCTGGTTGTCGTAGGAAAAAGATTTTAGACAGTTTTGGCGAACAGGTGTCAACATCACTATG
TAAGAAAACGTGTGACGCATGCAAAAATCCAAACATAGTGGAGAAATACTTGGACGAGCTTAAAGCCATGTGTTCCCTTG
GTAATCGAATCGGATCGTCACGGATATTTATAAGCAGCTCCTCAAAACCTGTTGATGACGTGGATTTCTCAGAGTATTGG
TCTCGTGATGACGAGGCAATCGTGTCTGAGGATGATATATCTGATGATGACGATGGTATTGATGTTGTGGAAGACCTTAC
ACAGTCAGCATTACCTTCAAAAACTAAATTTAGTGAGAAGATGGATATTTTGCAACGAGCGGAAGAAAAATACTATCAGA
AAAACGATCACGATAAACAGAGCAATAAACTCGACAAAAACGCTATAAACGAAACCACACGGGAGTCTTGCAAACAAAGG
TTACTCAATTCGCTAAAGCAAACACAGCAACGGCTCAACAACTTGCCCATAGACCCTGAAGTGTCTTCCGTATTCTTTGA
AAACGAGTGCTTCAAAAAATACGGGAAAACTGGAAAATCATTTTATTTATCTCAAGTGGCAAGTACTGTGAGGTGGCTTT
CAACAGTCAACGCTGCAGACTTAACATCTCGGCTTGCAACCACGAATCAGAATTCTACTTTAAAAACTGCAACGGAAGTG
GAACCTTTAAGTTCAGGATCACCGTCTGTTTCAGATGAGGTAACGAACACGATGAATGAAAAAGATCGTGAAAAAGTTTC
GTTATGTTCGTCACAGATCACTCACAATGATATAAAGCTGCCACCGATCCTGTCTTTCTCTGAGTTTGTTAACAAAAAGA
GCAGCAAAGATAGCAAACAGTCAGCATATGGAGTTGATAAGAAACCAGATAAAAGAAGCAGACTTCAGTAG