Microexon ID Ha_4:1392121-1392129:+
Species Helianthus annuus
Coordinates 4:1392121..1392129
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTATCTTG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGTTGTAGGAAAGAAGGGCAGACAAGAAAGGGTCGTCAAGTTTATCTTGGGGGATATGATAATGAAGAGAAAGCTGCAAGAGCTTATGACTTGGCTGCT
Microexon-tag Amino Acid Seq WDNSCRKEGQTRKGRQVYLGGYDNEEKAARAYDLAA
Microexon-tag spanning region1391932-1393196
Microexon-tag prediction score0.9738
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG26938x
Reference Transcript ID OTG26938
Gene ID HannXRQ_Chr04g0094581
Gene Name NA
Transcript ID OTG26938
Protein ID OTG26938
Gene ID HannXRQ_Chr04g0094581
Gene Name NA
Pfam domain motif AP2
Motif E-value 6.8e-13
Motif start 217
Motif end 276
Protein seq >OTG26938
MSDWLGFSLNHHHPQQDPINPPALHQHDPDSVIPINDFNPSTQTQRWRLEATDQSGPKLEDFLGGGACAGAGADVNLNFH
EYRPTEGDIVPRDGAACYISPPPAPPYLLHYGYPYYTTTSTSTTLTAANQNEQQNGVVSYDDGASAITQVSEIKTWLLPS
SSSTGCDVSDRQECLSLAVVPAGDNRKRSVVQMKSDKSGKGEVVTRKAVDGFGQRTSRFRGVTRHRWTGRYEAHLWDNSC
RKEGQTRKGRQVYLGGYDNEEKAARAYDLAALKYWGPTTHINFPLADYEKELEEMKNMNRQEFVANLRRKSSGFSRGASM
YRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGTSAVTNFDISRYDVKRICSSSTLIAGELAK
RSPRDTGPTMLLENCSHDPGDDFADMVWAGNQEEPPGLEDQQGASNIENMSPSELAGDLDPGVG*
CDS seq >OTG26938
ATGAGCGATTGGCTGGGCTTCTCCCTTAACCACCACCACCCCCAACAAGACCCCATTAATCCACCTGCACTTCATCAACA
TGACCCTGATTCCGTTATTCCTATCAACGATTTCAATCCTTCCACTCAAACACAACGATGGAGATTAGAAGCCACCGACC
AAAGTGGTCCGAAGCTCGAAGATTTTTTGGGCGGCGGTGCTTGTGCTGGTGCTGGTGCTGACGTCAACCTTAACTTTCAC
GAATATCGTCCGACAGAAGGTGATATTGTTCCTAGAGACGGCGCCGCCTGCTATATATCACCACCGCCGGCGCCGCCATA
TTTGTTACACTACGGTTATCCTTACTACACCACCACCAGCACCAGCACAACCCTCACCGCCGCAAACCAAAACGAACAAC
AAAACGGTGTCGTTTCATACGACGACGGTGCAAGTGCAATCACTCAGGTTTCCGAGATCAAAACGTGGTTACTGCCATCG
TCATCATCGACGGGGTGTGACGTCAGCGACCGTCAGGAGTGTTTGTCGCTGGCGGTGGTGCCTGCGGGTGATAATAGGAA
GCGGTCGGTGGTGCAGATGAAGTCGGATAAAAGCGGGAAAGGAGAAGTGGTTACGCGTAAAGCGGTGGATGGTTTCGGGC
AAAGGACTTCACGGTTTCGGGGCGTGACGAGGCACCGGTGGACGGGGAGATATGAAGCTCATTTGTGGGATAATAGTTGT
AGGAAAGAAGGGCAGACAAGAAAGGGTCGTCAAGTTTATCTTGGGGGATATGATAATGAAGAGAAAGCTGCAAGAGCTTA
TGACTTGGCTGCTCTTAAATACTGGGGTCCAACAACTCATATTAATTTCCCTCTAGCCGACTACGAAAAAGAACTCGAGG
AGATGAAAAACATGAACAGGCAAGAATTTGTAGCCAACTTAAGAAGGAAAAGCAGCGGGTTTTCAAGAGGGGCATCTATG
TATAGAGGAGTCACAAGGCATCATCAACATGGAAGGTGGCAAGCTAGAATAGGAAGAGTTGCAGGCAATAAGGACTTGTA
CCTTGGTACCTTTAGTACACAAGAGGAAGCGGCGGAAGCTTACGACATAGCAGCCATAAAATTCCGAGGGACAAGTGCAG
TGACCAACTTCGACATTAGCCGATACGACGTCAAAAGAATATGTTCAAGTTCCACTTTGATAGCCGGCGAGCTGGCAAAA
CGGTCTCCACGAGATACAGGCCCCACAATGCTGCTCGAAAACTGTAGTCATGACCCGGGTGATGATTTCGCTGACATGGT
ATGGGCTGGAAACCAGGAGGAGCCGCCTGGACTCGAAGACCAACAAGGTGCAAGTAATATCGAGAACATGAGCCCGAGTG
AGTTAGCCGGTGATCTTGACCCAGGGGTTGGATGA
Microexon DNA seq TTTATCTTG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGTTGTAGGAAAGAAGGGCAGACAAGAAAGGGTCGTCAAGTTTATCTTGGGGGATATGATAATGAAGAGAAAGCTGCAAGAGCTTATGACTTGGCTGCT
Microexon-tag Amino Acid seq WDNSCRKEGQTRKGRQVYLGGYDNEEKAARAYDLAA
Transcript ID Ha.39638.1
Gene ID Ha.39638
Gene Name NA
Pfam domain motif AP2
Motif E-value 6.8e-13
Motif start 218
Motif end 277
Protein seq >Ha.39638.1
MSDWLGFSLNHHHPQQDPINPPALHQHDPDSVIPINDFNPSTQTQLGWRLEATDQSGPKLEDFLGGGACAGAGADVNLNF
HEYRPTEGDIVPRDGAACYISPPPAPPYLLHYGYPYYTTTSTSTTLTAANQNEQQNGVVSYDDGASAITQVSEIKTWLLP
SSSSTGCDVSDRQECLSLAVVPAGDNRKRSVVQMKSDKSGKGEVVTRKAVDGFGQRTSRFRGVTRHRWTGRYEAHLWDNS
CRKEGQTRKGRQVYLGGYDNEEKAARAYDLAALKYWGPTTHINFPLADYEKELEEMKNMNRQEFVANLRRKSSGFSRGAS
MYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGTSAVTNFDISRYDVKRICSSSTLIAGELA
KRSPRDTGPTMLLENCSHDPGDDFADMVWAGNQEEPPGLEDQQGASNIENMSPSELAGDLDPGVG*
CDS seq >Ha.39638.1
ATGAGCGATTGGCTGGGCTTCTCCCTTAACCACCACCACCCCCAACAAGACCCCATTAATCCACCTGCACTTCATCAACA
TGACCCTGATTCCGTTATTCCTATCAACGATTTCAATCCTTCCACTCAAACACAACTAGGATGGAGATTAGAAGCCACCG
ACCAAAGTGGTCCGAAGCTCGAAGATTTTTTGGGCGGCGGTGCTTGTGCTGGTGCTGGTGCTGACGTCAACCTTAACTTT
CACGAATATCGTCCGACAGAAGGTGATATTGTTCCTAGAGACGGCGCCGCCTGCTATATATCACCACCGCCGGCGCCGCC
ATATTTGTTACACTACGGTTATCCTTACTACACCACCACCAGCACCAGCACAACCCTCACCGCCGCAAACCAAAACGAAC
AACAAAACGGTGTCGTTTCATACGACGACGGTGCAAGTGCAATCACTCAGGTTTCCGAGATCAAAACGTGGTTACTGCCA
TCGTCATCATCGACGGGGTGTGACGTCAGCGACCGTCAGGAGTGTTTGTCGCTGGCGGTGGTGCCTGCGGGTGATAATAG
GAAGCGGTCGGTGGTGCAGATGAAGTCGGATAAAAGCGGGAAAGGAGAAGTGGTTACGCGTAAAGCGGTGGATGGTTTCG
GGCAAAGGACTTCACGGTTTCGGGGCGTGACGAGGCACCGGTGGACGGGGAGATATGAAGCTCATTTGTGGGATAATAGT
TGTAGGAAAGAAGGGCAGACAAGAAAGGGTCGTCAAGTTTATCTTGGGGGATATGATAATGAAGAGAAAGCTGCAAGAGC
TTATGACTTGGCTGCTCTTAAATACTGGGGTCCAACAACTCATATTAATTTCCCTCTAGCCGACTACGAAAAAGAACTCG
AGGAGATGAAAAACATGAACAGGCAAGAATTTGTAGCCAACTTAAGAAGGAAAAGCAGCGGGTTTTCAAGAGGGGCATCT
ATGTATAGAGGAGTCACAAGGCATCATCAACATGGAAGGTGGCAAGCTAGAATAGGAAGAGTTGCAGGCAATAAGGACTT
GTACCTTGGTACCTTTAGTACACAAGAGGAAGCGGCGGAAGCTTACGACATAGCAGCCATAAAATTCCGAGGGACAAGTG
CAGTGACCAACTTCGACATTAGCCGATACGACGTCAAAAGAATATGTTCAAGTTCCACTTTGATAGCCGGCGAGCTGGCA
AAACGGTCTCCACGAGATACAGGCCCCACAATGCTGCTCGAAAACTGTAGTCATGACCCGGGTGATGATTTCGCTGACAT
GGTATGGGCTGGAAACCAGGAGGAGCCGCCTGGACTCGAAGACCAACAAGGTGCAAGTAATATCGAGAACATGAGCCCGA
GTGAGTTAGCCGGTGATCTTGACCCAGGGGTTGGATGA