Microexon ID Ha_5:13612069-13612079:-
Species Helianthus annuus
Coordinates 5:13612069..13612079
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAAAATTTGAG
Microexon Amino Acid seq GKFE
Microexon-tag DNA Seq CAAGAAACCGTCCGAGAGCCTCACTTGTTCGCATTTTCATTCAATAAAGGAAAATTTGAGATTGAGGAAATCTACAATTTTTCACAAGATGATCTCTTGACGGAGGAT
Microexon-tag Amino Acid Seq QETVREPHLFAFSFNKGKFEIEEIYNFSQDDLLTED
Microexon-tag spanning region13611947-13612544
Microexon-tag prediction score0.9688
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG24005x
Reference Transcript ID OTG24005
Gene ID HannXRQ_Chr05g0131871
Gene Name NA
Transcript ID OTG24005
Protein ID OTG24005
Gene ID HannXRQ_Chr05g0131871
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 3.8e-08
Motif start 636
Motif end 713
Protein seq >OTG24005
MSGSAKTLEPAFQGAGQRVGTEIWRIENFQPVPLPKSNYGKFYSGDSYIVLQTTSGKGGAYFYDIHFWLGKDTSQDEAGT
AAIKTVELDAVLGGRAVQYREPQGHESDKFLSYFKPCIIPLEGGVASGFKETEEEEFQTRLYTCKGKRVVKLKQVPFSRS
TLNHDDVFILDTKDKIFQFNGANSNIQERAKALEVIQFLKDKYHEGTCDVAIVDDGKLQAEGDSGEFWVIFGGFAPIGKK
VASEDDIIPEKTPPKLYCITEGQVKEVDGELSKSLLENNKCYLLDCGSEVFVWVGRVTQVEERKAAMQAAEEFITSQNRP
KSTRVTRLIQGYETHSFKSNFDSWPSGSAPSAPEEGRGKVAALLKQQGVGLKGLAKASTAVEEVPPLLEENGKIEVWRIN
GSAKTPVAKEDVGKFYSGDCYIVLYTYHSNEKKEDYYLCCWIGKDSIEEDQNMAARLATTMFNSLKGRPVQGRVYQGKEP
PQFVAIFQPMVVLKGGLSSGYKNYIADKGLNDETYSSDEVALIQISGTSLHNNKAVQVEAVATSLNSYDCFLLQSGSSLF
TWHGNQSTVEQHTLAAKIAEFLKPGANVKFAKEGTENQTFWFALGGKQGYTSKKVTQETVREPHLFAFSFNKGKFEIEEI
YNFSQDDLLTEDILILDTHAEVFVWVGQSVDSKEKQSAFEVGQKYIESAASLDGLSPSVPLYRVTEGNEPNFFTTFFSWD
SAKANAHGNSFQKKILLMFGLGGGHSAESQDRSNGNQGGPTQRASALAALNSAFKSSPPAKSSASPKAPSRGSQRAAAVA
ALSSVLTAEKKGPSDASPVRPVRSPPSETVSPAIAKSEEPSNGLESNEGSEVTTETSDPVQETNGEGSATKSTTEEPECV
SVDSRSTFSYEQLRAKSENPVTGIDFKRREAYLSAEEFEAVLGMTKEAFYKIPKWKQDMMKKKVDLF*
CDS seq >OTG24005
ATGTCTGGCTCTGCAAAAACTTTGGAACCAGCATTTCAAGGAGCCGGTCAGAGAGTAGGGACTGAAATATGGAGAATCGA
AAACTTTCAGCCAGTACCGTTGCCCAAGTCTAATTATGGTAAATTCTACTCGGGTGATTCATACATTGTATTGCAGACTA
CCTCTGGCAAGGGAGGTGCGTACTTTTACGACATACACTTCTGGTTGGGAAAAGATACTAGCCAGGATGAAGCTGGAACG
GCAGCTATAAAAACAGTCGAACTTGATGCAGTTCTTGGTGGGCGTGCAGTGCAATATCGGGAACCTCAGGGTCATGAATC
CGATAAATTTTTATCTTATTTCAAACCTTGTATCATACCACTTGAAGGAGGTGTTGCTTCCGGTTTTAAGGAAACCGAAG
AAGAAGAATTCCAAACACGACTATACACATGCAAAGGAAAACGAGTCGTCAAGTTGAAGCAGGTGCCGTTTTCTCGATCC
ACATTGAATCATGATGATGTCTTTATCTTGGATACTAAAGATAAAATCTTTCAATTTAACGGGGCGAACTCGAATATACA
AGAAAGGGCTAAAGCTTTGGAAGTTATTCAGTTTTTAAAGGATAAATATCATGAGGGGACATGCGATGTTGCAATTGTTG
ATGACGGGAAGCTACAAGCAGAGGGTGACTCCGGTGAATTTTGGGTTATATTCGGTGGCTTTGCTCCTATTGGCAAAAAG
GTTGCAAGTGAAGATGATATTATTCCCGAGAAGACCCCTCCGAAACTTTATTGCATTACTGAAGGTCAGGTTAAGGAAGT
GGATGGTGAACTTTCAAAATCTTTACTGGAAAACAACAAATGCTATCTATTGGATTGTGGTTCAGAGGTGTTTGTTTGGG
TCGGGCGAGTAACGCAAGTGGAAGAACGAAAAGCCGCCATGCAGGCTGCTGAGGAGTTTATAACAAGCCAAAATCGGCCC
AAGTCAACTCGTGTAACCCGGCTTATTCAAGGTTATGAAACGCATTCATTCAAGTCAAACTTTGACTCATGGCCATCAGG
CTCAGCACCTTCTGCCCCCGAGGAGGGTAGAGGCAAAGTAGCAGCTCTATTGAAGCAACAAGGTGTCGGTCTTAAAGGCC
TTGCAAAAGCTTCGACAGCTGTTGAAGAAGTCCCACCTTTGCTCGAAGAAAATGGAAAAATCGAGGTGTGGCGGATTAAT
GGCAGTGCTAAAACACCAGTAGCAAAGGAGGACGTCGGTAAATTTTACAGCGGAGATTGCTACATTGTTCTTTACACCTA
CCATTCTAATGAAAAGAAAGAAGACTACTACCTCTGCTGTTGGATCGGCAAAGACAGCATCGAGGAGGACCAGAACATGG
CTGCTCGGCTGGCGACAACGATGTTTAACTCACTCAAAGGAAGACCAGTTCAGGGTCGTGTATATCAAGGGAAAGAACCG
CCACAGTTTGTTGCCATCTTCCAGCCTATGGTTGTGTTGAAGGGTGGGTTAAGCTCTGGTTACAAGAATTACATTGCCGA
CAAAGGATTAAATGACGAAACGTATAGCTCAGATGAAGTAGCCCTCATTCAGATATCTGGAACTTCATTGCATAATAATA
AAGCCGTTCAAGTCGAAGCTGTGGCAACCTCTTTAAATTCGTACGACTGTTTTCTTCTTCAATCTGGTTCATCGTTGTTC
ACCTGGCACGGAAACCAGAGCACGGTTGAGCAACACACTCTAGCTGCTAAAATTGCTGAATTTTTGAAGCCTGGTGCCAA
CGTGAAGTTTGCTAAAGAAGGAACAGAGAACCAAACCTTCTGGTTTGCACTTGGAGGGAAACAAGGGTACACCAGTAAAA
AAGTAACACAAGAAACCGTCCGAGAGCCTCACTTGTTCGCATTTTCATTCAATAAAGGAAAATTTGAGATTGAGGAAATC
TACAATTTTTCACAAGATGATCTCTTGACGGAGGATATTTTAATTTTGGATACACACGCTGAGGTGTTCGTTTGGGTTGG
TCAATCGGTTGACTCCAAGGAAAAGCAAAGTGCCTTTGAAGTTGGCCAGAAATACATAGAGTCGGCTGCATCTCTTGATG
GACTATCCCCATCGGTGCCCTTATACAGAGTTACAGAAGGAAACGAACCGAACTTCTTCACAACGTTTTTCTCATGGGAT
TCCGCAAAAGCCAATGCTCACGGGAACTCGTTCCAGAAGAAGATTTTGCTAATGTTTGGGCTTGGGGGAGGTCATTCTGC
AGAGAGTCAAGATAGGTCAAACGGAAACCAGGGCGGGCCCACTCAAAGAGCTTCAGCGTTGGCGGCGTTGAACTCTGCTT
TCAAATCTTCTCCACCTGCTAAATCTTCAGCTTCTCCAAAGGCACCGAGTCGGGGTTCACAAAGAGCAGCCGCAGTCGCT
GCTTTATCTTCGGTTCTCACTGCTGAAAAGAAGGGCCCGTCTGATGCTTCTCCAGTCCGGCCCGTTAGAAGCCCACCATC
TGAAACCGTCTCACCTGCCATAGCCAAAAGCGAAGAACCTTCCAATGGTTTGGAGAGTAACGAAGGCTCGGAAGTCACAA
CCGAGACATCCGACCCAGTTCAAGAGACGAACGGCGAAGGTTCAGCAACAAAGTCCACAACAGAAGAACCCGAGTGTGTT
AGTGTCGACAGTCGAAGCACTTTCAGTTATGAACAACTTAGGGCTAAATCTGAGAACCCAGTTACAGGAATCGACTTTAA
GAGAAGAGAGGCTTATCTATCTGCTGAAGAATTCGAAGCGGTACTCGGGATGACAAAAGAAGCGTTCTACAAAATACCGA
AGTGGAAGCAAGACATGATGAAGAAGAAAGTTGATCTGTTCTAG
Microexon DNA seq GAAAATTTGAG
Microexon Amino Acid seq GKFE
Microexon-tag DNA Seq CAAGAAACCGTCCGAGAGCCTCACTTGTTCGCATTTTCATTCAATAAAGGAAAATTTGAGATTGAGGAAATCTACAATTTTTCACAAGATGATCTCTTGACGGAGGAT
Microexon-tag Amino Acid seq QETVREPHLFAFSFNKGKFEIEEIYNFSQDDLLTED
Transcript ID Ha.43442.1
Gene ID Ha.43442
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 3.8e-08
Motif start 636
Motif end 713
Protein seq >Ha.43442.1
MSGSAKTLEPAFQGAGQRVGTEIWRIENFQPVPLPKSNYGKFYSGDSYIVLQTTSGKGGAYFYDIHFWLGKDTSQDEAGT
AAIKTVELDAVLGGRAVQYREPQGHESDKFLSYFKPCIIPLEGGVASGFKETEEEEFQTRLYTCKGKRVVKLKQVPFSRS
TLNHDDVFILDTKDKIFQFNGANSNIQERAKALEVIQFLKDKYHEGTCDVAIVDDGKLQAEGDSGEFWVIFGGFAPIGKK
VASEDDIIPEKTPPKLYCITEGQVKEVDGELSKSLLENNKCYLLDCGSEVFVWVGRVTQVEERKAAMQAAEEFITSQNRP
KSTRVTRLIQGYETHSFKSNFDSWPSGSAPSAPEEGRGKVAALLKQQGVGLKGLAKASTAVEEVPPLLEENGKIEVWRIN
GSAKTPVAKEDVGKFYSGDCYIVLYTYHSNEKKEDYYLCCWIGKDSIEEDQNMAARLATTMFNSLKGRPVQGRVYQGKEP
PQFVAIFQPMVVLKGGLSSGYKNYIADKGLNDETYSSDEVALIQISGTSLHNNKAVQVEAVATSLNSYDCFLLQSGSSLF
TWHGNQSTVEQHTLAAKIAEFLKPGANVKFAKEGTENQTFWFALGGKQGYTSKKVTQETVREPHLFAFSFNKGKFEIEEI
YNFSQDDLLTEDILILDTHAEVFVWVGQSVDSKEKQSAFEVGQKYIESAASLDGLSPSVPLYRVTEGNEPNFFTTFFSWD
SAKANAHGNSFQKKILLMFGLGGGHSAESQDRSNGNQGGPTQRASALAALNSAFKSSPPAKSSASPKAPSRGSQRAAAVA
ALSSVLTAEKKGPSDASPVRPVRSPPSETVSPAIAKSEEPSNGLESNEGSEVTTETSDPVQETNGEGSATKSTTEEPECV
SVDSRSTFSYEQLRAKSENPVTGIDFKRREAYLSAEEFEAVLGMTKEAFYKIPKWKQDMMKKKVDLF*
CDS seq >Ha.43442.1
ATGTCTGGCTCTGCAAAAACTTTGGAACCAGCATTTCAAGGAGCCGGTCAGAGAGTAGGGACTGAAATATGGAGAATCGA
AAACTTTCAGCCAGTACCGTTGCCCAAGTCTAATTATGGTAAATTCTACTCGGGTGATTCATACATTGTATTGCAGACTA
CCTCTGGCAAGGGAGGTGCGTACTTTTACGACATACACTTCTGGTTGGGAAAAGATACTAGCCAGGATGAAGCTGGAACG
GCAGCTATAAAAACAGTCGAACTTGATGCAGTTCTTGGTGGGCGTGCAGTGCAATATCGGGAACCTCAGGGTCATGAATC
CGATAAATTTTTATCTTATTTCAAACCTTGTATCATACCACTTGAAGGAGGTGTTGCTTCCGGTTTTAAGGAAACCGAAG
AAGAAGAATTCCAAACACGACTATACACATGCAAAGGAAAACGAGTCGTCAAGTTGAAGCAGGTGCCGTTTTCTCGATCC
ACATTGAATCATGATGATGTCTTTATCTTGGATACTAAAGATAAAATCTTTCAATTTAACGGGGCGAACTCGAATATACA
AGAAAGGGCTAAAGCTTTGGAAGTTATTCAGTTTTTAAAGGATAAATATCATGAGGGGACATGCGATGTTGCAATTGTTG
ATGACGGGAAGCTACAAGCAGAGGGTGACTCCGGTGAATTTTGGGTTATATTCGGTGGCTTTGCTCCTATTGGCAAAAAG
GTTGCAAGTGAAGATGATATTATTCCCGAGAAGACCCCTCCGAAACTTTATTGCATTACTGAAGGTCAGGTTAAGGAAGT
GGATGGTGAACTTTCAAAATCTTTACTGGAAAACAACAAATGCTATCTATTGGATTGTGGTTCAGAGGTGTTTGTTTGGG
TCGGGCGAGTAACGCAAGTGGAAGAACGAAAAGCCGCCATGCAGGCTGCTGAGGAGTTTATAACAAGCCAAAATCGGCCC
AAGTCAACTCGTGTAACCCGGCTTATTCAAGGTTATGAAACGCATTCATTCAAGTCAAACTTTGACTCATGGCCATCAGG
CTCAGCACCTTCTGCCCCCGAGGAGGGTAGAGGCAAAGTAGCAGCTCTATTGAAGCAACAAGGTGTCGGTCTTAAAGGCC
TTGCAAAAGCTTCGACAGCTGTTGAAGAAGTCCCACCTTTGCTCGAAGAAAATGGAAAAATCGAGGTGTGGCGGATTAAT
GGCAGTGCTAAAACACCAGTAGCAAAGGAGGACGTCGGTAAATTTTACAGCGGAGATTGCTACATTGTTCTTTACACCTA
CCATTCTAATGAAAAGAAAGAAGACTACTACCTCTGCTGTTGGATCGGCAAAGACAGCATCGAGGAGGACCAGAACATGG
CTGCTCGGCTGGCGACAACGATGTTTAACTCACTCAAAGGAAGACCAGTTCAGGGTCGTGTATATCAAGGGAAAGAACCG
CCACAGTTTGTTGCCATCTTCCAGCCTATGGTTGTGTTGAAGGGTGGGTTAAGCTCTGGTTACAAGAATTACATTGCCGA
CAAAGGATTAAATGACGAAACGTATAGCTCAGATGAAGTAGCCCTCATTCAGATATCTGGAACTTCATTGCATAATAATA
AAGCCGTTCAAGTCGAAGCTGTGGCAACCTCTTTAAATTCGTACGACTGTTTTCTTCTTCAATCTGGTTCATCGTTGTTC
ACCTGGCACGGAAACCAGAGCACGGTTGAGCAACACACTCTAGCTGCTAAAATTGCTGAATTTTTGAAGCCTGGTGCCAA
CGTGAAGTTTGCTAAAGAAGGAACAGAGAACCAAACCTTCTGGTTTGCACTTGGAGGGAAACAAGGGTACACCAGTAAAA
AAGTAACACAAGAAACCGTCCGAGAGCCTCACTTGTTCGCATTTTCATTCAATAAAGGAAAATTTGAGATTGAGGAAATC
TACAATTTTTCACAAGATGATCTCTTGACGGAGGATATTTTAATTTTGGATACACACGCTGAGGTGTTCGTTTGGGTTGG
TCAATCGGTTGACTCCAAGGAAAAGCAAAGTGCCTTTGAAGTTGGCCAGAAATACATAGAGTCGGCTGCATCTCTTGATG
GACTATCCCCATCGGTGCCCTTATACAGAGTTACAGAAGGAAACGAACCGAACTTCTTCACAACGTTTTTCTCATGGGAT
TCCGCAAAAGCCAATGCTCACGGGAACTCGTTCCAGAAGAAGATTTTGCTAATGTTTGGGCTTGGGGGAGGTCATTCTGC
AGAGAGTCAAGATAGGTCAAACGGAAACCAGGGCGGGCCCACTCAAAGAGCTTCAGCGTTGGCGGCGTTGAACTCTGCTT
TCAAATCTTCTCCACCTGCTAAATCTTCAGCTTCTCCAAAGGCACCGAGTCGGGGTTCACAAAGAGCAGCCGCAGTCGCT
GCTTTATCTTCGGTTCTCACTGCTGAAAAGAAGGGCCCGTCTGATGCTTCTCCAGTCCGGCCCGTTAGAAGCCCACCATC
TGAAACCGTCTCACCTGCCATAGCCAAAAGCGAAGAACCTTCCAATGGTTTGGAGAGTAACGAAGGCTCGGAAGTCACAA
CCGAGACATCCGACCCAGTTCAAGAGACGAACGGCGAAGGTTCAGCAACAAAGTCCACAACAGAAGAACCCGAGTGTGTT
AGTGTCGACAGTCGAAGCACTTTCAGTTATGAACAACTTAGGGCTAAATCTGAGAACCCAGTTACAGGAATCGACTTTAA
GAGAAGAGAGGCTTATCTATCTGCTGAAGAATTCGAAGCGGTACTCGGGATGACAAAAGAAGCGTTCTACAAAATACCGA
AGTGGAAGCAAGACATGATGAAGAAGAAAGTTGATCTGTTCTAG