Microexon ID Ha_10:30371224-30371234:+
Species Helianthus annuus
Coordinates 10:30371224..30371234
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAAAATTTGAG
Microexon Amino Acid seq GKFE
Microexon-tag DNA Seq CCGGAAACCAGCAGAGATCCTCACTTGTTCGCATTCTCATTCAACAGAGGAAAATTTGAGATCGAGGAAATCTACAACTTTTCACAAGATGATCTGTTGACCGAGGAT
Microexon-tag Amino Acid Seq PETSRDPHLFAFSFNRGKFEIEEIYNFSQDDLLTED
Microexon-tag spanning region30371078-30371373
Microexon-tag prediction score0.9715
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG10053x
Reference Transcript ID OTG10053
Gene ID HannXRQ_Chr10g0283331
Gene Name VLN3
Transcript ID OTG10053
Protein ID OTG10053
Gene ID HannXRQ_Chr10g0283331
Gene Name VLN3
Pfam domain motif Gelsolin
Motif E-value 2.9e-06
Motif start 636
Motif end 713
Protein seq >OTG10053
MAGSAKALEPAFQGVGQRVGTEIWRIENFQPVPLPKSNYGKFYSGDSYIVLQTTSGKGGAYFYDIHFWLGKDTSQDEAGT
AAIKTVELDAILGGRAVQYREPQGYESDKFLSYFKPCIIPLEGGVASGFKETEEEEFQIRLYTCKGKRAVKLKQVPFSRS
TLNHDDVFILDTKDKIFQFNGANSNIQERAKALEVIQFLKDKYHEGTCDVAIVDDGKLQAEGDSGEFWVIFGGFAPIGKK
VASEDDTIPEKTPPKLYCIADGQVKDVDGELSKSLLENNKCYLLDCGSEVFVWVGRVTQVDERKAAMQAAEEFITSQNRP
KATRVTRLIQGYETHSFKSNFESWPSGSAPSAPEEGRGKVAALLKQQGVGLKGLAKSSTVTEEVPPLLEENGKIEVWRIN
GSAKTPVPKEDIGKFYSGDCYIVLYTYHSNEKKEDYYLCCWIGKDSIEEDQNMAARLATTMFNSLKGRPVQGRVYQGKEP
PQFVAIFQPMVVLKGGLSSGYKNYVADKGLNDETYSSDGVALIQISGTSVHNNKAVQVEPVATSLNSYDCFLLQSGSSLF
TWHGNQSTVEQHNIAAKIAEFLKPGANIKFAKEGTENSTFWFALGGKQGYTSKKTAPETSRDPHLFAFSFNRGKFEIEEI
YNFSQDDLLTEDMLILDTHAEVFVWVGQSVDSKEKQSAFEIGQKYVELAASLDGLSPCVPLYRVSEGNEPNFFTTFFSWD
PAKANVQGNSFQKKVLLLFGFGGGGNSGESQDKSNGNQGGPTQRASALAALNSAFKSSPTTKSSASPKVPSRGSQRAAAV
AALSSVLTAEKKGSSDASPARPVRSTDASPARSIRSPPSETVSPAVAKSEEPSDSNEGSEVTTETSDPVQEANGDGSAPK
PEENECESVNSQSTFSYEQLRAKSENPVKGIDFKRREAYLSAEEFEAVLGMTKEAFYKIPKWKQDMMKKKVDLF*
CDS seq >OTG10053
ATGGCTGGCTCTGCTAAAGCTCTGGAACCAGCATTTCAAGGAGTCGGTCAGAGAGTAGGAACCGAGATATGGAGAATCGA
AAACTTTCAGCCAGTACCATTGCCCAAATCTAATTATGGGAAATTCTATTCTGGTGATTCGTACATCGTATTGCAGACTA
CTTCCGGTAAGGGCGGTGCATACTTTTACGACATACATTTTTGGCTGGGAAAAGATACTAGCCAGGATGAAGCTGGAACA
GCAGCGATAAAAACAGTCGAACTTGATGCGATTCTCGGTGGGCGTGCAGTTCAATATAGGGAACCGCAGGGTTATGAGTC
TGATAAGTTTTTGTCTTATTTCAAACCTTGTATTATACCCCTTGAGGGCGGTGTTGCTTCTGGGTTTAAGGAAACCGAAG
AAGAAGAATTTCAAATACGATTATACACGTGCAAAGGAAAACGAGCTGTCAAGTTGAAGCAGGTCCCTTTTTCTCGATCC
ACATTGAATCACGATGATGTCTTTATCTTGGATACTAAAGATAAGATCTTTCAATTCAATGGGGCAAACTCAAATATCCA
AGAACGGGCTAAAGCGTTGGAAGTTATTCAGTTTTTAAAGGATAAATATCACGAGGGTACATGTGATGTCGCAATTGTTG
ATGATGGGAAACTGCAAGCGGAGGGTGATTCCGGTGAATTTTGGGTGATCTTTGGTGGCTTTGCTCCTATTGGCAAAAAG
GTTGCAAGCGAAGATGATACGATTCCCGAAAAGACTCCACCGAAACTTTATTGCATTGCGGACGGTCAGGTTAAGGATGT
AGATGGCGAACTTTCGAAATCTTTACTGGAAAACAACAAATGCTATCTATTGGACTGCGGTTCCGAGGTGTTTGTTTGGG
TCGGTCGAGTAACACAAGTGGATGAAAGAAAAGCTGCCATGCAGGCTGCTGAGGAGTTCATTACGAGCCAAAATCGGCCC
AAGGCAACGCGTGTAACTCGGCTTATTCAAGGTTACGAAACACATTCATTTAAGTCAAACTTCGAGTCATGGCCATCGGG
TTCAGCACCTTCTGCTCCTGAGGAAGGTAGAGGAAAAGTAGCAGCTCTATTGAAGCAACAAGGTGTTGGTCTCAAAGGGC
TTGCAAAAAGTTCTACGGTTACTGAGGAAGTCCCACCTTTGCTTGAAGAAAATGGAAAAATCGAGGTGTGGCGGATTAAT
GGAAGTGCTAAAACACCTGTACCAAAGGAGGATATCGGTAAATTTTACAGTGGAGATTGCTACATTGTTCTTTATACCTA
CCATTCCAATGAAAAGAAAGAAGATTACTACCTGTGCTGTTGGATCGGTAAAGATAGCATCGAGGAGGACCAAAACATGG
CTGCTCGGCTAGCTACAACAATGTTCAACTCACTAAAAGGAAGACCGGTTCAGGGTCGTGTATATCAAGGGAAAGAACCG
CCACAGTTTGTTGCAATTTTCCAGCCAATGGTGGTGTTGAAGGGTGGATTGAGCTCTGGTTACAAGAACTACGTTGCAGA
CAAGGGATTGAACGATGAAACCTATAGTTCAGATGGTGTAGCCCTCATTCAGATATCGGGTACTTCCGTGCATAATAATA
AAGCCGTCCAAGTAGAACCGGTGGCAACTTCATTGAATTCTTATGACTGCTTTCTTCTTCAATCTGGTTCATCATTATTC
ACCTGGCACGGAAACCAGAGTACGGTTGAGCAGCACAACATAGCTGCTAAAATTGCTGAATTTTTGAAGCCCGGTGCCAA
CATCAAGTTCGCCAAAGAAGGAACCGAGAACTCAACTTTCTGGTTTGCACTCGGAGGGAAACAAGGTTACACCAGCAAAA
AAACCGCACCGGAAACCAGCAGAGATCCTCACTTGTTCGCATTCTCATTCAACAGAGGAAAATTTGAGATCGAGGAAATC
TACAACTTTTCACAAGATGATCTGTTGACCGAGGATATGTTAATATTAGATACACACGCTGAGGTGTTTGTTTGGGTCGG
TCAGTCTGTAGACTCGAAGGAAAAGCAAAGTGCCTTTGAAATTGGACAGAAATACGTAGAGTTGGCTGCATCTCTAGATG
GGCTATCCCCATGTGTGCCTTTATACCGAGTTTCGGAAGGAAACGAACCTAACTTCTTCACAACATTCTTCTCTTGGGAT
CCTGCAAAAGCCAATGTTCAAGGGAACTCATTCCAAAAGAAGGTTTTGCTACTATTCGGGTTCGGTGGAGGCGGCAATTC
CGGAGAGAGTCAGGATAAGTCAAACGGAAACCAGGGTGGGCCCACTCAAAGAGCCTCAGCGTTGGCGGCCTTGAACTCCG
CTTTCAAATCATCACCGACCACTAAATCTTCAGCTTCTCCGAAAGTACCGAGTCGAGGTTCACAGAGAGCAGCTGCAGTT
GCCGCTTTATCTTCGGTTCTCACTGCTGAGAAAAAGGGATCATCCGATGCTTCTCCAGCTCGGCCCGTTAGAAGCACTGA
TGCTTCTCCAGCTCGGTCCATTAGAAGCCCACCGTCTGAAACTGTCTCACCTGCTGTAGCCAAAAGTGAAGAACCTTCTG
ATAGTAATGAAGGTTCGGAAGTTACCACCGAGACATCTGACCCGGTTCAAGAGGCCAATGGAGACGGTTCAGCGCCGAAG
CCAGAAGAAAATGAATGTGAGAGTGTAAACAGTCAAAGCACTTTCAGTTATGAACAACTCAGGGCTAAATCCGAGAATCC
AGTTAAAGGAATCGACTTTAAAAGAAGAGAGGCTTATCTATCTGCTGAAGAATTCGAGGCGGTACTCGGGATGACAAAAG
AGGCGTTCTACAAAATACCGAAATGGAAGCAAGACATGATGAAGAAGAAAGTTGATCTGTTCTAG
Microexon DNA seq GAAAATTTGAG
Microexon Amino Acid seq GKFE
Microexon-tag DNA Seq CCGGAAACCAGCAGAGATCCTCACTTGTTCGCATTCTCATTCAACAGAGGAAAATTTGAGATCGAGGAAATCTACAACTTTTCACAAGATGATCTGTTGACCGAGGAT
Microexon-tag Amino Acid seq PETSRDPHLFAFSFNRGKFEIEEIYNFSQDDLLTED
Transcript ID Ha.3760.1
Gene ID Ha.3760
Gene Name VLN3
Pfam domain motif Gelsolin
Motif E-value 2.9e-06
Motif start 636
Motif end 713
Protein seq >Ha.3760.1
MAGSAKALEPAFQGVGQRVGTEIWRIENFQPVPLPKSNYGKFYSGDSYIVLQTTSGKGGAYFYDIHFWLGKDTSQDEAGT
AAIKTVELDAILGGRAVQYREPQGYESDKFLSYFKPCIIPLEGGVASGFKETEEEEFQIRLYTCKGKRAVKLKQVPFSRS
TLNHDDVFILDTKDKIFQFNGANSNIQERAKALEVIQFLKDKYHEGTCDVAIVDDGKLQAEGDSGEFWVIFGGFAPIGKK
VASEDDTIPEKTPPKLYCIADGQVKDVDGELSKSLLENNKCYLLDCGSEVFVWVGRVTQVDERKAAMQAAEEFITSQNRP
KATRVTRLIQGYETHSFKSNFESWPSGSAPSAPEEGRGKVAALLKQQGVGLKGLAKSSTVTEEVPPLLEENGKIEVWRIN
GSAKTPVPKEDIGKFYSGDCYIVLYTYHSNEKKEDYYLCCWIGKDSIEEDQNMAARLATTMFNSLKGRPVQGRVYQGKEP
PQFVAIFQPMVVLKGGLSSGYKNYVADKGLNDETYSSDGVALIQISGTSVHNNKAVQVEPVATSLNSYDCFLLQSGSSLF
TWHGNQSTVEQHNIAAKIAEFLKPGANIKFAKEGTENSTFWFALGGKQGYTSKKTAPETSRDPHLFAFSFNRGKFEIEEI
YNFSQDDLLTEDMLILDTHAEVFVWVGQSVDSKEKQSAFEIGQKYVELAASLDGLSPCVPLYRVSEGNEPNFFTTFFSWD
PAKANVQGNSFQKKVLLLFGFGGGGNSGESQDKSNGNQGGPTQRASALAALNSAFKSSPTTKSSASPKVPSRGSQRAAAV
AALSSVLTAEKKGSSDASPARPVRSTDASPARSIRSPPSETVSPAVAKSEEPSDSNEGSEVTTETSDPVQEANGDGSAPK
PEENECESVNSQSTFSYEQLRAKSENPVKGIDFKRREAYLSAEEFEAVLGMTKEAFYKIPKWKQDMMKKKVDLF*
CDS seq >Ha.3760.1
ATGGCTGGCTCTGCTAAAGCTCTGGAACCAGCATTTCAAGGAGTCGGTCAGAGAGTAGGAACCGAGATATGGAGAATCGA
AAACTTTCAGCCAGTACCATTGCCCAAATCTAATTATGGGAAATTCTATTCTGGTGATTCGTACATCGTATTGCAGACTA
CTTCCGGTAAGGGCGGTGCATACTTTTACGACATACATTTTTGGCTGGGAAAAGATACTAGCCAGGATGAAGCTGGAACA
GCAGCGATAAAAACAGTCGAACTTGATGCGATTCTCGGTGGGCGTGCAGTTCAATATAGGGAACCGCAGGGTTATGAGTC
TGATAAGTTTTTGTCTTATTTCAAACCTTGTATTATACCCCTTGAGGGCGGTGTTGCTTCTGGGTTTAAGGAAACCGAAG
AAGAAGAATTTCAAATACGATTATACACGTGCAAAGGAAAACGAGCTGTCAAGTTGAAGCAGGTCCCTTTTTCTCGATCC
ACATTGAATCACGATGATGTCTTTATCTTGGATACTAAAGATAAGATCTTTCAATTCAATGGGGCAAACTCAAATATCCA
AGAACGGGCTAAAGCGTTGGAAGTTATTCAGTTTTTAAAGGATAAATATCACGAGGGTACATGTGATGTCGCAATTGTTG
ATGATGGGAAACTGCAAGCGGAGGGTGATTCCGGTGAATTTTGGGTGATCTTTGGTGGCTTTGCTCCTATTGGCAAAAAG
GTTGCAAGCGAAGATGATACGATTCCCGAAAAGACTCCACCGAAACTTTATTGCATTGCGGACGGTCAGGTTAAGGATGT
AGATGGCGAACTTTCGAAATCTTTACTGGAAAACAACAAATGCTATCTATTGGACTGCGGTTCCGAGGTGTTTGTTTGGG
TCGGTCGAGTAACACAAGTGGATGAAAGAAAAGCTGCCATGCAGGCTGCTGAGGAGTTCATTACGAGCCAAAATCGGCCC
AAGGCAACGCGTGTAACTCGGCTTATTCAAGGTTACGAAACACATTCATTTAAGTCAAACTTCGAGTCATGGCCATCGGG
TTCAGCACCTTCTGCTCCTGAGGAAGGTAGAGGAAAAGTAGCAGCTCTATTGAAGCAACAAGGTGTTGGTCTCAAAGGGC
TTGCAAAAAGTTCTACGGTTACTGAGGAAGTCCCACCTTTGCTTGAAGAAAATGGAAAAATCGAGGTGTGGCGGATTAAT
GGAAGTGCTAAAACACCTGTACCAAAGGAGGATATCGGTAAATTTTACAGTGGAGATTGCTACATTGTTCTTTATACCTA
CCATTCCAATGAAAAGAAAGAAGATTACTACCTGTGCTGTTGGATCGGTAAAGATAGCATCGAGGAGGACCAAAACATGG
CTGCTCGGCTAGCTACAACAATGTTCAACTCACTAAAAGGAAGACCGGTTCAGGGTCGTGTATATCAAGGGAAAGAACCG
CCACAGTTTGTTGCAATTTTCCAGCCAATGGTGGTGTTGAAGGGTGGATTGAGCTCTGGTTACAAGAACTACGTTGCAGA
CAAGGGATTGAACGATGAAACCTATAGTTCAGATGGTGTAGCCCTCATTCAGATATCGGGTACTTCCGTGCATAATAATA
AAGCCGTCCAAGTAGAACCGGTGGCAACTTCATTGAATTCTTATGACTGCTTTCTTCTTCAATCTGGTTCATCATTATTC
ACCTGGCACGGAAACCAGAGTACGGTTGAGCAGCACAACATAGCTGCTAAAATTGCTGAATTTTTGAAGCCCGGTGCCAA
CATCAAGTTCGCCAAAGAAGGAACCGAGAACTCAACTTTCTGGTTTGCACTCGGAGGGAAACAAGGTTACACCAGCAAAA
AAACCGCACCGGAAACCAGCAGAGATCCTCACTTGTTCGCATTCTCATTCAACAGAGGAAAATTTGAGATCGAGGAAATC
TACAACTTTTCACAAGATGATCTGTTGACCGAGGATATGTTAATATTAGATACACACGCTGAGGTGTTTGTTTGGGTCGG
TCAGTCTGTAGACTCGAAGGAAAAGCAAAGTGCCTTTGAAATTGGACAGAAATACGTAGAGTTGGCTGCATCTCTAGATG
GGCTATCCCCATGTGTGCCTTTATACCGAGTTTCGGAAGGAAACGAACCTAACTTCTTCACAACATTCTTCTCTTGGGAT
CCTGCAAAAGCCAATGTTCAAGGGAACTCATTCCAAAAGAAGGTTTTGCTACTATTCGGGTTCGGTGGAGGCGGCAATTC
CGGAGAGAGTCAGGATAAGTCAAACGGAAACCAGGGTGGGCCCACTCAAAGAGCCTCAGCGTTGGCGGCCTTGAACTCCG
CTTTCAAATCATCACCGACCACTAAATCTTCAGCTTCTCCGAAAGTACCGAGTCGAGGTTCACAGAGAGCAGCTGCAGTT
GCCGCTTTATCTTCGGTTCTCACTGCTGAGAAAAAGGGATCATCCGATGCTTCTCCAGCTCGGCCCGTTAGAAGCACTGA
TGCTTCTCCAGCTCGGTCCATTAGAAGCCCACCGTCTGAAACTGTCTCACCTGCTGTAGCCAAAAGTGAAGAACCTTCTG
ATAGTAATGAAGGTTCGGAAGTTACCACCGAGACATCTGACCCGGTTCAAGAGGCCAATGGAGACGGTTCAGCGCCGAAG
CCAGAAGAAAATGAATGTGAGAGTGTAAACAGTCAAAGCACTTTCAGTTATGAACAACTCAGGGCTAAATCCGAGAATCC
AGTTAAAGGAATCGACTTTAAAAGAAGAGAGGCTTATCTATCTGCTGAAGAATTCGAGGCGGTACTCGGGATGACAAAAG
AGGCGTTCTACAAAATACCGAAATGGAAGCAAGACATGATGAAGAAGAAAGTTGATCTGTTCTAG