
Microexon ID | Ha_10:30371224-30371234:+ |
Species | Helianthus annuus | Coordinates | 10:30371224..30371234 |
Microexon Cluster ID | MEP27 |
Size | 11 |
Phase | 1 |
Pfam Domain Motif | Gelsolin |
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 49,11,48 |
Microexon location in the Microexon-tag | 2 |
Microexon-tag DNA Seq | MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT |
Logo of Microexon-tag DNA Seq | ![]() |
Alignment of exons | ![]() |
Microexon DNA seq | GAAAATTTGAG |
Microexon Amino Acid seq | GKFE |
Microexon-tag DNA Seq | CCGGAAACCAGCAGAGATCCTCACTTGTTCGCATTCTCATTCAACAGAGGAAAATTTGAGATCGAGGAAATCTACAACTTTTCACAAGATGATCTGTTGACCGAGGAT |
Microexon-tag Amino Acid Seq | PETSRDPHLFAFSFNRGKFEIEEIYNFSQDDLLTED |
Microexon-tag spanning region | 30371078-30371373 |
Microexon-tag prediction score | 0.9715 |
Overlapped with the annotated transcript (%) | 100 |
New Transcript ID | OTG10053x |
Reference Transcript ID | OTG10053 |
Gene ID | HannXRQ_Chr10g0283331 |
Gene Name | VLN3 |
Transcript ID | OTG10053 |
Protein ID | OTG10053 |
Gene ID | HannXRQ_Chr10g0283331 |
Gene Name | VLN3 |
Pfam domain motif | Gelsolin |
Motif E-value | 2.9e-06 |
Motif start | 636 |
Motif end | 713 |
Protein seq | >OTG10053 MAGSAKALEPAFQGVGQRVGTEIWRIENFQPVPLPKSNYGKFYSGDSYIVLQTTSGKGGAYFYDIHFWLGKDTSQDEAGT AAIKTVELDAILGGRAVQYREPQGYESDKFLSYFKPCIIPLEGGVASGFKETEEEEFQIRLYTCKGKRAVKLKQVPFSRS TLNHDDVFILDTKDKIFQFNGANSNIQERAKALEVIQFLKDKYHEGTCDVAIVDDGKLQAEGDSGEFWVIFGGFAPIGKK VASEDDTIPEKTPPKLYCIADGQVKDVDGELSKSLLENNKCYLLDCGSEVFVWVGRVTQVDERKAAMQAAEEFITSQNRP KATRVTRLIQGYETHSFKSNFESWPSGSAPSAPEEGRGKVAALLKQQGVGLKGLAKSSTVTEEVPPLLEENGKIEVWRIN GSAKTPVPKEDIGKFYSGDCYIVLYTYHSNEKKEDYYLCCWIGKDSIEEDQNMAARLATTMFNSLKGRPVQGRVYQGKEP PQFVAIFQPMVVLKGGLSSGYKNYVADKGLNDETYSSDGVALIQISGTSVHNNKAVQVEPVATSLNSYDCFLLQSGSSLF TWHGNQSTVEQHNIAAKIAEFLKPGANIKFAKEGTENSTFWFALGGKQGYTSKKTAPETSRDPHLFAFSFNRGKFEIEEI YNFSQDDLLTEDMLILDTHAEVFVWVGQSVDSKEKQSAFEIGQKYVELAASLDGLSPCVPLYRVSEGNEPNFFTTFFSWD PAKANVQGNSFQKKVLLLFGFGGGGNSGESQDKSNGNQGGPTQRASALAALNSAFKSSPTTKSSASPKVPSRGSQRAAAV AALSSVLTAEKKGSSDASPARPVRSTDASPARSIRSPPSETVSPAVAKSEEPSDSNEGSEVTTETSDPVQEANGDGSAPK PEENECESVNSQSTFSYEQLRAKSENPVKGIDFKRREAYLSAEEFEAVLGMTKEAFYKIPKWKQDMMKKKVDLF* |
CDS seq | >OTG10053 ATGGCTGGCTCTGCTAAAGCTCTGGAACCAGCATTTCAAGGAGTCGGTCAGAGAGTAGGAACCGAGATATGGAGAATCGA AAACTTTCAGCCAGTACCATTGCCCAAATCTAATTATGGGAAATTCTATTCTGGTGATTCGTACATCGTATTGCAGACTA CTTCCGGTAAGGGCGGTGCATACTTTTACGACATACATTTTTGGCTGGGAAAAGATACTAGCCAGGATGAAGCTGGAACA GCAGCGATAAAAACAGTCGAACTTGATGCGATTCTCGGTGGGCGTGCAGTTCAATATAGGGAACCGCAGGGTTATGAGTC TGATAAGTTTTTGTCTTATTTCAAACCTTGTATTATACCCCTTGAGGGCGGTGTTGCTTCTGGGTTTAAGGAAACCGAAG AAGAAGAATTTCAAATACGATTATACACGTGCAAAGGAAAACGAGCTGTCAAGTTGAAGCAGGTCCCTTTTTCTCGATCC ACATTGAATCACGATGATGTCTTTATCTTGGATACTAAAGATAAGATCTTTCAATTCAATGGGGCAAACTCAAATATCCA AGAACGGGCTAAAGCGTTGGAAGTTATTCAGTTTTTAAAGGATAAATATCACGAGGGTACATGTGATGTCGCAATTGTTG ATGATGGGAAACTGCAAGCGGAGGGTGATTCCGGTGAATTTTGGGTGATCTTTGGTGGCTTTGCTCCTATTGGCAAAAAG GTTGCAAGCGAAGATGATACGATTCCCGAAAAGACTCCACCGAAACTTTATTGCATTGCGGACGGTCAGGTTAAGGATGT AGATGGCGAACTTTCGAAATCTTTACTGGAAAACAACAAATGCTATCTATTGGACTGCGGTTCCGAGGTGTTTGTTTGGG TCGGTCGAGTAACACAAGTGGATGAAAGAAAAGCTGCCATGCAGGCTGCTGAGGAGTTCATTACGAGCCAAAATCGGCCC AAGGCAACGCGTGTAACTCGGCTTATTCAAGGTTACGAAACACATTCATTTAAGTCAAACTTCGAGTCATGGCCATCGGG TTCAGCACCTTCTGCTCCTGAGGAAGGTAGAGGAAAAGTAGCAGCTCTATTGAAGCAACAAGGTGTTGGTCTCAAAGGGC TTGCAAAAAGTTCTACGGTTACTGAGGAAGTCCCACCTTTGCTTGAAGAAAATGGAAAAATCGAGGTGTGGCGGATTAAT GGAAGTGCTAAAACACCTGTACCAAAGGAGGATATCGGTAAATTTTACAGTGGAGATTGCTACATTGTTCTTTATACCTA CCATTCCAATGAAAAGAAAGAAGATTACTACCTGTGCTGTTGGATCGGTAAAGATAGCATCGAGGAGGACCAAAACATGG CTGCTCGGCTAGCTACAACAATGTTCAACTCACTAAAAGGAAGACCGGTTCAGGGTCGTGTATATCAAGGGAAAGAACCG CCACAGTTTGTTGCAATTTTCCAGCCAATGGTGGTGTTGAAGGGTGGATTGAGCTCTGGTTACAAGAACTACGTTGCAGA CAAGGGATTGAACGATGAAACCTATAGTTCAGATGGTGTAGCCCTCATTCAGATATCGGGTACTTCCGTGCATAATAATA AAGCCGTCCAAGTAGAACCGGTGGCAACTTCATTGAATTCTTATGACTGCTTTCTTCTTCAATCTGGTTCATCATTATTC ACCTGGCACGGAAACCAGAGTACGGTTGAGCAGCACAACATAGCTGCTAAAATTGCTGAATTTTTGAAGCCCGGTGCCAA CATCAAGTTCGCCAAAGAAGGAACCGAGAACTCAACTTTCTGGTTTGCACTCGGAGGGAAACAAGGTTACACCAGCAAAA AAACCGCACCGGAAACCAGCAGAGATCCTCACTTGTTCGCATTCTCATTCAACAGAGGAAAATTTGAGATCGAGGAAATC TACAACTTTTCACAAGATGATCTGTTGACCGAGGATATGTTAATATTAGATACACACGCTGAGGTGTTTGTTTGGGTCGG TCAGTCTGTAGACTCGAAGGAAAAGCAAAGTGCCTTTGAAATTGGACAGAAATACGTAGAGTTGGCTGCATCTCTAGATG GGCTATCCCCATGTGTGCCTTTATACCGAGTTTCGGAAGGAAACGAACCTAACTTCTTCACAACATTCTTCTCTTGGGAT CCTGCAAAAGCCAATGTTCAAGGGAACTCATTCCAAAAGAAGGTTTTGCTACTATTCGGGTTCGGTGGAGGCGGCAATTC CGGAGAGAGTCAGGATAAGTCAAACGGAAACCAGGGTGGGCCCACTCAAAGAGCCTCAGCGTTGGCGGCCTTGAACTCCG CTTTCAAATCATCACCGACCACTAAATCTTCAGCTTCTCCGAAAGTACCGAGTCGAGGTTCACAGAGAGCAGCTGCAGTT GCCGCTTTATCTTCGGTTCTCACTGCTGAGAAAAAGGGATCATCCGATGCTTCTCCAGCTCGGCCCGTTAGAAGCACTGA TGCTTCTCCAGCTCGGTCCATTAGAAGCCCACCGTCTGAAACTGTCTCACCTGCTGTAGCCAAAAGTGAAGAACCTTCTG ATAGTAATGAAGGTTCGGAAGTTACCACCGAGACATCTGACCCGGTTCAAGAGGCCAATGGAGACGGTTCAGCGCCGAAG CCAGAAGAAAATGAATGTGAGAGTGTAAACAGTCAAAGCACTTTCAGTTATGAACAACTCAGGGCTAAATCCGAGAATCC AGTTAAAGGAATCGACTTTAAAAGAAGAGAGGCTTATCTATCTGCTGAAGAATTCGAGGCGGTACTCGGGATGACAAAAG AGGCGTTCTACAAAATACCGAAATGGAAGCAAGACATGATGAAGAAGAAAGTTGATCTGTTCTAG |
Microexon DNA seq | GAAAATTTGAG |
Microexon Amino Acid seq | GKFE |
Microexon-tag DNA Seq | CCGGAAACCAGCAGAGATCCTCACTTGTTCGCATTCTCATTCAACAGAGGAAAATTTGAGATCGAGGAAATCTACAACTTTTCACAAGATGATCTGTTGACCGAGGAT |
Microexon-tag Amino Acid seq | PETSRDPHLFAFSFNRGKFEIEEIYNFSQDDLLTED |
Transcript ID | Ha.3760.1 |
Gene ID | Ha.3760 |
Gene Name | VLN3 |
Pfam domain motif | Gelsolin |
Motif E-value | 2.9e-06 |
Motif start | 636 |
Motif end | 713 |
Protein seq | >Ha.3760.1 MAGSAKALEPAFQGVGQRVGTEIWRIENFQPVPLPKSNYGKFYSGDSYIVLQTTSGKGGAYFYDIHFWLGKDTSQDEAGT AAIKTVELDAILGGRAVQYREPQGYESDKFLSYFKPCIIPLEGGVASGFKETEEEEFQIRLYTCKGKRAVKLKQVPFSRS TLNHDDVFILDTKDKIFQFNGANSNIQERAKALEVIQFLKDKYHEGTCDVAIVDDGKLQAEGDSGEFWVIFGGFAPIGKK VASEDDTIPEKTPPKLYCIADGQVKDVDGELSKSLLENNKCYLLDCGSEVFVWVGRVTQVDERKAAMQAAEEFITSQNRP KATRVTRLIQGYETHSFKSNFESWPSGSAPSAPEEGRGKVAALLKQQGVGLKGLAKSSTVTEEVPPLLEENGKIEVWRIN GSAKTPVPKEDIGKFYSGDCYIVLYTYHSNEKKEDYYLCCWIGKDSIEEDQNMAARLATTMFNSLKGRPVQGRVYQGKEP PQFVAIFQPMVVLKGGLSSGYKNYVADKGLNDETYSSDGVALIQISGTSVHNNKAVQVEPVATSLNSYDCFLLQSGSSLF TWHGNQSTVEQHNIAAKIAEFLKPGANIKFAKEGTENSTFWFALGGKQGYTSKKTAPETSRDPHLFAFSFNRGKFEIEEI YNFSQDDLLTEDMLILDTHAEVFVWVGQSVDSKEKQSAFEIGQKYVELAASLDGLSPCVPLYRVSEGNEPNFFTTFFSWD PAKANVQGNSFQKKVLLLFGFGGGGNSGESQDKSNGNQGGPTQRASALAALNSAFKSSPTTKSSASPKVPSRGSQRAAAV AALSSVLTAEKKGSSDASPARPVRSTDASPARSIRSPPSETVSPAVAKSEEPSDSNEGSEVTTETSDPVQEANGDGSAPK PEENECESVNSQSTFSYEQLRAKSENPVKGIDFKRREAYLSAEEFEAVLGMTKEAFYKIPKWKQDMMKKKVDLF* |
CDS seq | >Ha.3760.1 ATGGCTGGCTCTGCTAAAGCTCTGGAACCAGCATTTCAAGGAGTCGGTCAGAGAGTAGGAACCGAGATATGGAGAATCGA AAACTTTCAGCCAGTACCATTGCCCAAATCTAATTATGGGAAATTCTATTCTGGTGATTCGTACATCGTATTGCAGACTA CTTCCGGTAAGGGCGGTGCATACTTTTACGACATACATTTTTGGCTGGGAAAAGATACTAGCCAGGATGAAGCTGGAACA GCAGCGATAAAAACAGTCGAACTTGATGCGATTCTCGGTGGGCGTGCAGTTCAATATAGGGAACCGCAGGGTTATGAGTC TGATAAGTTTTTGTCTTATTTCAAACCTTGTATTATACCCCTTGAGGGCGGTGTTGCTTCTGGGTTTAAGGAAACCGAAG AAGAAGAATTTCAAATACGATTATACACGTGCAAAGGAAAACGAGCTGTCAAGTTGAAGCAGGTCCCTTTTTCTCGATCC ACATTGAATCACGATGATGTCTTTATCTTGGATACTAAAGATAAGATCTTTCAATTCAATGGGGCAAACTCAAATATCCA AGAACGGGCTAAAGCGTTGGAAGTTATTCAGTTTTTAAAGGATAAATATCACGAGGGTACATGTGATGTCGCAATTGTTG ATGATGGGAAACTGCAAGCGGAGGGTGATTCCGGTGAATTTTGGGTGATCTTTGGTGGCTTTGCTCCTATTGGCAAAAAG GTTGCAAGCGAAGATGATACGATTCCCGAAAAGACTCCACCGAAACTTTATTGCATTGCGGACGGTCAGGTTAAGGATGT AGATGGCGAACTTTCGAAATCTTTACTGGAAAACAACAAATGCTATCTATTGGACTGCGGTTCCGAGGTGTTTGTTTGGG TCGGTCGAGTAACACAAGTGGATGAAAGAAAAGCTGCCATGCAGGCTGCTGAGGAGTTCATTACGAGCCAAAATCGGCCC AAGGCAACGCGTGTAACTCGGCTTATTCAAGGTTACGAAACACATTCATTTAAGTCAAACTTCGAGTCATGGCCATCGGG TTCAGCACCTTCTGCTCCTGAGGAAGGTAGAGGAAAAGTAGCAGCTCTATTGAAGCAACAAGGTGTTGGTCTCAAAGGGC TTGCAAAAAGTTCTACGGTTACTGAGGAAGTCCCACCTTTGCTTGAAGAAAATGGAAAAATCGAGGTGTGGCGGATTAAT GGAAGTGCTAAAACACCTGTACCAAAGGAGGATATCGGTAAATTTTACAGTGGAGATTGCTACATTGTTCTTTATACCTA CCATTCCAATGAAAAGAAAGAAGATTACTACCTGTGCTGTTGGATCGGTAAAGATAGCATCGAGGAGGACCAAAACATGG CTGCTCGGCTAGCTACAACAATGTTCAACTCACTAAAAGGAAGACCGGTTCAGGGTCGTGTATATCAAGGGAAAGAACCG CCACAGTTTGTTGCAATTTTCCAGCCAATGGTGGTGTTGAAGGGTGGATTGAGCTCTGGTTACAAGAACTACGTTGCAGA CAAGGGATTGAACGATGAAACCTATAGTTCAGATGGTGTAGCCCTCATTCAGATATCGGGTACTTCCGTGCATAATAATA AAGCCGTCCAAGTAGAACCGGTGGCAACTTCATTGAATTCTTATGACTGCTTTCTTCTTCAATCTGGTTCATCATTATTC ACCTGGCACGGAAACCAGAGTACGGTTGAGCAGCACAACATAGCTGCTAAAATTGCTGAATTTTTGAAGCCCGGTGCCAA CATCAAGTTCGCCAAAGAAGGAACCGAGAACTCAACTTTCTGGTTTGCACTCGGAGGGAAACAAGGTTACACCAGCAAAA AAACCGCACCGGAAACCAGCAGAGATCCTCACTTGTTCGCATTCTCATTCAACAGAGGAAAATTTGAGATCGAGGAAATC TACAACTTTTCACAAGATGATCTGTTGACCGAGGATATGTTAATATTAGATACACACGCTGAGGTGTTTGTTTGGGTCGG TCAGTCTGTAGACTCGAAGGAAAAGCAAAGTGCCTTTGAAATTGGACAGAAATACGTAGAGTTGGCTGCATCTCTAGATG GGCTATCCCCATGTGTGCCTTTATACCGAGTTTCGGAAGGAAACGAACCTAACTTCTTCACAACATTCTTCTCTTGGGAT CCTGCAAAAGCCAATGTTCAAGGGAACTCATTCCAAAAGAAGGTTTTGCTACTATTCGGGTTCGGTGGAGGCGGCAATTC CGGAGAGAGTCAGGATAAGTCAAACGGAAACCAGGGTGGGCCCACTCAAAGAGCCTCAGCGTTGGCGGCCTTGAACTCCG CTTTCAAATCATCACCGACCACTAAATCTTCAGCTTCTCCGAAAGTACCGAGTCGAGGTTCACAGAGAGCAGCTGCAGTT GCCGCTTTATCTTCGGTTCTCACTGCTGAGAAAAAGGGATCATCCGATGCTTCTCCAGCTCGGCCCGTTAGAAGCACTGA TGCTTCTCCAGCTCGGTCCATTAGAAGCCCACCGTCTGAAACTGTCTCACCTGCTGTAGCCAAAAGTGAAGAACCTTCTG ATAGTAATGAAGGTTCGGAAGTTACCACCGAGACATCTGACCCGGTTCAAGAGGCCAATGGAGACGGTTCAGCGCCGAAG CCAGAAGAAAATGAATGTGAGAGTGTAAACAGTCAAAGCACTTTCAGTTATGAACAACTCAGGGCTAAATCCGAGAATCC AGTTAAAGGAATCGACTTTAAAAGAAGAGAGGCTTATCTATCTGCTGAAGAATTCGAGGCGGTACTCGGGATGACAAAAG AGGCGTTCTACAAAATACCGAAATGGAAGCAAGACATGATGAAGAAGAAAGTTGATCTGTTCTAG |