Microexon ID Ha_6:55246017-55246027:+
Species Helianthus annuus
Coordinates 6:55246017..55246027
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAGAACTAAAG
Microexon Amino Acid seq GELK
Microexon-tag DNA Seq AGAGATGCCGAAAGTGATCCCCATTTGTTCTCATGCACATTTACAAAAGGAGAACTAAAGGTCATTGAGATCTACAACTTTAACCAAGACGATTTGATGACGGAAGAT
Microexon-tag Amino Acid Seq RDAESDPHLFSCTFTKGELKVIEIYNFNQDDLMTED
Microexon-tag spanning region55245853-55246166
Microexon-tag prediction score0.9629
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG23162x
Reference Transcript ID OTG23162
Gene ID HannXRQ_Chr06g0179361
Gene Name VLN5
Transcript ID OTG23162
Protein ID OTG23162
Gene ID HannXRQ_Chr06g0179361
Gene Name VLN5
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >OTG23162
MSVSMRDLDPAFQGAGQKAGIEVWRIENFKPVPVPQSSYGKFFTGDSYLILKTIALKSGALRQDIHYWLGKDTSKDEAGA
AALKTVELDAALGGRAVQYREVQGHETERFLSYFKPCIIPQEGGTASGFKHVESEEHKIRMFTCQGKHVVHVKEVPFARS
SLNHDDIFILDTANKIFQFNGSNSSIQERAKALEVVQHIKDTYHDGKCDIATVEDGKLMSDAETGEFWGFFGGFAPLPRK
TTTDDVKSAVALPTQLFCVEKGQAEPVSADTFTKELLDTNKCYLLDCGAEIYLWMGRSTSLDERKAASGVAEEYLRSQDR
LKSQIIRVIENFETVSFRSKFDAWPQSAEVAVSEDGRGKVAALLKRQGVNVRGLLKAAPEKEEPQPYIDCTGNLQVWRVN
GQEKILLPVPDQSKFYSGECYIFQYNYPGEDQDECLIGTWFGKKSVEEERSSATSHTNKMVESLKFMASQLQVYEGSEPI
LFFAIFQSFLVLKGGLSDGYKTFISENELSDDTYKEDGVALFRVQGSGPENMQAIQVEPVASSLNSSYCYILHSGSSVFT
WIGNLRTPEVEELVERQLDVIKPNMQAKLQKEGSESEQFWEILGGKSEYPSQKIARDAESDPHLFSCTFTKGELKVIEIY
NFNQDDLMTEDIFILDCHSSIFVWVGQQVDQKLKTQALVIGEKFVKHDFLLEKLTVQTPIYIISEGSEPEFFTRFFTWDS
TKSAMHGNSFQRKLSILKNGGRPTLSNKPKRRTPGAHVGRSATNEKPQRARSVSFSPERVRVRGRSPAFNALASAFENPS
ARNLSTPPPQVRKPYPKSDSSNVAPRSTAIASLTSTFEQPPREPLMPRSIKPRPKSPPKAESNSKENIMSSKIETLTIQE
DVKENEVEDEEGLTLYPYERLITVSTDPAADIDVTKRETYLSSAEFREKFGMTKEAFYKLPKWKQNKLKMALQLF*
CDS seq >OTG23162
ATGTCTGTTTCTATGAGAGATTTGGATCCAGCCTTCCAAGGAGCTGGACAAAAAGCAGGAATTGAAGTGTGGCGCATTGA
GAATTTTAAACCAGTGCCCGTTCCACAGTCTTCTTATGGCAAGTTTTTTACAGGGGACTCCTATCTTATATTGAAGACTA
TTGCTTTGAAAAGCGGTGCACTACGCCAAGACATCCATTATTGGTTAGGTAAAGATACCAGCAAGGATGAAGCTGGAGCA
GCAGCACTGAAGACGGTCGAACTCGATGCAGCTCTTGGAGGACGTGCTGTTCAATATCGCGAGGTACAAGGACATGAAAC
CGAAAGATTCTTATCTTACTTTAAACCATGCATCATACCTCAAGAAGGCGGAACTGCATCTGGTTTCAAGCATGTTGAAT
CTGAGGAGCATAAGATCCGTATGTTTACTTGCCAAGGAAAGCATGTAGTTCATGTAAAAGAGGTTCCTTTTGCTCGATCC
TCACTCAATCATGATGATATCTTTATCTTGGATACTGCGAATAAGATATTCCAGTTTAACGGTTCCAATTCAAGCATTCA
AGAAAGGGCTAAAGCGCTAGAGGTTGTGCAACATATCAAAGATACGTATCATGATGGGAAGTGTGACATAGCTACTGTTG
AGGATGGAAAATTAATGTCTGATGCTGAAACCGGAGAGTTCTGGGGTTTCTTTGGTGGCTTTGCTCCGCTTCCAAGGAAA
ACAACAACAGATGACGTCAAAAGTGCCGTTGCTCTTCCCACTCAGCTATTCTGTGTGGAGAAGGGGCAGGCGGAACCGGT
TTCTGCCGATACGTTTACAAAGGAGCTGCTGGATACAAATAAATGCTATCTTCTGGATTGCGGGGCTGAAATATACTTAT
GGATGGGGAGAAGTACTTCTCTTGATGAAAGAAAAGCCGCAAGTGGAGTTGCAGAAGAATACCTGCGTAGCCAGGATAGA
CTGAAGTCTCAAATCATCCGAGTGATTGAAAATTTTGAAACTGTAAGTTTCCGGTCAAAGTTTGATGCTTGGCCTCAATC
AGCTGAGGTGGCCGTCTCTGAGGATGGTAGAGGCAAGGTGGCTGCACTTCTAAAGCGTCAAGGGGTCAATGTGAGGGGTT
TACTTAAAGCAGCTCCAGAAAAGGAGGAACCTCAACCATATATTGATTGCACAGGAAATTTGCAGGTTTGGCGTGTGAAT
GGCCAAGAGAAGATTCTTCTTCCGGTCCCTGATCAGTCGAAGTTTTACAGTGGAGAATGTTATATCTTCCAGTATAATTA
TCCCGGAGAAGATCAAGACGAATGCCTTATTGGGACATGGTTCGGAAAGAAAAGTGTCGAGGAAGAGCGGAGTTCGGCTA
CCTCACATACAAACAAGATGGTTGAGTCGCTCAAATTTATGGCTTCTCAGTTGCAAGTTTATGAAGGAAGCGAGCCTATT
CTATTCTTTGCAATCTTTCAGAGCTTTCTCGTTCTTAAGGGTGGTCTAAGTGATGGATACAAGACTTTTATATCAGAGAA
TGAACTTTCTGACGACACTTACAAAGAAGACGGGGTTGCGTTATTTCGAGTTCAAGGCTCTGGACCTGAAAACATGCAAG
CAATCCAAGTCGAACCCGTTGCGTCATCTTTGAACTCCTCGTACTGCTACATACTACACAGTGGTTCTTCCGTCTTTACA
TGGATTGGAAACCTTAGAACTCCCGAAGTAGAGGAACTCGTCGAGAGGCAACTAGATGTCATAAAGCCAAACATGCAGGC
AAAGTTACAAAAAGAGGGTTCGGAATCCGAACAATTTTGGGAAATTTTAGGTGGAAAATCCGAATACCCGAGTCAGAAGA
TTGCAAGAGATGCCGAAAGTGATCCCCATTTGTTCTCATGCACATTTACAAAAGGAGAACTAAAGGTCATTGAGATCTAC
AACTTTAACCAAGACGATTTGATGACGGAAGATATCTTTATTCTCGATTGTCACTCGAGCATCTTTGTTTGGGTAGGGCA
GCAGGTTGATCAGAAACTGAAAACACAAGCGTTAGTTATTGGGGAGAAATTCGTGAAGCATGATTTTCTTCTTGAGAAAT
TAACGGTTCAAACTCCGATATATATCATATCGGAAGGGAGCGAGCCAGAGTTCTTCACGCGCTTCTTCACATGGGATTCA
ACCAAATCTGCAATGCATGGAAACTCGTTTCAAAGGAAACTATCTATACTAAAAAATGGAGGTCGTCCAACCTTGAGTAA
CAAACCAAAAAGACGAACACCGGGAGCACATGTAGGAAGGTCTGCTACAAACGAAAAACCGCAGCGTGCAAGAAGTGTGT
CTTTTAGCCCAGAGAGGGTTCGTGTTAGAGGAAGATCACCAGCCTTCAATGCTCTTGCTTCTGCATTTGAGAATCCAAGT
GCACGGAACCTGTCAACACCCCCGCCCCAAGTGCGGAAGCCTTATCCAAAATCTGATTCGTCAAATGTTGCTCCGAGGTC
CACTGCTATAGCGTCACTAACCTCCACTTTCGAACAACCTCCACGAGAACCTCTCATGCCCCGTTCCATCAAACCTAGGC
CCAAATCGCCACCAAAGGCTGAGTCGAATTCGAAGGAAAACATAATGAGCAGTAAAATCGAAACCCTAACAATACAAGAA
GATGTTAAAGAAAATGAAGTTGAAGATGAGGAAGGGCTCACTTTGTACCCATACGAACGCCTTATAACAGTATCCACCGA
CCCTGCTGCAGACATCGATGTAACCAAAAGAGAGACATACTTGTCTTCCGCTGAGTTCAGGGAGAAGTTTGGAATGACGA
AAGAAGCCTTCTACAAGCTGCCAAAATGGAAGCAAAATAAACTGAAAATGGCACTTCAACTGTTCTGA
Microexon DNA seq GAGAACTAAAG
Microexon Amino Acid seq GELK
Microexon-tag DNA Seq AGAGATGCCGAAAGTGATCCCCATTTGTTCTCATGCACATTTACAAAAGGAGAACTAAAGGTCATTGAGATCTACAACTTTAACCAAGACGATTTGATGACGGAAGAT
Microexon-tag Amino Acid seq RDAESDPHLFSCTFTKGELKVIEIYNFNQDDLMTED
Transcript ID Ha.48513.2
Gene ID Ha.48513
Gene Name VLN5
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Ha.48513.2
MSVSMRDLDPAFQGAGQKAGIEVWRIENFKPVPVPQSSYGKFFTGDSYLILKTIALKSGALRQDIHYWLGKDTSKDEAGA
AALKTVELDAALGGRAVQYREVQGHETERFLSYFKPCIIPQEGGTASGFKHVESEEHKIRMFTCQGKHVVHVKEVPFARS
SLNHDDIFILDTANKIFQFNGSNSSIQERAKALEVVQHIKDTYHDGKCDIATVEDGKLMSDAETGEFWGFFGGFAPLPRK
TTTDDVKSAVALPTQLFCVEKGQAEPVSADTFTKELLDTNKCYLLDCGAEIYLWMGRSTSLDERKAASGVAEEYLRSQDR
LKSQIIRVIENFETVSFRSKFDAWPQSAEVAVSEDGRGKVAALLKRQGVNVRGLLKAAPEKEEPQPYIDCTGNLQVWRVN
GQEKILLPVPDQSKFYSGECYIFQYNYPGEDQDECLIGTWFGKKSVEEERSSATSHTNKMVESLKFMASQLQVYEGSEPI
LFFAIFQSFLVLKGGLSDGYKTFISENELSDDTYKEDGVALFRVQGSGPENMQAIQVEPVASSLNSSYCYILHSGSSVFT
WIGNLRTPEVEELVERQLDVIKPNMQAKLQKEGSESEQFWEILGGKSEYPSQKIARDAESDPHLFSCTFTKGELKVIEIY
NFNQDDLMTEDIFILDCHSSIFVWVGQQVDQKLKTQALVIGEKFVKHDFLLEKLTVQTPIYIISEGSEPEFFTRFFTWDS
TKSAMHGNSFQRKLSILKNGGRPTLSNKPKRRTPGAHVGRSATNEKPQRARSVSFSPERVRVRGRSPAFNALASAFENPS
ARNLSTPPPQVRKPYPKSDSSNVAPRSTAIASLTSTFEQPPREPLMPRSIKPRPKSPPKAESNSKENIMSSKIETLTIQE
DVKENEVEDEEGLTLYPYERLITVSTDPAADIDVTKRETYLSSAEFREKFGMTKEAFYKLPKWKQNKLKMALQLF*
CDS seq >Ha.48513.2
ATGTCTGTTTCTATGAGAGATTTGGATCCAGCCTTCCAAGGAGCTGGACAAAAAGCAGGAATTGAAGTGTGGCGCATTGA
GAATTTTAAACCAGTGCCCGTTCCACAGTCTTCTTATGGCAAGTTTTTTACAGGGGACTCCTATCTTATATTGAAGACTA
TTGCTTTGAAAAGCGGTGCACTACGCCAAGACATCCATTATTGGTTAGGTAAAGATACCAGCAAGGATGAAGCTGGAGCA
GCAGCACTGAAGACGGTCGAACTCGATGCAGCTCTTGGAGGACGTGCTGTTCAATATCGCGAGGTACAAGGACATGAAAC
CGAAAGATTCTTATCTTACTTTAAACCATGCATCATACCTCAAGAAGGCGGAACTGCATCTGGTTTCAAGCATGTTGAAT
CTGAGGAGCATAAGATCCGTATGTTTACTTGCCAAGGAAAGCATGTAGTTCATGTAAAAGAGGTTCCTTTTGCTCGATCC
TCACTCAATCATGATGATATCTTTATCTTGGATACTGCGAATAAGATATTCCAGTTTAACGGTTCCAATTCAAGCATTCA
AGAAAGGGCTAAAGCGCTAGAGGTTGTGCAACATATCAAAGATACGTATCATGATGGGAAGTGTGACATAGCTACTGTTG
AGGATGGAAAATTAATGTCTGATGCTGAAACCGGAGAGTTCTGGGGTTTCTTTGGTGGCTTTGCTCCGCTTCCAAGGAAA
ACAACAACAGATGACGTCAAAAGTGCCGTTGCTCTTCCCACTCAGCTATTCTGTGTGGAGAAGGGGCAGGCGGAACCGGT
TTCTGCCGATACGTTTACAAAGGAGCTGCTGGATACAAATAAATGCTATCTTCTGGATTGCGGGGCTGAAATATACTTAT
GGATGGGGAGAAGTACTTCTCTTGATGAAAGAAAAGCCGCAAGTGGAGTTGCAGAAGAATACCTGCGTAGCCAGGATAGA
CTGAAGTCTCAAATCATCCGAGTGATTGAAAATTTTGAAACTGTAAGTTTCCGGTCAAAGTTTGATGCTTGGCCTCAATC
AGCTGAGGTGGCCGTCTCTGAGGATGGTAGAGGCAAGGTGGCTGCACTTCTAAAGCGTCAAGGGGTCAATGTGAGGGGTT
TACTTAAAGCAGCTCCAGAAAAGGAGGAACCTCAACCATATATTGATTGCACAGGAAATTTGCAGGTTTGGCGTGTGAAT
GGCCAAGAGAAGATTCTTCTTCCGGTCCCTGATCAGTCGAAGTTTTACAGTGGAGAATGTTATATCTTCCAGTATAATTA
TCCCGGAGAAGATCAAGACGAATGCCTTATTGGGACATGGTTCGGAAAGAAAAGTGTCGAGGAAGAGCGGAGTTCGGCTA
CCTCACATACAAACAAGATGGTTGAGTCGCTCAAATTTATGGCTTCTCAGTTGCAAGTTTATGAAGGAAGCGAGCCTATT
CTATTCTTTGCAATCTTTCAGAGCTTTCTCGTTCTTAAGGGTGGTCTAAGTGATGGATACAAGACTTTTATATCAGAGAA
TGAACTTTCTGACGACACTTACAAAGAAGACGGGGTTGCGTTATTTCGAGTTCAAGGCTCTGGACCTGAAAACATGCAAG
CAATCCAAGTCGAACCCGTTGCGTCATCTTTGAACTCCTCGTACTGCTACATACTACACAGTGGTTCTTCCGTCTTTACA
TGGATTGGAAACCTTAGAACTCCCGAAGTAGAGGAACTCGTCGAGAGGCAACTAGATGTCATAAAGCCAAACATGCAGGC
AAAGTTACAAAAAGAGGGTTCGGAATCCGAACAATTTTGGGAAATTTTAGGTGGAAAATCCGAATACCCGAGTCAGAAGA
TTGCAAGAGATGCCGAAAGTGATCCCCATTTGTTCTCATGCACATTTACAAAAGGAGAACTAAAGGTCATTGAGATCTAC
AACTTTAACCAAGACGATTTGATGACGGAAGATATCTTTATTCTCGATTGTCACTCGAGCATCTTTGTTTGGGTAGGGCA
GCAGGTTGATCAGAAACTGAAAACACAAGCGTTAGTTATTGGGGAGAAATTCGTGAAGCATGATTTTCTTCTTGAGAAAT
TAACGGTTCAAACTCCGATATATATCATATCGGAAGGGAGCGAGCCAGAGTTCTTCACGCGCTTCTTCACATGGGATTCA
ACCAAATCTGCAATGCATGGAAACTCGTTTCAAAGGAAACTATCTATACTAAAAAATGGAGGTCGTCCAACCTTGAGTAA
CAAACCAAAAAGACGAACACCGGGAGCACATGTAGGAAGGTCTGCTACAAACGAAAAACCGCAGCGTGCAAGAAGTGTGT
CTTTTAGCCCAGAGAGGGTTCGTGTTAGAGGAAGATCACCAGCCTTCAATGCTCTTGCTTCTGCATTTGAGAATCCAAGT
GCACGGAACCTGTCAACACCCCCGCCCCAAGTGCGGAAGCCTTATCCAAAATCTGATTCGTCAAATGTTGCTCCGAGGTC
CACTGCTATAGCGTCACTAACCTCCACTTTCGAACAACCTCCACGAGAACCTCTCATGCCCCGTTCCATCAAACCTAGGC
CCAAATCGCCACCAAAGGCTGAGTCGAATTCGAAGGAAAACATAATGAGCAGTAAAATCGAAACCCTAACAATACAAGAA
GATGTTAAAGAAAATGAAGTTGAAGATGAGGAAGGGCTCACTTTGTACCCATACGAACGCCTTATAACAGTATCCACCGA
CCCTGCTGCAGACATCGATGTAACCAAAAGAGAGACATACTTGTCTTCCGCTGAGTTCAGGGAGAAGTTTGGAATGACGA
AAGAAGCCTTCTACAAGCTGCCAAAATGGAAGCAAAATAAACTGAAAATGGCACTTCAACTGTTCTGA