
Microexon ID | Ha_5:102618791-102618801:- |
Species | Helianthus annuus | Coordinates | 5:102618791..102618801 |
Microexon Cluster ID | MEP27 |
Size | 11 |
Phase | 1 |
Pfam Domain Motif | Gelsolin |
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 49,11,48 |
Microexon location in the Microexon-tag | 2 |
Microexon-tag DNA Seq | MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT |
Logo of Microexon-tag DNA Seq | ![]() |
Alignment of exons | ![]() |
Microexon DNA seq | GAGATTTGAAG |
Microexon Amino Acid seq | GDLK |
Microexon-tag DNA Seq | AGAGATGCTGAAAGCGATCCCCATTTGTTCTCATGCACATTTTTAAAAGGAGATTTGAAGGTGACTGAGATCTACAACTTCAACCAAGACGATTTGATGACTGAAGAT |
Microexon-tag Amino Acid Seq | RDAESDPHLFSCTFLKGDLKVTEIYNFNQDDLMTED |
Microexon-tag spanning region | 102618649-102618957 |
Microexon-tag prediction score | 0.968 |
Overlapped with the annotated transcript (%) | 81.51 |
New Transcript ID | OTG24938x |
Reference Transcript ID | OTG24938 |
Gene ID | HannXRQ_Chr05g0142201 |
Gene Name | VLN4 |
Ha_5:102618791-102618801:- does not have available information here.
Microexon DNA seq | GAGATTTGAAG |
Microexon Amino Acid seq | GDLK |
Microexon-tag DNA Seq | AGAGATGCTGAAAGCGATCCCCATTTGTTCTCATGCACATTTTTAAAAGGAGATTTGAAGGTGACTGAGATCTACAACTTCAACCAAGACGATTTGATGACTGAAGAT |
Microexon-tag Amino Acid seq | RDAESDPHLFSCTFLKGDLKVTEIYNFNQDDLMTED |
Transcript ID | Ha.44587.1 |
Gene ID | Ha.44587 |
Gene Name | VLN4 |
Pfam domain motif | Unknown |
Motif E-value | NA |
Motif start | NA |
Motif end | NA |
Protein seq | >Ha.44587.1 MSVSMRDLDPAFQGAGQKAGIEVWRIENFKPVPVPQSSYGKFFTGDSYVILKTVALKSGALRHDIHYWLGKDTSQDEAGA AALKTVELDAALGGRAVQYREVQGHETERFLSYFKPCIIPQEGGTASGFKHVESEEHKIRMFTCQGKHVVHVKEVPFARS SLNHDDIFILDTANKIFQFNGSNSSIQERAKALEVVQHIKDTYHDGKCDIATVEDGKLMSDADTGEFWGFFGGFAPLPRK TVTDDVKIDDAIPTQLLCVEKGQAEPVAADSLTKELLDTNKCYLLDCGSEIYVWMGRSTSLDERKAASGAAEEYLRSKDR LKSNIIRVIENFETVAFRSKFDTWPQSAEVAVSEDGRGKVAALLKRQGLNVRGLLKAAPVNKEEPQPYIDCTGNLQVWRV NGQEKILLPVSDQSKFYSGECYIFQYNYPGEDQEECLIGTWFGKQSVEDERNSATSQANKMVESLKFIASQLQVYEGSEP ILFFAIFQSFLVFKGGLSDGYKNFISEKELPDDTFKDDGISLFRVQGSGPENMQAIQVEPVASSLNSSYCYILLSGSLVF TWIGNLTTPEVQELVERQLDVIKPNMQSKLQKEGSESEQFWEILGGKSEYPSQKIARDAESDPHLFSCTFLKGDLKVTEI YNFNQDDLMTEDIFILDCHSSIFVWVGQEVDQKLKTQALVIGEKFVKRDFLLEKLSPQVPVYIINEGSEPQFFTRFFTWD STKSAMHGNSFQRRLSILKNGGRPTLTNKPKRRTPVAHAGRSATAEKPQQRSRSVSFSPDRVRVRGRSPAFNALASTFEN ASARNLSTPPPQVRKPYPKSDSSNAASRSTALASLAATFEQQPPREPLMPRSIKPRPKSPPKSDSNSKENTMSSKMEALT IQEDVKENEVEDEEGLTLYPYERLITVSTDPAPDIDVTKRETYLSSAEFREKFGMTKEAFYKLPKWKQNKLKMALQLF* |
CDS seq | >Ha.44587.1 ATGTCTGTTTCTATGAGAGATTTGGATCCAGCTTTTCAAGGAGCTGGACAGAAAGCAGGAATTGAAGTGTGGCGCATTGA GAATTTTAAGCCGGTGCCTGTTCCACAGTCTTCTTATGGCAAATTTTTTACAGGGGACTCTTATGTCATTTTAAAGACTG TTGCTTTGAAAAGCGGTGCATTACGTCATGACATCCATTATTGGCTAGGTAAAGATACAAGTCAGGATGAAGCTGGAGCG GCAGCACTAAAAACGGTTGAACTTGATGCAGCTCTCGGAGGACGTGCTGTTCAATATCGAGAAGTACAAGGACATGAAAC GGAGAGATTTTTATCTTACTTTAAACCATGCATCATACCTCAAGAAGGTGGAACAGCATCTGGTTTTAAGCATGTTGAAT CTGAGGAACACAAGATCCGTATGTTTACTTGTCAAGGAAAACACGTGGTTCATGTAAAAGAGGTTCCGTTTGCTCGATCA TCACTCAATCATGATGACATTTTTATCTTGGATACCGCCAATAAGATATTCCAGTTTAATGGATCCAATTCAAGCATTCA AGAAAGGGCTAAAGCACTAGAGGTTGTGCAACATATCAAAGATACATATCATGATGGGAAATGTGACATAGCAACTGTTG AGGATGGAAAATTGATGTCTGATGCTGACACTGGAGAGTTCTGGGGTTTCTTTGGTGGCTTTGCTCCACTTCCAAGGAAA ACAGTTACGGATGACGTCAAGATTGACGATGCAATTCCCACTCAGTTACTTTGTGTTGAGAAGGGGCAGGCAGAACCTGT TGCTGCTGATTCTTTAACAAAGGAGTTGCTAGATACGAATAAATGTTATCTTTTGGATTGTGGGTCAGAAATTTATGTAT GGATGGGGAGAAGCACTTCTCTTGATGAAAGAAAGGCTGCAAGTGGAGCTGCAGAAGAATACCTGCGTAGTAAGGATAGA CTAAAGTCTAATATCATCCGTGTGATTGAAAATTTTGAAACGGTGGCTTTTCGGTCAAAGTTTGACACCTGGCCTCAATC TGCTGAGGTGGCTGTCTCGGAGGATGGTAGAGGCAAGGTGGCTGCACTTTTAAAGCGACAAGGGCTCAACGTGAGGGGCT TACTAAAAGCTGCGCCCGTTAATAAGGAGGAACCTCAACCATATATTGATTGCACTGGAAATTTACAGGTTTGGCGTGTA AATGGCCAAGAGAAGATACTTCTTCCAGTTTCTGATCAGTCAAAGTTCTACAGTGGAGAATGTTATATCTTTCAATATAA TTATCCTGGTGAAGATCAAGAGGAATGCCTTATAGGGACATGGTTTGGAAAGCAGAGTGTTGAGGATGAACGGAATTCAG CTACCTCACAGGCAAACAAGATGGTTGAGTCACTCAAGTTTATAGCTTCTCAGTTGCAAGTTTATGAAGGAAGTGAACCT ATCCTTTTCTTTGCAATCTTTCAGAGCTTTCTGGTTTTTAAGGGTGGTCTAAGTGATGGATACAAGAATTTCATATCAGA GAAGGAACTTCCCGATGACACTTTTAAAGATGACGGGATTTCACTATTTCGAGTTCAAGGTTCTGGACCGGAAAACATGC AAGCAATCCAAGTTGAACCCGTGGCTTCATCGTTAAACTCCTCTTACTGTTACATATTGCTCAGTGGTTCTTTAGTCTTT ACATGGATCGGAAACCTTACAACTCCTGAAGTCCAGGAACTCGTTGAGAGGCAACTTGATGTCATTAAGCCAAATATGCA GTCCAAATTACAAAAAGAGGGTTCAGAGTCTGAACAATTTTGGGAAATTTTAGGTGGAAAATCTGAATACCCTAGTCAGA AGATTGCAAGAGATGCTGAAAGCGATCCCCATTTGTTCTCATGCACATTTTTAAAAGGAGATTTGAAGGTGACTGAGATC TACAACTTCAACCAAGACGATTTGATGACTGAAGATATATTTATTCTTGATTGTCACTCAAGCATCTTTGTTTGGGTAGG GCAGGAGGTTGATCAGAAACTGAAAACACAAGCGTTAGTTATTGGGGAGAAATTTGTGAAACGTGATTTTCTTCTTGAGA AATTATCTCCTCAAGTTCCAGTTTATATCATAAATGAAGGAAGTGAGCCACAGTTCTTCACACGTTTCTTCACATGGGAT TCAACCAAATCTGCAATGCATGGAAACTCGTTCCAAAGGAGACTCTCCATACTAAAAAACGGTGGTCGTCCTACCTTGAC AAATAAACCAAAAAGGCGAACACCTGTAGCACATGCAGGAAGGTCTGCTACAGCCGAAAAACCGCAGCAGCGTTCAAGAA GTGTGTCTTTCAGCCCAGACCGAGTCCGCGTAAGAGGCAGATCACCAGCCTTTAACGCTCTTGCTTCTACATTCGAGAAT GCAAGTGCAAGAAACCTATCAACACCCCCTCCCCAAGTGAGAAAACCTTATCCAAAATCCGATTCTTCTAATGCTGCTTC AAGGTCTACAGCTTTAGCATCACTAGCCGCCACTTTTGAACAGCAACCTCCACGAGAACCCCTCATGCCCCGTTCCATTA AACCTCGGCCCAAATCGCCACCAAAGTCTGATTCAAACTCCAAGGAAAACACGATGAGCAGTAAAATGGAAGCCCTAACG ATACAAGAAGACGTTAAAGAAAATGAAGTTGAAGATGAGGAAGGGCTTACTTTATATCCATATGAACGCCTTATTACTGT ATCCACCGACCCTGCTCCAGATATTGATGTAACCAAGAGAGAGACATACTTGTCTTCTGCTGAGTTCAGGGAGAAGTTTG GAATGACCAAAGAAGCCTTCTACAAGCTGCCAAAATGGAAGCAAAATAAACTGAAAATGGCCCTTCAGTTGTTCTAA |