Microexon ID Ha_5:102618791-102618801:-
Species Helianthus annuus
Coordinates 5:102618791..102618801
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAGATTTGAAG
Microexon Amino Acid seq GDLK
Microexon-tag DNA Seq AGAGATGCTGAAAGCGATCCCCATTTGTTCTCATGCACATTTTTAAAAGGAGATTTGAAGGTGACTGAGATCTACAACTTCAACCAAGACGATTTGATGACTGAAGAT
Microexon-tag Amino Acid Seq RDAESDPHLFSCTFLKGDLKVTEIYNFNQDDLMTED
Microexon-tag spanning region102618649-102618957
Microexon-tag prediction score0.968
Overlapped with the annotated transcript (%) 81.51
New Transcript ID OTG24938x
Reference Transcript ID OTG24938
Gene ID HannXRQ_Chr05g0142201
Gene Name VLN4
Ha_5:102618791-102618801:- does not have available information here.
Microexon DNA seq GAGATTTGAAG
Microexon Amino Acid seq GDLK
Microexon-tag DNA Seq AGAGATGCTGAAAGCGATCCCCATTTGTTCTCATGCACATTTTTAAAAGGAGATTTGAAGGTGACTGAGATCTACAACTTCAACCAAGACGATTTGATGACTGAAGAT
Microexon-tag Amino Acid seq RDAESDPHLFSCTFLKGDLKVTEIYNFNQDDLMTED
Transcript ID Ha.44587.1
Gene ID Ha.44587
Gene Name VLN4
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Ha.44587.1
MSVSMRDLDPAFQGAGQKAGIEVWRIENFKPVPVPQSSYGKFFTGDSYVILKTVALKSGALRHDIHYWLGKDTSQDEAGA
AALKTVELDAALGGRAVQYREVQGHETERFLSYFKPCIIPQEGGTASGFKHVESEEHKIRMFTCQGKHVVHVKEVPFARS
SLNHDDIFILDTANKIFQFNGSNSSIQERAKALEVVQHIKDTYHDGKCDIATVEDGKLMSDADTGEFWGFFGGFAPLPRK
TVTDDVKIDDAIPTQLLCVEKGQAEPVAADSLTKELLDTNKCYLLDCGSEIYVWMGRSTSLDERKAASGAAEEYLRSKDR
LKSNIIRVIENFETVAFRSKFDTWPQSAEVAVSEDGRGKVAALLKRQGLNVRGLLKAAPVNKEEPQPYIDCTGNLQVWRV
NGQEKILLPVSDQSKFYSGECYIFQYNYPGEDQEECLIGTWFGKQSVEDERNSATSQANKMVESLKFIASQLQVYEGSEP
ILFFAIFQSFLVFKGGLSDGYKNFISEKELPDDTFKDDGISLFRVQGSGPENMQAIQVEPVASSLNSSYCYILLSGSLVF
TWIGNLTTPEVQELVERQLDVIKPNMQSKLQKEGSESEQFWEILGGKSEYPSQKIARDAESDPHLFSCTFLKGDLKVTEI
YNFNQDDLMTEDIFILDCHSSIFVWVGQEVDQKLKTQALVIGEKFVKRDFLLEKLSPQVPVYIINEGSEPQFFTRFFTWD
STKSAMHGNSFQRRLSILKNGGRPTLTNKPKRRTPVAHAGRSATAEKPQQRSRSVSFSPDRVRVRGRSPAFNALASTFEN
ASARNLSTPPPQVRKPYPKSDSSNAASRSTALASLAATFEQQPPREPLMPRSIKPRPKSPPKSDSNSKENTMSSKMEALT
IQEDVKENEVEDEEGLTLYPYERLITVSTDPAPDIDVTKRETYLSSAEFREKFGMTKEAFYKLPKWKQNKLKMALQLF*
CDS seq >Ha.44587.1
ATGTCTGTTTCTATGAGAGATTTGGATCCAGCTTTTCAAGGAGCTGGACAGAAAGCAGGAATTGAAGTGTGGCGCATTGA
GAATTTTAAGCCGGTGCCTGTTCCACAGTCTTCTTATGGCAAATTTTTTACAGGGGACTCTTATGTCATTTTAAAGACTG
TTGCTTTGAAAAGCGGTGCATTACGTCATGACATCCATTATTGGCTAGGTAAAGATACAAGTCAGGATGAAGCTGGAGCG
GCAGCACTAAAAACGGTTGAACTTGATGCAGCTCTCGGAGGACGTGCTGTTCAATATCGAGAAGTACAAGGACATGAAAC
GGAGAGATTTTTATCTTACTTTAAACCATGCATCATACCTCAAGAAGGTGGAACAGCATCTGGTTTTAAGCATGTTGAAT
CTGAGGAACACAAGATCCGTATGTTTACTTGTCAAGGAAAACACGTGGTTCATGTAAAAGAGGTTCCGTTTGCTCGATCA
TCACTCAATCATGATGACATTTTTATCTTGGATACCGCCAATAAGATATTCCAGTTTAATGGATCCAATTCAAGCATTCA
AGAAAGGGCTAAAGCACTAGAGGTTGTGCAACATATCAAAGATACATATCATGATGGGAAATGTGACATAGCAACTGTTG
AGGATGGAAAATTGATGTCTGATGCTGACACTGGAGAGTTCTGGGGTTTCTTTGGTGGCTTTGCTCCACTTCCAAGGAAA
ACAGTTACGGATGACGTCAAGATTGACGATGCAATTCCCACTCAGTTACTTTGTGTTGAGAAGGGGCAGGCAGAACCTGT
TGCTGCTGATTCTTTAACAAAGGAGTTGCTAGATACGAATAAATGTTATCTTTTGGATTGTGGGTCAGAAATTTATGTAT
GGATGGGGAGAAGCACTTCTCTTGATGAAAGAAAGGCTGCAAGTGGAGCTGCAGAAGAATACCTGCGTAGTAAGGATAGA
CTAAAGTCTAATATCATCCGTGTGATTGAAAATTTTGAAACGGTGGCTTTTCGGTCAAAGTTTGACACCTGGCCTCAATC
TGCTGAGGTGGCTGTCTCGGAGGATGGTAGAGGCAAGGTGGCTGCACTTTTAAAGCGACAAGGGCTCAACGTGAGGGGCT
TACTAAAAGCTGCGCCCGTTAATAAGGAGGAACCTCAACCATATATTGATTGCACTGGAAATTTACAGGTTTGGCGTGTA
AATGGCCAAGAGAAGATACTTCTTCCAGTTTCTGATCAGTCAAAGTTCTACAGTGGAGAATGTTATATCTTTCAATATAA
TTATCCTGGTGAAGATCAAGAGGAATGCCTTATAGGGACATGGTTTGGAAAGCAGAGTGTTGAGGATGAACGGAATTCAG
CTACCTCACAGGCAAACAAGATGGTTGAGTCACTCAAGTTTATAGCTTCTCAGTTGCAAGTTTATGAAGGAAGTGAACCT
ATCCTTTTCTTTGCAATCTTTCAGAGCTTTCTGGTTTTTAAGGGTGGTCTAAGTGATGGATACAAGAATTTCATATCAGA
GAAGGAACTTCCCGATGACACTTTTAAAGATGACGGGATTTCACTATTTCGAGTTCAAGGTTCTGGACCGGAAAACATGC
AAGCAATCCAAGTTGAACCCGTGGCTTCATCGTTAAACTCCTCTTACTGTTACATATTGCTCAGTGGTTCTTTAGTCTTT
ACATGGATCGGAAACCTTACAACTCCTGAAGTCCAGGAACTCGTTGAGAGGCAACTTGATGTCATTAAGCCAAATATGCA
GTCCAAATTACAAAAAGAGGGTTCAGAGTCTGAACAATTTTGGGAAATTTTAGGTGGAAAATCTGAATACCCTAGTCAGA
AGATTGCAAGAGATGCTGAAAGCGATCCCCATTTGTTCTCATGCACATTTTTAAAAGGAGATTTGAAGGTGACTGAGATC
TACAACTTCAACCAAGACGATTTGATGACTGAAGATATATTTATTCTTGATTGTCACTCAAGCATCTTTGTTTGGGTAGG
GCAGGAGGTTGATCAGAAACTGAAAACACAAGCGTTAGTTATTGGGGAGAAATTTGTGAAACGTGATTTTCTTCTTGAGA
AATTATCTCCTCAAGTTCCAGTTTATATCATAAATGAAGGAAGTGAGCCACAGTTCTTCACACGTTTCTTCACATGGGAT
TCAACCAAATCTGCAATGCATGGAAACTCGTTCCAAAGGAGACTCTCCATACTAAAAAACGGTGGTCGTCCTACCTTGAC
AAATAAACCAAAAAGGCGAACACCTGTAGCACATGCAGGAAGGTCTGCTACAGCCGAAAAACCGCAGCAGCGTTCAAGAA
GTGTGTCTTTCAGCCCAGACCGAGTCCGCGTAAGAGGCAGATCACCAGCCTTTAACGCTCTTGCTTCTACATTCGAGAAT
GCAAGTGCAAGAAACCTATCAACACCCCCTCCCCAAGTGAGAAAACCTTATCCAAAATCCGATTCTTCTAATGCTGCTTC
AAGGTCTACAGCTTTAGCATCACTAGCCGCCACTTTTGAACAGCAACCTCCACGAGAACCCCTCATGCCCCGTTCCATTA
AACCTCGGCCCAAATCGCCACCAAAGTCTGATTCAAACTCCAAGGAAAACACGATGAGCAGTAAAATGGAAGCCCTAACG
ATACAAGAAGACGTTAAAGAAAATGAAGTTGAAGATGAGGAAGGGCTTACTTTATATCCATATGAACGCCTTATTACTGT
ATCCACCGACCCTGCTCCAGATATTGATGTAACCAAGAGAGAGACATACTTGTCTTCTGCTGAGTTCAGGGAGAAGTTTG
GAATGACCAAAGAAGCCTTCTACAAGCTGCCAAAATGGAAGCAAAATAAACTGAAAATGGCCCTTCAGTTGTTCTAA