Microexon ID Ha_1:59927035-59927045:+
Species Helianthus annuus
Coordinates 1:59927035..59927045
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAAAATTTGAG
Microexon Amino Acid seq GKFE
Microexon-tag DNA Seq TTTGATACTATTCGAGATCCTCATTTGTTTGGATTTTCACTTTCTAAAGGAAAATTTGAGGTCGAAGAAGTTTACAACTTTGATCAAGATGACTTGTTGCCGGAGGAT
Microexon-tag Amino Acid Seq FDTIRDPHLFGFSLSKGKFEVEEVYNFDQDDLLPED
Microexon-tag spanning region59926876-59927187
Microexon-tag prediction score0.9412
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG36546x
Reference Transcript ID OTG36546
Gene ID HannXRQ_Chr01g0008761
Gene Name NA
Transcript ID OTG36546
Protein ID OTG36546
Gene ID HannXRQ_Chr01g0008761
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 9.2e-10
Motif start 642
Motif end 720
Protein seq >OTG36546
MATAKALEPAFQGVGSRPGLEIWRIENFKPVPLPKSDYGTFYAGDSYVVLQTSAGRGGAGVFAHDVHYWLGKDTSQDEAG
TAAIKAVELDAILGGRAVQHREVQNYESDKFISYFKPCIAPREGGVKSGFKKPEEEEFETRLYTCRGKRVVHLKQVPFSR
SVLNHDDVFVLDTKEKIFQFNGANSNIQERAKALEVIQFLKDKYHEGTCNVAIVDDGKLQAEGDSGEFWVIFGGFAPIGK
KVLSDDDVIPERTPGKLYSIDGGKVGDQIEEYSKSTFETDKCYLMDCGFEVFVWVGRATQVDDRKAATQAAEEFLTSNNR
PKATLITRLIQGYETHAFKSNFDSWPSSTAPSAENRGKVAENRGKVSALLKQQGGGPKGKEKNAPVVEEVVPPLLDANGK
LEVWSIDGGAKNPVASEDVGKFYSGDCYIVLYSYHSREKKDDFYLCYWIGKDSTEEDQNTAAKLTTSMFNSLKGRPVQGR
IYQEKEPPQFIAIFQPMVLLKGGLSSSYKSYIAEKGLTDETYSPDNPSIIRISGTAVHNNKTVHLDPVPASLNSHEVFVV
HAGSHLYIWQGTQSTYEQQQWAAKIAEFLKPGVTAKYQKEGTESATFWLGLGGKEDVSTNKVSFDTIRDPHLFGFSLSKG
KFEVEEVYNFDQDDLLPEDILILDTHAEVFVWIGQAVDPKEKKNALDYGQKYIEWAENLDGLNPRVPLYRVPEGNEPNFF
TTYFSWEPLKTQIHGNSFQKKITILFGAGSAEGAGNQGGGNTQRAAAMAALNSTFGSSGGGGGGGKAPGATKNAGSQRRA
AVAALSGVIPDAKIDEPESPEKPEEAPEEPIEPSEPIPDDNDSEPKVAIEEDENGILSSQSTFSYEQVRVKSENPAPDID
LKRREAYLSVEEFESVLGMTREEFYKLPKWKQDLTKKKVDLF*
CDS seq >OTG36546
ATGGCTACTGCAAAAGCTCTTGAACCTGCATTCCAGGGAGTTGGTTCTAGACCAGGGCTTGAAATATGGAGAATTGAAAA
CTTTAAACCCGTCCCTTTGCCAAAATCTGATTATGGTACATTCTACGCTGGGGATTCATATGTTGTCTTGCAGACTTCTG
CTGGTAGGGGAGGTGCAGGTGTATTCGCGCATGACGTACACTACTGGTTAGGAAAAGATACTAGCCAGGATGAAGCTGGA
ACAGCGGCAATAAAAGCAGTGGAACTCGATGCGATTCTCGGTGGGCGCGCAGTGCAACATAGAGAAGTCCAGAATTATGA
ATCTGATAAATTTATATCGTATTTCAAACCGTGTATTGCACCACGAGAAGGTGGCGTTAAATCTGGATTTAAAAAACCCG
AAGAAGAAGAGTTTGAAACACGATTATATACATGCCGAGGAAAACGAGTTGTTCATTTGAAACAGGTCCCGTTTTCTCGA
TCTGTGTTGAATCATGACGATGTGTTTGTCTTGGACACTAAAGAGAAGATCTTTCAATTTAACGGAGCAAATTCAAATAT
TCAAGAAAGGGCTAAGGCTTTGGAGGTTATACAGTTCTTGAAGGATAAATATCATGAGGGGACATGCAATGTTGCAATTG
TCGACGATGGAAAACTACAAGCCGAGGGAGATTCAGGTGAATTTTGGGTTATTTTTGGCGGGTTTGCTCCTATTGGTAAA
AAGGTTCTAAGTGATGATGATGTCATCCCCGAAAGGACGCCTGGCAAACTTTACAGCATTGATGGAGGGAAGGTTGGGGA
TCAAATCGAGGAATATTCAAAATCAACATTCGAAACCGACAAATGCTATCTAATGGATTGTGGTTTCGAGGTGTTTGTTT
GGGTTGGTCGAGCAACGCAGGTGGATGATAGAAAAGCTGCCACACAGGCTGCCGAGGAGTTTCTCACCAGTAATAATCGG
CCGAAAGCCACCCTTATAACCCGACTAATTCAAGGTTATGAGACTCACGCATTCAAGTCAAACTTTGACTCTTGGCCATC
GTCAACCGCACCTTCTGCTGAAAACCGAGGAAAAGTGGCTGAAAATAGAGGAAAAGTTTCAGCTCTACTGAAGCAACAAG
GTGGTGGGCCGAAAGGAAAAGAAAAAAACGCTCCGGTTGTTGAGGAAGTTGTTCCTCCTTTGCTTGATGCAAATGGAAAA
CTCGAGGTATGGTCTATTGACGGCGGTGCTAAAAACCCCGTAGCCAGTGAGGACGTCGGTAAATTCTACAGTGGGGATTG
CTACATTGTTCTTTACAGTTACCATTCTCGAGAGAAAAAAGATGATTTTTATCTTTGTTACTGGATTGGAAAGGATAGTA
CTGAGGAGGACCAAAATACAGCTGCTAAGTTGACTACATCAATGTTCAATTCGCTCAAGGGGAGGCCAGTTCAGGGCCGT
ATATATCAAGAGAAAGAACCGCCACAATTCATTGCTATTTTTCAACCTATGGTTCTTTTGAAGGGTGGATTAAGCTCCAG
TTATAAAAGCTACATTGCGGAAAAAGGATTAACCGACGAAACTTACAGTCCAGATAACCCTTCAATTATTAGGATATCGG
GAACTGCAGTGCATAATAATAAAACTGTTCATCTAGATCCGGTGCCAGCATCTTTGAATTCGCATGAAGTCTTCGTCGTA
CATGCCGGGTCCCACCTATACATCTGGCAAGGAACCCAAAGTACTTATGAACAGCAGCAATGGGCAGCTAAAATTGCTGA
ATTTTTAAAACCTGGAGTAACCGCAAAGTATCAGAAAGAGGGAACCGAAAGCGCAACTTTTTGGCTCGGGCTTGGAGGGA
AAGAAGATGTTTCCACTAACAAAGTATCATTTGATACTATTCGAGATCCTCATTTGTTTGGATTTTCACTTTCTAAAGGA
AAATTTGAGGTCGAAGAAGTTTACAACTTTGATCAAGATGACTTGTTGCCGGAGGATATTTTAATATTAGACACACATGC
TGAGGTTTTTGTTTGGATTGGTCAAGCGGTTGACCCGAAGGAGAAGAAAAACGCTCTTGACTATGGGCAGAAATACATAG
AATGGGCTGAAAATCTGGACGGATTAAACCCGCGTGTGCCGTTATACAGAGTTCCAGAAGGAAATGAACCGAACTTCTTC
ACAACATATTTCTCTTGGGAACCTTTAAAAACCCAAATTCATGGAAACTCTTTCCAAAAGAAGATTACTATACTGTTTGG
AGCAGGTAGTGCTGAGGGTGCTGGGAATCAAGGTGGCGGTAACACTCAAAGAGCTGCAGCGATGGCTGCGTTGAACTCAA
CTTTTGGCTCGTCTGGTGGTGGAGGTGGCGGCGGTAAAGCACCCGGTGCAACAAAAAATGCCGGTTCACAGAGACGTGCC
GCAGTCGCTGCATTATCCGGGGTTATTCCTGATGCCAAAATCGATGAACCTGAGTCACCTGAGAAGCCTGAAGAAGCACC
TGAAGAACCTATAGAACCATCTGAACCCATTCCCGACGACAACGATTCAGAACCAAAAGTGGCGATTGAGGAGGACGAAA
ACGGAATTTTGTCAAGTCAATCCACTTTCAGCTATGAACAAGTTAGGGTTAAGTCAGAGAACCCTGCACCTGATATTGAT
CTTAAGAGGAGAGAGGCTTATCTTTCTGTTGAAGAATTTGAGTCTGTGCTTGGGATGACAAGAGAGGAGTTTTATAAGTT
GCCAAAATGGAAGCAAGATTTGACCAAGAAAAAGGTTGACCTCTTCTAA
Microexon DNA seq GAAAATTTGAG
Microexon Amino Acid seq GKFE
Microexon-tag DNA Seq TTTGATACTATTCGAGATCCTCATTTGTTTGGATTTTCACTTTCTAAAGGAAAATTTGAGGTCGAAGAAGTTTACAACTTTGATCAAGATGACTTGTTGCCGGAGGAT
Microexon-tag Amino Acid seq FDTIRDPHLFGFSLSKGKFEVEEVYNFDQDDLLPED
Transcript ID Ha.926.2
Gene ID Ha.926
Gene Name NA
Pfam domain motif Gelsolin
Motif E-value 9.2e-10
Motif start 642
Motif end 720
Protein seq >Ha.926.2
MATAKALEPAFQGVGSRPGLEIWRIENFKPVPLPKSDYGTFYAGDSYVVLQTSAGRGGAGVFAHDVHYWLGKDTSQDEAG
TAAIKAVELDAILGGRAVQHREVQNYESDKFISYFKPCIAPREGGVKSGFKKPEEEEFETRLYTCRGKRVVHLKQVPFSR
SVLNHDDVFVLDTKEKIFQFNGANSNIQERAKALEVIQFLKDKYHEGTCNVAIVDDGKLQAEGDSGEFWVIFGGFAPIGK
KVLSDDDVIPERTPGKLYSIDGGKVGDQIEEYSKSTFETDKCYLMDCGFEVFVWVGRATQVDDRKAATQAAEEFLTSNNR
PKATLITRLIQGYETHAFKSNFDSWPSSTAPSAENRGKVAENRGKVSALLKQQGGGPKGKEKNAPVVEEVVPPLLDANGK
LEVWSIDGGAKNPVASEDVGKFYSGDCYIVLYSYHSREKKDDFYLCYWIGKDSTEEDQNTAAKLTTSMFNSLKGRPVQGR
IYQEKEPPQFIAIFQPMVLLKGGLSSSYKSYIAEKGLTDETYSPDNPSIIRISGTAVHNNKTVHLDPVPASLNSHEVFVV
HAGSHLYIWQGTQSTYEQQQWAAKIAEFLKPGVTAKYQKEGTESATFWLGLGGKEDVSTNKVSFDTIRDPHLFGFSLSKG
KFEVEEVYNFDQDDLLPEDILILDTHAEVFVWIGQAVDPKEKKNALDYGQKYIEWAENLDGLNPRVPLYRVPEGNEPNFF
TTYFSWEPLKTQIHGNSFQKKITILFGAGSAEGAGNQGGGNTQRAAAMAALNSTFGSSGGGGGGGKAPGATKNAGSQRRA
AVAALSGVIPDAKIDEPESPEKPEEAPEEPIEPSEPIPDDNDSEPKVAIEEDENGILSSQSTFSYEQVRVKSENPAPDID
LKRREAYLSVEEFESVLGMTREEFYKLPKWKQDLTKKKVDLF*
CDS seq >Ha.926.2
ATGGCTACTGCAAAAGCTCTTGAACCTGCATTCCAGGGAGTTGGTTCTAGACCAGGGCTTGAAATATGGAGAATTGAAAA
CTTTAAACCCGTCCCTTTGCCAAAATCTGATTATGGTACATTCTACGCTGGGGATTCATATGTTGTCTTGCAGACTTCTG
CTGGTAGGGGAGGTGCAGGTGTATTCGCGCATGACGTACACTACTGGTTAGGAAAAGATACTAGCCAGGATGAAGCTGGA
ACAGCGGCAATAAAAGCAGTGGAACTCGATGCGATTCTCGGTGGGCGCGCAGTGCAACATAGAGAAGTCCAGAATTATGA
ATCTGATAAATTTATATCGTATTTCAAACCGTGTATTGCACCACGAGAAGGTGGCGTTAAATCTGGATTTAAAAAACCCG
AAGAAGAAGAGTTTGAAACACGATTATATACATGCCGAGGAAAACGAGTTGTTCATTTGAAACAGGTCCCGTTTTCTCGA
TCTGTGTTGAATCATGACGATGTGTTTGTCTTGGACACTAAAGAGAAGATCTTTCAATTTAACGGAGCAAATTCAAATAT
TCAAGAAAGGGCTAAGGCTTTGGAGGTTATACAGTTCTTGAAGGATAAATATCATGAGGGGACATGCAATGTTGCAATTG
TCGACGATGGAAAACTACAAGCCGAGGGAGATTCAGGTGAATTTTGGGTTATTTTTGGCGGGTTTGCTCCTATTGGTAAA
AAGGTTCTAAGTGATGATGATGTCATCCCCGAAAGGACGCCTGGCAAACTTTACAGCATTGATGGAGGGAAGGTTGGGGA
TCAAATCGAGGAATATTCAAAATCAACATTCGAAACCGACAAATGCTATCTAATGGATTGTGGTTTCGAGGTGTTTGTTT
GGGTTGGTCGAGCAACGCAGGTGGATGATAGAAAAGCTGCCACACAGGCTGCCGAGGAGTTTCTCACCAGTAATAATCGG
CCGAAAGCCACCCTTATAACCCGACTAATTCAAGGTTATGAGACTCACGCATTCAAGTCAAACTTTGACTCTTGGCCATC
GTCAACCGCACCTTCTGCTGAAAACCGAGGAAAAGTGGCTGAAAATAGAGGAAAAGTTTCAGCTCTACTGAAGCAACAAG
GTGGTGGGCCGAAAGGAAAAGAAAAAAACGCTCCGGTTGTTGAGGAAGTTGTTCCTCCTTTGCTTGATGCAAATGGAAAA
CTCGAGGTATGGTCTATTGACGGCGGTGCTAAAAACCCCGTAGCCAGTGAGGACGTCGGTAAATTCTACAGTGGGGATTG
CTACATTGTTCTTTACAGTTACCATTCTCGAGAGAAAAAAGATGATTTTTATCTTTGTTACTGGATTGGAAAGGATAGTA
CTGAGGAGGACCAAAATACAGCTGCTAAGTTGACTACATCAATGTTCAATTCGCTCAAGGGGAGGCCAGTTCAGGGCCGT
ATATATCAAGAGAAAGAACCGCCACAATTCATTGCTATTTTTCAACCTATGGTTCTTTTGAAGGGTGGATTAAGCTCCAG
TTATAAAAGCTACATTGCGGAAAAAGGATTAACCGACGAAACTTACAGTCCAGATAACCCTTCAATTATTAGGATATCGG
GAACTGCAGTGCATAATAATAAAACTGTTCATCTAGATCCGGTGCCAGCATCTTTGAATTCGCATGAAGTCTTCGTCGTA
CATGCCGGGTCCCACCTATACATCTGGCAAGGAACCCAAAGTACTTATGAACAGCAGCAATGGGCAGCTAAAATTGCTGA
ATTTTTAAAACCTGGAGTAACCGCAAAGTATCAGAAAGAGGGAACCGAAAGCGCAACTTTTTGGCTCGGGCTTGGAGGGA
AAGAAGATGTTTCCACTAACAAAGTATCATTTGATACTATTCGAGATCCTCATTTGTTTGGATTTTCACTTTCTAAAGGA
AAATTTGAGGTCGAAGAAGTTTACAACTTTGATCAAGATGACTTGTTGCCGGAGGATATTTTAATATTAGACACACATGC
TGAGGTTTTTGTTTGGATTGGTCAAGCGGTTGACCCGAAGGAGAAGAAAAACGCTCTTGACTATGGGCAGAAATACATAG
AATGGGCTGAAAATCTGGACGGATTAAACCCGCGTGTGCCGTTATACAGAGTTCCAGAAGGAAATGAACCGAACTTCTTC
ACAACATATTTCTCTTGGGAACCTTTAAAAACCCAAATTCATGGAAACTCTTTCCAAAAGAAGATTACTATACTGTTTGG
AGCAGGTAGTGCTGAGGGTGCTGGGAATCAAGGTGGCGGTAACACTCAAAGAGCTGCAGCGATGGCTGCGTTGAACTCAA
CTTTTGGCTCGTCTGGTGGTGGAGGTGGCGGCGGTAAAGCACCCGGTGCAACAAAAAATGCCGGTTCACAGAGACGTGCC
GCAGTCGCTGCATTATCCGGGGTTATTCCTGATGCCAAAATCGATGAACCTGAGTCACCTGAGAAGCCTGAAGAAGCACC
TGAAGAACCTATAGAACCATCTGAACCCATTCCCGACGACAACGATTCAGAACCAAAAGTGGCGATTGAGGAGGACGAAA
ACGGAATTTTGTCAAGTCAATCCACTTTCAGCTATGAACAAGTTAGGGTTAAGTCAGAGAACCCTGCACCTGATATTGAT
CTTAAGAGGAGAGAGGCTTATCTTTCTGTTGAAGAATTTGAGTCTGTGCTTGGGATGACAAGAGAGGAGTTTTATAAGTT
GCCAAAATGGAAGCAAGATTTGACCAAGAAAAAGGTTGACCTCTTCTAA