
Microexon ID | Ha_8:728389-728401:+ |
Species | Helianthus annuus | Coordinates | 8:728389..728401 |
Microexon Cluster ID | MEP32 |
Size | 13 |
Phase | 0 |
Pfam Domain Motif | MCM6_C |
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 48,13,47 |
Microexon location in the Microexon-tag | 2 |
Microexon-tag DNA Seq | AMYRTTGAWSTKWCWGATGTYCTYTCTARYTTCCCKGACATMTCARTGGHWCTGRYTGAAGAWATYATGGAKARRCTWSTWAAMSAWRRTRTACTRTCAARRRCRGGA |
Logo of Microexon-tag DNA Seq | ![]() |
Alignment of exons | ![]() |
Microexon DNA seq | GTTTTGCTTGAAG |
Microexon Amino Acid seq | VLLEE |
Microexon-tag DNA Seq | AACGTCGATGTCACTGATGTTCTCTCTAATTTCCCCGACATCTCATTGGTTTTGCTTGAAGAAATTATGGAAAAGCTTGTAACCGAAGGCATTTTAGCAAAGTCTGGA |
Microexon-tag Amino Acid Seq | NVDVTDVLSNFPDISLVLLEEIMEKLVTEGILAKSG |
Microexon-tag spanning region | 728241-728897 |
Microexon-tag prediction score | 0.9392 |
Overlapped with the annotated transcript (%) | 100 |
New Transcript ID | OTG17049x |
Reference Transcript ID | OTG17049 |
Gene ID | HannXRQ_Chr08g0207781 |
Gene Name | ASY1 |
Transcript ID | OTG17049 |
Protein ID | OTG17049 |
Gene ID | HannXRQ_Chr08g0207781 |
Gene Name | ASY1 |
Pfam domain motif | Unknown |
Motif E-value | NA |
Motif start | NA |
Motif end | NA |
Protein seq | >OTG17049 MVVAQKLKEAEITEQDSLLLTRNLLRIAIFNISYIRGLFPEKYFSDKGVPALEMKIKKLMPMDAESRRLIDWMEKGVYDA LQKKYLRTLMFCVCETAEGPMIEEYSFSFSYSNSDSQEVSMNVNRIGNKKRGETFKCNSTTEITPTQMKSSACKMVRTLI QLMRTLDKMPEERTIVMKLHYYDDVTPADYEPPFFRGCTEEEAHHTWTKNPLRMEVGNVNSKHFVLSLKVKSVLDPCEDD NVSLGGDSMQQDGDSEADSEASISDDEYIVAPIDKQKEKQDDTMVDEDDTQDPAEDEQQLNRVKDWIKSYHQENVDVTDV LSNFPDISLVLLEEIMEKLVTEGILAKSGNESFTIKRTKNSEYEFDAVKEETDARQASGKKSLPTTGEAQMYMKALYHAL PMNYITISKLQSKLGGEANQITVRKLMDKMTKEGYIESTSNRRLGKRVVHSDITKKKLAEVQKALDFDAMVEDTHETRNK SNHSVNQAGANCWDTSTMGGLHSIGSDLTRTKGRSDLHQNGSPFSDAAASKLRVNGHNTPTRNTQPIASIESHVAAGVEK GGNENVNPDDKMDTVVCSGQSTLDKRSRKASMVKEPILQFSKRQKSE* |
CDS seq | >OTG17049 ATGGTTGTTGCGCAGAAGTTGAAGGAAGCGGAGATCACCGAGCAGGATTCTCTGCTTCTGACAAGGAACCTACTACGCAT AGCAATATTCAATATCAGTTATATCAGAGGTCTTTTTCCAGAGAAATACTTTAGTGATAAGGGCGTGCCAGCCTTAGAGA TGAAGATAAAAAAGCTTATGCCAATGGATGCAGAGTCTAGAAGACTAATTGACTGGATGGAGAAAGGTGTATATGATGCT TTGCAGAAGAAATATCTCAGGACACTAATGTTCTGTGTGTGTGAGACTGCAGAAGGGCCAATGATTGAAGAATATTCTTT TTCTTTCAGTTACTCAAATTCTGATAGCCAAGAGGTTTCTATGAATGTCAATCGAATTGGAAACAAGAAGCGAGGGGAAA CATTCAAATGTAACTCCACAACAGAGATTACCCCTACTCAAATGAAGAGCTCTGCTTGTAAAATGGTTCGCACACTCATT CAATTGATGAGAACTCTAGATAAGATGCCAGAAGAGCGCACGATTGTAATGAAACTCCACTATTATGATGATGTCACGCC TGCTGATTACGAGCCCCCGTTCTTCAGAGGCTGCACAGAGGAAGAAGCCCATCATACATGGACCAAAAATCCTTTGAGAA TGGAGGTTGGAAATGTCAACAGCAAACATTTTGTTTTATCTCTTAAGGTTAAAAGTGTGCTTGATCCTTGTGAGGATGAT AATGTTAGTTTGGGAGGTGACTCAATGCAGCAAGATGGAGACAGTGAGGCTGATAGTGAGGCCAGCATATCTGATGATGA GTACATAGTGGCACCTATAGATAAGCAAAAGGAGAAACAAGATGATACCATGGTTGATGAAGACGACACTCAGGATCCAG CTGAAGATGAACAGCAGCTAAACCGTGTCAAGGATTGGATCAAATCCTATCATCAAGAAAACGTCGATGTCACTGATGTT CTCTCTAATTTCCCCGACATCTCATTGGTTTTGCTTGAAGAAATTATGGAAAAGCTTGTAACCGAAGGCATTTTAGCAAA GTCTGGAAACGAAAGTTTCACCATCAAGAGGACGAAGAATTCGGAGTATGAATTTGATGCTGTGAAAGAGGAAACTGACG CTCGTCAGGCATCTGGCAAGAAGAGCCTGCCCACTACTGGCGAGGCTCAAATGTACATGAAAGCACTTTATCATGCTCTT CCAATGAACTACATTACTATTTCGAAGCTTCAAAGCAAGCTTGGTGGTGAGGCTAACCAAATTACCGTGCGCAAGTTAAT GGATAAAATGACTAAAGAAGGTTACATCGAATCCACAAGCAATCGTAGGCTAGGAAAGCGTGTGGTGCATTCTGATATAA CCAAGAAGAAACTTGCTGAAGTGCAAAAGGCCTTGGACTTTGATGCTATGGTTGAGGATACACATGAAACTCGTAACAAA TCCAACCATTCGGTGAACCAAGCAGGGGCTAATTGTTGGGACACATCCACAATGGGTGGACTGCATTCCATAGGATCAGA TCTCACACGCACGAAAGGTAGATCCGATTTGCACCAAAACGGTTCCCCCTTCAGTGATGCAGCTGCCTCTAAGTTGAGAG TGAATGGACACAACACTCCCACAAGAAACACTCAGCCAATAGCTTCAATAGAGAGCCACGTGGCAGCAGGGGTTGAGAAG GGTGGAAACGAAAATGTCAATCCGGATGATAAAATGGATACTGTCGTTTGTAGCGGCCAGTCCACCCTTGACAAACGTTC TCGAAAAGCAAGCATGGTGAAGGAGCCAATCCTCCAGTTCTCGAAACGCCAGAAATCTGAGTGA |
Microexon DNA seq | GTTTTGCTTGAAG |
Microexon Amino Acid seq | VLLEE |
Microexon-tag DNA Seq | AACGTCGATGTCACTGATGTTCTCTCTAATTTCCCCGACATCTCATTGGTTTTGCTTGAAGAAATTATGGAAAAGCTTGTAACCGAAGGCATTTTAGCAAAGTCTGGA |
Microexon-tag Amino Acid seq | NVDVTDVLSNFPDISLVLLEEIMEKLVTEGILAKSG |
Transcript ID | OTG17049 |
Gene ID | Ha.51413 |
Gene Name | ASY1 |
Pfam domain motif | Unknown |
Motif E-value | NA |
Motif start | NA |
Motif end | NA |
Protein seq | >OTG17049 MVVAQKLKEAEITEQDSLLLTRNLLRIAIFNISYIRGLFPEKYFSDKGVPALEMKIKKLMPMDAESRRLIDWMEKGVYDA LQKKYLRTLMFCVCETAEGPMIEEYSFSFSYSNSDSQEVSMNVNRIGNKKRGETFKCNSTTEITPTQMKSSACKMVRTLI QLMRTLDKMPEERTIVMKLHYYDDVTPADYEPPFFRGCTEEEAHHTWTKNPLRMEVGNVNSKHFVLSLKVKSVLDPCEDD NVSLGGDSMQQDGDSEADSEASISDDEYIVAPIDKQKEKQDDTMVDEDDTQDPAEDEQQLNRVKDWIKSYHQENVDVTDV LSNFPDISLVLLEEIMEKLVTEGILAKSGNESFTIKRTKNSEYEFDAVKEETDARQASGKKSLPTTGEAQMYMKALYHAL PMNYITISKLQSKLGGEANQITVRKLMDKMTKEGYIESTSNRRLGKRVVHSDITKKKLAEVQKALDFDAMVEDTHETRNK SNHSVNQAGANCWDTSTMGGLHSIGSDLTRTKGRSDLHQNGSPFSDAAASKLRVNGHNTPTRNTQPIASIESHVAAGVEK GGNENVNPDDKMDTVVCSGQSTLDKRSRKASMVKEPILQFSKRQKSE* |
CDS seq | >OTG17049 ATGGTTGTTGCGCAGAAGTTGAAGGAAGCGGAGATCACCGAGCAGGATTCTCTGCTTCTGACAAGGAACCTACTACGCAT AGCAATATTCAATATCAGTTATATCAGAGGTCTTTTTCCAGAGAAATACTTTAGTGATAAGGGCGTGCCAGCCTTAGAGA TGAAGATAAAAAAGCTTATGCCAATGGATGCAGAGTCTAGAAGACTAATTGACTGGATGGAGAAAGGTGTATATGATGCT TTGCAGAAGAAATATCTCAGGACACTAATGTTCTGTGTGTGTGAGACTGCAGAAGGGCCAATGATTGAAGAATATTCTTT TTCTTTCAGTTACTCAAATTCTGATAGCCAAGAGGTTTCTATGAATGTCAATCGAATTGGAAACAAGAAGCGAGGGGAAA CATTCAAATGTAACTCCACAACAGAGATTACCCCTACTCAAATGAAGAGCTCTGCTTGTAAAATGGTTCGCACACTCATT CAATTGATGAGAACTCTAGATAAGATGCCAGAAGAGCGCACGATTGTAATGAAACTCCACTATTATGATGATGTCACGCC TGCTGATTACGAGCCCCCGTTCTTCAGAGGCTGCACAGAGGAAGAAGCCCATCATACATGGACCAAAAATCCTTTGAGAA TGGAGGTTGGAAATGTCAACAGCAAACATTTTGTTTTATCTCTTAAGGTTAAAAGTGTGCTTGATCCTTGTGAGGATGAT AATGTTAGTTTGGGAGGTGACTCAATGCAGCAAGATGGAGACAGTGAGGCTGATAGTGAGGCCAGCATATCTGATGATGA GTACATAGTGGCACCTATAGATAAGCAAAAGGAGAAACAAGATGATACCATGGTTGATGAAGACGACACTCAGGATCCAG CTGAAGATGAACAGCAGCTAAACCGTGTCAAGGATTGGATCAAATCCTATCATCAAGAAAACGTCGATGTCACTGATGTT CTCTCTAATTTCCCCGACATCTCATTGGTTTTGCTTGAAGAAATTATGGAAAAGCTTGTAACCGAAGGCATTTTAGCAAA GTCTGGAAACGAAAGTTTCACCATCAAGAGGACGAAGAATTCGGAGTATGAATTTGATGCTGTGAAAGAGGAAACTGACG CTCGTCAGGCATCTGGCAAGAAGAGCCTGCCCACTACTGGCGAGGCTCAAATGTACATGAAAGCACTTTATCATGCTCTT CCAATGAACTACATTACTATTTCGAAGCTTCAAAGCAAGCTTGGTGGTGAGGCTAACCAAATTACCGTGCGCAAGTTAAT GGATAAAATGACTAAAGAAGGTTACATCGAATCCACAAGCAATCGTAGGCTAGGAAAGCGTGTGGTGCATTCTGATATAA CCAAGAAGAAACTTGCTGAAGTGCAAAAGGCCTTGGACTTTGATGCTATGGTTGAGGATACACATGAAACTCGTAACAAA TCCAACCATTCGGTGAACCAAGCAGGGGCTAATTGTTGGGACACATCCACAATGGGTGGACTGCATTCCATAGGATCAGA TCTCACACGCACGAAAGGTAGATCCGATTTGCACCAAAACGGTTCCCCCTTCAGTGATGCAGCTGCCTCTAAGTTGAGAG TGAATGGACACAACACTCCCACAAGAAACACTCAGCCAATAGCTTCAATAGAGAGCCACGTGGCAGCAGGGGTTGAGAAG GGTGGAAACGAAAATGTCAATCCGGATGATAAAATGGATACTGTCGTTTGTAGCGGCCAGTCCACCCTTGACAAACGTTC TCGAAAAGCAAGCATGGTGAAGGAGCCAATCCTCCAGTTCTCGAAACGCCAGAAATCTGAGTGA |