Microexon ID Ha_8:728389-728401:+
Species Helianthus annuus
Coordinates 8:728389..728401
Microexon Cluster ID MEP32
Size 13
Phase 0
Pfam Domain Motif MCM6_C
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,13,47
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq AMYRTTGAWSTKWCWGATGTYCTYTCTARYTTCCCKGACATMTCARTGGHWCTGRYTGAAGAWATYATGGAKARRCTWSTWAAMSAWRRTRTACTRTCAARRRCRGGA
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GTTTTGCTTGAAG
Microexon Amino Acid seq VLLEE
Microexon-tag DNA Seq AACGTCGATGTCACTGATGTTCTCTCTAATTTCCCCGACATCTCATTGGTTTTGCTTGAAGAAATTATGGAAAAGCTTGTAACCGAAGGCATTTTAGCAAAGTCTGGA
Microexon-tag Amino Acid Seq NVDVTDVLSNFPDISLVLLEEIMEKLVTEGILAKSG
Microexon-tag spanning region728241-728897
Microexon-tag prediction score0.9392
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG17049x
Reference Transcript ID OTG17049
Gene ID HannXRQ_Chr08g0207781
Gene Name ASY1
Transcript ID OTG17049
Protein ID OTG17049
Gene ID HannXRQ_Chr08g0207781
Gene Name ASY1
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >OTG17049
MVVAQKLKEAEITEQDSLLLTRNLLRIAIFNISYIRGLFPEKYFSDKGVPALEMKIKKLMPMDAESRRLIDWMEKGVYDA
LQKKYLRTLMFCVCETAEGPMIEEYSFSFSYSNSDSQEVSMNVNRIGNKKRGETFKCNSTTEITPTQMKSSACKMVRTLI
QLMRTLDKMPEERTIVMKLHYYDDVTPADYEPPFFRGCTEEEAHHTWTKNPLRMEVGNVNSKHFVLSLKVKSVLDPCEDD
NVSLGGDSMQQDGDSEADSEASISDDEYIVAPIDKQKEKQDDTMVDEDDTQDPAEDEQQLNRVKDWIKSYHQENVDVTDV
LSNFPDISLVLLEEIMEKLVTEGILAKSGNESFTIKRTKNSEYEFDAVKEETDARQASGKKSLPTTGEAQMYMKALYHAL
PMNYITISKLQSKLGGEANQITVRKLMDKMTKEGYIESTSNRRLGKRVVHSDITKKKLAEVQKALDFDAMVEDTHETRNK
SNHSVNQAGANCWDTSTMGGLHSIGSDLTRTKGRSDLHQNGSPFSDAAASKLRVNGHNTPTRNTQPIASIESHVAAGVEK
GGNENVNPDDKMDTVVCSGQSTLDKRSRKASMVKEPILQFSKRQKSE*
CDS seq >OTG17049
ATGGTTGTTGCGCAGAAGTTGAAGGAAGCGGAGATCACCGAGCAGGATTCTCTGCTTCTGACAAGGAACCTACTACGCAT
AGCAATATTCAATATCAGTTATATCAGAGGTCTTTTTCCAGAGAAATACTTTAGTGATAAGGGCGTGCCAGCCTTAGAGA
TGAAGATAAAAAAGCTTATGCCAATGGATGCAGAGTCTAGAAGACTAATTGACTGGATGGAGAAAGGTGTATATGATGCT
TTGCAGAAGAAATATCTCAGGACACTAATGTTCTGTGTGTGTGAGACTGCAGAAGGGCCAATGATTGAAGAATATTCTTT
TTCTTTCAGTTACTCAAATTCTGATAGCCAAGAGGTTTCTATGAATGTCAATCGAATTGGAAACAAGAAGCGAGGGGAAA
CATTCAAATGTAACTCCACAACAGAGATTACCCCTACTCAAATGAAGAGCTCTGCTTGTAAAATGGTTCGCACACTCATT
CAATTGATGAGAACTCTAGATAAGATGCCAGAAGAGCGCACGATTGTAATGAAACTCCACTATTATGATGATGTCACGCC
TGCTGATTACGAGCCCCCGTTCTTCAGAGGCTGCACAGAGGAAGAAGCCCATCATACATGGACCAAAAATCCTTTGAGAA
TGGAGGTTGGAAATGTCAACAGCAAACATTTTGTTTTATCTCTTAAGGTTAAAAGTGTGCTTGATCCTTGTGAGGATGAT
AATGTTAGTTTGGGAGGTGACTCAATGCAGCAAGATGGAGACAGTGAGGCTGATAGTGAGGCCAGCATATCTGATGATGA
GTACATAGTGGCACCTATAGATAAGCAAAAGGAGAAACAAGATGATACCATGGTTGATGAAGACGACACTCAGGATCCAG
CTGAAGATGAACAGCAGCTAAACCGTGTCAAGGATTGGATCAAATCCTATCATCAAGAAAACGTCGATGTCACTGATGTT
CTCTCTAATTTCCCCGACATCTCATTGGTTTTGCTTGAAGAAATTATGGAAAAGCTTGTAACCGAAGGCATTTTAGCAAA
GTCTGGAAACGAAAGTTTCACCATCAAGAGGACGAAGAATTCGGAGTATGAATTTGATGCTGTGAAAGAGGAAACTGACG
CTCGTCAGGCATCTGGCAAGAAGAGCCTGCCCACTACTGGCGAGGCTCAAATGTACATGAAAGCACTTTATCATGCTCTT
CCAATGAACTACATTACTATTTCGAAGCTTCAAAGCAAGCTTGGTGGTGAGGCTAACCAAATTACCGTGCGCAAGTTAAT
GGATAAAATGACTAAAGAAGGTTACATCGAATCCACAAGCAATCGTAGGCTAGGAAAGCGTGTGGTGCATTCTGATATAA
CCAAGAAGAAACTTGCTGAAGTGCAAAAGGCCTTGGACTTTGATGCTATGGTTGAGGATACACATGAAACTCGTAACAAA
TCCAACCATTCGGTGAACCAAGCAGGGGCTAATTGTTGGGACACATCCACAATGGGTGGACTGCATTCCATAGGATCAGA
TCTCACACGCACGAAAGGTAGATCCGATTTGCACCAAAACGGTTCCCCCTTCAGTGATGCAGCTGCCTCTAAGTTGAGAG
TGAATGGACACAACACTCCCACAAGAAACACTCAGCCAATAGCTTCAATAGAGAGCCACGTGGCAGCAGGGGTTGAGAAG
GGTGGAAACGAAAATGTCAATCCGGATGATAAAATGGATACTGTCGTTTGTAGCGGCCAGTCCACCCTTGACAAACGTTC
TCGAAAAGCAAGCATGGTGAAGGAGCCAATCCTCCAGTTCTCGAAACGCCAGAAATCTGAGTGA
Microexon DNA seq GTTTTGCTTGAAG
Microexon Amino Acid seq VLLEE
Microexon-tag DNA Seq AACGTCGATGTCACTGATGTTCTCTCTAATTTCCCCGACATCTCATTGGTTTTGCTTGAAGAAATTATGGAAAAGCTTGTAACCGAAGGCATTTTAGCAAAGTCTGGA
Microexon-tag Amino Acid seq NVDVTDVLSNFPDISLVLLEEIMEKLVTEGILAKSG
Transcript ID OTG17049
Gene ID Ha.51413
Gene Name ASY1
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >OTG17049
MVVAQKLKEAEITEQDSLLLTRNLLRIAIFNISYIRGLFPEKYFSDKGVPALEMKIKKLMPMDAESRRLIDWMEKGVYDA
LQKKYLRTLMFCVCETAEGPMIEEYSFSFSYSNSDSQEVSMNVNRIGNKKRGETFKCNSTTEITPTQMKSSACKMVRTLI
QLMRTLDKMPEERTIVMKLHYYDDVTPADYEPPFFRGCTEEEAHHTWTKNPLRMEVGNVNSKHFVLSLKVKSVLDPCEDD
NVSLGGDSMQQDGDSEADSEASISDDEYIVAPIDKQKEKQDDTMVDEDDTQDPAEDEQQLNRVKDWIKSYHQENVDVTDV
LSNFPDISLVLLEEIMEKLVTEGILAKSGNESFTIKRTKNSEYEFDAVKEETDARQASGKKSLPTTGEAQMYMKALYHAL
PMNYITISKLQSKLGGEANQITVRKLMDKMTKEGYIESTSNRRLGKRVVHSDITKKKLAEVQKALDFDAMVEDTHETRNK
SNHSVNQAGANCWDTSTMGGLHSIGSDLTRTKGRSDLHQNGSPFSDAAASKLRVNGHNTPTRNTQPIASIESHVAAGVEK
GGNENVNPDDKMDTVVCSGQSTLDKRSRKASMVKEPILQFSKRQKSE*
CDS seq >OTG17049
ATGGTTGTTGCGCAGAAGTTGAAGGAAGCGGAGATCACCGAGCAGGATTCTCTGCTTCTGACAAGGAACCTACTACGCAT
AGCAATATTCAATATCAGTTATATCAGAGGTCTTTTTCCAGAGAAATACTTTAGTGATAAGGGCGTGCCAGCCTTAGAGA
TGAAGATAAAAAAGCTTATGCCAATGGATGCAGAGTCTAGAAGACTAATTGACTGGATGGAGAAAGGTGTATATGATGCT
TTGCAGAAGAAATATCTCAGGACACTAATGTTCTGTGTGTGTGAGACTGCAGAAGGGCCAATGATTGAAGAATATTCTTT
TTCTTTCAGTTACTCAAATTCTGATAGCCAAGAGGTTTCTATGAATGTCAATCGAATTGGAAACAAGAAGCGAGGGGAAA
CATTCAAATGTAACTCCACAACAGAGATTACCCCTACTCAAATGAAGAGCTCTGCTTGTAAAATGGTTCGCACACTCATT
CAATTGATGAGAACTCTAGATAAGATGCCAGAAGAGCGCACGATTGTAATGAAACTCCACTATTATGATGATGTCACGCC
TGCTGATTACGAGCCCCCGTTCTTCAGAGGCTGCACAGAGGAAGAAGCCCATCATACATGGACCAAAAATCCTTTGAGAA
TGGAGGTTGGAAATGTCAACAGCAAACATTTTGTTTTATCTCTTAAGGTTAAAAGTGTGCTTGATCCTTGTGAGGATGAT
AATGTTAGTTTGGGAGGTGACTCAATGCAGCAAGATGGAGACAGTGAGGCTGATAGTGAGGCCAGCATATCTGATGATGA
GTACATAGTGGCACCTATAGATAAGCAAAAGGAGAAACAAGATGATACCATGGTTGATGAAGACGACACTCAGGATCCAG
CTGAAGATGAACAGCAGCTAAACCGTGTCAAGGATTGGATCAAATCCTATCATCAAGAAAACGTCGATGTCACTGATGTT
CTCTCTAATTTCCCCGACATCTCATTGGTTTTGCTTGAAGAAATTATGGAAAAGCTTGTAACCGAAGGCATTTTAGCAAA
GTCTGGAAACGAAAGTTTCACCATCAAGAGGACGAAGAATTCGGAGTATGAATTTGATGCTGTGAAAGAGGAAACTGACG
CTCGTCAGGCATCTGGCAAGAAGAGCCTGCCCACTACTGGCGAGGCTCAAATGTACATGAAAGCACTTTATCATGCTCTT
CCAATGAACTACATTACTATTTCGAAGCTTCAAAGCAAGCTTGGTGGTGAGGCTAACCAAATTACCGTGCGCAAGTTAAT
GGATAAAATGACTAAAGAAGGTTACATCGAATCCACAAGCAATCGTAGGCTAGGAAAGCGTGTGGTGCATTCTGATATAA
CCAAGAAGAAACTTGCTGAAGTGCAAAAGGCCTTGGACTTTGATGCTATGGTTGAGGATACACATGAAACTCGTAACAAA
TCCAACCATTCGGTGAACCAAGCAGGGGCTAATTGTTGGGACACATCCACAATGGGTGGACTGCATTCCATAGGATCAGA
TCTCACACGCACGAAAGGTAGATCCGATTTGCACCAAAACGGTTCCCCCTTCAGTGATGCAGCTGCCTCTAAGTTGAGAG
TGAATGGACACAACACTCCCACAAGAAACACTCAGCCAATAGCTTCAATAGAGAGCCACGTGGCAGCAGGGGTTGAGAAG
GGTGGAAACGAAAATGTCAATCCGGATGATAAAATGGATACTGTCGTTTGTAGCGGCCAGTCCACCCTTGACAAACGTTC
TCGAAAAGCAAGCATGGTGAAGGAGCCAATCCTCCAGTTCTCGAAACGCCAGAAATCTGAGTGA