Microexon ID Ha_9:158773525-158773533:-
Species Helianthus annuus
Coordinates 9:158773525..158773533
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ACCCTGACG
Microexon Amino Acid seq DPDA
Microexon-tag DNA Seq TGGCAGCAGCCTGCTTATCATTTTCGGCCCCATAAGAACTTCATCAGTGACCCTGACGCTCCATTATATTACAAGGGTTGGTACCATTTGTTCTACCAATACAACCCA
Microexon-tag Amino Acid Seq WQQPAYHFRPHKNFISDPDAPLYYKGWYHLFYQYNP
Microexon-tag spanning region158768305-158775404
Microexon-tag prediction score0.9388
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG15588x
Reference Transcript ID OTG15588
Gene ID HannXRQ_Chr09g0262031
Gene Name INV1
Transcript ID OTG15588
Protein ID OTG15588
Gene ID HannXRQ_Chr09g0262031
Gene Name INV1
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.3e-89
Motif start 69
Motif end 388
Protein seq >OTG15588
MASSTTTTTFTSLSTHNHHERSPKSSICRNVLVRLFVSMLVLSSLVVITANSTSPTVEDSVKWQQPAYHFRPHKNFISDP
DAPLYYKGWYHLFYQYNPDSVVWGNVTWAHAVSKNLVNWFILPLAMVPDHWYDINGVMSGSATTLPDGRLMMFYSGMSTN
LSQLQCLAFPKNSSDPLLMDWVKYDGNPVLYPPPNIGPKDFRDPSNLWIGPDGKYRMIMGSKHNNTIGCVLVFHTTDFIH
YKLLDDVLHSVPGTGMWECADIYPVSTMNTNGLDMSVYGSGVKYVLKQSGDEDRHDWYAIGSYDAWNDKWYPDDPEKDLG
IGLRYDYGKFYASKTFYDPLKKRRVLWGYVGETDTVKEDRIKRWANILNFPRTVTLDIKTQTNLIQWPIEEVEMLRFKTH
FEFKDIELRPGSIVPLNIGSTTKLDISALFKLCQASLNATLESDVSYNCTTSEGSAGRGGLGPFGLLVLADASFTEQTPV
YFYIAKNTDGTSRTYFCADESRSSTLSDVGKRVYGSTVPVLHGEDYNMRLLVDHSIVESFAQGGRTVISSRVYPTKAIYK
AAKAFLFNNATDISVKASLKIWKMGRAKFKHFPL*
CDS seq >OTG15588
ATGGCTTCTTCCACCACCACCACCACATTCACCTCTCTCTCTACCCACAATCACCACGAACGCAGCCCGAAATCATCCAT
CTGCCGCAATGTCCTCGTTCGTCTCTTTGTCTCCATGCTAGTCCTTTCATCTTTGGTTGTTATAACAGCTAATTCAACAT
CTCCAACGGTTGAAGATAGCGTAAAATGGCAGCAGCCTGCTTATCATTTTCGGCCCCATAAGAACTTCATCAGTGACCCT
GACGCTCCATTATATTACAAGGGTTGGTACCATTTGTTCTACCAATACAACCCAGATTCAGTAGTTTGGGGCAACGTGAC
ATGGGCTCATGCAGTCTCCAAAAATCTAGTCAACTGGTTCATACTCCCTTTAGCCATGGTTCCTGATCACTGGTACGACA
TCAACGGCGTCATGAGTGGCTCCGCAACCACCCTCCCAGACGGTCGACTGATGATGTTCTATAGCGGCATGTCTACCAAC
TTGTCTCAACTCCAATGCCTTGCATTCCCCAAAAACTCTTCGGATCCCCTTCTTATGGATTGGGTCAAATACGACGGTAA
CCCTGTCCTCTACCCACCTCCCAACATCGGTCCCAAAGACTTTCGGGACCCCTCAAATCTCTGGATTGGACCCGATGGCA
AGTATCGAATGATCATGGGCTCAAAGCATAACAATACCATTGGTTGTGTTCTAGTTTTCCATACAACTGATTTTATTCAT
TATAAATTGTTGGATGACGTGCTCCACTCGGTTCCGGGTACGGGTATGTGGGAGTGTGCAGATATTTACCCGGTATCCAC
TATGAATACAAATGGGTTGGATATGTCGGTTTATGGGTCGGGTGTTAAGTATGTGTTGAAGCAAAGTGGGGATGAAGATA
GACATGATTGGTATGCAATTGGGTCGTATGATGCGTGGAATGATAAATGGTATCCGGATGATCCGGAAAAGGATTTGGGT
ATCGGGTTGAGATACGATTATGGAAAGTTTTATGCTTCGAAAACGTTTTACGACCCGTTAAAGAAAAGACGGGTTCTGTG
GGGGTATGTCGGAGAAACGGATACGGTTAAAGAGGATCGAATAAAGAGATGGGCCAATATTCTGAACTTTCCAAGAACGG
TGACATTGGACATAAAGACTCAAACAAATTTGATTCAATGGCCCATCGAAGAAGTAGAGATGTTGAGGTTTAAAACCCAT
TTCGAATTTAAAGACATTGAATTGAGGCCTGGATCAATTGTTCCGCTCAATATAGGCTCAACCACGAAGTTGGATATAAG
TGCCTTGTTCAAGTTATGTCAAGCATCACTAAATGCAACATTGGAATCGGATGTTTCGTACAACTGTACAACAAGTGAGG
GCTCAGCCGGGAGAGGCGGTTTGGGACCGTTTGGGCTCTTGGTTCTAGCCGATGCATCATTTACCGAACAAACACCAGTG
TATTTTTACATCGCAAAAAACACGGATGGCACCTCACGAACTTATTTTTGTGCTGATGAATCAAGATCATCAACACTTTC
AGATGTGGGCAAAAGGGTATATGGAAGTACTGTTCCTGTGCTTCATGGTGAAGACTACAACATGAGGTTATTGGTGGACC
ATTCAATCGTAGAAAGCTTTGCACAAGGAGGAAGAACGGTTATATCATCAAGAGTCTATCCTACAAAGGCAATCTATAAA
GCAGCCAAAGCGTTCTTGTTCAACAATGCCACTGATATCTCTGTCAAGGCATCTCTCAAGATCTGGAAGATGGGTCGAGC
AAAATTTAAACATTTCCCTCTCTAA
Microexon DNA seq ACCCTGACG
Microexon Amino Acid seq DPDA
Microexon-tag DNA Seq TGGCAGCAGCCTGCTTATCATTTTCGGCCCCATAAGAACTTCATCAGTGACCCTGACGCTCCATTATATTACAAGGGTTGGTACCATTTGTTCTACCAATACAACCCA
Microexon-tag Amino Acid seq WQQPAYHFRPHKNFISDPDAPLYYKGWYHLFYQYNP
Transcript ID OTG15588
Gene ID Ha.57026
Gene Name INV1
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.3e-89
Motif start 69
Motif end 388
Protein seq >OTG15588
MASSTTTTTFTSLSTHNHHERSPKSSICRNVLVRLFVSMLVLSSLVVITANSTSPTVEDSVKWQQPAYHFRPHKNFISDP
DAPLYYKGWYHLFYQYNPDSVVWGNVTWAHAVSKNLVNWFILPLAMVPDHWYDINGVMSGSATTLPDGRLMMFYSGMSTN
LSQLQCLAFPKNSSDPLLMDWVKYDGNPVLYPPPNIGPKDFRDPSNLWIGPDGKYRMIMGSKHNNTIGCVLVFHTTDFIH
YKLLDDVLHSVPGTGMWECADIYPVSTMNTNGLDMSVYGSGVKYVLKQSGDEDRHDWYAIGSYDAWNDKWYPDDPEKDLG
IGLRYDYGKFYASKTFYDPLKKRRVLWGYVGETDTVKEDRIKRWANILNFPRTVTLDIKTQTNLIQWPIEEVEMLRFKTH
FEFKDIELRPGSIVPLNIGSTTKLDISALFKLCQASLNATLESDVSYNCTTSEGSAGRGGLGPFGLLVLADASFTEQTPV
YFYIAKNTDGTSRTYFCADESRSSTLSDVGKRVYGSTVPVLHGEDYNMRLLVDHSIVESFAQGGRTVISSRVYPTKAIYK
AAKAFLFNNATDISVKASLKIWKMGRAKFKHFPL*
CDS seq >OTG15588
ATGGCTTCTTCCACCACCACCACCACATTCACCTCTCTCTCTACCCACAATCACCACGAACGCAGCCCGAAATCATCCAT
CTGCCGCAATGTCCTCGTTCGTCTCTTTGTCTCCATGCTAGTCCTTTCATCTTTGGTTGTTATAACAGCTAATTCAACAT
CTCCAACGGTTGAAGATAGCGTAAAATGGCAGCAGCCTGCTTATCATTTTCGGCCCCATAAGAACTTCATCAGTGACCCT
GACGCTCCATTATATTACAAGGGTTGGTACCATTTGTTCTACCAATACAACCCAGATTCAGTAGTTTGGGGCAACGTGAC
ATGGGCTCATGCAGTCTCCAAAAATCTAGTCAACTGGTTCATACTCCCTTTAGCCATGGTTCCTGATCACTGGTACGACA
TCAACGGCGTCATGAGTGGCTCCGCAACCACCCTCCCAGACGGTCGACTGATGATGTTCTATAGCGGCATGTCTACCAAC
TTGTCTCAACTCCAATGCCTTGCATTCCCCAAAAACTCTTCGGATCCCCTTCTTATGGATTGGGTCAAATACGACGGTAA
CCCTGTCCTCTACCCACCTCCCAACATCGGTCCCAAAGACTTTCGGGACCCCTCAAATCTCTGGATTGGACCCGATGGCA
AGTATCGAATGATCATGGGCTCAAAGCATAACAATACCATTGGTTGTGTTCTAGTTTTCCATACAACTGATTTTATTCAT
TATAAATTGTTGGATGACGTGCTCCACTCGGTTCCGGGTACGGGTATGTGGGAGTGTGCAGATATTTACCCGGTATCCAC
TATGAATACAAATGGGTTGGATATGTCGGTTTATGGGTCGGGTGTTAAGTATGTGTTGAAGCAAAGTGGGGATGAAGATA
GACATGATTGGTATGCAATTGGGTCGTATGATGCGTGGAATGATAAATGGTATCCGGATGATCCGGAAAAGGATTTGGGT
ATCGGGTTGAGATACGATTATGGAAAGTTTTATGCTTCGAAAACGTTTTACGACCCGTTAAAGAAAAGACGGGTTCTGTG
GGGGTATGTCGGAGAAACGGATACGGTTAAAGAGGATCGAATAAAGAGATGGGCCAATATTCTGAACTTTCCAAGAACGG
TGACATTGGACATAAAGACTCAAACAAATTTGATTCAATGGCCCATCGAAGAAGTAGAGATGTTGAGGTTTAAAACCCAT
TTCGAATTTAAAGACATTGAATTGAGGCCTGGATCAATTGTTCCGCTCAATATAGGCTCAACCACGAAGTTGGATATAAG
TGCCTTGTTCAAGTTATGTCAAGCATCACTAAATGCAACATTGGAATCGGATGTTTCGTACAACTGTACAACAAGTGAGG
GCTCAGCCGGGAGAGGCGGTTTGGGACCGTTTGGGCTCTTGGTTCTAGCCGATGCATCATTTACCGAACAAACACCAGTG
TATTTTTACATCGCAAAAAACACGGATGGCACCTCACGAACTTATTTTTGTGCTGATGAATCAAGATCATCAACACTTTC
AGATGTGGGCAAAAGGGTATATGGAAGTACTGTTCCTGTGCTTCATGGTGAAGACTACAACATGAGGTTATTGGTGGACC
ATTCAATCGTAGAAAGCTTTGCACAAGGAGGAAGAACGGTTATATCATCAAGAGTCTATCCTACAAAGGCAATCTATAAA
GCAGCCAAAGCGTTCTTGTTCAACAATGCCACTGATATCTCTGTCAAGGCATCTCTCAAGATCTGGAAGATGGGTCGAGC
AAAATTTAAACATTTCCCTCTCTAA