Microexon ID Ha_14:171011641-171011649:+
Species Helianthus annuus
Coordinates 14:171011641..171011649
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCCAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAAAGGACAGGCTTTCATTTTCAACCTCCAAAAAACTGGATGAATGATCCCAATGGTCCATTATATCACATGGGGTGGTACCATTTCTTCTATCAATATAATCCG
Microexon-tag Amino Acid Seq WQRTGFHFQPPKNWMNDPNGPLYHMGWYHFFYQYNP
Microexon-tag spanning region171011430-171014735
Microexon-tag prediction score0.9628
Overlapped with the annotated transcript (%) 100
New Transcript ID OTF99883x
Reference Transcript ID OTF99883
Gene ID HannXRQ_Chr14g0461441
Gene Name INV1
Transcript ID OTF99883
Protein ID OTF99883
Gene ID HannXRQ_Chr14g0461441
Gene Name INV1
Pfam domain motif Glyco_hydro_32N
Motif E-value 3.2e-105
Motif start 117
Motif end 435
Protein seq >OTF99883
MNADLEHTTSTSYSPLPDADNHRHPSSSTRQRPITLLSGIFLSMLFFSSLIALILNQHNQPHLDDSRNKESSRAIPVSRG
PPQGVSEKSNIELASSDVEDIYPWTNAMLSWQRTGFHFQPPKNWMNDPNGPLYHMGWYHFFYQYNPDAAVWGNISWGHAI
SEDLINWFHLPFAMVPDQWYDINGVWTGSATILPDGRIIMLYTGDTNEEVQVQNLAYPANLSDPLLLDWIKYPGNPVLVP
PPGIGTKDFRDPTTAWLGPNGKWRVALGSKVNKTGITLVYETTDFTSYELLENLMHAVPGTGMWECVDFYPVSTTESNGL
DTSFNGPGIKHVLKSSLDDDKHDYYALGTYDPISSKWTPDDPELDVGIGLRLDYGKYYASKTFYDQNTQRRLLWGWIGET
DSEAADILKGWSSVQTIPREVVFDKKTGTNVFQWPIKEVEKLRSKVTVYEKVLLEPGSLVPLDVGLATQLDIIATFEMDK
EAVKTSVEADVGYNCTTSGGASSRGAFGPFGLVVLADEKLTEQTPIYFYIAKGSDGVAQTHFCADQSKSSRAPDVTKLIY
GSSVPVLHEENFSMRLLIDHSIVESFGQGGRTVITSRVYPTKAIYSKAKVFLFNNATGITVTANVNVWNMDSAHIDYFPF
LNLER*
CDS seq >OTF99883
ATGAATGCTGATCTTGAACATACAACTTCTACCTCATATTCTCCGTTGCCGGATGCCGACAACCACCGCCATCCATCGTC
ATCTACACGTCAACGACCCATCACCCTCTTGTCCGGAATCTTTCTCTCCATGCTATTTTTTTCATCTTTAATAGCACTCA
TCCTCAACCAACATAACCAACCACATCTAGATGACTCTCGCAACAAAGAGTCATCGCGTGCAATACCCGTATCAAGAGGA
CCGCCACAAGGGGTGTCGGAGAAGAGTAATATCGAGCTAGCATCCTCCGACGTGGAGGACATATACCCTTGGACCAATGC
TATGCTTTCATGGCAAAGGACAGGCTTTCATTTTCAACCTCCAAAAAACTGGATGAATGATCCCAATGGTCCATTATATC
ACATGGGGTGGTACCATTTCTTCTATCAATATAATCCGGATGCGGCTGTTTGGGGGAACATATCGTGGGGACATGCTATC
TCGGAGGACTTGATTAACTGGTTTCATCTACCGTTTGCTATGGTGCCAGATCAATGGTATGATATCAACGGCGTTTGGAC
AGGATCGGCTACAATCCTCCCTGATGGTAGGATCATCATGTTATACACTGGGGACACTAATGAGGAAGTGCAGGTCCAAA
ACTTAGCATACCCTGCCAACCTATCTGATCCCCTCCTCCTGGATTGGATCAAGTATCCTGGTAATCCAGTTTTGGTCCCT
CCACCCGGTATTGGCACTAAGGACTTTAGGGATCCCACAACAGCATGGCTTGGGCCAAATGGAAAATGGCGGGTCGCATT
AGGGTCAAAGGTTAATAAAACGGGCATAACACTAGTTTACGAAACAACAGATTTTACAAGCTACGAGTTATTAGAGAATT
TGATGCATGCTGTTCCAGGTACGGGTATGTGGGAGTGTGTTGACTTTTATCCCGTGTCAACAACTGAGTCAAACGGGTTG
GACACGTCTTTCAATGGACCTGGTATTAAGCATGTGTTGAAATCAAGCTTGGATGATGATAAACATGATTATTATGCATT
AGGGACGTATGACCCGATTAGTAGCAAATGGACGCCTGATGATCCGGAGTTAGATGTGGGTATCGGGTTACGACTGGATT
ATGGAAAGTACTATGCATCGAAAACATTTTATGACCAAAATACGCAAAGGCGACTTCTTTGGGGTTGGATTGGAGAAACA
GATAGTGAAGCTGCTGACATCTTGAAGGGATGGTCCTCTGTTCAGACCATTCCAAGAGAAGTGGTGTTTGACAAAAAGAC
GGGGACAAATGTTTTTCAATGGCCGATTAAAGAGGTAGAGAAGTTGAGGTCCAAAGTTACGGTATACGAGAAAGTGTTGC
TGGAACCAGGATCTCTCGTGCCTCTTGACGTAGGCTTGGCTACACAGTTAGATATTATAGCGACATTTGAAATGGATAAG
GAAGCAGTGAAGACATCAGTTGAGGCAGATGTGGGTTACAATTGCACCACAAGTGGCGGTGCCTCTTCAAGAGGTGCCTT
TGGACCATTTGGGTTAGTAGTGCTTGCAGATGAAAAACTTACCGAGCAAACTCCTATATATTTCTATATTGCTAAAGGTA
GTGATGGGGTTGCTCAAACTCATTTTTGTGCCGACCAATCCAAATCATCAAGAGCTCCAGATGTCACTAAATTAATCTAC
GGAAGCAGTGTTCCTGTGCTACATGAAGAAAACTTTAGCATGCGATTATTGATTGATCATTCAATAGTGGAAAGCTTTGG
GCAAGGTGGAAGAACGGTTATAACATCAAGGGTGTATCCTACAAAAGCAATTTACTCGAAAGCAAAGGTGTTTCTGTTTA
ACAATGCAACAGGGATAACCGTAACAGCAAATGTAAATGTTTGGAATATGGATTCTGCACATATCGACTATTTTCCATTC
CTCAATTTGGAACGTTGA
Microexon DNA seq ATCCCAATG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAAAGGACAGGCTTTCATTTTCAACCTCCAAAAAACTGGATGAATGATCCCAATGGTCCATTATATCACATGGGGTGGTACCATTTCTTCTATCAATATAATCCG
Microexon-tag Amino Acid seq WQRTGFHFQPPKNWMNDPNGPLYHMGWYHFFYQYNP
Transcript ID OTF99883
Gene ID Ha.21778
Gene Name INV1
Pfam domain motif Glyco_hydro_32N
Motif E-value 3.2e-105
Motif start 117
Motif end 435
Protein seq >OTF99883
MNADLEHTTSTSYSPLPDADNHRHPSSSTRQRPITLLSGIFLSMLFFSSLIALILNQHNQPHLDDSRNKESSRAIPVSRG
PPQGVSEKSNIELASSDVEDIYPWTNAMLSWQRTGFHFQPPKNWMNDPNGPLYHMGWYHFFYQYNPDAAVWGNISWGHAI
SEDLINWFHLPFAMVPDQWYDINGVWTGSATILPDGRIIMLYTGDTNEEVQVQNLAYPANLSDPLLLDWIKYPGNPVLVP
PPGIGTKDFRDPTTAWLGPNGKWRVALGSKVNKTGITLVYETTDFTSYELLENLMHAVPGTGMWECVDFYPVSTTESNGL
DTSFNGPGIKHVLKSSLDDDKHDYYALGTYDPISSKWTPDDPELDVGIGLRLDYGKYYASKTFYDQNTQRRLLWGWIGET
DSEAADILKGWSSVQTIPREVVFDKKTGTNVFQWPIKEVEKLRSKVTVYEKVLLEPGSLVPLDVGLATQLDIIATFEMDK
EAVKTSVEADVGYNCTTSGGASSRGAFGPFGLVVLADEKLTEQTPIYFYIAKGSDGVAQTHFCADQSKSSRAPDVTKLIY
GSSVPVLHEENFSMRLLIDHSIVESFGQGGRTVITSRVYPTKAIYSKAKVFLFNNATGITVTANVNVWNMDSAHIDYFPF
LNLER*
CDS seq >OTF99883
ATGAATGCTGATCTTGAACATACAACTTCTACCTCATATTCTCCGTTGCCGGATGCCGACAACCACCGCCATCCATCGTC
ATCTACACGTCAACGACCCATCACCCTCTTGTCCGGAATCTTTCTCTCCATGCTATTTTTTTCATCTTTAATAGCACTCA
TCCTCAACCAACATAACCAACCACATCTAGATGACTCTCGCAACAAAGAGTCATCGCGTGCAATACCCGTATCAAGAGGA
CCGCCACAAGGGGTGTCGGAGAAGAGTAATATCGAGCTAGCATCCTCCGACGTGGAGGACATATACCCTTGGACCAATGC
TATGCTTTCATGGCAAAGGACAGGCTTTCATTTTCAACCTCCAAAAAACTGGATGAATGATCCCAATGGTCCATTATATC
ACATGGGGTGGTACCATTTCTTCTATCAATATAATCCGGATGCGGCTGTTTGGGGGAACATATCGTGGGGACATGCTATC
TCGGAGGACTTGATTAACTGGTTTCATCTACCGTTTGCTATGGTGCCAGATCAATGGTATGATATCAACGGCGTTTGGAC
AGGATCGGCTACAATCCTCCCTGATGGTAGGATCATCATGTTATACACTGGGGACACTAATGAGGAAGTGCAGGTCCAAA
ACTTAGCATACCCTGCCAACCTATCTGATCCCCTCCTCCTGGATTGGATCAAGTATCCTGGTAATCCAGTTTTGGTCCCT
CCACCCGGTATTGGCACTAAGGACTTTAGGGATCCCACAACAGCATGGCTTGGGCCAAATGGAAAATGGCGGGTCGCATT
AGGGTCAAAGGTTAATAAAACGGGCATAACACTAGTTTACGAAACAACAGATTTTACAAGCTACGAGTTATTAGAGAATT
TGATGCATGCTGTTCCAGGTACGGGTATGTGGGAGTGTGTTGACTTTTATCCCGTGTCAACAACTGAGTCAAACGGGTTG
GACACGTCTTTCAATGGACCTGGTATTAAGCATGTGTTGAAATCAAGCTTGGATGATGATAAACATGATTATTATGCATT
AGGGACGTATGACCCGATTAGTAGCAAATGGACGCCTGATGATCCGGAGTTAGATGTGGGTATCGGGTTACGACTGGATT
ATGGAAAGTACTATGCATCGAAAACATTTTATGACCAAAATACGCAAAGGCGACTTCTTTGGGGTTGGATTGGAGAAACA
GATAGTGAAGCTGCTGACATCTTGAAGGGATGGTCCTCTGTTCAGACCATTCCAAGAGAAGTGGTGTTTGACAAAAAGAC
GGGGACAAATGTTTTTCAATGGCCGATTAAAGAGGTAGAGAAGTTGAGGTCCAAAGTTACGGTATACGAGAAAGTGTTGC
TGGAACCAGGATCTCTCGTGCCTCTTGACGTAGGCTTGGCTACACAGTTAGATATTATAGCGACATTTGAAATGGATAAG
GAAGCAGTGAAGACATCAGTTGAGGCAGATGTGGGTTACAATTGCACCACAAGTGGCGGTGCCTCTTCAAGAGGTGCCTT
TGGACCATTTGGGTTAGTAGTGCTTGCAGATGAAAAACTTACCGAGCAAACTCCTATATATTTCTATATTGCTAAAGGTA
GTGATGGGGTTGCTCAAACTCATTTTTGTGCCGACCAATCCAAATCATCAAGAGCTCCAGATGTCACTAAATTAATCTAC
GGAAGCAGTGTTCCTGTGCTACATGAAGAAAACTTTAGCATGCGATTATTGATTGATCATTCAATAGTGGAAAGCTTTGG
GCAAGGTGGAAGAACGGTTATAACATCAAGGGTGTATCCTACAAAAGCAATTTACTCGAAAGCAAAGGTGTTTCTGTTTA
ACAATGCAACAGGGATAACCGTAACAGCAAATGTAAATGTTTGGAATATGGATTCTGCACATATCGACTATTTTCCATTC
CTCAATTTGGAACGTTGA