Microexon ID Ha_7:99553858-99553866:-
Species Helianthus annuus
Coordinates 7:99553858..99553866
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCAAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAAAGAACAGGATTTCATTTTCAACCACCGAAAAATTGGATGAATGATCCAAACGGTCCATTATATCATATGGGGTGGTACCATTTGTTCTATCAATATAACCGA
Microexon-tag Amino Acid Seq WQRTGFHFQPPKNWMNDPNGPLYHMGWYHLFYQYNR
Microexon-tag spanning region99549903-99554384
Microexon-tag prediction score0.9437
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG21599x
Reference Transcript ID OTG21599
Gene ID HannXRQ_Chr07g0205891
Gene Name INVB
Transcript ID OTG21599
Protein ID OTG21599
Gene ID HannXRQ_Chr07g0205891
Gene Name INVB
Pfam domain motif Glyco_hydro_32N
Motif E-value 9.4e-106
Motif start 140
Motif end 458
Protein seq >OTG21599
MSICCYKYSKTHPPHYQHSSNILSATKMATDLEHATSTIYSPLPDSDNGYHPPSSERRRPIALLSGIFLSMLLFSSLIAL
ILNQHNQHDIDHVKSPLVKPVSRGVSHGVSEKSNSQLLSSDVEVYPWTNAMLSWQRTGFHFQPPKNWMNDPNGPLYHMGW
YHLFYQYNRDAAVWGNITWGHAISTDLINWYHLPFAMVPDQWYDINGVWTGSATILPDGRIVMLYTGDTNEEVQVQNLAY
PANLSDPLLLDWIKYSNNPVMVPPPGIGTKDFRDPTTAWLGPNGKWRVAIGSKVNKTGITLVYETTDFTSYALLDEVMHA
VPGTGMWECVDFYPVSTTESNGLDTSVNGPSVKHVLKSSLDDDKNDYYALGTYDSISNKWTPDDPNLDVGIGLRVDYGKY
YASKTFYDQNKQRRLLWGWIGETDSEAADILKGWASVQSIPREVVFDRKTGTNILQWPIKEVEKLRSKSWVYQKVLLEPG
CLVSLDVGLATQLDIIATFDIETQTIEEADAGYNCTTSGGSFSRGAFGPFGLVVLADETRTEQTPVYFYITKGSDGVART
HFCADQTKSSTASDVTKLVYGSNVHVLEGETLSMRLLVDHSIVESFAQGGRTVITSRVYPTKAIYTSAKVFLFNNATGVS
VTANVNVWNMDSARIDHFPLGQH*
CDS seq >OTG21599
ATGTCCATTTGTTGCTATAAATACTCCAAAACTCACCCCCCTCATTACCAACATTCCTCCAATATTCTCTCAGCCACCAA
AATGGCTACCGACCTTGAACATGCCACTTCCACCATATATTCCCCTTTGCCGGACTCCGACAACGGCTACCATCCACCGT
CGTCGGAACGAAGGCGACCCATCGCCCTCTTGTCCGGAATCTTTCTCTCCATGTTGCTTTTTTCATCATTGATAGCTCTC
ATCCTCAACCAACATAACCAACATGATATTGACCATGTAAAGTCACCGCTCGTGAAGCCCGTTTCACGAGGGGTGTCACA
CGGGGTGTCGGAGAAGAGTAATTCCCAGCTCTTGTCCTCCGATGTGGAGGTATACCCTTGGACTAATGCTATGCTTTCTT
GGCAAAGAACAGGATTTCATTTTCAACCACCGAAAAATTGGATGAATGATCCAAACGGTCCATTATATCATATGGGGTGG
TACCATTTGTTCTATCAATATAACCGAGATGCAGCCGTTTGGGGTAATATTACATGGGGACATGCAATTTCAACGGATCT
GATCAATTGGTATCATCTTCCTTTTGCTATGGTCCCGGATCAATGGTATGATATCAACGGTGTTTGGACTGGATCCGCTA
CGATCCTTCCTGACGGTAGGATCGTCATGCTTTATACCGGAGACACTAATGAGGAAGTGCAGGTGCAAAACTTAGCGTAC
CCCGCCAACCTATCTGATCCTCTCCTCCTAGATTGGATAAAGTATTCAAACAATCCGGTTATGGTCCCTCCACCGGGTAT
TGGTACTAAGGATTTTAGGGACCCGACAACCGCTTGGCTAGGGCCAAATGGAAAATGGCGGGTCGCGATAGGGTCAAAGG
TTAATAAAACGGGTATTACACTTGTTTACGAAACGACAGATTTCACGAGCTATGCGTTATTGGATGAGGTGATGCATGCT
GTTCCAGGTACGGGTATGTGGGAATGTGTCGACTTTTATCCGGTATCAACAACTGAGTCAAACGGGTTGGACACGTCGGT
TAATGGGCCTAGTGTTAAACATGTGTTGAAATCGAGTTTGGATGATGATAAAAATGACTATTATGCACTAGGGACGTATG
ACTCTATTAGTAACAAGTGGACACCCGATGATCCGAATTTGGATGTGGGTATCGGGTTACGAGTTGATTATGGAAAGTAC
TACGCATCTAAGACGTTTTACGACCAAAACAAGCAAAGACGACTTCTTTGGGGTTGGATCGGAGAAACCGATAGTGAAGC
TGCTGATATTTTGAAGGGATGGGCCTCCGTTCAGAGCATTCCAAGAGAAGTGGTCTTTGACAGAAAGACTGGAACAAATA
TTCTTCAATGGCCAATCAAAGAGGTAGAGAAGTTAAGATCCAAAAGTTGGGTATATCAGAAAGTGCTTCTTGAACCAGGA
TGTCTTGTTTCACTTGATGTAGGCTTAGCTACACAGTTAGACATAATAGCAACATTTGACATCGAAACGCAAACCATAGA
AGAGGCGGATGCGGGTTACAATTGCACCACAAGTGGCGGGTCTTTTTCACGAGGAGCTTTCGGACCATTTGGATTGGTGG
TTCTTGCGGACGAAACGCGTACTGAACAAACTCCTGTGTATTTCTACATTACCAAAGGATCTGATGGGGTTGCTCGTACA
CACTTTTGTGCCGATCAAACCAAATCATCAACAGCTTCAGATGTAACCAAATTAGTTTATGGAAGTAATGTTCATGTGCT
AGAGGGAGAAACTTTAAGCATGCGATTATTGGTTGATCATTCAATAGTCGAAAGCTTTGCTCAAGGTGGAAGGACGGTTA
TAACCTCAAGGGTATATCCGACAAAAGCAATTTACACGTCAGCAAAGGTTTTCCTCTTCAACAATGCAACAGGAGTAAGC
GTTACCGCAAATGTAAATGTGTGGAACATGGATTCAGCACGTATTGACCATTTTCCCCTTGGTCAACATTGA
Microexon DNA seq ATCCAAACG
Microexon Amino Acid seq DPNG
Microexon-tag DNA Seq TGGCAAAGAACAGGATTTCATTTTCAACCACCGAAAAATTGGATGAATGATCCAAACGGTCCATTATATCATATGGGGTGGTACCATTTGTTCTATCAATATAACCGA
Microexon-tag Amino Acid seq WQRTGFHFQPPKNWMNDPNGPLYHMGWYHLFYQYNR
Transcript ID OTG21599
Gene ID Ha.51225
Gene Name INVB
Pfam domain motif Glyco_hydro_32N
Motif E-value 9.4e-106
Motif start 140
Motif end 458
Protein seq >OTG21599
MSICCYKYSKTHPPHYQHSSNILSATKMATDLEHATSTIYSPLPDSDNGYHPPSSERRRPIALLSGIFLSMLLFSSLIAL
ILNQHNQHDIDHVKSPLVKPVSRGVSHGVSEKSNSQLLSSDVEVYPWTNAMLSWQRTGFHFQPPKNWMNDPNGPLYHMGW
YHLFYQYNRDAAVWGNITWGHAISTDLINWYHLPFAMVPDQWYDINGVWTGSATILPDGRIVMLYTGDTNEEVQVQNLAY
PANLSDPLLLDWIKYSNNPVMVPPPGIGTKDFRDPTTAWLGPNGKWRVAIGSKVNKTGITLVYETTDFTSYALLDEVMHA
VPGTGMWECVDFYPVSTTESNGLDTSVNGPSVKHVLKSSLDDDKNDYYALGTYDSISNKWTPDDPNLDVGIGLRVDYGKY
YASKTFYDQNKQRRLLWGWIGETDSEAADILKGWASVQSIPREVVFDRKTGTNILQWPIKEVEKLRSKSWVYQKVLLEPG
CLVSLDVGLATQLDIIATFDIETQTIEEADAGYNCTTSGGSFSRGAFGPFGLVVLADETRTEQTPVYFYITKGSDGVART
HFCADQTKSSTASDVTKLVYGSNVHVLEGETLSMRLLVDHSIVESFAQGGRTVITSRVYPTKAIYTSAKVFLFNNATGVS
VTANVNVWNMDSARIDHFPLGQH*
CDS seq >OTG21599
ATGTCCATTTGTTGCTATAAATACTCCAAAACTCACCCCCCTCATTACCAACATTCCTCCAATATTCTCTCAGCCACCAA
AATGGCTACCGACCTTGAACATGCCACTTCCACCATATATTCCCCTTTGCCGGACTCCGACAACGGCTACCATCCACCGT
CGTCGGAACGAAGGCGACCCATCGCCCTCTTGTCCGGAATCTTTCTCTCCATGTTGCTTTTTTCATCATTGATAGCTCTC
ATCCTCAACCAACATAACCAACATGATATTGACCATGTAAAGTCACCGCTCGTGAAGCCCGTTTCACGAGGGGTGTCACA
CGGGGTGTCGGAGAAGAGTAATTCCCAGCTCTTGTCCTCCGATGTGGAGGTATACCCTTGGACTAATGCTATGCTTTCTT
GGCAAAGAACAGGATTTCATTTTCAACCACCGAAAAATTGGATGAATGATCCAAACGGTCCATTATATCATATGGGGTGG
TACCATTTGTTCTATCAATATAACCGAGATGCAGCCGTTTGGGGTAATATTACATGGGGACATGCAATTTCAACGGATCT
GATCAATTGGTATCATCTTCCTTTTGCTATGGTCCCGGATCAATGGTATGATATCAACGGTGTTTGGACTGGATCCGCTA
CGATCCTTCCTGACGGTAGGATCGTCATGCTTTATACCGGAGACACTAATGAGGAAGTGCAGGTGCAAAACTTAGCGTAC
CCCGCCAACCTATCTGATCCTCTCCTCCTAGATTGGATAAAGTATTCAAACAATCCGGTTATGGTCCCTCCACCGGGTAT
TGGTACTAAGGATTTTAGGGACCCGACAACCGCTTGGCTAGGGCCAAATGGAAAATGGCGGGTCGCGATAGGGTCAAAGG
TTAATAAAACGGGTATTACACTTGTTTACGAAACGACAGATTTCACGAGCTATGCGTTATTGGATGAGGTGATGCATGCT
GTTCCAGGTACGGGTATGTGGGAATGTGTCGACTTTTATCCGGTATCAACAACTGAGTCAAACGGGTTGGACACGTCGGT
TAATGGGCCTAGTGTTAAACATGTGTTGAAATCGAGTTTGGATGATGATAAAAATGACTATTATGCACTAGGGACGTATG
ACTCTATTAGTAACAAGTGGACACCCGATGATCCGAATTTGGATGTGGGTATCGGGTTACGAGTTGATTATGGAAAGTAC
TACGCATCTAAGACGTTTTACGACCAAAACAAGCAAAGACGACTTCTTTGGGGTTGGATCGGAGAAACCGATAGTGAAGC
TGCTGATATTTTGAAGGGATGGGCCTCCGTTCAGAGCATTCCAAGAGAAGTGGTCTTTGACAGAAAGACTGGAACAAATA
TTCTTCAATGGCCAATCAAAGAGGTAGAGAAGTTAAGATCCAAAAGTTGGGTATATCAGAAAGTGCTTCTTGAACCAGGA
TGTCTTGTTTCACTTGATGTAGGCTTAGCTACACAGTTAGACATAATAGCAACATTTGACATCGAAACGCAAACCATAGA
AGAGGCGGATGCGGGTTACAATTGCACCACAAGTGGCGGGTCTTTTTCACGAGGAGCTTTCGGACCATTTGGATTGGTGG
TTCTTGCGGACGAAACGCGTACTGAACAAACTCCTGTGTATTTCTACATTACCAAAGGATCTGATGGGGTTGCTCGTACA
CACTTTTGTGCCGATCAAACCAAATCATCAACAGCTTCAGATGTAACCAAATTAGTTTATGGAAGTAATGTTCATGTGCT
AGAGGGAGAAACTTTAAGCATGCGATTATTGGTTGATCATTCAATAGTCGAAAGCTTTGCTCAAGGTGGAAGGACGGTTA
TAACCTCAAGGGTATATCCGACAAAAGCAATTTACACGTCAGCAAAGGTTTTCCTCTTCAACAATGCAACAGGAGTAAGC
GTTACCGCAAATGTAAATGTGTGGAACATGGATTCAGCACGTATTGACCATTTTCCCCTTGGTCAACATTGA