Microexon ID Ha_16:3215381-3215389:-
Species Helianthus annuus
Coordinates 16:3215381..3215389
Microexon Cluster ID MEP22
Size 9
Phase 1
Pfam Domain Motif Glyco_hydro_32N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq YSKYAMMGRACYGSYTWYCAYTTYCARCCYSMCAARAAYTGGATGAAYGATCCYAAYGGTCCAATGTWYTACAAGGGATKGTACCAYYTSTTCTAYCARTACAAYCCV
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATCCAAACG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq GTTTACAGAACTGGTTATCACTTTCAACCCAAGAAAAATTGGATGAACGATCCAAACGCACCAATGTACTACAAAGGATTTTACCATTTGTTTTATCAATACAATCCA
Microexon-tag Amino Acid Seq VYRTGYHFQPKKNWMNDPNAPMYYKGFYHLFYQYNP
Microexon-tag spanning region3215252-3221500
Microexon-tag prediction score0.953
Overlapped with the annotated transcript (%) 100
New Transcript ID OTF90346x
Reference Transcript ID OTF90346
Gene ID HannXRQ_Chr16g0498631
Gene Name NA
Transcript ID OTF90346
Protein ID OTF90346
Gene ID HannXRQ_Chr16g0498631
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.5e-103
Motif start 57
Motif end 376
Protein seq >OTF90346
MMRVFNRCVWSLALLICLLSMLSNFHGIGVMASHNVHLRYQTNNVDKVKQVYRTGYHFQPKKNWMNDPNAPMYYKGFYHL
FYQYNPKGAVWGNIVWGHSVSKDMINWIPLEPAIVPSKPFDKYGCWSGSATILPGDKPVIIYTGIINEKPEPGHQAQSYA
IPANYSDPYLRKWIKPDNNPIIKPIHENVSSFRDPTTAWFNNGHWKLIVGSKNNRRGITYLYRSRDFIKWTKAKHPFHTK
KDVGMWECPDFYPISSQGTNGLDTSTLGDNVKYVFKASLDITRFDYYTIGSYNLVKDKYVPDSTSVDGWAGLRYDYGNFY
ASKTFFDPIKKRRILWGWANESSTRSEDIAKGWAGIQLIPRKVWLDPSGHKLLQWPINELERLRDQKIELSNVKINKGDT
VEVKGINVVQADVDVTFTFSSLDKAELYDKKWENFPPEDLAKSICGIKGATVQGGLGPFGLLTLASSKLEEYTPVFFRIF
KTTGKKHKVLFCSDATPSSINKNEYKPSFGGFVDVDLTTKKLSLRSLIDHSVVESFAEGGKTVISSRVYPTLAINENAHL
HLFNNGSENVMVEKLNAWSMKTAHIN*
CDS seq >OTF90346
ATGATGAGGGTTTTCAATAGGTGTGTATGGAGTTTAGCATTGTTAATATGTCTTTTGTCGATGCTGAGCAACTTCCATGG
AATAGGAGTTATGGCGTCCCATAACGTGCATCTTAGGTATCAAACGAATAACGTCGACAAGGTTAAGCAAGTTTACAGAA
CTGGTTATCACTTTCAACCCAAGAAAAATTGGATGAACGATCCAAACGCACCAATGTACTACAAAGGATTTTACCATTTG
TTTTATCAATACAATCCAAAGGGTGCAGTATGGGGCAATATTGTGTGGGGGCATTCGGTGTCGAAAGACATGATAAACTG
GATCCCCTTGGAGCCAGCAATTGTACCATCGAAACCATTCGATAAATATGGATGTTGGTCTGGATCCGCTACAATCCTAC
CAGGTGATAAACCTGTGATCATATACACAGGAATAATTAACGAAAAACCAGAACCTGGCCACCAAGCCCAAAGCTATGCT
ATACCTGCCAACTATTCTGATCCTTACCTTCGAAAATGGATCAAACCTGACAACAACCCGATCATTAAGCCAATACACGA
GAATGTATCATCTTTTCGTGATCCTACAACAGCTTGGTTCAATAATGGTCATTGGAAACTTATTGTGGGTAGTAAGAATA
ACCGTAGAGGCATTACCTATTTGTACCGAAGCCGAGATTTCATTAAGTGGACCAAGGCAAAACATCCATTTCACACGAAA
AAAGATGTTGGTATGTGGGAATGCCCGGACTTTTACCCAATATCATCTCAAGGAACAAATGGGTTAGACACTTCAACATT
AGGGGATAATGTTAAATATGTTTTCAAGGCTAGCCTAGATATAACTAGGTTTGATTATTACACGATTGGAAGTTACAACC
TCGTTAAGGACAAATATGTCCCGGATAGCACGTCAGTTGATGGGTGGGCAGGGTTAAGATATGATTATGGGAACTTTTAT
GCCTCTAAGACATTCTTTGATCCTATCAAGAAACGAAGGATTTTGTGGGGTTGGGCTAACGAGTCAAGCACTAGAAGTGA
AGACATTGCTAAAGGATGGGCCGGAATTCAATTGATTCCACGTAAGGTGTGGTTGGATCCTAGCGGACACAAATTGCTAC
AATGGCCGATTAACGAATTGGAAAGATTAAGGGATCAAAAGATAGAACTAAGCAATGTGAAAATCAACAAAGGTGATACC
GTAGAAGTAAAGGGAATTAATGTAGTACAGGCAGATGTCGATGTAACTTTTACGTTTTCAAGTTTGGATAAAGCTGAGCT
GTACGACAAGAAATGGGAAAATTTTCCTCCTGAAGATCTCGCTAAAAGTATTTGCGGAATCAAGGGTGCAACCGTACAAG
GCGGGCTAGGGCCATTTGGGCTTTTGACACTTGCTTCAAGCAAGCTTGAAGAGTATACACCGGTTTTCTTTAGGATTTTC
AAGACAACCGGCAAAAAACATAAAGTTCTCTTTTGTTCTGATGCTACCCCTTCATCAATAAACAAGAATGAATACAAGCC
ATCATTTGGAGGCTTTGTGGATGTCGACCTAACGACGAAGAAGCTTTCTCTTAGGAGCTTGATTGACCACTCAGTCGTTG
AAAGTTTTGCCGAGGGAGGAAAAACAGTTATCTCATCTAGAGTGTACCCGACATTAGCAATTAATGAGAATGCACATTTG
CATCTATTTAACAATGGTTCTGAGAATGTAATGGTTGAAAAGCTGAACGCGTGGTCAATGAAGACGGCTCACATCAACTA
G
Microexon DNA seq ATCCAAACG
Microexon Amino Acid seq DPNA
Microexon-tag DNA Seq GTTTACAGAACTGGTTATCACTTTCAACCCAAGAAAAATTGGATGAACGATCCAAACGCACCAATGTACTACAAAGGATTTTACCATTTGTTTTATCAATACAATCCA
Microexon-tag Amino Acid seq VYRTGYHFQPKKNWMNDPNAPMYYKGFYHLFYQYNP
Transcript ID OTF90346
Gene ID Ha.25638
Gene Name NA
Pfam domain motif Glyco_hydro_32N
Motif E-value 2.5e-103
Motif start 57
Motif end 376
Protein seq >OTF90346
MMRVFNRCVWSLALLICLLSMLSNFHGIGVMASHNVHLRYQTNNVDKVKQVYRTGYHFQPKKNWMNDPNAPMYYKGFYHL
FYQYNPKGAVWGNIVWGHSVSKDMINWIPLEPAIVPSKPFDKYGCWSGSATILPGDKPVIIYTGIINEKPEPGHQAQSYA
IPANYSDPYLRKWIKPDNNPIIKPIHENVSSFRDPTTAWFNNGHWKLIVGSKNNRRGITYLYRSRDFIKWTKAKHPFHTK
KDVGMWECPDFYPISSQGTNGLDTSTLGDNVKYVFKASLDITRFDYYTIGSYNLVKDKYVPDSTSVDGWAGLRYDYGNFY
ASKTFFDPIKKRRILWGWANESSTRSEDIAKGWAGIQLIPRKVWLDPSGHKLLQWPINELERLRDQKIELSNVKINKGDT
VEVKGINVVQADVDVTFTFSSLDKAELYDKKWENFPPEDLAKSICGIKGATVQGGLGPFGLLTLASSKLEEYTPVFFRIF
KTTGKKHKVLFCSDATPSSINKNEYKPSFGGFVDVDLTTKKLSLRSLIDHSVVESFAEGGKTVISSRVYPTLAINENAHL
HLFNNGSENVMVEKLNAWSMKTAHIN*
CDS seq >OTF90346
ATGATGAGGGTTTTCAATAGGTGTGTATGGAGTTTAGCATTGTTAATATGTCTTTTGTCGATGCTGAGCAACTTCCATGG
AATAGGAGTTATGGCGTCCCATAACGTGCATCTTAGGTATCAAACGAATAACGTCGACAAGGTTAAGCAAGTTTACAGAA
CTGGTTATCACTTTCAACCCAAGAAAAATTGGATGAACGATCCAAACGCACCAATGTACTACAAAGGATTTTACCATTTG
TTTTATCAATACAATCCAAAGGGTGCAGTATGGGGCAATATTGTGTGGGGGCATTCGGTGTCGAAAGACATGATAAACTG
GATCCCCTTGGAGCCAGCAATTGTACCATCGAAACCATTCGATAAATATGGATGTTGGTCTGGATCCGCTACAATCCTAC
CAGGTGATAAACCTGTGATCATATACACAGGAATAATTAACGAAAAACCAGAACCTGGCCACCAAGCCCAAAGCTATGCT
ATACCTGCCAACTATTCTGATCCTTACCTTCGAAAATGGATCAAACCTGACAACAACCCGATCATTAAGCCAATACACGA
GAATGTATCATCTTTTCGTGATCCTACAACAGCTTGGTTCAATAATGGTCATTGGAAACTTATTGTGGGTAGTAAGAATA
ACCGTAGAGGCATTACCTATTTGTACCGAAGCCGAGATTTCATTAAGTGGACCAAGGCAAAACATCCATTTCACACGAAA
AAAGATGTTGGTATGTGGGAATGCCCGGACTTTTACCCAATATCATCTCAAGGAACAAATGGGTTAGACACTTCAACATT
AGGGGATAATGTTAAATATGTTTTCAAGGCTAGCCTAGATATAACTAGGTTTGATTATTACACGATTGGAAGTTACAACC
TCGTTAAGGACAAATATGTCCCGGATAGCACGTCAGTTGATGGGTGGGCAGGGTTAAGATATGATTATGGGAACTTTTAT
GCCTCTAAGACATTCTTTGATCCTATCAAGAAACGAAGGATTTTGTGGGGTTGGGCTAACGAGTCAAGCACTAGAAGTGA
AGACATTGCTAAAGGATGGGCCGGAATTCAATTGATTCCACGTAAGGTGTGGTTGGATCCTAGCGGACACAAATTGCTAC
AATGGCCGATTAACGAATTGGAAAGATTAAGGGATCAAAAGATAGAACTAAGCAATGTGAAAATCAACAAAGGTGATACC
GTAGAAGTAAAGGGAATTAATGTAGTACAGGCAGATGTCGATGTAACTTTTACGTTTTCAAGTTTGGATAAAGCTGAGCT
GTACGACAAGAAATGGGAAAATTTTCCTCCTGAAGATCTCGCTAAAAGTATTTGCGGAATCAAGGGTGCAACCGTACAAG
GCGGGCTAGGGCCATTTGGGCTTTTGACACTTGCTTCAAGCAAGCTTGAAGAGTATACACCGGTTTTCTTTAGGATTTTC
AAGACAACCGGCAAAAAACATAAAGTTCTCTTTTGTTCTGATGCTACCCCTTCATCAATAAACAAGAATGAATACAAGCC
ATCATTTGGAGGCTTTGTGGATGTCGACCTAACGACGAAGAAGCTTTCTCTTAGGAGCTTGATTGACCACTCAGTCGTTG
AAAGTTTTGCCGAGGGAGGAAAAACAGTTATCTCATCTAGAGTGTACCCGACATTAGCAATTAATGAGAATGCACATTTG
CATCTATTTAACAATGGTTCTGAGAATGTAATGGTTGAAAAGCTGAACGCGTGGTCAATGAAGACGGCTCACATCAACTA
G