Microexon ID Ha_6:4559285-4559299:+
Species Helianthus annuus
Coordinates 6:4559285..4559299
Microexon Cluster ID MEP42
Size 15
Phase 0
Pfam Domain Motif bHLH-MYC_N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq AGTGCATCAATTCAG
Microexon Amino Acid seq SASIQ
Microexon-tag DNA Seq GCTCACCTTGCCGATAGCAAAGTCTTCAGCCGCTCTCTTCTAGCAAAAAGTGCATCAATTCAGACCGTAGTATGCTTTCCATATTTAGAAGGTCTTGTTGAGTTCGGC
Microexon-tag Amino Acid Seq AHLADSKVFSRSLLAKSASIQTVVCFPYLEGLVEFG
Microexon-tag spanning region4559125-4559440
Microexon-tag prediction score0.9456
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG21925x
Reference Transcript ID OTG21925
Gene ID HannXRQ_Chr06g0165851
Gene Name GL3
Transcript ID OTG21925
Protein ID OTG21925
Gene ID HannXRQ_Chr06g0165851
Gene Name GL3
Pfam domain motif bHLH-MYC_N
Motif E-value 1.5e-45
Motif start 7
Motif end 202
Protein seq >OTG21925
MAAKEHLRRKLAMAVKSINWSYAIFWSISSTEPGVLTWCDGYYNGGIKTRKTIQADEMEEEEEEDGGDDDEAGLRRTEQL
RQLYESLSIASESHGYDPQPRRPPAALSPEDLTDAEWYFLVCMTYEFTNGQGLPGRTLAKNTVSWLSDAHLADSKVFSRS
LLAKSASIQTVVCFPYLEGLVEFGVAEKVLEEQNIVKQIKDLLFDVPSQKVREIPLESCSNMLDHHDIVINNLDNMLEYD
QNQEQSWRFVDDDDDDEGEISFHQNNSMGSSDCISQNLASGHGDVWSDVDDRYHCVLLRIFKNTPRLVLGPRFRNCDSKE
SAFVSWKNCDGMESNGSCSQVLLKSVLYKVPKMHENRLVWSRCEDGNVDRMMKLEDDEVKDIDHRFSVLSALVPSRGKVD
KVSLLDDTIDYLKTLERKVESLQSDRKSHDVRERTCDNYCNKRKASCDPTDLQEERFSDCITVSAIEKDVTIEIRCRWRD
NMIVQVFDAMSSLNLESHSVHSYTVDGILTLTIESKLKSCTGSTAKMIRQALQRVIGSKFGVD*
CDS seq >OTG21925
ATGGCTGCAAAGGAACACTTAAGAAGAAAACTGGCTATGGCTGTCAAAAGCATTAACTGGAGCTATGCAATCTTCTGGTC
CATTTCTTCAACAGAACCAGGGGTATTGACATGGTGTGATGGATACTACAATGGAGGCATCAAAACAAGGAAAACAATAC
AAGCAGATGAAATGGAAGAAGAAGAAGAAGAAGATGGCGGTGATGATGATGAAGCGGGATTGCGAAGGACCGAGCAATTA
AGACAACTTTATGAATCACTTTCCATAGCTTCCGAATCACACGGTTATGATCCACAACCGAGAAGGCCCCCAGCTGCATT
ATCCCCTGAAGATCTCACGGACGCCGAGTGGTATTTCTTAGTTTGCATGACATATGAGTTCACTAATGGTCAAGGATTGC
CAGGAAGAACATTGGCAAAGAACACAGTTAGTTGGCTGTCTGATGCTCACCTTGCCGATAGCAAAGTCTTCAGCCGCTCT
CTTCTAGCAAAAAGTGCATCAATTCAGACCGTAGTATGCTTTCCATATTTAGAAGGTCTTGTTGAGTTCGGCGTAGCAGA
GAAGGTTTTAGAAGAACAAAACATCGTTAAACAGATTAAAGATTTACTCTTTGATGTTCCATCGCAAAAAGTTCGTGAAA
TACCCTTAGAAAGTTGTTCCAACATGCTTGATCATCATGATATAGTCATCAACAATCTCGACAACATGCTTGAATACGAT
CAAAATCAAGAACAAAGTTGGCGGTTTGTGGATGATGATGATGACGACGAAGGAGAGATTAGTTTCCACCAAAATAATTC
CATGGGTTCGAGTGATTGTATATCTCAAAACTTAGCAAGTGGTCATGGCGATGTTTGGAGCGACGTCGATGATCGCTATC
ATTGTGTTCTTTTGAGGATATTCAAGAACACCCCGAGATTGGTTTTGGGACCTCGTTTTAGAAACTGCGATTCGAAAGAG
TCTGCTTTTGTTAGCTGGAAAAATTGTGATGGAATGGAATCGAACGGAAGTTGTTCACAAGTGTTATTAAAAAGTGTGCT
TTATAAAGTCCCGAAAATGCACGAAAACCGCTTAGTTTGGTCTCGATGTGAAGATGGAAATGTGGATCGAATGATGAAAC
TCGAAGATGACGAAGTGAAGGATATTGATCATAGGTTTTCGGTGCTAAGCGCATTGGTCCCTTCTAGAGGAAAGGTTGAC
AAAGTGTCTCTACTCGATGACACTATCGACTACTTAAAAACTCTGGAGAGAAAGGTGGAATCGTTACAATCCGACAGAAA
ATCTCATGATGTGCGAGAAAGAACGTGTGATAACTATTGCAACAAACGAAAAGCATCTTGCGATCCTACAGATTTACAAG
AAGAACGTTTTTCGGATTGTATCACAGTCAGCGCGATAGAAAAAGATGTGACGATTGAGATACGATGTAGGTGGCGAGAT
AACATGATAGTTCAAGTGTTTGATGCAATGAGCAGCCTAAACTTGGAATCTCATTCTGTTCATTCATATACTGTTGATGG
AATTCTCACATTAACCATTGAATCAAAGTTAAAGAGTTGTACAGGATCAACAGCAAAGATGATCAGGCAGGCACTTCAAA
GAGTAATTGGTAGTAAATTTGGTGTAGATTAG
Microexon DNA seq AGTGCATCAATTCAG
Microexon Amino Acid seq SASIQ
Microexon-tag DNA Seq GCTCACCTTGCCGATAGCAAAGTCTTCAGCCGCTCTCTTCTAGCAAAAAGTGCATCAATTCAGACCGTAGTATGCTTTCCATATTTAGAAGGTCTTGTTGAGTTCGGC
Microexon-tag Amino Acid seq AHLADSKVFSRSLLAKSASIQTVVCFPYLEGLVEFG
Transcript ID OTG21925
Gene ID Ha.47104
Gene Name GL3
Pfam domain motif bHLH-MYC_N
Motif E-value 1.5e-45
Motif start 7
Motif end 202
Protein seq >OTG21925
MAAKEHLRRKLAMAVKSINWSYAIFWSISSTEPGVLTWCDGYYNGGIKTRKTIQADEMEEEEEEDGGDDDEAGLRRTEQL
RQLYESLSIASESHGYDPQPRRPPAALSPEDLTDAEWYFLVCMTYEFTNGQGLPGRTLAKNTVSWLSDAHLADSKVFSRS
LLAKSASIQTVVCFPYLEGLVEFGVAEKVLEEQNIVKQIKDLLFDVPSQKVREIPLESCSNMLDHHDIVINNLDNMLEYD
QNQEQSWRFVDDDDDDEGEISFHQNNSMGSSDCISQNLASGHGDVWSDVDDRYHCVLLRIFKNTPRLVLGPRFRNCDSKE
SAFVSWKNCDGMESNGSCSQVLLKSVLYKVPKMHENRLVWSRCEDGNVDRMMKLEDDEVKDIDHRFSVLSALVPSRGKVD
KVSLLDDTIDYLKTLERKVESLQSDRKSHDVRERTCDNYCNKRKASCDPTDLQEERFSDCITVSAIEKDVTIEIRCRWRD
NMIVQVFDAMSSLNLESHSVHSYTVDGILTLTIESKLKSCTGSTAKMIRQALQRVIGSKFGVD*
CDS seq >OTG21925
ATGGCTGCAAAGGAACACTTAAGAAGAAAACTGGCTATGGCTGTCAAAAGCATTAACTGGAGCTATGCAATCTTCTGGTC
CATTTCTTCAACAGAACCAGGGGTATTGACATGGTGTGATGGATACTACAATGGAGGCATCAAAACAAGGAAAACAATAC
AAGCAGATGAAATGGAAGAAGAAGAAGAAGAAGATGGCGGTGATGATGATGAAGCGGGATTGCGAAGGACCGAGCAATTA
AGACAACTTTATGAATCACTTTCCATAGCTTCCGAATCACACGGTTATGATCCACAACCGAGAAGGCCCCCAGCTGCATT
ATCCCCTGAAGATCTCACGGACGCCGAGTGGTATTTCTTAGTTTGCATGACATATGAGTTCACTAATGGTCAAGGATTGC
CAGGAAGAACATTGGCAAAGAACACAGTTAGTTGGCTGTCTGATGCTCACCTTGCCGATAGCAAAGTCTTCAGCCGCTCT
CTTCTAGCAAAAAGTGCATCAATTCAGACCGTAGTATGCTTTCCATATTTAGAAGGTCTTGTTGAGTTCGGCGTAGCAGA
GAAGGTTTTAGAAGAACAAAACATCGTTAAACAGATTAAAGATTTACTCTTTGATGTTCCATCGCAAAAAGTTCGTGAAA
TACCCTTAGAAAGTTGTTCCAACATGCTTGATCATCATGATATAGTCATCAACAATCTCGACAACATGCTTGAATACGAT
CAAAATCAAGAACAAAGTTGGCGGTTTGTGGATGATGATGATGACGACGAAGGAGAGATTAGTTTCCACCAAAATAATTC
CATGGGTTCGAGTGATTGTATATCTCAAAACTTAGCAAGTGGTCATGGCGATGTTTGGAGCGACGTCGATGATCGCTATC
ATTGTGTTCTTTTGAGGATATTCAAGAACACCCCGAGATTGGTTTTGGGACCTCGTTTTAGAAACTGCGATTCGAAAGAG
TCTGCTTTTGTTAGCTGGAAAAATTGTGATGGAATGGAATCGAACGGAAGTTGTTCACAAGTGTTATTAAAAAGTGTGCT
TTATAAAGTCCCGAAAATGCACGAAAACCGCTTAGTTTGGTCTCGATGTGAAGATGGAAATGTGGATCGAATGATGAAAC
TCGAAGATGACGAAGTGAAGGATATTGATCATAGGTTTTCGGTGCTAAGCGCATTGGTCCCTTCTAGAGGAAAGGTTGAC
AAAGTGTCTCTACTCGATGACACTATCGACTACTTAAAAACTCTGGAGAGAAAGGTGGAATCGTTACAATCCGACAGAAA
ATCTCATGATGTGCGAGAAAGAACGTGTGATAACTATTGCAACAAACGAAAAGCATCTTGCGATCCTACAGATTTACAAG
AAGAACGTTTTTCGGATTGTATCACAGTCAGCGCGATAGAAAAAGATGTGACGATTGAGATACGATGTAGGTGGCGAGAT
AACATGATAGTTCAAGTGTTTGATGCAATGAGCAGCCTAAACTTGGAATCTCATTCTGTTCATTCATATACTGTTGATGG
AATTCTCACATTAACCATTGAATCAAAGTTAAAGAGTTGTACAGGATCAACAGCAAAGATGATCAGGCAGGCACTTCAAA
GAGTAATTGGTAGTAAATTTGGTGTAGATTAG