Microexon ID Ha_6:45078673-45078679:+
Species Helianthus annuus
Coordinates 6:45078673..45078679
Microexon Cluster ID MEP15
Size 7
Phase 2
Pfam Domain Motif GBP
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 50,7,51
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq ATATCTMGRYTSTCATTTGCTGTTGARMTTGCTGAAGARTTYTATGGAAGAGTGAAGGGRCAAGATGTTGCWTTTGARCCWGCWAARCTYYTGTGGCTTATCCARMGT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq AGTAAAG
Microexon Amino Acid seq RVK
Microexon-tag DNA Seq ATATCCCGATTGTCATTTGCTGTTGAGCTTGCTGAAGAGTTCTATGGAAGAGTAAAGGGGCAAGATGTGGCGTTTGAACCAGCAAAACTCTTGTGGCTTATACAACGT
Microexon-tag Amino Acid Seq ISRLSFAVELAEEFYGRVKGQDVAFEPAKLLWLIQR
Microexon-tag spanning region45078506-45078812
Microexon-tag prediction score0.974
Overlapped with the annotated transcript (%) 100
New Transcript ID OTG23032x
Reference Transcript ID OTG23032
Gene ID HannXRQ_Chr06g0177911
Gene Name NA
Transcript ID OTG23032
Protein ID OTG23032
Gene ID HannXRQ_Chr06g0177911
Gene Name NA
Pfam domain motif GBP
Motif E-value 1.9e-38
Motif start 43
Motif end 282
Protein seq >OTG23032
MSKSLWWIGCFFIILHLSSFWVSGIDDFNQAFPIIEPDPGHTKLRLSSEGLEAIRRITNPIAAVAVIGPYRSGKSFLLNQ
LLSLSCYEGFGVGHMRDTKTKGIWVWGTPVEMDIDGVKTSVFYLDTEGFESIGKSNVYDDRIFALATVLSSVLIYNLPET
IREADISRLSFAVELAEEFYGRVKGQDVAFEPAKLLWLIQRDFLQGKSVKEMVDEALRHVPNHDGDRNIDQVNQIRDSLA
IMGDNSTAFSLPQPHLQRTKLCDLKDGELDQSYVQKREQLKEVVKSVIRPKIVQGKSLNGNEFVSFLEKILEALNKGEIP
STGSLVEVFNKKILEGCLKLYEDSMTKVILPIHEESLQSIHEASKAESLKSFDEQHFGRHHAKKSVETLDEDIEKMFKNF
VMANQYQSSKLCEALYTKCEDQMDQLQVLRLPSMAKFNAGFLHCNQSFGKECVGPSKTIYEQRMVKMMGKSKSSFIKEYN
HRLFNWLVAFSLVMVIVGRFIIKFILIEIGAWILFIFLETYTRMFWSPESLYYNPVWHGIVATWETLVYSPVLDLDRWAI
QIGFVSVVLLVYWRCCARRKHKPKHGPHSVLPLYNNASRGRDRSE*
CDS seq >OTG23032
ATGTCGAAGAGTTTGTGGTGGATCGGTTGCTTTTTTATTATTCTACACTTATCCTCCTTTTGGGTCTCTGGGATTGATGA
TTTCAATCAAGCGTTTCCTATCATTGAACCGGATCCTGGTCATACAAAACTCCGTCTTTCTAGTGAAGGACTGGAGGCTA
TTCGTAGAATAACTAATCCTATTGCAGCCGTGGCCGTAATTGGACCATATCGATCTGGAAAGTCTTTTTTGCTTAACCAG
CTTCTCTCACTTTCTTGTTATGAAGGTTTTGGTGTTGGCCACATGCGCGATACTAAAACTAAAGGAATATGGGTATGGGG
AACTCCGGTTGAGATGGATATTGATGGAGTGAAAACATCTGTCTTTTATCTTGACACAGAGGGGTTCGAAAGTATAGGGA
AATCAAACGTCTATGATGACAGAATATTTGCTCTCGCAACGGTTTTGAGTTCTGTTCTTATATACAATTTGCCTGAGACG
ATACGTGAAGCTGATATATCCCGATTGTCATTTGCTGTTGAGCTTGCTGAAGAGTTCTATGGAAGAGTAAAGGGGCAAGA
TGTGGCGTTTGAACCAGCAAAACTCTTGTGGCTTATACAACGTGACTTCTTACAGGGGAAGTCTGTAAAAGAAATGGTAG
ATGAAGCTCTACGACATGTTCCTAACCATGATGGTGACAGAAATATTGACCAGGTCAACCAAATTCGGGACTCTCTAGCT
ATCATGGGCGACAATAGCACAGCTTTCAGCTTACCACAACCGCATCTCCAAAGGACAAAGCTGTGTGACTTGAAAGACGG
GGAGCTTGACCAGAGTTATGTCCAAAAGAGAGAGCAGTTGAAAGAAGTTGTTAAATCTGTTATTCGTCCGAAGATTGTAC
AGGGAAAATCACTTAATGGAAACGAGTTTGTATCCTTTCTTGAAAAGATTTTAGAAGCATTAAACAAGGGGGAGATTCCC
TCAACAGGGTCTCTTGTGGAGGTATTCAACAAGAAAATATTGGAGGGATGTCTGAAGCTATACGAGGATAGCATGACGAA
AGTGATTTTACCGATTCACGAGGAATCTTTACAATCCATCCATGAAGCTTCAAAAGCCGAATCTCTGAAATCTTTTGACG
AACAACATTTTGGCCGTCACCATGCCAAGAAATCCGTTGAAACACTCGATGAAGATATCGAAAAGATGTTTAAGAATTTT
GTGATGGCAAACCAATATCAGTCGTCAAAACTGTGTGAGGCACTCTATACAAAGTGCGAGGACCAAATGGACCAACTTCA
AGTACTCAGACTACCGTCGATGGCCAAGTTTAACGCAGGCTTCCTTCACTGCAACCAAAGTTTTGGAAAAGAGTGCGTCG
GGCCATCTAAAACCATATATGAGCAACGAATGGTGAAGATGATGGGAAAGTCGAAGTCTTCGTTTATAAAGGAATACAAT
CACAGACTATTTAATTGGTTGGTGGCCTTTTCACTTGTAATGGTGATCGTGGGTCGTTTTATCATAAAGTTCATTTTGAT
TGAGATCGGGGCGTGGATACTCTTCATCTTTTTGGAGACGTACACACGGATGTTCTGGTCACCCGAGTCTCTTTATTACA
ATCCCGTCTGGCATGGTATAGTAGCCACTTGGGAAACACTTGTTTACAGCCCTGTTCTCGATTTGGACAGATGGGCGATA
CAGATTGGATTCGTCTCAGTAGTATTGTTGGTATACTGGCGGTGCTGTGCAAGGAGGAAACACAAGCCAAAACACGGGCC
CCACTCAGTGTTACCATTGTATAACAACGCTTCAAGAGGTCGGGATAGATCAGAATAA
Microexon DNA seq AGTAAAG
Microexon Amino Acid seq RVK
Microexon-tag DNA Seq ATATCCCGATTGTCATTTGCTGTTGAGCTTGCTGAAGAGTTCTATGGAAGAGTAAAGGGGCAAGATGTGGCGTTTGAACCAGCAAAACTCTTGTGGCTTATACAACGT
Microexon-tag Amino Acid seq ISRLSFAVELAEEFYGRVKGQDVAFEPAKLLWLIQR
Transcript ID OTG23032
Gene ID Ha.48355
Gene Name NA
Pfam domain motif GBP
Motif E-value 1.9e-38
Motif start 43
Motif end 282
Protein seq >OTG23032
MSKSLWWIGCFFIILHLSSFWVSGIDDFNQAFPIIEPDPGHTKLRLSSEGLEAIRRITNPIAAVAVIGPYRSGKSFLLNQ
LLSLSCYEGFGVGHMRDTKTKGIWVWGTPVEMDIDGVKTSVFYLDTEGFESIGKSNVYDDRIFALATVLSSVLIYNLPET
IREADISRLSFAVELAEEFYGRVKGQDVAFEPAKLLWLIQRDFLQGKSVKEMVDEALRHVPNHDGDRNIDQVNQIRDSLA
IMGDNSTAFSLPQPHLQRTKLCDLKDGELDQSYVQKREQLKEVVKSVIRPKIVQGKSLNGNEFVSFLEKILEALNKGEIP
STGSLVEVFNKKILEGCLKLYEDSMTKVILPIHEESLQSIHEASKAESLKSFDEQHFGRHHAKKSVETLDEDIEKMFKNF
VMANQYQSSKLCEALYTKCEDQMDQLQVLRLPSMAKFNAGFLHCNQSFGKECVGPSKTIYEQRMVKMMGKSKSSFIKEYN
HRLFNWLVAFSLVMVIVGRFIIKFILIEIGAWILFIFLETYTRMFWSPESLYYNPVWHGIVATWETLVYSPVLDLDRWAI
QIGFVSVVLLVYWRCCARRKHKPKHGPHSVLPLYNNASRGRDRSE*
CDS seq >OTG23032
ATGTCGAAGAGTTTGTGGTGGATCGGTTGCTTTTTTATTATTCTACACTTATCCTCCTTTTGGGTCTCTGGGATTGATGA
TTTCAATCAAGCGTTTCCTATCATTGAACCGGATCCTGGTCATACAAAACTCCGTCTTTCTAGTGAAGGACTGGAGGCTA
TTCGTAGAATAACTAATCCTATTGCAGCCGTGGCCGTAATTGGACCATATCGATCTGGAAAGTCTTTTTTGCTTAACCAG
CTTCTCTCACTTTCTTGTTATGAAGGTTTTGGTGTTGGCCACATGCGCGATACTAAAACTAAAGGAATATGGGTATGGGG
AACTCCGGTTGAGATGGATATTGATGGAGTGAAAACATCTGTCTTTTATCTTGACACAGAGGGGTTCGAAAGTATAGGGA
AATCAAACGTCTATGATGACAGAATATTTGCTCTCGCAACGGTTTTGAGTTCTGTTCTTATATACAATTTGCCTGAGACG
ATACGTGAAGCTGATATATCCCGATTGTCATTTGCTGTTGAGCTTGCTGAAGAGTTCTATGGAAGAGTAAAGGGGCAAGA
TGTGGCGTTTGAACCAGCAAAACTCTTGTGGCTTATACAACGTGACTTCTTACAGGGGAAGTCTGTAAAAGAAATGGTAG
ATGAAGCTCTACGACATGTTCCTAACCATGATGGTGACAGAAATATTGACCAGGTCAACCAAATTCGGGACTCTCTAGCT
ATCATGGGCGACAATAGCACAGCTTTCAGCTTACCACAACCGCATCTCCAAAGGACAAAGCTGTGTGACTTGAAAGACGG
GGAGCTTGACCAGAGTTATGTCCAAAAGAGAGAGCAGTTGAAAGAAGTTGTTAAATCTGTTATTCGTCCGAAGATTGTAC
AGGGAAAATCACTTAATGGAAACGAGTTTGTATCCTTTCTTGAAAAGATTTTAGAAGCATTAAACAAGGGGGAGATTCCC
TCAACAGGGTCTCTTGTGGAGGTATTCAACAAGAAAATATTGGAGGGATGTCTGAAGCTATACGAGGATAGCATGACGAA
AGTGATTTTACCGATTCACGAGGAATCTTTACAATCCATCCATGAAGCTTCAAAAGCCGAATCTCTGAAATCTTTTGACG
AACAACATTTTGGCCGTCACCATGCCAAGAAATCCGTTGAAACACTCGATGAAGATATCGAAAAGATGTTTAAGAATTTT
GTGATGGCAAACCAATATCAGTCGTCAAAACTGTGTGAGGCACTCTATACAAAGTGCGAGGACCAAATGGACCAACTTCA
AGTACTCAGACTACCGTCGATGGCCAAGTTTAACGCAGGCTTCCTTCACTGCAACCAAAGTTTTGGAAAAGAGTGCGTCG
GGCCATCTAAAACCATATATGAGCAACGAATGGTGAAGATGATGGGAAAGTCGAAGTCTTCGTTTATAAAGGAATACAAT
CACAGACTATTTAATTGGTTGGTGGCCTTTTCACTTGTAATGGTGATCGTGGGTCGTTTTATCATAAAGTTCATTTTGAT
TGAGATCGGGGCGTGGATACTCTTCATCTTTTTGGAGACGTACACACGGATGTTCTGGTCACCCGAGTCTCTTTATTACA
ATCCCGTCTGGCATGGTATAGTAGCCACTTGGGAAACACTTGTTTACAGCCCTGTTCTCGATTTGGACAGATGGGCGATA
CAGATTGGATTCGTCTCAGTAGTATTGTTGGTATACTGGCGGTGCTGTGCAAGGAGGAAACACAAGCCAAAACACGGGCC
CCACTCAGTGTTACCATTGTATAACAACGCTTCAAGAGGTCGGGATAGATCAGAATAA