Microexon ID Pp_14:4075744-4075750:+
Species Physcomitrium patens
Coordinates 14:4075744..4075750
Microexon Cluster ID MEP13
Size 7
Phase 0
Pfam Domain Motif DNA_ligase_A_M
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 13,38,7,50
Microexon location in the Microexon-tag 3
Microexon-tag DNA Seq AAYCAGGAAATWGCMAAGGCWGCAARRGAKGGRYTKGMSASTGATMGACAGTTRTGYTATGTTGCWTTTGAYRTTCTKTATGYTGGAGAYACYAGYGTYATYCAYCAR
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Pp_14:4075744-4075750:+ does not have available information here.
Transcript ID Pp3c14_5920V3.1
Protein ID Pp3c14_5920V3.1
Gene ID Pp3c14_5920
Gene Name NA
Pfam domain motif DNA_ligase_A_M
Motif E-value 7.3e-42
Motif start 229
Motif end 445
Protein seq >Pp3c14_5920V3.1
MKRMEETVEFGVLCSMFEAIVRCKKGALKRKHVRTFLEHVYNGQEHFSAMRLILPDLDKERANYGLREAVLAKFLADALG
LSKESEDAKKLINWRKGGQRAGSNAGNFSMVASEVLCRRQKTAPGGLMIKEVNDLLDRLAAAQDKEEKTAVLAELINKTN
AQEMRWIIMIILKDLKLGISEKTVFSEFHPDAEDYFNVTCDLKLVCEKLRDRSIRYKRQDIEVGKAVRPQLAARAANVED
AWKKMRGKDVVVECKFDGDRIQVHKNGNNLNFWSRTFNDHPEFKEAIGDVLCQRIIPEKCILDGEMLVWDRVSQRFAEFG
SNRGVAKAAKDGLDSDLQICYVAFDILYDGDSSVIHRPLRERQRLLQNAIRPLSGRLEVLIPAAGGLNAKHAVGDPRWSI
LATSPEEVEQFFQETIDNREEGVVLKDLDSKWEPSDRSGKWLKLKPDYVHSESDLDALIIGGYFGTGRRGGEVAQFLLGL
AEPSEVGGYPTKFRSFCRVGTGLTDDEAEQLVHKLKPHFRRNHKSTKPPSCYVLTNSSKERPDVWIERPEKSVILQITSD
IRTIRSEVFATPYSLRFPRIQRIRYDKPWYDCLDVQTLVDTVHAKSQNDGQAEGGNQFKYRARRAKQEKPERASLVPSHM
LVTDVSHVKQATRIFKGLVFYVANTSSEYPVEKIHKLVVENGGTFSMNLSSTVTHAVAFEKKGLKYQASFMNGDVIHLSW
LLDCMAQKVLLVVSPKYYLNMSDATKERLKGEVDEFGDFYCNEVDETDLTQIFENIDVNKQYHDIDRVNYYIKKYSPSPT
WCLFSGCCVFFLRPLHSANENTLQVANLTLRRLALEVEMHEGTVSDKLTRSITHVVMYVPTESPVSFQIILRSASSEERR
LLLSKRVKVVSHRWIEDSVNRSIYRHPADEEYDLRTGEVLLSDEEEEELVSKTSVPIKADSMVPITPDKRVVPRSLRKPA
TRRRALNLGRPSRRSKGTEEELPRVEAQKSGKEEHVEDDRAPTEITNKGTRNDPKAVEPFQLDDSASRTPEKRVLRVTRS
RKKDKPSSGNERLPLKRRLIHDEGEASLTQPSDKALTGAEPFIPRMSTRLAEKRKAFESKREADTYKLECTAADLREEDQ
EEPVEASEFEGLQNTSSLKEPTGRDDSRLHFNDFDHQSPAIDDMINTLLPNYASSQQANLSQQTNRESALPSRSNSFFIK
PTLAPQPTQGVLGSYSRPYLPSHCASEALQPVPDAPDNSDLFSSVHRLASSQDHLTGTVTVESSSQGLATYLPPTNNPEK
KKRSFKDLVNQMLNGN*
CDS seq >Pp3c14_5920V3.1
ATGAAGAGGATGGAGGAAACTGTGGAGTTCGGAGTATTGTGTAGCATGTTTGAGGCCATCGTTAGGTGTAAGAAGGGAGC
TCTGAAGCGGAAACATGTGCGGACGTTTTTGGAGCATGTTTACAATGGCCAAGAGCATTTTAGCGCTATGCGTTTGATTC
TTCCCGACTTGGATAAAGAGCGTGCCAACTATGGACTCCGCGAGGCTGTGTTGGCCAAATTCTTAGCCGATGCTCTGGGG
CTGTCTAAAGAATCTGAGGACGCTAAGAAACTTATCAATTGGCGTAAAGGAGGCCAAAGAGCCGGCAGCAATGCCGGCAA
TTTCTCTATGGTTGCGTCTGAGGTGCTCTGCAGGAGGCAGAAAACAGCACCTGGTGGTCTAATGATTAAAGAGGTCAATG
ATCTTCTCGATCGCTTAGCGGCAGCACAAGACAAGGAAGAGAAAACAGCTGTGCTAGCTGAGCTCATCAACAAGACAAAT
GCTCAGGAAATGAGGTGGATTATTATGATTATTCTGAAAGATTTGAAACTCGGGATTAGTGAGAAGACAGTTTTCAGCGA
GTTTCATCCCGACGCGGAAGATTATTTCAATGTCACATGCGATCTGAAGTTGGTTTGTGAAAAGCTTCGTGATCGCAGCA
TACGTTACAAGCGCCAGGATATCGAAGTGGGGAAGGCAGTGCGACCACAATTAGCCGCAAGGGCTGCCAATGTCGAAGAT
GCCTGGAAGAAGATGCGAGGCAAAGACGTTGTGGTAGAATGCAAGTTTGATGGAGACCGTATTCAAGTGCACAAAAATGG
AAATAATCTGAACTTCTGGTCAAGGACGTTTAATGACCATCCAGAGTTCAAGGAGGCTATTGGTGACGTATTGTGTCAAC
GTATCATTCCCGAGAAGTGTATACTTGACGGAGAGATGTTGGTTTGGGATCGAGTGTCGCAAAGATTTGCTGAGTTTGGC
TCTAATCGTGGAGTTGCAAAGGCTGCTAAGGATGGCTTGGATTCTGATCTTCAGATTTGTTATGTTGCATTCGATATTCT
GTATGATGGTGACAGCAGTGTTATCCATCGTCCGTTGCGTGAACGGCAACGGCTCCTTCAGAACGCTATTCGTCCTCTCA
GTGGCCGTCTGGAGGTTCTTATACCTGCAGCCGGAGGGCTTAATGCAAAGCATGCTGTTGGAGATCCGAGGTGGTCAATC
TTGGCCACAAGTCCCGAGGAGGTGGAACAGTTTTTTCAAGAGACGATCGACAATAGGGAGGAAGGGGTTGTGTTAAAAGA
CCTAGATTCCAAATGGGAGCCCAGTGATCGTAGTGGAAAATGGCTTAAACTCAAGCCTGATTACGTTCACTCTGAATCTG
ACCTTGATGCGCTGATAATAGGTGGTTATTTTGGAACTGGACGGCGTGGTGGTGAGGTTGCTCAATTTTTATTGGGATTG
GCTGAGCCGTCAGAAGTGGGTGGCTACCCTACAAAATTTCGTTCCTTCTGCAGAGTGGGTACAGGCCTCACTGATGATGA
GGCTGAGCAGTTAGTGCACAAGCTCAAACCCCACTTCAGGAGGAATCATAAAAGCACCAAACCACCGAGCTGTTATGTGC
TGACTAACTCATCCAAAGAGCGGCCTGATGTCTGGATTGAACGACCTGAAAAGTCCGTAATACTTCAAATCACAAGTGAT
ATCCGAACCATACGCTCTGAGGTATTTGCAACGCCGTATAGTCTACGGTTTCCTCGAATCCAAAGAATTCGATATGATAA
ACCTTGGTATGACTGCCTCGATGTTCAGACATTAGTAGATACGGTACATGCAAAGAGCCAGAACGATGGTCAAGCGGAAG
GTGGGAATCAATTCAAATATCGAGCAAGACGGGCCAAACAGGAAAAGCCTGAACGAGCATCATTGGTCCCCTCCCACATG
TTGGTAACAGACGTATCGCATGTAAAGCAAGCCACTCGCATCTTCAAGGGCCTTGTCTTCTACGTTGCCAACACTTCCAG
TGAGTACCCAGTTGAGAAAATTCACAAATTGGTTGTAGAGAATGGAGGCACCTTCTCTATGAACTTGAGCAGTACTGTTA
CTCACGCGGTCGCATTCGAAAAGAAAGGACTCAAGTATCAGGCATCTTTTATGAATGGGGATGTCATTCATCTATCATGG
CTTTTAGACTGCATGGCTCAGAAAGTACTCCTTGTAGTAAGCCCCAAGTATTACTTGAATATGTCAGATGCCACAAAGGA
AAGACTGAAAGGTGAAGTTGACGAATTTGGAGATTTTTACTGCAACGAAGTTGATGAAACTGACCTTACACAGATCTTCG
AAAACATAGATGTGAACAAGCAGTACCACGACATTGACAGAGTGAATTACTACATCAAGAAATATTCCCCATCTCCAACT
TGGTGCCTTTTCTCCGGCTGCTGCGTCTTCTTTCTCCGTCCTCTTCACTCCGCGAATGAAAATACTCTACAGGTGGCAAA
CCTAACATTGAGAAGACTTGCACTCGAGGTGGAGATGCACGAAGGCACCGTTTCTGACAAATTGACGCGTAGCATCACAC
ACGTTGTGATGTATGTACCTACGGAAAGTCCCGTATCGTTCCAGATTATATTACGAAGTGCGTCATCGGAGGAAAGAAGG
CTGCTATTATCGAAGCGGGTCAAAGTTGTAAGCCATCGCTGGATAGAGGATTCTGTTAACAGAAGTATCTACCGTCACCC
CGCAGACGAAGAGTATGATCTCAGGACTGGAGAAGTGCTTTTGTCCGATGAGGAAGAAGAAGAGCTGGTGTCAAAAACTT
CAGTCCCCATAAAGGCTGACTCGATGGTACCCATCACCCCAGACAAGAGGGTTGTACCTAGATCTCTTCGAAAACCAGCT
ACGCGGAGACGAGCCCTGAACCTTGGAAGGCCTAGTAGAAGATCCAAAGGAACAGAGGAGGAACTTCCAAGAGTAGAGGC
CCAAAAGTCGGGCAAAGAAGAGCACGTAGAGGATGACAGAGCACCAACAGAAATAACCAATAAGGGAACCAGGAATGATC
CGAAGGCAGTGGAACCTTTTCAATTAGATGATTCTGCTTCTCGTACACCAGAAAAGAGAGTTCTTAGAGTTACTCGAAGC
AGGAAAAAAGACAAACCATCATCTGGGAATGAGCGCCTACCTTTAAAACGACGTCTCATTCATGATGAAGGTGAGGCATC
TCTGACCCAACCTTCAGACAAGGCACTCACAGGAGCGGAACCTTTCATACCAAGAATGTCGACCAGATTGGCTGAGAAAA
GAAAAGCATTCGAAAGCAAGAGAGAAGCCGACACATATAAGCTAGAATGTACTGCAGCGGATCTTCGTGAGGAAGACCAA
GAAGAGCCCGTGGAAGCTTCAGAATTCGAGGGGTTGCAGAACACCAGTTCTTTGAAAGAGCCAACTGGAAGGGACGACAG
CAGATTACATTTCAACGATTTCGACCATCAAAGCCCAGCCATTGATGATATGATCAATACTCTACTTCCAAATTACGCCT
CATCACAACAGGCTAACTTATCTCAACAAACGAACAGAGAGAGTGCCTTGCCCTCAAGGTCGAACTCATTTTTTATCAAA
CCGACGTTGGCCCCACAACCCACTCAGGGAGTTCTAGGTAGCTACTCTCGACCATATCTCCCTTCACATTGTGCATCTGA
AGCCCTCCAGCCGGTACCAGATGCTCCAGATAACTCTGATTTATTTTCATCAGTGCATCGTTTGGCAAGTTCACAAGACC
ACTTGACTGGGACTGTGACGGTTGAATCATCCTCACAGGGGTTGGCAACTTATCTACCTCCGACCAACAACCCTGAGAAG
AAGAAAAGAAGCTTTAAGGATCTGGTCAACCAGATGCTCAACGGTAACTGA
Microexon DNA seq ATTTGTT
Microexon Amino Acid seq ICY
Microexon-tag DNA Seq AATCGTGGAGTTGCAAAGGCTGCTAAGGATGGCTTGGATTCTGATCTTCAGATTTGTTATGTTGCATTCGATATTCTGTATGATGGTGACAGCAGTGTTATCCATCGT
Microexon-tag Amino Acid seq NRGVAKAAKDGLDSDLQICYVAFDILYDGDSSVIHR
Transcript ID Pp3c14_5920V3.1
Gene ID Pp.5522
Gene Name NA
Pfam domain motif DNA_ligase_A_M
Motif E-value 7.3e-42
Motif start 229
Motif end 445
Protein seq >Pp3c14_5920V3.1
MKRMEETVEFGVLCSMFEAIVRCKKGALKRKHVRTFLEHVYNGQEHFSAMRLILPDLDKERANYGLREAVLAKFLADALG
LSKESEDAKKLINWRKGGQRAGSNAGNFSMVASEVLCRRQKTAPGGLMIKEVNDLLDRLAAAQDKEEKTAVLAELINKTN
AQEMRWIIMIILKDLKLGISEKTVFSEFHPDAEDYFNVTCDLKLVCEKLRDRSIRYKRQDIEVGKAVRPQLAARAANVED
AWKKMRGKDVVVECKFDGDRIQVHKNGNNLNFWSRTFNDHPEFKEAIGDVLCQRIIPEKCILDGEMLVWDRVSQRFAEFG
SNRGVAKAAKDGLDSDLQICYVAFDILYDGDSSVIHRPLRERQRLLQNAIRPLSGRLEVLIPAAGGLNAKHAVGDPRWSI
LATSPEEVEQFFQETIDNREEGVVLKDLDSKWEPSDRSGKWLKLKPDYVHSESDLDALIIGGYFGTGRRGGEVAQFLLGL
AEPSEVGGYPTKFRSFCRVGTGLTDDEAEQLVHKLKPHFRRNHKSTKPPSCYVLTNSSKERPDVWIERPEKSVILQITSD
IRTIRSEVFATPYSLRFPRIQRIRYDKPWYDCLDVQTLVDTVHAKSQNDGQAEGGNQFKYRARRAKQEKPERASLVPSHM
LVTDVSHVKQATRIFKGLVFYVANTSSEYPVEKIHKLVVENGGTFSMNLSSTVTHAVAFEKKGLKYQASFMNGDVIHLSW
LLDCMAQKVLLVVSPKYYLNMSDATKERLKGEVDEFGDFYCNEVDETDLTQIFENIDVNKQYHDIDRVNYYIKKYSPSPT
WCLFSGCCVFFLRPLHSANENTLQVANLTLRRLALEVEMHEGTVSDKLTRSITHVVMYVPTESPVSFQIILRSASSEERR
LLLSKRVKVVSHRWIEDSVNRSIYRHPADEEYDLRTGEVLLSDEEEEELVSKTSVPIKADSMVPITPDKRVVPRSLRKPA
TRRRALNLGRPSRRSKGTEEELPRVEAQKSGKEEHVEDDRAPTEITNKGTRNDPKAVEPFQLDDSASRTPEKRVLRVTRS
RKKDKPSSGNERLPLKRRLIHDEGEASLTQPSDKALTGAEPFIPRMSTRLAEKRKAFESKREADTYKLECTAADLREEDQ
EEPVEASEFEGLQNTSSLKEPTGRDDSRLHFNDFDHQSPAIDDMINTLLPNYASSQQANLSQQTNRESALPSRSNSFFIK
PTLAPQPTQGVLGSYSRPYLPSHCASEALQPVPDAPDNSDLFSSVHRLASSQDHLTGTVTVESSSQGLATYLPPTNNPEK
KKRSFKDLVNQMLNGN*
CDS seq >Pp3c14_5920V3.1
ATGAAGAGGATGGAGGAAACTGTGGAGTTCGGAGTATTGTGTAGCATGTTTGAGGCCATCGTTAGGTGTAAGAAGGGAGC
TCTGAAGCGGAAACATGTGCGGACGTTTTTGGAGCATGTTTACAATGGCCAAGAGCATTTTAGCGCTATGCGTTTGATTC
TTCCCGACTTGGATAAAGAGCGTGCCAACTATGGACTCCGCGAGGCTGTGTTGGCCAAATTCTTAGCCGATGCTCTGGGG
CTGTCTAAAGAATCTGAGGACGCTAAGAAACTTATCAATTGGCGTAAAGGAGGCCAAAGAGCCGGCAGCAATGCCGGCAA
TTTCTCTATGGTTGCGTCTGAGGTGCTCTGCAGGAGGCAGAAAACAGCACCTGGTGGTCTAATGATTAAAGAGGTCAATG
ATCTTCTCGATCGCTTAGCGGCAGCACAAGACAAGGAAGAGAAAACAGCTGTGCTAGCTGAGCTCATCAACAAGACAAAT
GCTCAGGAAATGAGGTGGATTATTATGATTATTCTGAAAGATTTGAAACTCGGGATTAGTGAGAAGACAGTTTTCAGCGA
GTTTCATCCCGACGCGGAAGATTATTTCAATGTCACATGCGATCTGAAGTTGGTTTGTGAAAAGCTTCGTGATCGCAGCA
TACGTTACAAGCGCCAGGATATCGAAGTGGGGAAGGCAGTGCGACCACAATTAGCCGCAAGGGCTGCCAATGTCGAAGAT
GCCTGGAAGAAGATGCGAGGCAAAGACGTTGTGGTAGAATGCAAGTTTGATGGAGACCGTATTCAAGTGCACAAAAATGG
AAATAATCTGAACTTCTGGTCAAGGACGTTTAATGACCATCCAGAGTTCAAGGAGGCTATTGGTGACGTATTGTGTCAAC
GTATCATTCCCGAGAAGTGTATACTTGACGGAGAGATGTTGGTTTGGGATCGAGTGTCGCAAAGATTTGCTGAGTTTGGC
TCTAATCGTGGAGTTGCAAAGGCTGCTAAGGATGGCTTGGATTCTGATCTTCAGATTTGTTATGTTGCATTCGATATTCT
GTATGATGGTGACAGCAGTGTTATCCATCGTCCGTTGCGTGAACGGCAACGGCTCCTTCAGAACGCTATTCGTCCTCTCA
GTGGCCGTCTGGAGGTTCTTATACCTGCAGCCGGAGGGCTTAATGCAAAGCATGCTGTTGGAGATCCGAGGTGGTCAATC
TTGGCCACAAGTCCCGAGGAGGTGGAACAGTTTTTTCAAGAGACGATCGACAATAGGGAGGAAGGGGTTGTGTTAAAAGA
CCTAGATTCCAAATGGGAGCCCAGTGATCGTAGTGGAAAATGGCTTAAACTCAAGCCTGATTACGTTCACTCTGAATCTG
ACCTTGATGCGCTGATAATAGGTGGTTATTTTGGAACTGGACGGCGTGGTGGTGAGGTTGCTCAATTTTTATTGGGATTG
GCTGAGCCGTCAGAAGTGGGTGGCTACCCTACAAAATTTCGTTCCTTCTGCAGAGTGGGTACAGGCCTCACTGATGATGA
GGCTGAGCAGTTAGTGCACAAGCTCAAACCCCACTTCAGGAGGAATCATAAAAGCACCAAACCACCGAGCTGTTATGTGC
TGACTAACTCATCCAAAGAGCGGCCTGATGTCTGGATTGAACGACCTGAAAAGTCCGTAATACTTCAAATCACAAGTGAT
ATCCGAACCATACGCTCTGAGGTATTTGCAACGCCGTATAGTCTACGGTTTCCTCGAATCCAAAGAATTCGATATGATAA
ACCTTGGTATGACTGCCTCGATGTTCAGACATTAGTAGATACGGTACATGCAAAGAGCCAGAACGATGGTCAAGCGGAAG
GTGGGAATCAATTCAAATATCGAGCAAGACGGGCCAAACAGGAAAAGCCTGAACGAGCATCATTGGTCCCCTCCCACATG
TTGGTAACAGACGTATCGCATGTAAAGCAAGCCACTCGCATCTTCAAGGGCCTTGTCTTCTACGTTGCCAACACTTCCAG
TGAGTACCCAGTTGAGAAAATTCACAAATTGGTTGTAGAGAATGGAGGCACCTTCTCTATGAACTTGAGCAGTACTGTTA
CTCACGCGGTCGCATTCGAAAAGAAAGGACTCAAGTATCAGGCATCTTTTATGAATGGGGATGTCATTCATCTATCATGG
CTTTTAGACTGCATGGCTCAGAAAGTACTCCTTGTAGTAAGCCCCAAGTATTACTTGAATATGTCAGATGCCACAAAGGA
AAGACTGAAAGGTGAAGTTGACGAATTTGGAGATTTTTACTGCAACGAAGTTGATGAAACTGACCTTACACAGATCTTCG
AAAACATAGATGTGAACAAGCAGTACCACGACATTGACAGAGTGAATTACTACATCAAGAAATATTCCCCATCTCCAACT
TGGTGCCTTTTCTCCGGCTGCTGCGTCTTCTTTCTCCGTCCTCTTCACTCCGCGAATGAAAATACTCTACAGGTGGCAAA
CCTAACATTGAGAAGACTTGCACTCGAGGTGGAGATGCACGAAGGCACCGTTTCTGACAAATTGACGCGTAGCATCACAC
ACGTTGTGATGTATGTACCTACGGAAAGTCCCGTATCGTTCCAGATTATATTACGAAGTGCGTCATCGGAGGAAAGAAGG
CTGCTATTATCGAAGCGGGTCAAAGTTGTAAGCCATCGCTGGATAGAGGATTCTGTTAACAGAAGTATCTACCGTCACCC
CGCAGACGAAGAGTATGATCTCAGGACTGGAGAAGTGCTTTTGTCCGATGAGGAAGAAGAAGAGCTGGTGTCAAAAACTT
CAGTCCCCATAAAGGCTGACTCGATGGTACCCATCACCCCAGACAAGAGGGTTGTACCTAGATCTCTTCGAAAACCAGCT
ACGCGGAGACGAGCCCTGAACCTTGGAAGGCCTAGTAGAAGATCCAAAGGAACAGAGGAGGAACTTCCAAGAGTAGAGGC
CCAAAAGTCGGGCAAAGAAGAGCACGTAGAGGATGACAGAGCACCAACAGAAATAACCAATAAGGGAACCAGGAATGATC
CGAAGGCAGTGGAACCTTTTCAATTAGATGATTCTGCTTCTCGTACACCAGAAAAGAGAGTTCTTAGAGTTACTCGAAGC
AGGAAAAAAGACAAACCATCATCTGGGAATGAGCGCCTACCTTTAAAACGACGTCTCATTCATGATGAAGGTGAGGCATC
TCTGACCCAACCTTCAGACAAGGCACTCACAGGAGCGGAACCTTTCATACCAAGAATGTCGACCAGATTGGCTGAGAAAA
GAAAAGCATTCGAAAGCAAGAGAGAAGCCGACACATATAAGCTAGAATGTACTGCAGCGGATCTTCGTGAGGAAGACCAA
GAAGAGCCCGTGGAAGCTTCAGAATTCGAGGGGTTGCAGAACACCAGTTCTTTGAAAGAGCCAACTGGAAGGGACGACAG
CAGATTACATTTCAACGATTTCGACCATCAAAGCCCAGCCATTGATGATATGATCAATACTCTACTTCCAAATTACGCCT
CATCACAACAGGCTAACTTATCTCAACAAACGAACAGAGAGAGTGCCTTGCCCTCAAGGTCGAACTCATTTTTTATCAAA
CCGACGTTGGCCCCACAACCCACTCAGGGAGTTCTAGGTAGCTACTCTCGACCATATCTCCCTTCACATTGTGCATCTGA
AGCCCTCCAGCCGGTACCAGATGCTCCAGATAACTCTGATTTATTTTCATCAGTGCATCGTTTGGCAAGTTCACAAGACC
ACTTGACTGGGACTGTGACGGTTGAATCATCCTCACAGGGGTTGGCAACTTATCTACCTCCGACCAACAACCCTGAGAAG
AAGAAAAGAAGCTTTAAGGATCTGGTCAACCAGATGCTCAACGGTAACTGA