Microexon ID Pp_15:8955260-8955273:+
Species Physcomitrium patens
Coordinates 15:8955260..8955273
Microexon Cluster ID MEP38
Size 14
Phase 1
Pfam Domain Motif Myosin_head
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 46,14,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq CAYTTYARTRMAACTGGRAARATATSTGGTGCYAADATTCAAACWTTTYTRCTTGARAAGTCWAGAGTWGTYCARYKTGCWGAWGGWGARAGRTCATAYCATATWTTT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATTTGCTTGAGAAG
Microexon Amino Acid seq YLLEK
Microexon-tag DNA Seq CATTTTGACGAAAGTGGGAAAATATGTGGAGCAATTATTGAAACATATTTGCTTGAGAAGTCGAGAGTCGTACAGCAAGCAGAAGGTGAAAGGTCATATCACGTTTTT
Microexon-tag Amino Acid Seq HFDESGKICGAIIETYLLEKSRVVQQAEGERSYHVF
Microexon-tag spanning region8955038-8955503
Microexon-tag prediction score0.9483
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c15_13490V3.1x
Reference Transcript ID Pp3c15_13490V3.1
Gene ID Pp3c15_13490
Gene Name NA
Transcript ID Pp3c15_13490V3.1
Protein ID Pp3c15_13490V3.1
Gene ID Pp3c15_13490
Gene Name NA
Pfam domain motif Myosin_head
Motif E-value 1.4e-252
Motif start 308
Motif end 963
Protein seq >Pp3c15_13490V3.1
MFPLKSSGPRSTLEEMLDLFRTDDKQEKESMDNGEEARPPPLPVRPASRARLPSSVRARKHQVVAMVKLISSSERSVDVV
QGDVAVDAPAVNTVLCPVTDCLDVDDDQVSKPERCQERPVNHPPAIVIPDDQALQNGHMAFETNVTATNCTKMSESQEIT
ASLTNPAAISVGKQRVGEEEAELHSNKLLVDVRGTANGSVVDDGAAVEQSDFSALPPEQLALLQIPSLQSPARTPTSPAP
SRKWIDDGVLRLRKNLRVWCLTSENIWICGTIISVEDAEAVVWTSDREEIQVSVTKLLPANPAFLEGVDDLIKLSYLNEP
SVLHDLDYRYSKDQIYTKAGPVLIAVNPFKKIHIYGEDIMQAYRDRTSASSQPHVYMIAGSAFGAMMKEGINQSIIISGE
SGAGKTETAKIAMQYLAALGGGSGIEDEILQTNPILEAFGNAKTSKNDNSSRFGKLIDIHFDESGKICGAIIETYLLEKS
RVVQQAEGERSYHVFYQLCAGADESLRDLLRLRSAKEYRYLSQSSCMSIDNVDDAEQFQRLRKAMNVVQICKEDQQKVFE
LLSAVLWLGNIVFRVSEPDNHVVVVDNEAVEIAAALLGCEVDKLVTALYSRRIRAGGDTIVQRLTLSQATDSRDALAKAI
YSYLFDWLVERVNKSLEAGKLRTGRSISILDIYGFETFKRNSFEQLCINYANERLQQHFNRHLFKLEQEEYTSEDIDWTR
IEFQDNQQCLDLIEKRPVGLISLLDEECMFPRATDFTLANKLKDHLKKNASFRGERDKKFRVYHYAGEVLYEADGFLEKN
RDLLHADLVELLESCDCALIFDFLASAGQGSGKSNGSEYQKQSVASKFKGQLNKLLQRLEATEPHFIRCIKPNTQQLPNV
IDQKLVLQQLRCCGVLEVVRISRSGYPTRYTHNEFASRYAFLLPRDVSEQEDVLSVCVAILEHFRKFITSEMYQVGITKL
FFRAGQIGMLEDVRVRTLRSIDRAQAVYKGYKVRREYKKKRKAVVFLQSLVRAAIARRHFEKRKERHRAVVFIQKNVRGW
IARCAYQAKKEKVILIQSVVRMSLAKGQLNDLQKEAEEKRAVERKLAEEKRASELQLAAEIQEKEAAEEKVRIEAVLQEE
VRMRRQAEEGTGSADEEQESIKEICETITTKPPESEEQNESTIRVRPSHILELQQRAVIAERTLLEKEEDNALLRQRIQH
YENQWVEYEAKMSSMEEMWQKQMSTLQLSLAAAKKSIATEESATLQTSSKDGSEDQKTVAGKHNRNTRPLLPTEEEKFHK
VIQDLDDEAAKVPENVENNSNKFLHAGSELGSSQGEVAAGHSYVTQLDREFDHRKQVFTDDIDFLVEVKSGQTTAHLSPE
DELRKLKTRFDAWKKDFKVRLRETKAVLSKLGHTDSSDKWIRGKKWHWVKLGKQITPP*
CDS seq >Pp3c15_13490V3.1
ATGTTTCCTTTAAAGAGCTCCGGTCCTCGGAGCACTTTAGAAGAGATGTTGGATTTATTTAGAACGGATGACAAGCAAGA
GAAAGAATCTATGGACAATGGAGAGGAGGCTAGGCCACCCCCTCTTCCTGTGAGGCCAGCATCTCGAGCTCGACTACCAT
CCTCTGTACGTGCAAGGAAACATCAAGTTGTAGCTATGGTAAAGCTCATCTCGTCTAGCGAAAGATCCGTTGATGTAGTC
CAGGGGGACGTTGCAGTGGATGCTCCTGCTGTAAACACAGTTTTGTGTCCTGTAACAGATTGCTTGGATGTAGATGATGA
CCAGGTATCCAAACCCGAGAGATGCCAGGAACGACCTGTTAATCATCCTCCAGCAATCGTAATCCCCGATGATCAGGCAC
TTCAAAATGGTCACATGGCTTTTGAGACAAATGTGACTGCTACGAACTGCACGAAAATGTCGGAGTCGCAAGAAATTACA
GCTTCATTGACCAACCCAGCTGCTATCTCGGTTGGAAAGCAAAGGGTGGGCGAGGAAGAAGCAGAATTGCATTCTAACAA
GCTTTTGGTTGATGTGAGAGGGACTGCAAATGGGTCTGTTGTTGATGATGGGGCTGCGGTGGAGCAATCAGATTTCTCTG
CCCTTCCACCTGAACAGCTTGCTTTGCTTCAAATTCCCAGCTTACAATCTCCAGCTCGTACTCCTACCTCACCTGCACCA
AGCAGGAAATGGATCGACGATGGAGTATTACGGTTAAGAAAGAATTTGCGAGTATGGTGCTTGACCTCGGAAAATATATG
GATTTGTGGAACCATTATATCTGTCGAAGACGCGGAGGCTGTTGTCTGGACCTCTGATCGAGAGGAGATTCAAGTGAGTG
TAACAAAGTTGCTACCTGCAAACCCCGCCTTTCTGGAGGGAGTGGATGACCTCATAAAGTTGAGCTACCTCAACGAGCCT
TCTGTTCTCCATGACTTGGATTACCGATATTCGAAAGATCAGATATATACAAAGGCTGGTCCTGTATTGATTGCGGTTAA
TCCGTTCAAGAAAATTCATATATATGGAGAGGATATAATGCAAGCTTATCGGGATAGAACTTCTGCAAGCTCTCAGCCTC
ATGTTTATATGATAGCTGGCAGTGCCTTTGGCGCAATGATGAAAGAGGGGATTAATCAGTCCATTATTATCAGTGGTGAG
AGTGGTGCAGGGAAAACAGAAACGGCAAAGATTGCCATGCAGTATTTAGCTGCTCTAGGAGGAGGTAGTGGGATAGAAGA
TGAAATTTTGCAAACCAACCCAATCTTAGAAGCCTTCGGCAACGCAAAAACTTCAAAGAATGACAATTCGAGTCGCTTTG
GAAAGCTCATTGACATACATTTTGACGAAAGTGGGAAAATATGTGGAGCAATTATTGAAACATATTTGCTTGAGAAGTCG
AGAGTCGTACAGCAAGCAGAAGGTGAAAGGTCATATCACGTTTTTTATCAACTGTGTGCTGGAGCTGATGAATCTTTACG
AGATCTTCTAAGACTAAGATCCGCCAAGGAGTATCGGTATCTAAGCCAAAGTAGCTGTATGTCTATCGATAATGTTGATG
ATGCGGAGCAATTCCAACGTTTGAGGAAAGCCATGAACGTGGTGCAAATTTGTAAAGAAGATCAGCAGAAAGTTTTTGAA
CTGCTCTCCGCCGTCCTATGGCTTGGAAATATTGTGTTTCGCGTTTCAGAGCCTGATAATCATGTCGTGGTTGTGGACAA
TGAAGCTGTGGAAATAGCAGCAGCCTTGTTGGGTTGCGAGGTTGATAAACTTGTAACAGCATTATACAGTCGGAGGATCC
GTGCAGGGGGTGATACTATTGTACAGAGACTGACACTTTCTCAGGCAACTGACTCAAGAGATGCGCTTGCTAAAGCAATT
TACTCCTACTTGTTTGATTGGCTGGTTGAACGTGTAAACAAATCACTGGAAGCTGGCAAGTTGCGAACTGGAAGATCAAT
CAGCATTCTGGATATCTATGGATTTGAAACTTTCAAGAGAAATAGTTTTGAGCAACTGTGTATAAACTATGCAAATGAGA
GGCTGCAGCAGCATTTCAACCGTCATCTTTTTAAGCTCGAACAAGAAGAATATACTTCTGAAGATATTGATTGGACCAGA
ATAGAATTTCAAGACAATCAACAATGCCTTGATCTTATCGAGAAGAGACCTGTGGGATTAATATCGTTACTTGATGAGGA
GTGTATGTTTCCACGAGCAACTGATTTTACGCTGGCGAATAAGTTGAAGGATCATCTGAAAAAAAATGCTTCTTTCAGAG
GGGAGCGGGACAAAAAATTTCGTGTTTACCACTATGCTGGAGAGGTGCTCTATGAGGCGGATGGGTTTCTTGAGAAGAAT
AGAGACTTACTACATGCAGACCTGGTGGAGCTTCTGGAGTCATGTGATTGTGCGTTGATTTTTGATTTTCTAGCATCTGC
TGGTCAAGGGTCTGGAAAGTCCAATGGCTCAGAATATCAAAAGCAAAGTGTTGCATCCAAGTTCAAGGGCCAATTGAACA
AGCTTCTGCAAAGATTGGAGGCCACTGAACCTCATTTTATACGGTGCATCAAACCAAATACCCAGCAGCTTCCAAATGTC
ATCGACCAGAAACTGGTCCTGCAGCAGCTTCGTTGTTGTGGAGTCCTGGAGGTGGTCCGCATCTCTCGCTCTGGTTACCC
AACTCGGTATACTCATAATGAATTTGCGAGCAGATATGCTTTCCTCCTTCCAAGAGACGTCTCTGAACAAGAGGATGTGT
TGAGCGTATGCGTGGCTATTCTTGAGCATTTCAGGAAGTTTATCACTTCCGAAATGTATCAAGTTGGTATTACCAAATTA
TTTTTCCGCGCTGGACAGATTGGAATGCTGGAGGATGTGAGAGTAAGAACTCTCCGCAGTATTGACCGAGCCCAAGCTGT
GTACAAAGGGTACAAGGTTCGACGTGAGTACAAAAAGAAGCGCAAGGCAGTAGTTTTCTTGCAATCCTTGGTAAGAGCGG
CTATAGCAAGAAGACATTTCGAGAAAAGAAAAGAACGGCACAGAGCAGTTGTGTTCATTCAAAAGAATGTCCGGGGGTGG
ATTGCACGTTGTGCATATCAAGCTAAGAAGGAGAAGGTTATCCTGATCCAATCAGTGGTGCGCATGTCGTTGGCCAAGGG
ACAGTTAAACGACCTCCAGAAAGAGGCCGAGGAGAAGAGAGCCGTAGAAAGAAAGTTGGCCGAGGAGAAGAGGGCGTCTG
AGTTACAATTGGCTGCCGAGATTCAGGAAAAAGAGGCAGCCGAGGAAAAAGTACGTATCGAAGCCGTCTTACAAGAAGAA
GTCAGGATGAGGCGACAAGCGGAGGAGGGTACTGGATCAGCTGATGAAGAGCAAGAAAGCATCAAGGAGATATGTGAAAC
GATTACAACGAAACCTCCTGAATCTGAGGAACAGAATGAATCAACCATCAGGGTGAGACCTTCTCACATTTTGGAGCTGC
AACAGAGGGCGGTGATTGCGGAGAGAACCCTTCTAGAGAAAGAGGAAGACAACGCGCTACTGCGACAGAGGATCCAACAC
TACGAGAATCAATGGGTGGAATATGAAGCCAAGATGTCATCTATGGAGGAGATGTGGCAAAAGCAGATGTCTACATTGCA
GCTAAGCTTAGCAGCTGCAAAGAAGAGTATTGCCACTGAAGAGAGCGCCACCTTGCAGACTTCTTCTAAAGATGGCTCTG
AGGACCAAAAGACTGTAGCTGGGAAACACAACCGTAACACTCGGCCTCTGCTTCCTACTGAGGAAGAGAAATTTCACAAA
GTTATACAAGATCTGGATGACGAAGCGGCGAAAGTTCCGGAGAATGTAGAAAACAATTCCAACAAGTTCTTGCATGCAGG
ATCTGAACTTGGCTCGAGCCAGGGTGAGGTGGCTGCTGGGCATTCATATGTAACCCAATTGGACCGAGAATTCGACCATC
GGAAGCAAGTTTTCACCGACGACATCGACTTCCTTGTGGAAGTGAAGTCTGGTCAGACTACGGCGCATTTGAGTCCTGAA
GACGAGCTCCGAAAACTCAAAACTAGATTCGATGCGTGGAAGAAAGACTTCAAGGTGAGGTTGCGTGAGACCAAGGCGGT
GCTCAGCAAACTTGGTCACACGGATTCGAGCGACAAATGGATCCGAGGGAAAAAATGGCACTGGGTTAAACTGGGGAAAC
AAATCACACCCCCTTGA
Microexon DNA seq ATTTGCTTGAGAAG
Microexon Amino Acid seq YLLEK
Microexon-tag DNA Seq CATTTTGACGAAAGTGGGAAAATATGTGGAGCAATTATTGAAACATATTTGCTTGAGAAGTCGAGAGTCGTACAGCAAGCAGAAGGTGAAAGGTCATATCACGTTTTT
Microexon-tag Amino Acid seq HFDESGKICGAIIETYLLEKSRVVQQAEGERSYHVF
Transcript ID Pp3c15_13490V3.1
Gene ID Pp.6869
Gene Name NA
Pfam domain motif Myosin_head
Motif E-value 1.4e-252
Motif start 308
Motif end 963
Protein seq >Pp3c15_13490V3.1
MFPLKSSGPRSTLEEMLDLFRTDDKQEKESMDNGEEARPPPLPVRPASRARLPSSVRARKHQVVAMVKLISSSERSVDVV
QGDVAVDAPAVNTVLCPVTDCLDVDDDQVSKPERCQERPVNHPPAIVIPDDQALQNGHMAFETNVTATNCTKMSESQEIT
ASLTNPAAISVGKQRVGEEEAELHSNKLLVDVRGTANGSVVDDGAAVEQSDFSALPPEQLALLQIPSLQSPARTPTSPAP
SRKWIDDGVLRLRKNLRVWCLTSENIWICGTIISVEDAEAVVWTSDREEIQVSVTKLLPANPAFLEGVDDLIKLSYLNEP
SVLHDLDYRYSKDQIYTKAGPVLIAVNPFKKIHIYGEDIMQAYRDRTSASSQPHVYMIAGSAFGAMMKEGINQSIIISGE
SGAGKTETAKIAMQYLAALGGGSGIEDEILQTNPILEAFGNAKTSKNDNSSRFGKLIDIHFDESGKICGAIIETYLLEKS
RVVQQAEGERSYHVFYQLCAGADESLRDLLRLRSAKEYRYLSQSSCMSIDNVDDAEQFQRLRKAMNVVQICKEDQQKVFE
LLSAVLWLGNIVFRVSEPDNHVVVVDNEAVEIAAALLGCEVDKLVTALYSRRIRAGGDTIVQRLTLSQATDSRDALAKAI
YSYLFDWLVERVNKSLEAGKLRTGRSISILDIYGFETFKRNSFEQLCINYANERLQQHFNRHLFKLEQEEYTSEDIDWTR
IEFQDNQQCLDLIEKRPVGLISLLDEECMFPRATDFTLANKLKDHLKKNASFRGERDKKFRVYHYAGEVLYEADGFLEKN
RDLLHADLVELLESCDCALIFDFLASAGQGSGKSNGSEYQKQSVASKFKGQLNKLLQRLEATEPHFIRCIKPNTQQLPNV
IDQKLVLQQLRCCGVLEVVRISRSGYPTRYTHNEFASRYAFLLPRDVSEQEDVLSVCVAILEHFRKFITSEMYQVGITKL
FFRAGQIGMLEDVRVRTLRSIDRAQAVYKGYKVRREYKKKRKAVVFLQSLVRAAIARRHFEKRKERHRAVVFIQKNVRGW
IARCAYQAKKEKVILIQSVVRMSLAKGQLNDLQKEAEEKRAVERKLAEEKRASELQLAAEIQEKEAAEEKVRIEAVLQEE
VRMRRQAEEGTGSADEEQESIKEICETITTKPPESEEQNESTIRVRPSHILELQQRAVIAERTLLEKEEDNALLRQRIQH
YENQWVEYEAKMSSMEEMWQKQMSTLQLSLAAAKKSIATEESATLQTSSKDGSEDQKTVAGKHNRNTRPLLPTEEEKFHK
VIQDLDDEAAKVPENVENNSNKFLHAGSELGSSQGEVAAGHSYVTQLDREFDHRKQVFTDDIDFLVEVKSGQTTAHLSPE
DELRKLKTRFDAWKKDFKVRLRETKAVLSKLGHTDSSDKWIRGKKWHWVKLGKQITPP*
CDS seq >Pp3c15_13490V3.1
ATGTTTCCTTTAAAGAGCTCCGGTCCTCGGAGCACTTTAGAAGAGATGTTGGATTTATTTAGAACGGATGACAAGCAAGA
GAAAGAATCTATGGACAATGGAGAGGAGGCTAGGCCACCCCCTCTTCCTGTGAGGCCAGCATCTCGAGCTCGACTACCAT
CCTCTGTACGTGCAAGGAAACATCAAGTTGTAGCTATGGTAAAGCTCATCTCGTCTAGCGAAAGATCCGTTGATGTAGTC
CAGGGGGACGTTGCAGTGGATGCTCCTGCTGTAAACACAGTTTTGTGTCCTGTAACAGATTGCTTGGATGTAGATGATGA
CCAGGTATCCAAACCCGAGAGATGCCAGGAACGACCTGTTAATCATCCTCCAGCAATCGTAATCCCCGATGATCAGGCAC
TTCAAAATGGTCACATGGCTTTTGAGACAAATGTGACTGCTACGAACTGCACGAAAATGTCGGAGTCGCAAGAAATTACA
GCTTCATTGACCAACCCAGCTGCTATCTCGGTTGGAAAGCAAAGGGTGGGCGAGGAAGAAGCAGAATTGCATTCTAACAA
GCTTTTGGTTGATGTGAGAGGGACTGCAAATGGGTCTGTTGTTGATGATGGGGCTGCGGTGGAGCAATCAGATTTCTCTG
CCCTTCCACCTGAACAGCTTGCTTTGCTTCAAATTCCCAGCTTACAATCTCCAGCTCGTACTCCTACCTCACCTGCACCA
AGCAGGAAATGGATCGACGATGGAGTATTACGGTTAAGAAAGAATTTGCGAGTATGGTGCTTGACCTCGGAAAATATATG
GATTTGTGGAACCATTATATCTGTCGAAGACGCGGAGGCTGTTGTCTGGACCTCTGATCGAGAGGAGATTCAAGTGAGTG
TAACAAAGTTGCTACCTGCAAACCCCGCCTTTCTGGAGGGAGTGGATGACCTCATAAAGTTGAGCTACCTCAACGAGCCT
TCTGTTCTCCATGACTTGGATTACCGATATTCGAAAGATCAGATATATACAAAGGCTGGTCCTGTATTGATTGCGGTTAA
TCCGTTCAAGAAAATTCATATATATGGAGAGGATATAATGCAAGCTTATCGGGATAGAACTTCTGCAAGCTCTCAGCCTC
ATGTTTATATGATAGCTGGCAGTGCCTTTGGCGCAATGATGAAAGAGGGGATTAATCAGTCCATTATTATCAGTGGTGAG
AGTGGTGCAGGGAAAACAGAAACGGCAAAGATTGCCATGCAGTATTTAGCTGCTCTAGGAGGAGGTAGTGGGATAGAAGA
TGAAATTTTGCAAACCAACCCAATCTTAGAAGCCTTCGGCAACGCAAAAACTTCAAAGAATGACAATTCGAGTCGCTTTG
GAAAGCTCATTGACATACATTTTGACGAAAGTGGGAAAATATGTGGAGCAATTATTGAAACATATTTGCTTGAGAAGTCG
AGAGTCGTACAGCAAGCAGAAGGTGAAAGGTCATATCACGTTTTTTATCAACTGTGTGCTGGAGCTGATGAATCTTTACG
AGATCTTCTAAGACTAAGATCCGCCAAGGAGTATCGGTATCTAAGCCAAAGTAGCTGTATGTCTATCGATAATGTTGATG
ATGCGGAGCAATTCCAACGTTTGAGGAAAGCCATGAACGTGGTGCAAATTTGTAAAGAAGATCAGCAGAAAGTTTTTGAA
CTGCTCTCCGCCGTCCTATGGCTTGGAAATATTGTGTTTCGCGTTTCAGAGCCTGATAATCATGTCGTGGTTGTGGACAA
TGAAGCTGTGGAAATAGCAGCAGCCTTGTTGGGTTGCGAGGTTGATAAACTTGTAACAGCATTATACAGTCGGAGGATCC
GTGCAGGGGGTGATACTATTGTACAGAGACTGACACTTTCTCAGGCAACTGACTCAAGAGATGCGCTTGCTAAAGCAATT
TACTCCTACTTGTTTGATTGGCTGGTTGAACGTGTAAACAAATCACTGGAAGCTGGCAAGTTGCGAACTGGAAGATCAAT
CAGCATTCTGGATATCTATGGATTTGAAACTTTCAAGAGAAATAGTTTTGAGCAACTGTGTATAAACTATGCAAATGAGA
GGCTGCAGCAGCATTTCAACCGTCATCTTTTTAAGCTCGAACAAGAAGAATATACTTCTGAAGATATTGATTGGACCAGA
ATAGAATTTCAAGACAATCAACAATGCCTTGATCTTATCGAGAAGAGACCTGTGGGATTAATATCGTTACTTGATGAGGA
GTGTATGTTTCCACGAGCAACTGATTTTACGCTGGCGAATAAGTTGAAGGATCATCTGAAAAAAAATGCTTCTTTCAGAG
GGGAGCGGGACAAAAAATTTCGTGTTTACCACTATGCTGGAGAGGTGCTCTATGAGGCGGATGGGTTTCTTGAGAAGAAT
AGAGACTTACTACATGCAGACCTGGTGGAGCTTCTGGAGTCATGTGATTGTGCGTTGATTTTTGATTTTCTAGCATCTGC
TGGTCAAGGGTCTGGAAAGTCCAATGGCTCAGAATATCAAAAGCAAAGTGTTGCATCCAAGTTCAAGGGCCAATTGAACA
AGCTTCTGCAAAGATTGGAGGCCACTGAACCTCATTTTATACGGTGCATCAAACCAAATACCCAGCAGCTTCCAAATGTC
ATCGACCAGAAACTGGTCCTGCAGCAGCTTCGTTGTTGTGGAGTCCTGGAGGTGGTCCGCATCTCTCGCTCTGGTTACCC
AACTCGGTATACTCATAATGAATTTGCGAGCAGATATGCTTTCCTCCTTCCAAGAGACGTCTCTGAACAAGAGGATGTGT
TGAGCGTATGCGTGGCTATTCTTGAGCATTTCAGGAAGTTTATCACTTCCGAAATGTATCAAGTTGGTATTACCAAATTA
TTTTTCCGCGCTGGACAGATTGGAATGCTGGAGGATGTGAGAGTAAGAACTCTCCGCAGTATTGACCGAGCCCAAGCTGT
GTACAAAGGGTACAAGGTTCGACGTGAGTACAAAAAGAAGCGCAAGGCAGTAGTTTTCTTGCAATCCTTGGTAAGAGCGG
CTATAGCAAGAAGACATTTCGAGAAAAGAAAAGAACGGCACAGAGCAGTTGTGTTCATTCAAAAGAATGTCCGGGGGTGG
ATTGCACGTTGTGCATATCAAGCTAAGAAGGAGAAGGTTATCCTGATCCAATCAGTGGTGCGCATGTCGTTGGCCAAGGG
ACAGTTAAACGACCTCCAGAAAGAGGCCGAGGAGAAGAGAGCCGTAGAAAGAAAGTTGGCCGAGGAGAAGAGGGCGTCTG
AGTTACAATTGGCTGCCGAGATTCAGGAAAAAGAGGCAGCCGAGGAAAAAGTACGTATCGAAGCCGTCTTACAAGAAGAA
GTCAGGATGAGGCGACAAGCGGAGGAGGGTACTGGATCAGCTGATGAAGAGCAAGAAAGCATCAAGGAGATATGTGAAAC
GATTACAACGAAACCTCCTGAATCTGAGGAACAGAATGAATCAACCATCAGGGTGAGACCTTCTCACATTTTGGAGCTGC
AACAGAGGGCGGTGATTGCGGAGAGAACCCTTCTAGAGAAAGAGGAAGACAACGCGCTACTGCGACAGAGGATCCAACAC
TACGAGAATCAATGGGTGGAATATGAAGCCAAGATGTCATCTATGGAGGAGATGTGGCAAAAGCAGATGTCTACATTGCA
GCTAAGCTTAGCAGCTGCAAAGAAGAGTATTGCCACTGAAGAGAGCGCCACCTTGCAGACTTCTTCTAAAGATGGCTCTG
AGGACCAAAAGACTGTAGCTGGGAAACACAACCGTAACACTCGGCCTCTGCTTCCTACTGAGGAAGAGAAATTTCACAAA
GTTATACAAGATCTGGATGACGAAGCGGCGAAAGTTCCGGAGAATGTAGAAAACAATTCCAACAAGTTCTTGCATGCAGG
ATCTGAACTTGGCTCGAGCCAGGGTGAGGTGGCTGCTGGGCATTCATATGTAACCCAATTGGACCGAGAATTCGACCATC
GGAAGCAAGTTTTCACCGACGACATCGACTTCCTTGTGGAAGTGAAGTCTGGTCAGACTACGGCGCATTTGAGTCCTGAA
GACGAGCTCCGAAAACTCAAAACTAGATTCGATGCGTGGAAGAAAGACTTCAAGGTGAGGTTGCGTGAGACCAAGGCGGT
GCTCAGCAAACTTGGTCACACGGATTCGAGCGACAAATGGATCCGAGGGAAAAAATGGCACTGGGTTAAACTGGGGAAAC
AAATCACACCCCCTTGA