Microexon ID Pp_21:13629054-13629067:-
Species Physcomitrium patens
Coordinates 21:13629054..13629067
Microexon Cluster ID MEP38
Size 14
Phase 1
Pfam Domain Motif Myosin_head
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 46,14,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq CAYTTYARTRMAACTGGRAARATATSTGGTGCYAADATTCAAACWTTTYTRCTTGARAAGTCWAGAGTWGTYCARYKTGCWGAWGGWGARAGRTCATAYCATATWTTT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATTTATTAGAAAAG
Microexon Amino Acid seq YLLEK
Microexon-tag DNA Seq CATTTTGATAGAGCCGGTAAAATATGTGGGGCAAAGATTCAAACTTATTTATTAGAAAAGTCTCGAGTTGTACAGCAGGCTGAAGGCGAAAGATCTTACCATATTTTT
Microexon-tag Amino Acid Seq HFDRAGKICGAKIQTYLLEKSRVVQQAEGERSYHIF
Microexon-tag spanning region13628728-13629315
Microexon-tag prediction score0.9431
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c21_20730V3.1x
Reference Transcript ID Pp3c21_20730V3.1
Gene ID Pp3c21_20730
Gene Name NA
Transcript ID Pp3c21_20730V3.1
Protein ID Pp3c21_20730V3.1
Gene ID Pp3c21_20730
Gene Name NA
Pfam domain motif Myosin_head
Motif E-value 5.4e-251
Motif start 276
Motif end 934
Protein seq >Pp3c21_20730V3.1
MYSTNGIEGRSTLEKMLDFMKTGDIEESEITSDETYADLPPLPLRPSSRARLPSSMGAKKALGACLDSIVLSSNESEAFK
ENIAVGSPIVNLVAPADPAALASKSVTCTNVHTPLAERVGTALNESFASPQLASSPSIIPDVFTPADQVRSSGTLSFDQR
LDACGAQESSFSFLTAQESSTPETPLPQTPVLENTALPVTTPSSGKKWKDDGTLRLKKNLRVWCLTSEYNWIAGTVVSAE
DKDTEAMVRTADHKVIRVNVTRLQPANPDILEGVYDLIKLSYLNEPSVLHNLDFRYEQDKIYTKAGPVLIAVNPFKEISI
YGPNNILAYRNRTSESTYPHVYMTADTAFKAMIRDGINQSVIISGESGAGKTETAKITMQYLAALGGGGGLEDEILQTNP
ILEAFGNAKTLRNDNSSRFGKLIDIHFDRAGKICGAKIQTYLLEKSRVVQQAEGERSYHIFYQLCAGADTALRERLHLKS
AKEYKYLNQSRCLYIDNVDDAKNFQHMKSAMDVVQISVEDQEQAFKMLAAVLWIGNITFHVVENDSYVVVDESEAVNVAA
GLLHCKSNALVAALSTRRIRVGGEEIVQRLTFAQANDSRDALAKAIYASLFDWLVGRINKSLEVGKKPTGRSISILDIYG
FESFKKNSFEQLCINYANERLQQHFNRHLFKLEQEEYTSENIDWTRVDFEDNQECLDLIEKRPLGLISLLDEECMFPRAS
DATLANKLKEHLKGNDCFKGERDKAFRICHYAGEVVYETSAFLEKNRDLLHADLLQLLASCDCALPKLFGASIEDGAQKL
LSPNRRANGMESQKQSVAAKFKGQLNKLMQRLESTEPHFIRCIKPNTSQLPNIFEQDLVLHQLRCCGVLEVVRISRSGYP
TRHSHHEFAKRYGFLLPRNLSNQEDMLSICVSILHQFGIAPDMYQVGITKLFFRAGQIGHLEDVRLRTLQGITRVQALYK
GYKVRCNYKHRRATTIFLQSLVRGAIARRRFELLRERHRAAVTIQKYARRQVACRRYRSVKENIVILQSVVRMWLSRKQS
LARKKEANEAKRAMESKLSEEARVAETEANVKEDAVDDGRECIKEVASSTRAESAKELKEATIKVAPSYLLELQRRAVMA
EKALREKEEDNAMLRQRLLHYEARWMEYEAKMSSMEDMWQKQMSSLQLSLAAAKKSLATDEFLPQTPGKHDNGRISAGKH
RHSTKRQLLPSDDEEFDWDDVATNGMKSPDDFTNKYLVTGSGNGASRGDVEAARSVVSHLTREYDHRTQVFNDDVDFLIE
VKSGLTEANLNPEEELRKLKVRFDTWRRDFKARLRETRLVLNKLCSLDSAEKDGDRMLCALDSLEKEGDRTRKKWWGKKT
TSSRALQG*
CDS seq >Pp3c21_20730V3.1
ATGTATTCTACGAATGGCATTGAGGGGCGTAGTACATTAGAGAAGATGCTGGATTTCATGAAGACCGGTGACATTGAGGA
AAGTGAAATCACAAGTGATGAGACTTATGCGGACTTGCCTCCCTTGCCTTTAAGACCATCATCCAGGGCACGCTTGCCAT
CATCCATGGGAGCTAAGAAAGCGCTGGGCGCGTGTTTAGATAGCATTGTGCTTTCGAGCAATGAGTCTGAAGCGTTCAAA
GAAAACATCGCCGTGGGATCACCCATTGTAAATCTGGTGGCACCAGCTGATCCCGCAGCGCTAGCTTCGAAGTCCGTGAC
ATGTACGAACGTTCACACACCTCTTGCTGAAAGAGTGGGCACTGCTCTCAATGAGAGCTTCGCCAGTCCTCAGCTCGCCT
CCTCCCCTTCCATCATACCAGATGTTTTCACGCCGGCAGACCAAGTGCGCAGTAGTGGAACACTGAGCTTTGATCAAAGA
CTCGACGCCTGTGGCGCACAGGAATCAAGTTTCAGTTTTCTAACTGCACAAGAATCCTCCACACCTGAAACACCTCTGCC
TCAGACTCCTGTCCTGGAAAATACAGCTCTGCCTGTAACAACTCCATCGTCTGGCAAGAAGTGGAAGGATGACGGTACAC
TGCGCCTGAAGAAGAATTTGCGAGTATGGTGCTTAACTTCTGAGTACAATTGGATTGCTGGAACGGTAGTTTCTGCTGAG
GATAAGGATACAGAGGCTATGGTGCGCACTGCTGATCACAAGGTTATCAGAGTGAATGTCACCAGACTTCAACCAGCAAA
CCCTGACATATTAGAAGGAGTTTATGACCTCATCAAACTCAGCTACTTGAATGAGCCTTCAGTTCTGCATAATTTAGACT
TTCGGTATGAGCAAGATAAGATTTATACTAAGGCCGGTCCCGTCTTGATTGCTGTTAATCCGTTCAAGGAAATTTCCATC
TATGGTCCAAACAACATCCTTGCCTACAGAAATAGAACCTCCGAAAGCACTTACCCTCATGTGTATATGACAGCGGACAC
TGCATTCAAAGCCATGATTCGAGATGGTATTAATCAGTCTGTCATCATCAGCGGTGAGAGCGGTGCGGGGAAGACGGAAA
CAGCGAAAATTACCATGCAGTATCTAGCTGCACTTGGTGGCGGTGGTGGATTAGAAGACGAAATTTTGCAAACTAACCCG
ATTTTGGAAGCGTTCGGGAATGCCAAGACCTTAAGAAATGACAACTCCAGTCGCTTTGGAAAGCTGATTGATATTCATTT
TGATAGAGCCGGTAAAATATGTGGGGCAAAGATTCAAACTTATTTATTAGAAAAGTCTCGAGTTGTACAGCAGGCTGAAG
GCGAAAGATCTTACCATATTTTTTATCAACTTTGCGCTGGAGCCGACACAGCTTTGAGAGAGCGGTTGCATTTAAAATCT
GCGAAAGAGTACAAGTATTTGAATCAAAGCAGGTGTTTGTACATTGATAACGTTGACGATGCAAAAAATTTCCAACATAT
GAAGAGTGCTATGGATGTGGTGCAAATCAGCGTGGAAGACCAGGAGCAAGCTTTTAAGATGCTTGCTGCAGTCCTTTGGA
TCGGGAACATCACGTTTCACGTTGTTGAGAATGATTCTTATGTCGTTGTGGATGAAAGTGAAGCTGTGAATGTGGCGGCT
GGACTGCTTCACTGCAAGTCCAATGCGCTGGTTGCAGCACTTTCTACTCGAAGGATCCGCGTTGGAGGCGAAGAAATTGT
ACAGAGATTGACATTTGCGCAGGCAAATGATTCCAGAGATGCCCTTGCTAAAGCTATCTATGCTAGCCTGTTCGACTGGT
TGGTGGGACGTATCAACAAGTCTTTAGAAGTTGGCAAGAAGCCGACAGGAAGGTCAATAAGCATCCTGGACATTTATGGA
TTTGAATCTTTTAAGAAAAACAGTTTTGAGCAATTATGTATAAATTATGCGAATGAAAGGTTGCAACAACATTTCAATCG
TCATCTGTTCAAGCTTGAGCAAGAGGAGTATACGTCTGAAAATATTGATTGGACGAGGGTGGATTTTGAAGACAATCAAG
AATGTCTTGATCTTATTGAGAAGAGACCATTAGGATTGATTTCTTTACTCGATGAGGAGTGCATGTTCCCGCGAGCTTCA
GATGCGACTCTTGCAAATAAGCTGAAAGAGCATCTGAAAGGAAACGACTGCTTTAAAGGCGAGCGAGATAAGGCATTCCG
AATCTGTCACTATGCTGGAGAGGTTGTCTATGAAACATCTGCGTTTCTTGAGAAGAACAGGGACCTGCTACACGCAGATT
TGTTGCAGCTGCTAGCATCTTGTGACTGTGCATTGCCAAAACTGTTTGGTGCCTCTATTGAAGATGGTGCTCAGAAGTTG
CTGAGCCCCAATCGAAGGGCCAATGGCATGGAATCTCAAAAGCAGAGTGTGGCTGCGAAGTTTAAGGGCCAATTGAACAA
GCTGATGCAAAGACTAGAGAGCACTGAACCTCACTTTATCAGGTGCATCAAACCCAATACCTCGCAGCTTCCTAATATCT
TCGAGCAGGATCTAGTATTACATCAGCTCCGGTGTTGTGGCGTCCTGGAGGTGGTTCGCATTTCACGTTCTGGCTACCCG
ACTCGCCATTCACATCATGAGTTTGCAAAGAGGTATGGCTTCCTGCTTCCAAGGAATCTGTCTAATCAAGAGGACATGCT
AAGCATATGTGTCTCTATTCTTCATCAATTTGGCATTGCTCCAGATATGTATCAAGTAGGCATCACGAAGTTGTTCTTTC
GGGCTGGACAGATAGGACATTTGGAGGACGTTCGACTGAGAACCCTTCAGGGTATTACACGAGTTCAAGCTTTATATAAA
GGCTATAAAGTCCGATGCAATTACAAACATCGACGAGCAACCACAATTTTCTTACAATCCTTGGTCAGAGGAGCCATCGC
AAGGAGGCGATTTGAGTTGTTGCGAGAGAGGCATCGTGCGGCTGTAACGATTCAAAAGTATGCAAGAAGGCAGGTTGCTT
GTCGTAGATATCGCTCAGTGAAGGAAAACATTGTAATCCTTCAATCAGTCGTTCGCATGTGGCTGTCTAGAAAGCAGTCG
CTTGCTCGGAAAAAGGAGGCCAATGAGGCGAAGCGAGCTATGGAATCTAAACTAAGTGAAGAAGCTAGAGTTGCCGAGAC
TGAGGCAAATGTAAAGGAAGATGCTGTTGATGATGGTCGCGAATGTATAAAGGAGGTGGCTAGTTCAACACGTGCAGAAT
CTGCTAAGGAATTAAAAGAGGCCACTATCAAGGTTGCACCGTCATACCTTCTTGAGTTGCAGCGACGGGCAGTCATGGCG
GAGAAGGCACTGAGGGAGAAAGAGGAAGACAATGCAATGCTGCGACAGAGGCTTCTGCACTACGAGGCACGGTGGATGGA
GTATGAAGCCAAGATGTCGTCCATGGAGGACATGTGGCAAAAGCAGATGTCTTCATTGCAACTTAGCTTAGCAGCTGCCA
AGAAGAGCTTAGCAACAGATGAATTTCTGCCGCAAACTCCTGGCAAGCACGACAATGGTCGCATCTCAGCCGGGAAGCAC
CGGCATAGTACTAAGCGGCAACTGCTGCCCTCCGATGACGAAGAGTTCGATTGGGACGATGTCGCAACGAACGGCATGAA
GAGCCCGGATGACTTCACCAACAAATATTTGGTAACTGGCTCTGGAAATGGCGCGTCACGTGGTGACGTCGAAGCTGCAC
GGTCTGTCGTCAGCCACCTGACGAGAGAGTACGACCACCGAACGCAGGTATTCAACGATGATGTTGATTTTCTCATCGAA
GTTAAATCTGGCTTGACTGAGGCTAACTTGAACCCTGAAGAAGAGTTGAGGAAGCTGAAGGTGAGGTTTGACACATGGAG
GAGAGACTTCAAAGCCAGATTGCGAGAGACCAGGCTTGTGCTGAACAAGCTCTGTTCTTTAGACTCGGCTGAAAAAGACG
GAGATAGGATGCTGTGTGCTTTAGACTCGCTGGAAAAAGAGGGAGATCGGACGCGCAAGAAGTGGTGGGGAAAGAAAACC
ACCTCCTCAAGAGCGCTCCAAGGTTAG
Microexon DNA seq ATTTATTAGAAAAG
Microexon Amino Acid seq YLLEK
Microexon-tag DNA Seq CATTTTGATAGAGCCGGTAAAATATGTGGGGCAAAGATTCAAACTTATTTATTAGAAAAGTCTCGAGTTGTACAGCAGGCTGAAGGCGAAAGATCTTACCATATTTTT
Microexon-tag Amino Acid seq HFDRAGKICGAKIQTYLLEKSRVVQQAEGERSYHIF
Transcript ID Pp3c21_20730V3.2
Gene ID Pp.13886
Gene Name NA
Pfam domain motif Myosin_head
Motif E-value 5.4e-251
Motif start 276
Motif end 934
Protein seq >Pp3c21_20730V3.2
MYSTNGIEGRSTLEKMLDFMKTGDIEESEITSDETYADLPPLPLRPSSRARLPSSMGAKKALGACLDSIVLSSNESEAFK
ENIAVGSPIVNLVAPADPAALASKSVTCTNVHTPLAERVGTALNESFASPQLASSPSIIPDVFTPADQVRSSGTLSFDQR
LDACGAQESSFSFLTAQESSTPETPLPQTPVLENTALPVTTPSSGKKWKDDGTLRLKKNLRVWCLTSEYNWIAGTVVSAE
DKDTEAMVRTADHKVIRVNVTRLQPANPDILEGVYDLIKLSYLNEPSVLHNLDFRYEQDKIYTKAGPVLIAVNPFKEISI
YGPNNILAYRNRTSESTYPHVYMTADTAFKAMIRDGINQSVIISGESGAGKTETAKITMQYLAALGGGGGLEDEILQTNP
ILEAFGNAKTLRNDNSSRFGKLIDIHFDRAGKICGAKIQTYLLEKSRVVQQAEGERSYHIFYQLCAGADTALRERLHLKS
AKEYKYLNQSRCLYIDNVDDAKNFQHMKSAMDVVQISVEDQEQAFKMLAAVLWIGNITFHVVENDSYVVVDESEAVNVAA
GLLHCKSNALVAALSTRRIRVGGEEIVQRLTFAQANDSRDALAKAIYASLFDWLVGRINKSLEVGKKPTGRSISILDIYG
FESFKKNSFEQLCINYANERLQQHFNRHLFKLEQEEYTSENIDWTRVDFEDNQECLDLIEKRPLGLISLLDEECMFPRAS
DATLANKLKEHLKGNDCFKGERDKAFRICHYAGEVVYETSAFLEKNRDLLHADLLQLLASCDCALPKLFGASIEDGAQKL
LSPNRRANGMESQKQSVAAKFKGQLNKLMQRLESTEPHFIRCIKPNTSQLPNIFEQDLVLHQLRCCGVLEVVRISRSGYP
TRHSHHEFAKRYGFLLPRNLSNQEDMLSICVSILHQFGIAPDMYQVGITKLFFRAGQIGHLEDVRLRTLQGITRVQALYK
GYKVRCNYKHRRATTIFLQSLVRGAIARRRFELLRERHRAAVTIQKYARRQVACRRYRSVKENIVILQSVVRMWLSRKQS
LARKKEANEAKRAMESKLSEEARVAETEANVKEDAVDDGRECIKEVASSTRAESAKELKEATIKVAPSYLLELQRRAVMA
EKALREKEEDNAMLRQRLLHYEARWMEYEAKMSSMEDMWQKQMSSLQLSLAAAKKSLATDEFLPQTPGKHDNGRISAGKH
RHSTKRQLLPSDDEEFDWDDVATNGMKSPDDFTNKYLVTGSGNGASRGDVEAARSVVSHLTREYDHRTQVFNDDVDFLIE
VKSGLTEANLNPEEELRKLKVRFDTWRRDFKARLRETRLVLNKLCSLDSAEKDGDRMLCALDSLEKEGDRTRKKWWGKKT
TSSRALQG*
CDS seq >Pp3c21_20730V3.2
ATGTATTCTACGAATGGCATTGAGGGGCGTAGTACATTAGAGAAGATGCTGGATTTCATGAAGACCGGTGACATTGAGGA
AAGTGAAATCACAAGTGATGAGACTTATGCGGACTTGCCTCCCTTGCCTTTAAGACCATCATCCAGGGCACGCTTGCCAT
CATCCATGGGAGCTAAGAAAGCGCTGGGCGCGTGTTTAGATAGCATTGTGCTTTCGAGCAATGAGTCTGAAGCGTTCAAA
GAAAACATCGCCGTGGGATCACCCATTGTAAATCTGGTGGCACCAGCTGATCCCGCAGCGCTAGCTTCGAAGTCCGTGAC
ATGTACGAACGTTCACACACCTCTTGCTGAAAGAGTGGGCACTGCTCTCAATGAGAGCTTCGCCAGTCCTCAGCTCGCCT
CCTCCCCTTCCATCATACCAGATGTTTTCACGCCGGCAGACCAAGTGCGCAGTAGTGGAACACTGAGCTTTGATCAAAGA
CTCGACGCCTGTGGCGCACAGGAATCAAGTTTCAGTTTTCTAACTGCACAAGAATCCTCCACACCTGAAACACCTCTGCC
TCAGACTCCTGTCCTGGAAAATACAGCTCTGCCTGTAACAACTCCATCGTCTGGCAAGAAGTGGAAGGATGACGGTACAC
TGCGCCTGAAGAAGAATTTGCGAGTATGGTGCTTAACTTCTGAGTACAATTGGATTGCTGGAACGGTAGTTTCTGCTGAG
GATAAGGATACAGAGGCTATGGTGCGCACTGCTGATCACAAGGTTATCAGAGTGAATGTCACCAGACTTCAACCAGCAAA
CCCTGACATATTAGAAGGAGTTTATGACCTCATCAAACTCAGCTACTTGAATGAGCCTTCAGTTCTGCATAATTTAGACT
TTCGGTATGAGCAAGATAAGATTTATACTAAGGCCGGTCCCGTCTTGATTGCTGTTAATCCGTTCAAGGAAATTTCCATC
TATGGTCCAAACAACATCCTTGCCTACAGAAATAGAACCTCCGAAAGCACTTACCCTCATGTGTATATGACAGCGGACAC
TGCATTCAAAGCCATGATTCGAGATGGTATTAATCAGTCTGTCATCATCAGCGGTGAGAGCGGTGCGGGGAAGACGGAAA
CAGCGAAAATTACCATGCAGTATCTAGCTGCACTTGGTGGCGGTGGTGGATTAGAAGACGAAATTTTGCAAACTAACCCG
ATTTTGGAAGCGTTCGGGAATGCCAAGACCTTAAGAAATGACAACTCCAGTCGCTTTGGAAAGCTGATTGATATTCATTT
TGATAGAGCCGGTAAAATATGTGGGGCAAAGATTCAAACTTATTTATTAGAAAAGTCTCGAGTTGTACAGCAGGCTGAAG
GCGAAAGATCTTACCATATTTTTTATCAACTTTGCGCTGGAGCCGACACAGCTTTGAGAGAGCGGTTGCATTTAAAATCT
GCGAAAGAGTACAAGTATTTGAATCAAAGCAGGTGTTTGTACATTGATAACGTTGACGATGCAAAAAATTTCCAACATAT
GAAGAGTGCTATGGATGTGGTGCAAATCAGCGTGGAAGACCAGGAGCAAGCTTTTAAGATGCTTGCTGCAGTCCTTTGGA
TCGGGAACATCACGTTTCACGTTGTTGAGAATGATTCTTATGTCGTTGTGGATGAAAGTGAAGCTGTGAATGTGGCGGCT
GGACTGCTTCACTGCAAGTCCAATGCGCTGGTTGCAGCACTTTCTACTCGAAGGATCCGCGTTGGAGGCGAAGAAATTGT
ACAGAGATTGACATTTGCGCAGGCAAATGATTCCAGAGATGCCCTTGCTAAAGCTATCTATGCTAGCCTGTTCGACTGGT
TGGTGGGACGTATCAACAAGTCTTTAGAAGTTGGCAAGAAGCCGACAGGAAGGTCAATAAGCATCCTGGACATTTATGGA
TTTGAATCTTTTAAGAAAAACAGTTTTGAGCAATTATGTATAAATTATGCGAATGAAAGGTTGCAACAACATTTCAATCG
TCATCTGTTCAAGCTTGAGCAAGAGGAGTATACGTCTGAAAATATTGATTGGACGAGGGTGGATTTTGAAGACAATCAAG
AATGTCTTGATCTTATTGAGAAGAGACCATTAGGATTGATTTCTTTACTCGATGAGGAGTGCATGTTCCCGCGAGCTTCA
GATGCGACTCTTGCAAATAAGCTGAAAGAGCATCTGAAAGGAAACGACTGCTTTAAAGGCGAGCGAGATAAGGCATTCCG
AATCTGTCACTATGCTGGAGAGGTTGTCTATGAAACATCTGCGTTTCTTGAGAAGAACAGGGACCTGCTACACGCAGATT
TGTTGCAGCTGCTAGCATCTTGTGACTGTGCATTGCCAAAACTGTTTGGTGCCTCTATTGAAGATGGTGCTCAGAAGTTG
CTGAGCCCCAATCGAAGGGCCAATGGCATGGAATCTCAAAAGCAGAGTGTGGCTGCGAAGTTTAAGGGCCAATTGAACAA
GCTGATGCAAAGACTAGAGAGCACTGAACCTCACTTTATCAGGTGCATCAAACCCAATACCTCGCAGCTTCCTAATATCT
TCGAGCAGGATCTAGTATTACATCAGCTCCGGTGTTGTGGCGTCCTGGAGGTGGTTCGCATTTCACGTTCTGGCTACCCG
ACTCGCCATTCACATCATGAGTTTGCAAAGAGGTATGGCTTCCTGCTTCCAAGGAATCTGTCTAATCAAGAGGACATGCT
AAGCATATGTGTCTCTATTCTTCATCAATTTGGCATTGCTCCAGATATGTATCAAGTAGGCATCACGAAGTTGTTCTTTC
GGGCTGGACAGATAGGACATTTGGAGGACGTTCGACTGAGAACCCTTCAGGGTATTACACGAGTTCAAGCTTTATATAAA
GGCTATAAAGTCCGATGCAATTACAAACATCGACGAGCAACCACAATTTTCTTACAATCCTTGGTCAGAGGAGCCATCGC
AAGGAGGCGATTTGAGTTGTTGCGAGAGAGGCATCGTGCGGCTGTAACGATTCAAAAGTATGCAAGAAGGCAGGTTGCTT
GTCGTAGATATCGCTCAGTGAAGGAAAACATTGTAATCCTTCAATCAGTCGTTCGCATGTGGCTGTCTAGAAAGCAGTCG
CTTGCTCGGAAAAAGGAGGCCAATGAGGCGAAGCGAGCTATGGAATCTAAACTAAGTGAAGAAGCTAGAGTTGCCGAGAC
TGAGGCAAATGTAAAGGAAGATGCTGTTGATGATGGTCGCGAATGTATAAAGGAGGTGGCTAGTTCAACACGTGCAGAAT
CTGCTAAGGAATTAAAAGAGGCCACTATCAAGGTTGCACCGTCATACCTTCTTGAGTTGCAGCGACGGGCAGTCATGGCG
GAGAAGGCACTGAGGGAGAAAGAGGAAGACAATGCAATGCTGCGACAGAGGCTTCTGCACTACGAGGCACGGTGGATGGA
GTATGAAGCCAAGATGTCGTCCATGGAGGACATGTGGCAAAAGCAGATGTCTTCATTGCAACTTAGCTTAGCAGCTGCCA
AGAAGAGCTTAGCAACAGATGAATTTCTGCCGCAAACTCCTGGCAAGCACGACAATGGTCGCATCTCAGCCGGGAAGCAC
CGGCATAGTACTAAGCGGCAACTGCTGCCCTCCGATGACGAAGAGTTCGATTGGGACGATGTCGCAACGAACGGCATGAA
GAGCCCGGATGACTTCACCAACAAATATTTGGTAACTGGCTCTGGAAATGGCGCGTCACGTGGTGACGTCGAAGCTGCAC
GGTCTGTCGTCAGCCACCTGACGAGAGAGTACGACCACCGAACGCAGGTATTCAACGATGATGTTGATTTTCTCATCGAA
GTTAAATCTGGCTTGACTGAGGCTAACTTGAACCCTGAAGAAGAGTTGAGGAAGCTGAAGGTGAGGTTTGACACATGGAG
GAGAGACTTCAAAGCCAGATTGCGAGAGACCAGGCTTGTGCTGAACAAGCTCTGTTCTTTAGACTCGGCTGAAAAAGACG
GAGATAGGATGCTGTGTGCTTTAGACTCGCTGGAAAAAGAGGGAGATCGGACGCGCAAGAAGTGGTGGGGAAAGAAAACC
ACCTCCTCAAGAGCGCTCCAAGGTTAG