Microexon ID Pp_19:6027017-6027030:+
Species Physcomitrium patens
Coordinates 19:6027017..6027030
Microexon Cluster ID MEP38
Size 14
Phase 1
Pfam Domain Motif Myosin_head
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 46,14,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq CAYTTYARTRMAACTGGRAARATATSTGGTGCYAADATTCAAACWTTTYTRCTTGARAAGTCWAGAGTWGTYCARYKTGCWGAWGGWGARAGRTCATAYCATATWTTT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq ATTTACTAGAAAAG
Microexon Amino Acid seq YLLEK
Microexon-tag DNA Seq CACTTTGATAGATCAGGGAGAATATGTGGTGCATATATTCACACTTATTTACTAGAAAAGTCACGAGTTGTCAAGCAGGCTGAAGGCGAAAGGTCTTATCATGTCTTT
Microexon-tag Amino Acid Seq HFDRSGRICGAYIHTYLLEKSRVVKQAEGERSYHVF
Microexon-tag spanning region6026761-6027347
Microexon-tag prediction score0.9305
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c19_9660V3.1x
Reference Transcript ID Pp3c19_9660V3.1
Gene ID Pp3c19_9660
Gene Name NA
Transcript ID Pp3c19_9660V3.1
Protein ID Pp3c19_9660V3.1
Gene ID Pp3c19_9660
Gene Name NA
Pfam domain motif Myosin_head
Motif E-value 2.2e-249
Motif start 284
Motif end 944
Protein seq >Pp3c19_9660V3.1
MFSSVGNDSRTPVRGVQDFTTAEYIEEDDMLSDETENDVVLSAPSMPTSRARLSSSMRAKKAPGACIDNLAIPSKGSGML
LKENIALGSPISNLITAVDPAGLASRCLSKPFDLASENSHAPLAEREIVANYGSLPNPQFVSSPTIMPTLFTPAKESNAQ
MGSIFDDRINVSGDNRGFSREQSTFSFLTAQEPPAPQTPGAENFALHPATTPSSGKKWRDDGTLRLKKNLRVWFLSSDYN
WIAGTVITIEDTEAVVRTPDQLMIKVNASSLQPANPEILEGVFDLIKLSYLNEPSVLHNLAFRYAKDKIYTRAGPVLIAV
NPFKKVPIYGPDSVQAYQKRTPESSHPHVYMTADTAFNAMMRDGINQSIIISGESGAGKTETAKIAMQYLAALGGGGGLE
DEILQTNPILEAFGNAKTLRNDNSSRFGKLIDIHFDRSGRICGAYIHTYLLEKSRVVKQAEGERSYHVFYQLCAGANRPL
QERLHLKSAKEYRYLSQSNCLSIDNVDDAEKFQNLRSAMNVVDISKEDQEQSFEMLSAVLWLGNITFSVVEYDNHVVVDE
NEAVKVAAALLHCECSDLIAALSTRRIRAGGDHIIQRLTLTQATDSRDALAKAIYANLFDWLVERINKSLEVGKKRTGRS
ISILDIYGFESFQKNSFEQLCINYANERLQQHFNRHLFKLEQEEYTSENIDWTRVDFEDNQECLDLIEKRPLGLISLLDE
ECMFPRSSDLTLANKWKEHLKGNVCFKCERDKAFRVCHYAGEVVYETNGFLEKNRDLLHADLLQLLASCDCALSQLFAAS
IGDGVQKLISPTRRSFNGSTESQKQSVATKFKGQLNKLMQRLESTEPHFIRCIKPNTSQLPDIFEQGLVLQQLRCCGVLE
VVRISRSGYPNRHSHDEFASRYGFLLPRSLSNQEDVLDICVSILHQFGIPPDMYQVGISKLFFRAGQIGHLEDVRLRTLQ
GVTRVQAVYKGYKARCIYKQRRMTTIILQCMVRGAIARKRFGRLLERHRAAVIVQKYARQQSACRKYQSIKEKIVKVQAV
IRMWLARKQFLAQRREAEERLATEAKLRVEAQAREEARIKEETKLKKERMIHEQHTFADDERDEEPELIKVVAAEELQEV
TIKVRPSYLLELQRRAVMAEKALREKEEENASMRQKILHYEARWMEYEAKMTSMEEMWQKQMSSLQLSLSAAKRSLATDD
YSMLQTPTKDHDSINDRFSAGKHQRTKRQLLPPPDDEEFDWDDATTNGTRSPDQFYNRYLLPGRECSTPRGDVDAARSVV
NHLVREFDHRTQVFNDDADFLIEVKSGLTEAPLDPEEELRKLRMRFDTWKKDFKTRLRETKLVLQRLCNVDSAEKEKTRK
KWWSKRTTP*
CDS seq >Pp3c19_9660V3.1
ATGTTTTCCTCGGTTGGAAACGATTCTAGGACCCCCGTGCGGGGAGTGCAAGATTTTACGACGGCTGAATACATTGAGGA
GGATGACATGTTGAGTGATGAGACGGAGAACGATGTGGTTCTGTCTGCACCTTCGATGCCCACATCCAGAGCGCGCTTAT
CATCATCAATGCGAGCTAAGAAAGCTCCGGGCGCGTGTATAGACAATCTCGCCATCCCGAGTAAAGGTTCCGGGATGCTG
TTGAAGGAAAATATCGCCCTCGGGTCCCCCATTTCAAATCTGATTACAGCAGTCGATCCTGCAGGCTTGGCTTCAAGATG
CCTGTCAAAACCATTCGACCTTGCATCTGAAAATAGTCATGCCCCTCTTGCGGAAAGAGAGATAGTTGCGAACTATGGAA
GCTTGCCCAATCCCCAGTTCGTATCGTCCCCTACCATCATGCCTACTCTTTTCACGCCGGCAAAGGAAAGCAATGCACAA
ATGGGCAGTATCTTTGATGACAGAATAAATGTCTCCGGCGATAACCGGGGATTTTCTCGCGAACAATCAACTTTCAGTTT
TCTAACTGCACAAGAACCTCCTGCACCTCAAACTCCTGGCGCGGAAAATTTTGCCCTTCATCCTGCCACTACTCCATCAT
CTGGCAAGAAATGGAGGGATGATGGTACATTGCGCTTGAAGAAGAATCTACGGGTGTGGTTTTTATCTTCAGATTACAAT
TGGATTGCTGGAACAGTAATTACTATCGAGGATACGGAGGCCGTGGTTCGCACTCCTGATCAACTGATGATCAAAGTAAA
TGCGTCAAGTCTACAGCCAGCAAACCCCGAAATATTAGAAGGAGTCTTCGATTTGATCAAACTCAGCTACCTGAACGAGC
CTTCTGTTCTGCATAATTTAGCCTTCCGGTATGCGAAGGACAAAATATATACTAGAGCAGGTCCTGTCTTGATTGCAGTC
AATCCTTTTAAGAAAGTTCCCATCTATGGTCCAGACAGTGTACAAGCTTATCAAAAGAGAACTCCAGAGAGCTCTCATCC
TCATGTATATATGACAGCGGACACTGCTTTCAATGCCATGATGCGAGATGGAATCAATCAGTCTATTATCATTAGTGGTG
AGAGCGGTGCAGGGAAGACAGAAACCGCCAAAATTGCCATGCAGTATCTAGCTGCACTTGGAGGCGGTGGTGGGTTGGAA
GATGAGATCTTGCAAACAAACCCTATTTTGGAAGCATTCGGCAACGCTAAAACGTTACGAAATGACAATTCCAGTCGCTT
TGGCAAGCTGATTGACATCCACTTTGATAGATCAGGGAGAATATGTGGTGCATATATTCACACTTATTTACTAGAAAAGT
CACGAGTTGTCAAGCAGGCTGAAGGCGAAAGGTCTTATCATGTCTTTTATCAGCTTTGCGCTGGAGCTAATAGACCTTTA
CAAGAACGGTTACATCTAAAATCTGCGAAAGAGTATCGGTATTTGAGCCAAAGCAACTGTTTGTCTATCGACAACGTTGA
CGATGCAGAGAAATTCCAAAATTTGAGGAGTGCCATGAATGTCGTGGATATCAGCAAGGAAGACCAAGAGCAATCTTTTG
AAATGCTTTCAGCTGTGCTGTGGCTTGGGAATATCACTTTCTCCGTTGTTGAATATGATAATCATGTCGTTGTTGATGAA
AATGAAGCGGTAAAAGTGGCCGCAGCATTGCTTCATTGCGAGTGCAGCGATCTTATTGCAGCCCTCTCCACCCGAAGGAT
CCGTGCAGGAGGTGATCATATTATACAGAGATTGACCCTAACTCAGGCAACCGATTCCAGAGACGCCCTTGCTAAAGCTA
TCTATGCGAACTTGTTTGACTGGTTGGTGGAACGTATTAACAAGTCCTTGGAGGTTGGCAAGAAGCGAACAGGAAGGTCA
ATCAGCATACTAGATATTTATGGTTTTGAATCTTTTCAGAAGAACAGTTTTGAGCAGTTATGCATAAACTATGCAAACGA
AAGGTTGCAACAACATTTCAATCGCCATCTGTTTAAGCTTGAGCAAGAGGAGTACACCTCAGAAAACATTGATTGGACCA
GAGTAGATTTTGAAGATAATCAAGAGTGTCTTGATCTTATTGAGAAGAGACCATTGGGATTGATTTCACTACTGGATGAA
GAGTGCATGTTTCCACGATCTTCAGATTTAACACTTGCAAATAAGTGGAAGGAGCATCTGAAAGGGAATGTTTGTTTCAA
ATGCGAACGAGATAAGGCATTCCGCGTTTGTCACTACGCCGGGGAGGTGGTGTATGAAACCAATGGGTTTCTCGAGAAAA
ACAGGGACCTGCTTCATGCAGATCTGTTGCAGTTGTTAGCTTCCTGTGATTGCGCATTATCACAGCTCTTTGCTGCCTCT
ATCGGAGATGGTGTGCAGAAGCTGATCAGTCCCACCCGCAGGAGTTTCAATGGCAGTACAGAATCACAAAAGCAGAGTGT
GGCTACAAAGTTCAAGGGTCAACTGAACAAGCTGATGCAAAGGCTGGAGAGCACTGAGCCCCACTTCATCAGGTGTATCA
AACCCAATACATCACAGCTTCCTGATATATTCGAGCAGGGGCTGGTATTACAGCAGCTTCGGTGTTGTGGAGTCCTTGAG
GTCGTTCGCATCTCACGATCTGGCTATCCAAACCGTCATTCGCATGATGAATTTGCAAGCCGGTATGGCTTTCTTCTTCC
AAGAAGCCTATCCAACCAGGAGGATGTGCTAGATATATGTGTTTCCATACTACATCAATTTGGCATTCCTCCAGATATGT
ATCAAGTTGGCATCTCAAAGCTATTCTTTCGTGCTGGACAGATAGGACATTTGGAGGACGTTAGATTGAGGACCCTTCAG
GGTGTCACTCGAGTTCAAGCAGTGTACAAGGGTTACAAGGCCCGGTGCATTTACAAACAACGACGCATGACCACAATCAT
TTTACAATGCATGGTGAGAGGGGCAATAGCAAGAAAGCGATTCGGGAGATTGCTAGAGAGGCATCGTGCAGCTGTAATTG
TACAAAAGTATGCACGACAGCAGTCTGCCTGTCGTAAATATCAATCAATAAAAGAAAAAATCGTCAAGGTTCAAGCAGTC
ATTCGTATGTGGTTGGCTAGGAAACAATTTCTTGCCCAAAGAAGGGAGGCTGAAGAGAGGCTGGCAACTGAGGCGAAGCT
AAGAGTCGAGGCCCAAGCTAGGGAAGAAGCTAGGATCAAAGAAGAAACTAAGTTAAAGAAAGAACGCATGATTCACGAAC
AACATACTTTTGCTGATGATGAGCGTGATGAAGAACCAGAACTCATTAAGGTGGTGGCAGCAGAGGAATTGCAAGAGGTC
ACCATCAAGGTGCGGCCCTCATACCTTCTGGAGCTGCAGCGGCGTGCAGTAATGGCAGAGAAAGCACTAAGAGAGAAGGA
GGAAGAAAATGCCTCGATGCGGCAGAAGATCCTGCATTACGAGGCACGGTGGATGGAGTATGAAGCCAAGATGACGTCCA
TGGAAGAGATGTGGCAGAAGCAGATGTCGTCCCTGCAGCTCAGCTTATCAGCTGCTAAGAGGAGCTTAGCAACGGATGAC
TATTCAATGCTGCAAACTCCAACGAAGGATCATGATTCTATCAATGATCGCTTTTCTGCCGGTAAGCATCAGCGCACCAA
ACGGCAACTCCTGCCCCCCCCTGACGACGAAGAATTTGATTGGGATGACGCCACCACTAACGGCACAAGGAGCCCGGACC
AGTTCTATAACAGGTACTTGCTACCAGGCCGTGAGTGTAGCACCCCACGTGGTGATGTGGACGCTGCTCGGTCTGTGGTC
AATCACCTAGTCAGAGAATTTGATCACCGAACACAGGTCTTCAATGACGACGCTGATTTTCTCATAGAGGTTAAGTCAGG
CCTTACTGAAGCGCCCCTGGACCCAGAAGAAGAACTCCGGAAGCTGAGAATGAGATTTGATACCTGGAAGAAAGACTTCA
AAACCAGGCTGCGAGAGACCAAGTTGGTCTTGCAGAGATTGTGCAACGTAGATTCGGCTGAGAAAGAGAAGACACGCAAG
AAGTGGTGGAGCAAAAGAACTACTCCTTGA
Microexon DNA seq ATTTACTAGAAAAG
Microexon Amino Acid seq YLLEK
Microexon-tag DNA Seq CACTTTGATAGATCAGGGAGAATATGTGGTGCATATATTCACACTTATTTACTAGAAAAGTCACGAGTTGTCAAGCAGGCTGAAGGCGAAAGGTCTTATCATGTCTTT
Microexon-tag Amino Acid seq HFDRSGRICGAYIHTYLLEKSRVVKQAEGERSYHVF
Transcript ID Pp3c19_9660V3.1
Gene ID Pp.10324
Gene Name NA
Pfam domain motif Myosin_head
Motif E-value 2.2e-249
Motif start 284
Motif end 944
Protein seq >Pp3c19_9660V3.1
MFSSVGNDSRTPVRGVQDFTTAEYIEEDDMLSDETENDVVLSAPSMPTSRARLSSSMRAKKAPGACIDNLAIPSKGSGML
LKENIALGSPISNLITAVDPAGLASRCLSKPFDLASENSHAPLAEREIVANYGSLPNPQFVSSPTIMPTLFTPAKESNAQ
MGSIFDDRINVSGDNRGFSREQSTFSFLTAQEPPAPQTPGAENFALHPATTPSSGKKWRDDGTLRLKKNLRVWFLSSDYN
WIAGTVITIEDTEAVVRTPDQLMIKVNASSLQPANPEILEGVFDLIKLSYLNEPSVLHNLAFRYAKDKIYTRAGPVLIAV
NPFKKVPIYGPDSVQAYQKRTPESSHPHVYMTADTAFNAMMRDGINQSIIISGESGAGKTETAKIAMQYLAALGGGGGLE
DEILQTNPILEAFGNAKTLRNDNSSRFGKLIDIHFDRSGRICGAYIHTYLLEKSRVVKQAEGERSYHVFYQLCAGANRPL
QERLHLKSAKEYRYLSQSNCLSIDNVDDAEKFQNLRSAMNVVDISKEDQEQSFEMLSAVLWLGNITFSVVEYDNHVVVDE
NEAVKVAAALLHCECSDLIAALSTRRIRAGGDHIIQRLTLTQATDSRDALAKAIYANLFDWLVERINKSLEVGKKRTGRS
ISILDIYGFESFQKNSFEQLCINYANERLQQHFNRHLFKLEQEEYTSENIDWTRVDFEDNQECLDLIEKRPLGLISLLDE
ECMFPRSSDLTLANKWKEHLKGNVCFKCERDKAFRVCHYAGEVVYETNGFLEKNRDLLHADLLQLLASCDCALSQLFAAS
IGDGVQKLISPTRRSFNGSTESQKQSVATKFKGQLNKLMQRLESTEPHFIRCIKPNTSQLPDIFEQGLVLQQLRCCGVLE
VVRISRSGYPNRHSHDEFASRYGFLLPRSLSNQEDVLDICVSILHQFGIPPDMYQVGISKLFFRAGQIGHLEDVRLRTLQ
GVTRVQAVYKGYKARCIYKQRRMTTIILQCMVRGAIARKRFGRLLERHRAAVIVQKYARQQSACRKYQSIKEKIVKVQAV
IRMWLARKQFLAQRREAEERLATEAKLRVEAQAREEARIKEETKLKKERMIHEQHTFADDERDEEPELIKVVAAEELQEV
TIKVRPSYLLELQRRAVMAEKALREKEEENASMRQKILHYEARWMEYEAKMTSMEEMWQKQMSSLQLSLSAAKRSLATDD
YSMLQTPTKDHDSINDRFSAGKHQRTKRQLLPPPDDEEFDWDDATTNGTRSPDQFYNRYLLPGRECSTPRGDVDAARSVV
NHLVREFDHRTQVFNDDADFLIEVKSGLTEAPLDPEEELRKLRMRFDTWKKDFKTRLRETKLVLQRLCNVDSAEKEKTRK
KWWSKRTTP*
CDS seq >Pp3c19_9660V3.1
ATGTTTTCCTCGGTTGGAAACGATTCTAGGACCCCCGTGCGGGGAGTGCAAGATTTTACGACGGCTGAATACATTGAGGA
GGATGACATGTTGAGTGATGAGACGGAGAACGATGTGGTTCTGTCTGCACCTTCGATGCCCACATCCAGAGCGCGCTTAT
CATCATCAATGCGAGCTAAGAAAGCTCCGGGCGCGTGTATAGACAATCTCGCCATCCCGAGTAAAGGTTCCGGGATGCTG
TTGAAGGAAAATATCGCCCTCGGGTCCCCCATTTCAAATCTGATTACAGCAGTCGATCCTGCAGGCTTGGCTTCAAGATG
CCTGTCAAAACCATTCGACCTTGCATCTGAAAATAGTCATGCCCCTCTTGCGGAAAGAGAGATAGTTGCGAACTATGGAA
GCTTGCCCAATCCCCAGTTCGTATCGTCCCCTACCATCATGCCTACTCTTTTCACGCCGGCAAAGGAAAGCAATGCACAA
ATGGGCAGTATCTTTGATGACAGAATAAATGTCTCCGGCGATAACCGGGGATTTTCTCGCGAACAATCAACTTTCAGTTT
TCTAACTGCACAAGAACCTCCTGCACCTCAAACTCCTGGCGCGGAAAATTTTGCCCTTCATCCTGCCACTACTCCATCAT
CTGGCAAGAAATGGAGGGATGATGGTACATTGCGCTTGAAGAAGAATCTACGGGTGTGGTTTTTATCTTCAGATTACAAT
TGGATTGCTGGAACAGTAATTACTATCGAGGATACGGAGGCCGTGGTTCGCACTCCTGATCAACTGATGATCAAAGTAAA
TGCGTCAAGTCTACAGCCAGCAAACCCCGAAATATTAGAAGGAGTCTTCGATTTGATCAAACTCAGCTACCTGAACGAGC
CTTCTGTTCTGCATAATTTAGCCTTCCGGTATGCGAAGGACAAAATATATACTAGAGCAGGTCCTGTCTTGATTGCAGTC
AATCCTTTTAAGAAAGTTCCCATCTATGGTCCAGACAGTGTACAAGCTTATCAAAAGAGAACTCCAGAGAGCTCTCATCC
TCATGTATATATGACAGCGGACACTGCTTTCAATGCCATGATGCGAGATGGAATCAATCAGTCTATTATCATTAGTGGTG
AGAGCGGTGCAGGGAAGACAGAAACCGCCAAAATTGCCATGCAGTATCTAGCTGCACTTGGAGGCGGTGGTGGGTTGGAA
GATGAGATCTTGCAAACAAACCCTATTTTGGAAGCATTCGGCAACGCTAAAACGTTACGAAATGACAATTCCAGTCGCTT
TGGCAAGCTGATTGACATCCACTTTGATAGATCAGGGAGAATATGTGGTGCATATATTCACACTTATTTACTAGAAAAGT
CACGAGTTGTCAAGCAGGCTGAAGGCGAAAGGTCTTATCATGTCTTTTATCAGCTTTGCGCTGGAGCTAATAGACCTTTA
CAAGAACGGTTACATCTAAAATCTGCGAAAGAGTATCGGTATTTGAGCCAAAGCAACTGTTTGTCTATCGACAACGTTGA
CGATGCAGAGAAATTCCAAAATTTGAGGAGTGCCATGAATGTCGTGGATATCAGCAAGGAAGACCAAGAGCAATCTTTTG
AAATGCTTTCAGCTGTGCTGTGGCTTGGGAATATCACTTTCTCCGTTGTTGAATATGATAATCATGTCGTTGTTGATGAA
AATGAAGCGGTAAAAGTGGCCGCAGCATTGCTTCATTGCGAGTGCAGCGATCTTATTGCAGCCCTCTCCACCCGAAGGAT
CCGTGCAGGAGGTGATCATATTATACAGAGATTGACCCTAACTCAGGCAACCGATTCCAGAGACGCCCTTGCTAAAGCTA
TCTATGCGAACTTGTTTGACTGGTTGGTGGAACGTATTAACAAGTCCTTGGAGGTTGGCAAGAAGCGAACAGGAAGGTCA
ATCAGCATACTAGATATTTATGGTTTTGAATCTTTTCAGAAGAACAGTTTTGAGCAGTTATGCATAAACTATGCAAACGA
AAGGTTGCAACAACATTTCAATCGCCATCTGTTTAAGCTTGAGCAAGAGGAGTACACCTCAGAAAACATTGATTGGACCA
GAGTAGATTTTGAAGATAATCAAGAGTGTCTTGATCTTATTGAGAAGAGACCATTGGGATTGATTTCACTACTGGATGAA
GAGTGCATGTTTCCACGATCTTCAGATTTAACACTTGCAAATAAGTGGAAGGAGCATCTGAAAGGGAATGTTTGTTTCAA
ATGCGAACGAGATAAGGCATTCCGCGTTTGTCACTACGCCGGGGAGGTGGTGTATGAAACCAATGGGTTTCTCGAGAAAA
ACAGGGACCTGCTTCATGCAGATCTGTTGCAGTTGTTAGCTTCCTGTGATTGCGCATTATCACAGCTCTTTGCTGCCTCT
ATCGGAGATGGTGTGCAGAAGCTGATCAGTCCCACCCGCAGGAGTTTCAATGGCAGTACAGAATCACAAAAGCAGAGTGT
GGCTACAAAGTTCAAGGGTCAACTGAACAAGCTGATGCAAAGGCTGGAGAGCACTGAGCCCCACTTCATCAGGTGTATCA
AACCCAATACATCACAGCTTCCTGATATATTCGAGCAGGGGCTGGTATTACAGCAGCTTCGGTGTTGTGGAGTCCTTGAG
GTCGTTCGCATCTCACGATCTGGCTATCCAAACCGTCATTCGCATGATGAATTTGCAAGCCGGTATGGCTTTCTTCTTCC
AAGAAGCCTATCCAACCAGGAGGATGTGCTAGATATATGTGTTTCCATACTACATCAATTTGGCATTCCTCCAGATATGT
ATCAAGTTGGCATCTCAAAGCTATTCTTTCGTGCTGGACAGATAGGACATTTGGAGGACGTTAGATTGAGGACCCTTCAG
GGTGTCACTCGAGTTCAAGCAGTGTACAAGGGTTACAAGGCCCGGTGCATTTACAAACAACGACGCATGACCACAATCAT
TTTACAATGCATGGTGAGAGGGGCAATAGCAAGAAAGCGATTCGGGAGATTGCTAGAGAGGCATCGTGCAGCTGTAATTG
TACAAAAGTATGCACGACAGCAGTCTGCCTGTCGTAAATATCAATCAATAAAAGAAAAAATCGTCAAGGTTCAAGCAGTC
ATTCGTATGTGGTTGGCTAGGAAACAATTTCTTGCCCAAAGAAGGGAGGCTGAAGAGAGGCTGGCAACTGAGGCGAAGCT
AAGAGTCGAGGCCCAAGCTAGGGAAGAAGCTAGGATCAAAGAAGAAACTAAGTTAAAGAAAGAACGCATGATTCACGAAC
AACATACTTTTGCTGATGATGAGCGTGATGAAGAACCAGAACTCATTAAGGTGGTGGCAGCAGAGGAATTGCAAGAGGTC
ACCATCAAGGTGCGGCCCTCATACCTTCTGGAGCTGCAGCGGCGTGCAGTAATGGCAGAGAAAGCACTAAGAGAGAAGGA
GGAAGAAAATGCCTCGATGCGGCAGAAGATCCTGCATTACGAGGCACGGTGGATGGAGTATGAAGCCAAGATGACGTCCA
TGGAAGAGATGTGGCAGAAGCAGATGTCGTCCCTGCAGCTCAGCTTATCAGCTGCTAAGAGGAGCTTAGCAACGGATGAC
TATTCAATGCTGCAAACTCCAACGAAGGATCATGATTCTATCAATGATCGCTTTTCTGCCGGTAAGCATCAGCGCACCAA
ACGGCAACTCCTGCCCCCCCCTGACGACGAAGAATTTGATTGGGATGACGCCACCACTAACGGCACAAGGAGCCCGGACC
AGTTCTATAACAGGTACTTGCTACCAGGCCGTGAGTGTAGCACCCCACGTGGTGATGTGGACGCTGCTCGGTCTGTGGTC
AATCACCTAGTCAGAGAATTTGATCACCGAACACAGGTCTTCAATGACGACGCTGATTTTCTCATAGAGGTTAAGTCAGG
CCTTACTGAAGCGCCCCTGGACCCAGAAGAAGAACTCCGGAAGCTGAGAATGAGATTTGATACCTGGAAGAAAGACTTCA
AAACCAGGCTGCGAGAGACCAAGTTGGTCTTGCAGAGATTGTGCAACGTAGATTCGGCTGAGAAAGAGAAGACACGCAAG
AAGTGGTGGAGCAAAAGAACTACTCCTTGA