Microexon ID Pp_3:21098970-21098977:+
Species Physcomitrium patens
Coordinates 3:21098970..21098977
Microexon Cluster ID Unclassified
Size 8
Pp_3:21098970-21098977:+ does not have available information here.
Transcript ID Pp3c3_30960V3.1
Protein ID Pp3c3_30960V3.1
Gene ID Pp3c3_30960
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp3c3_30960V3.1
MNTAVKILGSGSVEGKSDREGWDPMGMGLERKASEEKECMDAFKECSSSKELLGRVNVKGGERGESVGEAGQELLDNTCR
DGNFDGLSKKCPREGEGKVAMTEGPTDVKWEEQHEKAKEKEKKDEKRSEGKEDVLPSLQLMQAHLDVNSEKDNEKDEKEK
EKERGDKEKERARQRERSRDKEKEAVRSEKREKREKERDRDSHRREREQRDGILVVKGEKGEGKGVGVDNMDVDRKPVRE
VDERKLVEKEDHERVRDRDRDKEGDEAGEGDKRKKRGREQENDRDSLSNIGTEGAGGDKEKNADHGQGVHQRKRMLRPRA
QSNPANRDPRSRFRPEDSEGTQGRPESLFINYRPGEGIPELAKFRKEYESGERISSANGSGPTVEIRIPAEYATTNNRQI
RGSQLWGTDIYTDDSDIVAVLLHTGYYSSSPVLPPNSTLELRATIRILESQNMYISTLRNGLRSRAWGGGSGCSYTVERC
RIMKQGGFTVELKPCLTRTPPIAPTLAPAASERTVTTRAASSNAYRQQRFMQEVTIQYNLCNEPWAKYCMNNVADRGLKK
SQYTSARLKKGEVLYVETLVHRYELAYDGERATCNGAATATSLPQDNSGMEKGKEKWGAAERASAVVEKEPILRTHSVEK
SHGNHPTNGYHGNHNGEPYEYYRWSKCKRPLSLSCMKRKGVPLPEDFIEVLEEGLTWEEVQWSPTSVWIRGAEYVLSRAH
FFSFDKDDMED*
CDS seq >Pp3c3_30960V3.1
ATGAATACTGCTGTAAAGATCCTTGGAAGTGGGAGTGTAGAGGGGAAGAGCGATCGGGAGGGGTGGGATCCAATGGGGAT
GGGTTTGGAGAGGAAGGCAAGTGAGGAGAAGGAATGCATGGATGCTTTTAAGGAGTGCAGTAGCAGTAAGGAGCTTCTTG
GTAGGGTTAACGTGAAGGGGGGGGAGCGAGGAGAGAGTGTTGGGGAAGCAGGGCAAGAGTTGCTGGATAACACCTGTAGG
GACGGAAATTTTGACGGCTTGAGTAAGAAGTGCCCTCGCGAGGGTGAAGGTAAGGTGGCCATGACGGAGGGCCCTACGGA
TGTGAAGTGGGAGGAGCAGCATGAGAAAGCGAAAGAGAAAGAGAAGAAGGATGAGAAGCGTAGTGAGGGAAAAGAGGATG
TGCTGCCGTCACTGCAGTTGATGCAAGCGCACTTGGATGTAAATAGTGAGAAAGACAATGAGAAGGATGAGAAGGAGAAG
GAGAAGGAGAGAGGCGACAAGGAGAAAGAGAGAGCGCGGCAACGGGAGCGGTCGCGTGACAAGGAGAAAGAAGCTGTGCG
CTCGGAGAAACGGGAGAAGAGAGAGAAGGAGCGTGATCGGGATTCTCATCGGCGAGAGCGAGAACAACGAGATGGTATCC
TCGTAGTCAAAGGGGAGAAAGGGGAAGGAAAGGGAGTTGGGGTCGACAACATGGATGTGGATCGTAAGCCTGTGAGGGAA
GTCGATGAGCGAAAATTGGTCGAGAAGGAAGACCACGAGCGGGTGAGGGACCGAGATCGTGACAAGGAGGGAGATGAAGC
GGGCGAGGGTGACAAGCGCAAGAAACGGGGTCGCGAGCAGGAGAACGATCGTGATTCTTTGAGTAATATTGGCACTGAAG
GTGCAGGAGGGGATAAAGAAAAGAATGCTGATCATGGCCAAGGAGTACATCAGCGGAAGAGGATGTTGCGCCCCAGAGCG
CAGTCGAATCCTGCTAATCGAGATCCCCGATCACGATTTCGGCCCGAAGACAGTGAAGGGACTCAAGGCAGGCCAGAAAG
TTTATTCATAAATTACAGACCTGGAGAAGGAATTCCAGAACTTGCGAAATTTCGTAAGGAGTATGAGTCTGGAGAGCGAA
TCAGTTCAGCTAATGGTTCGGGACCTACTGTCGAAATACGCATTCCTGCTGAATACGCCACTACTAATAACCGCCAGATT
CGAGGCAGTCAACTATGGGGAACAGACATATATACAGATGACTCAGACATTGTTGCAGTCCTACTGCATACAGGATACTA
CTCATCTTCACCTGTTCTACCTCCAAATTCAACATTAGAACTGCGGGCCACCATTCGTATTCTTGAATCTCAAAATATGT
ATATTTCTACGTTAAGAAACGGTCTGCGGTCGCGTGCCTGGGGAGGTGGAAGTGGGTGCAGCTACACCGTCGAGAGGTGC
CGTATAATGAAGCAAGGAGGGTTCACTGTTGAGCTCAAGCCATGTTTGACTCGCACTCCTCCAATCGCTCCAACTCTCGC
ACCCGCAGCATCAGAGCGAACTGTTACGACGAGAGCCGCATCTTCTAATGCGTACCGGCAGCAAAGATTTATGCAGGAAG
TGACAATACAGTACAATTTATGCAATGAACCGTGGGCTAAGTATTGCATGAACAACGTGGCTGACCGTGGGTTGAAGAAA
TCTCAATATACGTCTGCCCGACTGAAAAAAGGGGAAGTGCTATATGTGGAGACCCTGGTTCATCGGTACGAATTAGCATA
TGATGGAGAGCGTGCGACGTGCAATGGTGCAGCTACTGCTACTTCTCTCCCTCAAGATAATTCAGGGATGGAAAAGGGTA
AGGAGAAATGGGGTGCGGCAGAACGAGCTTCGGCAGTTGTAGAAAAAGAACCAATTTTGCGAACTCATAGCGTCGAAAAG
TCACATGGAAACCATCCAACCAATGGGTACCACGGCAACCACAATGGTGAGCCATATGAGTATTATCGATGGTCGAAGTG
TAAACGACCTCTTTCACTTTCATGCATGAAAAGAAAAGGTGTACCCTTGCCAGAAGATTTTATTGAGGTCCTGGAAGAAG
GTTTGACATGGGAAGAAGTTCAGTGGTCTCCAACAAGTGTGTGGATTCGAGGAGCAGAATATGTTCTCAGTAGAGCACAT
TTTTTTTCTTTCGATAAGGATGATATGGAGGATTAG
Microexon DNA seq GACTCAAG
Microexon Amino Acid seq GTQG
Microexon-tag DNA Seq CCTGCTAATCGAGATCCCCGATCACGATTTCGGCCCGAAGACAGTGAAGGGACTCAAGGCAGGCCAGAAAGTTTATTCATAAATTACAGACCTGGAGAAGGAATTCCA
Microexon-tag Amino Acid seq PANRDPRSRFRPEDSEGTQGRPESLFINYRPGEGIP
Transcript ID Pp3c3_30960V3.2
Gene ID Pp.18857
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp3c3_30960V3.2
MSLPAKWQKEETGAVVGKVLEGSGGGTGKQQHVGVGSSMIAESASAYGGTGELRPAKVARHGDRGVEAGEKAELVNNGSY
ANFKPGRDVGCSGSFEGAVADGNNLQEVQVQTMNTAVKILGSGSVEGKSDREGWDPMGMGLERKASEEKECMDAFKECSS
SKELLGRVNVKGGERGESVGEAGQELLDNTCRDGNFDGLSKKCPREGEGKVAMTEGPTDVKWEEQHEKAKEKEKKDEKRS
EGKEDVLPSLQLMQAHLDVNSEKDNEKDEKEKEKERGDKEKERARQRERSRDKEKEAVRSEKREKREKERDRDSHRRERE
QRDGILVVKGEKGEGKGVGVDNMDVDRKPVREVDERKLVEKEDHERVRDRDRDKEGDEAGEGDKRKKRGREQENDRDSLS
NIGTEGAGGDKEKNADHGQGVHQRKRMLRPRAQSNPANRDPRSRFRPEDSEGTQGRPESLFINYRPGEGIPELAKFRKEY
ESGERISSANGSGPTVEIRIPAEYATTNNRQIRGSQLWGTDIYTDDSDIVAVLLHTGYYSSSPVLPPNSTLELRATIRIL
ESQNMYISTLRNGLRSRAWGGGSGCSYTVERCRIMKQGGFTVELKPCLTRTPPIAPTLAPAASERTVTTRAASSNAYRQQ
RFMQEVTIQYNLCNEPWAKYCMNNVADRGLKKSQYTSARLKKGEVLYVETLVHRYELAYDGERATCNGAATATSLPQDNS
GMEKGKEKWGAAERASAVVEKEPILRTHSVEKSHGNHPTNGYHGNHNGEPYEYYRWSKCKRPLSLSCMKRKGVPLPEDFI
EVLEEGLTWEEVQWSPTSVWIRGAEYVLSRAHFFSFDKDDMED*
CDS seq >Pp3c3_30960V3.2
ATGAGTCTTCCGGCGAAGTGGCAGAAGGAGGAGACAGGGGCCGTTGTCGGGAAGGTGCTCGAAGGGAGTGGGGGTGGTAC
TGGGAAGCAGCAACATGTAGGTGTGGGGAGCAGCATGATAGCGGAGTCTGCTTCTGCATACGGAGGGACAGGGGAGTTGC
GTCCCGCTAAAGTGGCGCGACATGGTGATCGTGGAGTTGAGGCTGGAGAGAAGGCCGAGCTTGTCAACAACGGTTCCTAT
GCGAATTTCAAACCTGGGAGAGATGTTGGCTGCTCCGGGTCGTTTGAGGGTGCCGTTGCAGACGGGAATAATCTTCAGGA
AGTGCAAGTTCAGACAATGAATACTGCTGTAAAGATCCTTGGAAGTGGGAGTGTAGAGGGGAAGAGCGATCGGGAGGGGT
GGGATCCAATGGGGATGGGTTTGGAGAGGAAGGCAAGTGAGGAGAAGGAATGCATGGATGCTTTTAAGGAGTGCAGTAGC
AGTAAGGAGCTTCTTGGTAGGGTTAACGTGAAGGGGGGGGAGCGAGGAGAGAGTGTTGGGGAAGCAGGGCAAGAGTTGCT
GGATAACACCTGTAGGGACGGAAATTTTGACGGCTTGAGTAAGAAGTGCCCTCGCGAGGGTGAAGGTAAGGTGGCCATGA
CGGAGGGCCCTACGGATGTGAAGTGGGAGGAGCAGCATGAGAAAGCGAAAGAGAAAGAGAAGAAGGATGAGAAGCGTAGT
GAGGGAAAAGAGGATGTGCTGCCGTCACTGCAGTTGATGCAAGCGCACTTGGATGTAAATAGTGAGAAAGACAATGAGAA
GGATGAGAAGGAGAAGGAGAAGGAGAGAGGCGACAAGGAGAAAGAGAGAGCGCGGCAACGGGAGCGGTCGCGTGACAAGG
AGAAAGAAGCTGTGCGCTCGGAGAAACGGGAGAAGAGAGAGAAGGAGCGTGATCGGGATTCTCATCGGCGAGAGCGAGAA
CAACGAGATGGTATCCTCGTAGTCAAAGGGGAGAAAGGGGAAGGAAAGGGAGTTGGGGTCGACAACATGGATGTGGATCG
TAAGCCTGTGAGGGAAGTCGATGAGCGAAAATTGGTCGAGAAGGAAGACCACGAGCGGGTGAGGGACCGAGATCGTGACA
AGGAGGGAGATGAAGCGGGCGAGGGTGACAAGCGCAAGAAACGGGGTCGCGAGCAGGAGAACGATCGTGATTCTTTGAGT
AATATTGGCACTGAAGGTGCAGGAGGGGATAAAGAAAAGAATGCTGATCATGGCCAAGGAGTACATCAGCGGAAGAGGAT
GTTGCGCCCCAGAGCGCAGTCGAATCCTGCTAATCGAGATCCCCGATCACGATTTCGGCCCGAAGACAGTGAAGGGACTC
AAGGCAGGCCAGAAAGTTTATTCATAAATTACAGACCTGGAGAAGGAATTCCAGAACTTGCGAAATTTCGTAAGGAGTAT
GAGTCTGGAGAGCGAATCAGTTCAGCTAATGGTTCGGGACCTACTGTCGAAATACGCATTCCTGCTGAATACGCCACTAC
TAATAACCGCCAGATTCGAGGCAGTCAACTATGGGGAACAGACATATATACAGATGACTCAGACATTGTTGCAGTCCTAC
TGCATACAGGATACTACTCATCTTCACCTGTTCTACCTCCAAATTCAACATTAGAACTGCGGGCCACCATTCGTATTCTT
GAATCTCAAAATATGTATATTTCTACGTTAAGAAACGGTCTGCGGTCGCGTGCCTGGGGAGGTGGAAGTGGGTGCAGCTA
CACCGTCGAGAGGTGCCGTATAATGAAGCAAGGAGGGTTCACTGTTGAGCTCAAGCCATGTTTGACTCGCACTCCTCCAA
TCGCTCCAACTCTCGCACCCGCAGCATCAGAGCGAACTGTTACGACGAGAGCCGCATCTTCTAATGCGTACCGGCAGCAA
AGATTTATGCAGGAAGTGACAATACAGTACAATTTATGCAATGAACCGTGGGCTAAGTATTGCATGAACAACGTGGCTGA
CCGTGGGTTGAAGAAATCTCAATATACGTCTGCCCGACTGAAAAAAGGGGAAGTGCTATATGTGGAGACCCTGGTTCATC
GGTACGAATTAGCATATGATGGAGAGCGTGCGACGTGCAATGGTGCAGCTACTGCTACTTCTCTCCCTCAAGATAATTCA
GGGATGGAAAAGGGTAAGGAGAAATGGGGTGCGGCAGAACGAGCTTCGGCAGTTGTAGAAAAAGAACCAATTTTGCGAAC
TCATAGCGTCGAAAAGTCACATGGAAACCATCCAACCAATGGGTACCACGGCAACCACAATGGTGAGCCATATGAGTATT
ATCGATGGTCGAAGTGTAAACGACCTCTTTCACTTTCATGCATGAAAAGAAAAGGTGTACCCTTGCCAGAAGATTTTATT
GAGGTCCTGGAAGAAGGTTTGACATGGGAAGAAGTTCAGTGGTCTCCAACAAGTGTGTGGATTCGAGGAGCAGAATATGT
TCTCAGTAGAGCACATTTTTTTTCTTTCGATAAGGATGATATGGAGGATTAG