Microexon ID Pp_3:22205074-22205081:-
Species Physcomitrium patens
Coordinates 3:22205074..22205081
Microexon Cluster ID Unclassified
Size 8
Pp_3:22205074-22205081:- does not have available information here.
Transcript ID Pp3c3_32800V3.1
Protein ID Pp3c3_32800V3.1
Gene ID Pp3c3_32800
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp3c3_32800V3.1
MVAEGKEVQVQTLKIKLKDLGGGSAEGKSDREGRDPRRMDLERKTSGGKDYGDALRESSGNKELLGKVNMKGEEQLLQQG
GSGREVGQELVINSRMEGNPEGLRKKSPREGEVKAAAKDVPMDVKKEELREKEKEKKDDTLSERKDGVLPTLLHPHLDAN
AEKDYERDEKERETGDKEKEKMRERERSRDKEKEAVRSEKREKREKERDRDSHWREREERDDNLGDKAEKEEGKGVKVED
VEMDRKPVREVDERKLFEKEVRERMRERDREKERDEEGDGEKRKKRGRELDKDRDSLSNMDSERPEGDKEKDAVHGYGVQ
QRKRMLRPRGQSNPSNRDPRSRFRPKDNEGTQGRAESSFITYRIGEGMPELAKLWKEYESGERNSPDNGSGPTVEIRIPA
EHATTNNRQIRGSQLWGTDIYTNDSDIVAVLLHTGYYSPSPSPPPNSILELRATVRILESQNMYMSTLRNSIRSRAWGGG
SGCSYSVERCRLVKQGGGTVELEPSLTRTPPFVPTLAPAASERTVTTRAASSSAYRQQRFMQEVTIQYNLCNEPWAKYSM
SIVADRGLKNSQYTSARLKKGEVLYVETQTHRYELAYDGERMTSNGATTATSFSQGNSGTEKGKDKWSLGVCDKEPISQI
YNGELPHGNHPSNGQTGGHHCHHNSEPHEYYRWSKCKRPLSLSSMKRKGVPLPGEFVEVLEEGLGWEEIQWSPTSVWVRG
TEYILSRAQFFSFEKDDMEE*
CDS seq >Pp3c3_32800V3.1
ATGGTTGCAGAGGGGAAGGAAGTGCAAGTGCAGACACTGAAGATCAAGCTGAAGGACCTTGGTGGTGGGAGTGCAGAGGG
GAAGAGTGATCGAGAGGGGCGGGATCCAAGGCGTATGGATTTGGAGAGGAAGACAAGTGGCGGGAAGGATTATGGGGATG
CGCTAAGGGAGAGTAGTGGCAACAAGGAACTTCTTGGTAAGGTAAATATGAAGGGCGAGGAGCAGTTGCTGCAGCAGGGA
GGGAGTGGTCGGGAAGTAGGGCAAGAATTGGTGATTAACAGTCGGATGGAAGGAAATCCTGAAGGTTTGAGGAAAAAGAG
CCCTCGCGAGGGCGAAGTTAAAGCAGCCGCGAAGGATGTGCCGATGGATGTGAAGAAGGAAGAGCTGCGTGAGAAGGAGA
AAGAGAAGAAGGATGACACGCTTAGTGAGCGGAAAGATGGTGTGCTGCCGACATTACTGCATCCGCACTTGGATGCTAAT
GCTGAGAAAGATTATGAGAGAGATGAGAAGGAGAGAGAGACGGGCGACAAAGAGAAAGAGAAGATGCGGGAGCGGGAGCG
GTCGCGTGACAAGGAGAAAGAGGCAGTACGCTCGGAGAAACGGGAGAAGAGGGAGAAAGAGCGTGATCGGGATTCCCATT
GGCGAGAGCGAGAGGAGCGAGATGATAATCTCGGTGACAAAGCGGAGAAAGAGGAGGGAAAAGGGGTCAAGGTTGAGGAC
GTGGAGATGGACAGAAAGCCTGTGCGCGAGGTCGACGAGCGTAAATTGTTCGAAAAAGAAGTGCGCGAGCGTATGCGGGA
GAGAGATAGAGAGAAGGAGAGAGATGAAGAGGGGGATGGTGAGAAGCGTAAGAAACGGGGTCGCGAGTTGGACAAAGACC
GTGATTCGTTGAGCAATATGGACAGTGAACGTCCGGAAGGGGATAAAGAAAAGGATGCCGTTCATGGCTATGGAGTACAG
CAGCGTAAGAGGATGTTGCGTCCCAGGGGCCAGTCAAATCCTTCTAACCGAGACCCCCGGTCACGATTTCGGCCCAAAGA
CAATGAAGGGACTCAAGGTAGGGCAGAAAGTTCGTTCATTACTTACAGAATTGGAGAAGGGATGCCGGAACTCGCGAAAC
TTTGGAAGGAATACGAGTCTGGTGAACGAAACAGTCCAGACAATGGGTCGGGTCCTACCGTTGAAATTCGTATTCCTGCT
GAGCACGCCACTACTAATAACCGTCAGATACGGGGTAGTCAGCTATGGGGAACAGATATATACACAAATGACTCAGACAT
TGTTGCAGTCCTGTTACATACAGGATACTACTCACCTTCTCCGTCTCCGCCTCCAAATTCTATATTAGAGCTGCGAGCCA
CCGTTCGAATTCTTGAATCTCAAAATATGTACATGTCTACGCTACGAAATAGTATTCGATCGCGTGCCTGGGGAGGCGGA
AGTGGGTGTAGCTACAGCGTTGAGAGGTGCCGATTAGTGAAGCAAGGAGGGGGTACTGTAGAGCTGGAGCCATCTTTGAC
TCGAACTCCTCCATTTGTCCCAACGCTTGCGCCGGCTGCATCAGAGCGAACTGTCACCACGAGAGCTGCATCTTCTAGTG
CATATCGGCAGCAAAGATTTATGCAGGAAGTGACAATACAATACAATCTATGCAATGAACCCTGGGCGAAATACAGCATG
AGCATCGTGGCTGATCGTGGACTGAAGAATTCACAATACACGTCTGCTCGACTCAAAAAAGGGGAAGTATTATATGTAGA
AACCCAGACTCATAGGTATGAATTGGCGTATGATGGAGAACGTATGACGAGTAATGGAGCAACTACTGCCACCTCTTTCT
CTCAAGGCAACTCAGGGACGGAAAAAGGTAAAGATAAATGGAGTTTGGGAGTTTGTGACAAAGAACCTATTTCGCAAATT
TACAATGGTGAGTTACCGCATGGGAATCATCCAAGTAATGGACAAACAGGTGGCCATCACTGCCACCATAATAGCGAGCC
ACATGAGTACTACAGATGGTCTAAGTGTAAACGACCTCTGTCGCTATCGTCCATGAAACGAAAGGGTGTACCCTTGCCAG
GAGAATTTGTTGAGGTTTTGGAAGAAGGTCTGGGATGGGAGGAAATTCAGTGGTCTCCAACGAGTGTGTGGGTCCGGGGA
ACAGAGTACATCCTCAGTAGAGCACAATTCTTTTCTTTTGAGAAGGATGATATGGAGGAATAG
Microexon DNA seq GACTCAAG
Microexon Amino Acid seq GTQG
Microexon-tag DNA Seq CCTTCTAACCGAGACCCCCGGTCACGATTTCGGCCCAAAGACAATGAAGGGACTCAAGGTAGGGCAGAAAGTTCGTTCATTACTTACAGAATTGGAGAAGGGATGCCG
Microexon-tag Amino Acid seq PSNRDPRSRFRPKDNEGTQGRAESSFITYRIGEGMP
Transcript ID Pp.18945.1
Gene ID Pp.18945
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp.18945.1
MSLPVKREHEETAAVVEGASTRTGKPQHVGGGSGVIMESAFAFGGGGESRSTKVARHGDRGVDSGEKVELFNNSAHTNYQ
PATDGGGSGPIEDMVAEGKEVQVQTLKIKLKDLGGGSAEGKSDREGRDPRRMDLERKTSGGKDYGDALRESSGNKELLGK
VNMKGEEQLLQQGGSGREVGQELVINSRMEGNPEGLRKKSPREGEVKAAAKDVPMDVKKEELREKEKEKKDDTLSERKDG
VLPTLLHPHLDANAEKDYERDEKERETGDKEKEKMRERERSRDKEKEAVRSEKREKREKERDRDSHWREREERDDNLGDK
AEKEEGKGVKVEDVEMDRKPVREVDERKLFEKEVRERMRERDREKERDEEGDGEKRKKRGRELDKDRDSLSNMDSERPEG
DKEKDAVHGYGVQQRKRMLRPRGQSNPSNRDPRSRFRPKDNEGTQGRAESSFITYRIGEGMPELAKLWKEYESGERNSPD
NGSGPTVEIRIPAEHATTNNRQIRGSQLWGTDIYTNDSDIVAVLLHTGYYSPSPSPPPNSILELRATVRILESQNMYMST
LRNSIRSRAWGGGSGCSYSVERCRLVKQGGGTVELEPSLTRTPPFVPTLAPAASERTVTTRAASSSAYRQQRFMQEVTIQ
YNLCNEPWAKYSMSIVADRGLKNSQYTSARLKKGEVLYVETQTHRYELAYDGERMTSNGATTATSFSQGNSGTEKGKDKW
SLGVCDKEPISQIYNGELPHGNHPSNGQTGGHHCHHNSEPHEYYRWSKCKRPLSLSSMKRKGVPLPGEFVEVLEEGLGWE
EIQWSPTSVWVRGTEYILSRAQFFSFEKDDMEE*
CDS seq >Pp.18945.1
ATGAGTCTTCCGGTGAAGCGGGAGCACGAGGAGACGGCGGCAGTAGTCGAGGGGGCCAGCACTCGGACTGGGAAGCCGCA
ACATGTGGGAGGTGGGAGCGGAGTGATTATGGAGTCTGCGTTCGCGTTTGGAGGAGGAGGTGAGTCTCGTTCTACTAAAG
TAGCGCGCCATGGTGATCGTGGAGTGGACTCTGGAGAGAAGGTAGAGCTTTTCAACAACAGTGCGCATACGAATTATCAA
CCTGCGACTGATGGAGGTGGCTCCGGGCCGATTGAGGATATGGTTGCAGAGGGGAAGGAAGTGCAAGTGCAGACACTGAA
GATCAAGCTGAAGGACCTTGGTGGTGGGAGTGCAGAGGGGAAGAGTGATCGAGAGGGGCGGGATCCAAGGCGTATGGATT
TGGAGAGGAAGACAAGTGGCGGGAAGGATTATGGGGATGCGCTAAGGGAGAGTAGTGGCAACAAGGAACTTCTTGGTAAG
GTAAATATGAAGGGCGAGGAGCAGTTGCTGCAGCAGGGAGGGAGTGGTCGGGAAGTAGGGCAAGAATTGGTGATTAACAG
TCGGATGGAAGGAAATCCTGAAGGTTTGAGGAAAAAGAGCCCTCGCGAGGGCGAAGTTAAAGCAGCCGCGAAGGATGTGC
CGATGGATGTGAAGAAGGAAGAGCTGCGTGAGAAGGAGAAAGAGAAGAAGGATGACACGCTTAGTGAGCGGAAAGATGGT
GTGCTGCCGACATTACTGCATCCGCACTTGGATGCTAATGCTGAGAAAGATTATGAGAGAGATGAGAAGGAGAGAGAGAC
GGGCGACAAAGAGAAAGAGAAGATGCGGGAGCGGGAGCGGTCGCGTGACAAGGAGAAAGAGGCAGTACGCTCGGAGAAAC
GGGAGAAGAGGGAGAAAGAGCGTGATCGGGATTCCCATTGGCGAGAGCGAGAGGAGCGAGATGATAATCTCGGTGACAAA
GCGGAGAAAGAGGAGGGAAAAGGGGTCAAGGTTGAGGACGTGGAGATGGACAGAAAGCCTGTGCGCGAGGTCGACGAGCG
TAAATTGTTCGAAAAAGAAGTGCGCGAGCGTATGCGGGAGAGAGATAGAGAGAAGGAGAGAGATGAAGAGGGGGATGGTG
AGAAGCGTAAGAAACGGGGTCGCGAGTTGGACAAAGACCGTGATTCGTTGAGCAATATGGACAGTGAACGTCCGGAAGGG
GATAAAGAAAAGGATGCCGTTCATGGCTATGGAGTACAGCAGCGTAAGAGGATGTTGCGTCCCAGGGGCCAGTCAAATCC
TTCTAACCGAGACCCCCGGTCACGATTTCGGCCCAAAGACAATGAAGGGACTCAAGGTAGGGCAGAAAGTTCGTTCATTA
CTTACAGAATTGGAGAAGGGATGCCGGAACTCGCGAAACTTTGGAAGGAATACGAGTCTGGTGAACGAAACAGTCCAGAC
AATGGGTCGGGTCCTACCGTTGAAATTCGTATTCCTGCTGAGCACGCCACTACTAATAACCGTCAGATACGGGGTAGTCA
GCTATGGGGAACAGATATATACACAAATGACTCAGACATTGTTGCAGTCCTGTTACATACAGGATACTACTCACCTTCTC
CGTCTCCGCCTCCAAATTCTATATTAGAGCTGCGAGCCACCGTTCGAATTCTTGAATCTCAAAATATGTACATGTCTACG
CTACGAAATAGTATTCGATCGCGTGCCTGGGGAGGCGGAAGTGGGTGTAGCTACAGCGTTGAGAGGTGCCGATTAGTGAA
GCAAGGAGGGGGTACTGTAGAGCTGGAGCCATCTTTGACTCGAACTCCTCCATTTGTCCCAACGCTTGCGCCGGCTGCAT
CAGAGCGAACTGTCACCACGAGAGCTGCATCTTCTAGTGCATATCGGCAGCAAAGATTTATGCAGGAAGTGACAATACAA
TACAATCTATGCAATGAACCCTGGGCGAAATACAGCATGAGCATCGTGGCTGATCGTGGACTGAAGAATTCACAATACAC
GTCTGCTCGACTCAAAAAAGGGGAAGTATTATATGTAGAAACCCAGACTCATAGGTATGAATTGGCGTATGATGGAGAAC
GTATGACGAGTAATGGAGCAACTACTGCCACCTCTTTCTCTCAAGGCAACTCAGGGACGGAAAAAGGTAAAGATAAATGG
AGTTTGGGAGTTTGTGACAAAGAACCTATTTCGCAAATTTACAATGGTGAGTTACCGCATGGGAATCATCCAAGTAATGG
ACAAACAGGTGGCCATCACTGCCACCATAATAGCGAGCCACATGAGTACTACAGATGGTCTAAGTGTAAACGACCTCTGT
CGCTATCGTCCATGAAACGAAAGGGTGTACCCTTGCCAGGAGAATTTGTTGAGGTTTTGGAAGAAGGTCTGGGATGGGAG
GAAATTCAGTGGTCTCCAACGAGTGTGTGGGTCCGGGGAACAGAGTACATCCTCAGTAGAGCACAATTCTTTTCTTTTGA
GAAGGATGATATGGAGGAATAG