Microexon ID Pp_6:17814810-17814819:-
Species Physcomitrium patens
Coordinates 6:17814810..17814819
Microexon Cluster ID Unclassified
Size 10
Pp_6:17814810-17814819:- does not have available information here.
Pp_6:17814810-17814819:- does not have available information here.
Microexon DNA seq ATCATGCCAG
Microexon Amino Acid seq IMPD
Microexon-tag DNA Seq ATGGCACAGGCGGGGAGAGCATTTACGGTGGCAAGTTTGATGATCATGCCAGATGAAAATTTCAAACTTGGGCATGATGGAGCAGGGATACTTTCGATGGCA
Microexon-tag Amino Acid seq MAQAGRAFTVASLMIMPDENFKLGHDGAGILSMA
Transcript ID Pp.22358.13
Gene ID Pp.22358
Gene Name NA
Pfam domain motif Pro_isomerase
Motif E-value 5.6e-24
Motif start 15
Motif end 97
Protein seq >Pp.22358.13
MAQAGRAFTVASLMIMPDENFKLGHDGAGILSMANAGPNTNGSQFFLTFKSQPHLNGKHVVFGKIVEGLDILKKIEAVPS
SGQRNKPDVPIKIVDCGEVLRDKDNGSVPEKNVKRKIKKHKDSRDDFSSDDDREPVRSKRKPRKVVKDRRRKRRRYSSVS
SEDSSDSDSYSDNSSYTDSDSDSDSSSELSSSSEDERRRRKRKPVKKEKKRTHKRRREKRKDRKRKRRTRTRRRSKWSSD
SDSSDSDSETSSSENESDTGSDVSARGVKSKGSQKAQIRGKIKQTSATVENSSQKDDGDLVEKQVEVEIKALKEVQEFPK
RIKSQKAVSDISLSPPPSPRQSRSITRSRSPEKRRSLTRSPSPIRSHIQIKRSLSRNSSLSGAPRSPGSVKRKSRSMNMS
LSPDPRSVQPQRGSAQRQSLNRSSSKSRSPVCPPAEKARSPTPAELPRRSRSRSLVHSPSREGTPKRIRRGRGFSQQYSY
ARRYRTPDRSPPRTYGYGGRGDRDRHDRGRNYRYGGGYKGNRDWSPRRYRSPPRARSPPRYRRHVSRSRSRSPPARRDLS
RRDISRSISRSASPSGGRAPISNDLRNRLRPRRTSSPKDGPNNGAAGAAIRRSPSATSSISHSPSRSSSKSLSPGRKKLV
SYARDKPDRRRRRSTTPSVSVSPSRSRSSGENAGLVSYGRDVSPGPRSP*
CDS seq >Pp.22358.13
ATGGCACAGGCGGGGAGAGCATTTACGGTGGCAAGTTTGATGATCATGCCAGATGAAAATTTCAAACTTGGGCATGATGG
AGCAGGGATACTTTCGATGGCAAATGCCGGCCCAAACACAAACGGGTCACAGTTCTTTCTTACTTTTAAATCGCAGCCAC
ATTTGAACGGGAAGCATGTGGTGTTTGGTAAGATAGTTGAAGGACTGGATATCTTGAAGAAGATCGAGGCTGTCCCTTCA
TCTGGTCAAAGAAACAAGCCAGATGTGCCTATAAAGATTGTGGATTGTGGAGAGGTGCTCCGAGACAAAGACAATGGTTC
TGTTCCCGAGAAAAATGTCAAAAGGAAGATAAAAAAACACAAAGATAGTAGAGATGATTTCTCAAGCGATGACGATAGAG
AGCCTGTTCGTTCCAAGCGAAAACCCAGGAAGGTTGTGAAAGACCGAAGGAGGAAGAGGAGGCGCTACTCTTCGGTATCT
TCTGAAGACTCATCAGATTCAGATTCCTATTCAGATAATTCATCTTATACCGATTCCGACTCTGATTCAGACTCATCTTC
TGAATTAAGCTCTTCAAGCGAAGATGAGAGACGGAGAAGAAAGAGGAAACCTGTGAAGAAGGAGAAGAAAAGAACGCATA
AGAGAAGAAGAGAGAAACGCAAGGATAGGAAGCGAAAGAGGAGGACAAGAACAAGAAGAAGATCGAAGTGGAGTTCTGAT
AGTGATAGTAGTGATTCTGACAGTGAAACGTCGAGCTCAGAGAATGAGAGTGATACGGGTAGCGATGTGAGTGCACGTGG
TGTCAAGTCTAAAGGCTCTCAGAAAGCTCAAATTCGAGGGAAAATCAAGCAAACCTCAGCAACAGTGGAAAATTCTAGTC
AGAAGGATGATGGTGATCTTGTCGAGAAACAAGTAGAAGTTGAGATAAAAGCCTTGAAGGAAGTTCAGGAATTTCCAAAA
CGTATCAAATCCCAGAAGGCCGTTTCAGACATAAGTTTATCGCCTCCACCCAGTCCCCGTCAAAGCCGATCCATTACTCG
GAGCAGAAGCCCTGAGAAGAGGAGAAGCTTGACTCGTAGTCCAAGTCCCATCAGGAGCCACATTCAGATTAAAAGAAGCC
TGAGCAGGAATTCAAGCTTGAGTGGAGCCCCTCGCAGTCCTGGATCTGTCAAGAGGAAAAGTCGGAGTATGAACATGAGC
TTAAGTCCTGATCCACGAAGTGTTCAACCTCAACGAGGTAGTGCACAGCGGCAGAGTCTGAATAGGAGCTCGAGTAAGAG
TAGGAGCCCTGTTTGTCCCCCTGCTGAAAAGGCGCGAAGCCCCACACCTGCTGAACTTCCCCGTAGATCTCGTAGTCGGA
GTTTGGTGCACAGTCCCAGTCGAGAAGGTACTCCGAAGAGGATCAGAAGAGGTCGTGGATTTAGCCAACAGTACTCATAC
GCTCGTCGGTACCGCACACCAGACCGGTCTCCACCGCGGACATATGGCTATGGTGGGCGTGGCGATCGGGATAGGCACGA
TCGAGGTAGAAATTACAGGTATGGAGGAGGTTACAAAGGTAATCGCGATTGGAGCCCAAGACGATATAGAAGTCCTCCAA
GAGCCCGATCACCACCGAGATACAGGAGGCATGTCAGCAGGAGTCGAAGTCGTAGTCCTCCTGCCCGCCGTGATCTTAGC
CGAAGGGATATTTCTCGTAGCATCAGTCGTAGTGCATCCCCGTCTGGAGGTCGTGCTCCCATCAGCAATGACCTCCGTAA
TCGCCTTAGGCCCCGACGTACTAGCTCACCTAAAGATGGACCAAACAACGGAGCCGCTGGAGCAGCGATACGTCGTTCTC
CAAGTGCCACCTCTTCCATATCACACTCTCCAAGTCGATCGTCGTCCAAAAGTCTCTCACCTGGCAGGAAGAAATTAGTC
TCCTATGCTAGGGATAAACCTGATCGCCGACGACGCAGGTCAACGACACCATCAGTCAGTGTTTCTCCATCGCGGAGCAG
ATCTTCAGGTGAGAATGCAGGATTAGTGTCATATGGAAGGGATGTAAGTCCTGGTCCACGATCACCCTAA