Microexon ID Pp_2:6593477-6593487:+
Species Physcomitrium patens
Coordinates 2:6593477..6593487
Microexon Cluster ID Unclassified
Size 11
Pp_2:6593477-6593487:+ does not have available information here.
Transcript ID Pp3c2_9530V3.1
Protein ID Pp3c2_9530V3.1
Gene ID Pp3c2_9530
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp3c2_9530V3.1
MSFTSSRRIPNDSVCEVEEPRLDREYDSKVYMACILQGSRLGMSYFDTEANAMFVLETWEDGSEEFPSMQLIKYQAQPLV
IYTSTKMDDTFVTALQKKAVKEEDEPHEVKIVKSSHFSYQQARHRLAYLRVTGMEEDFNDRERMHLLNSMMNLDSEMQVR
AAGGLLGILQQEMLIDDMVVEEYRISPLQIGSICELSLYPFIFSPGNSILDRSSNGFLNVDASTHDALHIFQVDKHASSM
GIGKAKEGFSLFGLLNKVCNHRASSPLNRLFHLFSLINSGPTFNHRGLLDQCLTQPGRKLLRLWFLRPILDLSVLNDRLD
TRQVSAFRLSQDLVSALRNTLKGIKDIPRLIKKISSPSSVVTSKDWAVFTQQSVSALIHIRAMFEVAVSQRHQSQISLID
LQIVKKVMSQITEDLMYVNSIVAGVIDFDQRCGDSSKTMVAYGVCEELDELKNIFEGLPDFLNQITGVELNDLKLQVGSE
EAAICYIPQVGYLLKFESKMLDEYTLESRPDIQLAFEGGIEGGYFYHTSKTRELDEVIGDILHKILGTRIHFEDMEHAIL
RDLERRVLIYASALRHAADVAAEIDCLVALSTVAHDQKYVRPQLSENNVLMIKAGRHPLQECIVNTFVPNDTNIDQSSRI
NIISGPNYSGKSIYAKQVALIVFLAHIGSFVPAEAAVIGLTDRHATSRSLCVIDEFGKGTLVPDGVGLLCATLHHLTTQE
EKPKVLACTHFCEVFDENYLPKSQNISFYTMSVLEPDGHKNAVRGVDEIVFLYRLVPGRVVPSYGVPAEILKRCVEISEC
TKEGRPIERLGTRRTTAMDQIYKTIADLLQKFDCEKDDLNKFLSEVFSLLAS*
CDS seq >Pp3c2_9530V3.1
ATGAGTTTTACATCCTCGCGGCGAATTCCGAACGACAGTGTTTGTGAGGTTGAAGAACCGAGACTCGATAGAGAATATGA
CTCAAAGGTGTACATGGCTTGCATTCTACAAGGTAGCAGGTTAGGCATGTCATATTTTGACACAGAAGCAAACGCGATGT
TTGTTTTGGAGACTTGGGAGGATGGAAGTGAGGAGTTTCCGTCCATGCAGCTTATAAAATATCAGGCTCAGCCTCTTGTC
ATCTATACCAGTACGAAAATGGATGACACGTTTGTAACTGCACTTCAAAAGAAAGCAGTCAAGGAGGAAGATGAACCGCA
TGAAGTGAAGATAGTAAAAAGTTCTCATTTTAGTTACCAGCAAGCACGTCACAGATTGGCTTATCTGCGTGTCACTGGAA
TGGAAGAAGATTTCAACGACAGGGAAAGAATGCACCTGTTGAACTCTATGATGAATCTTGACAGTGAAATGCAAGTTCGG
GCTGCTGGTGGTCTTCTTGGTATTCTTCAGCAAGAAATGCTCATTGATGATATGGTAGTTGAAGAATATAGAATTTCTCC
TCTTCAAATTGGAAGCATCTGCGAGCTCTCTCTGTATCCTTTTATATTCAGCCCAGGCAATTCGATCCTTGACAGAAGTA
GTAATGGTTTTCTGAATGTTGATGCGTCTACGCATGATGCACTACATATTTTCCAGGTTGATAAACATGCCAGTTCGATG
GGCATTGGAAAAGCAAAAGAAGGGTTTTCCCTTTTCGGTCTCCTCAACAAGGTATGCAATCATCGTGCTTCTTCGCCCTT
GAATAGGCTCTTCCACTTATTTTCATTAATTAATTCTGGCCCAACGTTTAATCATCGTGGACTTCTTGATCAGTGTTTAA
CACAACCAGGGCGGAAACTTTTAAGGTTATGGTTTCTTAGACCAATTTTGGACTTGAGCGTCCTCAATGACCGGCTTGAT
ACGCGCCAGGTTTCAGCTTTCAGACTCTCCCAGGACTTGGTGTCTGCTTTGCGGAATACATTGAAAGGCATTAAAGATAT
TCCTAGGCTCATCAAGAAAATAAGTTCACCAAGTTCAGTAGTGACCTCAAAGGATTGGGCAGTTTTCACACAGCAGAGTG
TAAGCGCTCTTATCCACATTCGAGCCATGTTTGAGGTGGCAGTGTCCCAGAGACATCAAAGTCAAATTTCTTTGATCGAT
CTTCAAATTGTGAAGAAGGTCATGAGTCAAATTACCGAGGATCTCATGTACGTCAACAGCATAGTAGCAGGCGTCATTGA
CTTCGATCAGAGATGTGGTGACTCCTCCAAAACGATGGTTGCATATGGAGTTTGTGAAGAGCTTGATGAACTCAAGAATA
TTTTTGAGGGGCTCCCCGACTTCCTCAATCAGATCACTGGGGTTGAGTTGAATGATTTGAAACTTCAAGTCGGGTCCGAA
GAAGCTGCAATATGTTACATCCCTCAAGTTGGTTACCTGTTAAAATTCGAGTCAAAGATGCTTGATGAGTATACCCTTGA
GAGCCGTCCAGACATCCAGCTGGCATTCGAAGGAGGGATTGAAGGGGGTTACTTTTATCACACCTCGAAGACCAGGGAGC
TGGATGAAGTGATTGGAGACATTCTACACAAGATTTTAGGTACGAGGATTCATTTTGAAGATATGGAGCATGCAATTCTG
AGGGATCTGGAGCGCAGAGTATTAATTTATGCTTCGGCATTGCGCCATGCAGCGGATGTTGCAGCAGAAATCGATTGCTT
GGTTGCACTGAGTACTGTGGCTCACGACCAAAAGTACGTGCGTCCCCAGTTGTCCGAGAATAATGTTCTCATGATAAAAG
CTGGAAGGCATCCTCTCCAGGAGTGTATTGTCAACACGTTTGTGCCAAATGACACCAACATTGACCAAAGTAGTCGTATA
AACATTATATCTGGCCCAAACTACTCAGGCAAGAGCATTTACGCGAAGCAGGTTGCTCTCATCGTATTTCTAGCACATAT
TGGAAGCTTTGTTCCAGCAGAAGCGGCAGTAATTGGTCTTACTGACAGACATGCTACTTCGAGGTCCTTGTGCGTGATTG
ACGAGTTCGGCAAGGGCACCTTGGTCCCAGATGGTGTTGGTCTCCTTTGTGCAACCTTGCATCACCTTACAACCCAAGAA
GAAAAGCCTAAGGTTTTGGCATGTACACACTTCTGTGAAGTCTTCGATGAGAATTATCTACCCAAGTCGCAGAATATCTC
ATTTTACACCATGAGTGTCCTCGAGCCGGATGGACATAAAAATGCTGTCAGAGGCGTTGATGAGATTGTGTTCCTTTACA
GGCTCGTTCCTGGTCGTGTGGTCCCAAGTTATGGTGTTCCAGCCGAGATTTTGAAGAGATGCGTTGAAATCTCAGAATGT
ACCAAAGAAGGGAGACCAATTGAGCGCCTTGGTACAAGACGTACCACTGCCATGGATCAAATTTATAAGACGATTGCGGA
TCTTCTACAGAAGTTCGACTGCGAGAAGGATGATTTGAATAAGTTCCTATCTGAAGTTTTTTCCCTTCTCGCTAGTTAG
Microexon DNA seq TAAAATATCAG
Microexon Amino Acid seq IKYQ
Microexon-tag DNA Seq TTGGAGACTTGGGAGGATGGAAGTGAGGAGTTTCCGTCCATGCAGCTTATAAAATATCAGGCTCAGCCTCTTGTCATCTATACCAGTACGAAAATGGATGACACGTTT
Microexon-tag Amino Acid seq METWEDGSEEFPSMQLIKYQAQPLVIYTSTKMDDTF
Transcript ID Pp.11108.1
Gene ID Pp.11108
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Pp.11108.1
MSFTSSRRIPNDSVCEVEEPRLDREYDSKVYMACILQGSRLGMSYFDTEANAMFVLETWEDGSEEFPSMQLIKYQAQPLV
IYTSTKMDDTFVTALQKKVKEEDEPHEVKIVKSSHFSYQQARHRLAYLRVTGMEEDFNDRERMHLLNSMMNLDSEMQVRA
AGGLLGILQQEMLIDDMVVEEYRISPLQIGSICELSLNGFLNVDASTHDALHIFQVDKHASSMGIGKAKEGFSLFGLLNK
CLTQPGRKLLRLWFLRPILDLSVLNDRLDTVSAFRLSQDLVSALRNTLKGIKDIPRLIKKISSPSSVVTSKDWAVFTQSV
SALIHIRAMFEVAVSQRHQSQISLIDLQIVKKVMSQITEDLMYVNSIVAGVIDFDQRCGDSSKTMVAYGVCEELDELKNI
FEGLPDFLNQITGVELNDLKLQVGSEEAAICYIPQVGYLLKFESKMLDEYTLESRPDIQLAFEGGIEGGYFYHTSKTREL
DEVIGDILHKILDMEHAILRDLERRVLIYASALRHAADVAAEIDCLVALSTVAHDQKYVRPQLSENNVLMIKAGRHPLQE
CIVNTFVPNDTNIDQSSRINIISGPNYSGKSIYAKQVALIVFLAHIGSFVPAEAAVIGLTDRIFTRVSSKETMAVQQSTF
MIDLHQIAFMLRHATSRSLCVIDEFGKGTLVPDGVGLLCATLHHLTTQEEKPKVLACTHFCEVFDENYLPKSQNISFYTM
SVLEPDGHKNAVRGVDEIVFLYRLVPGRVVPSYGIHCAELAGVPAEILKRCVEISECTKEGRPIERLGTRRTTAMDQIYK
TIADLLQKFDCEKDDLNKFLSEVFSLLAS*
CDS seq >Pp.11108.1
ATGAGTTTTACATCCTCGCGGCGAATTCCGAACGACAGTGTTTGTGAGGTTGAAGAACCGAGACTCGATAGAGAATATGA
CTCAAAGGTGTACATGGCTTGCATTCTACAAGGTAGCAGGTTAGGCATGTCATATTTTGACACAGAAGCAAACGCGATGT
TTGTTTTGGAGACTTGGGAGGATGGAAGTGAGGAGTTTCCGTCCATGCAGCTTATAAAATATCAGGCTCAGCCTCTTGTC
ATCTATACCAGTACGAAAATGGATGACACGTTTGTAACTGCACTTCAAAAGAAAGTCAAGGAGGAAGATGAACCGCATGA
AGTGAAGATAGTAAAAAGTTCTCATTTTAGTTACCAGCAAGCACGTCACAGATTGGCTTATCTGCGTGTCACTGGAATGG
AAGAAGATTTCAACGACAGGGAAAGAATGCACCTGTTGAACTCTATGATGAATCTTGACAGTGAAATGCAAGTTCGGGCT
GCTGGTGGTCTTCTTGGTATTCTTCAGCAAGAAATGCTCATTGATGATATGGTAGTTGAAGAATATAGAATTTCTCCTCT
TCAAATTGGAAGCATCTGCGAGCTCTCTCTTAATGGTTTTCTGAATGTTGATGCGTCTACGCATGATGCACTACATATTT
TCCAGGTTGATAAACATGCCAGTTCGATGGGCATTGGAAAAGCAAAAGAAGGGTTTTCCCTTTTCGGTCTCCTCAACAAG
TGTTTAACACAACCAGGGCGGAAACTTTTAAGGTTATGGTTTCTTAGACCAATTTTGGACTTGAGCGTCCTCAATGACCG
GCTTGATACGGTTTCAGCTTTCAGACTCTCCCAGGACTTGGTGTCTGCTTTGCGGAATACATTGAAAGGCATTAAAGATA
TTCCTAGGCTCATCAAGAAAATAAGTTCACCAAGTTCAGTAGTGACCTCAAAGGATTGGGCAGTTTTCACACAGAGTGTA
AGCGCTCTTATCCACATTCGAGCCATGTTTGAGGTGGCAGTGTCCCAGAGACATCAAAGTCAAATTTCTTTGATCGATCT
TCAAATTGTGAAGAAGGTCATGAGTCAAATTACCGAGGATCTCATGTACGTCAACAGCATAGTAGCAGGCGTCATTGACT
TCGATCAGAGATGTGGTGACTCCTCCAAAACGATGGTTGCATATGGAGTTTGTGAAGAGCTTGATGAACTCAAGAATATT
TTTGAGGGGCTCCCCGACTTCCTCAATCAGATCACTGGGGTTGAGTTGAATGATTTGAAACTTCAAGTCGGGTCCGAAGA
AGCTGCAATATGTTACATCCCTCAAGTTGGTTACCTGTTAAAATTCGAGTCAAAGATGCTTGATGAGTATACCCTTGAGA
GCCGTCCAGACATCCAGCTGGCATTCGAAGGAGGGATTGAAGGGGGTTACTTTTATCACACCTCGAAGACCAGGGAGCTG
GATGAAGTGATTGGAGACATTCTACACAAGATTTTAGATATGGAGCATGCAATTCTGAGGGATCTGGAGCGCAGAGTATT
AATTTATGCTTCGGCATTGCGCCATGCAGCGGATGTTGCAGCAGAAATCGATTGCTTGGTTGCACTGAGTACTGTGGCTC
ACGACCAAAAGTACGTGCGTCCCCAGTTGTCCGAGAATAATGTTCTCATGATAAAAGCTGGAAGGCATCCTCTCCAGGAG
TGTATTGTCAACACGTTTGTGCCAAATGACACCAACATTGACCAAAGTAGTCGTATAAACATTATATCTGGCCCAAACTA
CTCAGGCAAGAGCATTTACGCGAAGCAGGTTGCTCTCATCGTATTTCTAGCACATATTGGAAGCTTTGTTCCAGCAGAAG
CGGCAGTAATTGGTCTTACTGACAGAATCTTCACAAGAGTCTCCAGCAAAGAGACAATGGCTGTTCAACAGTCAACGTTC
ATGATTGACTTGCATCAGATAGCCTTCATGCTGAGACATGCTACTTCGAGGTCCTTGTGCGTGATTGACGAGTTCGGCAA
GGGCACCTTGGTCCCAGATGGTGTTGGTCTCCTTTGTGCAACCTTGCATCACCTTACAACCCAAGAAGAAAAGCCTAAGG
TTTTGGCATGTACACACTTCTGTGAAGTCTTCGATGAGAATTATCTACCCAAGTCGCAGAATATCTCATTTTACACCATG
AGTGTCCTCGAGCCGGATGGACATAAAAATGCTGTCAGAGGCGTTGATGAGATTGTGTTCCTTTACAGGCTCGTTCCTGG
TCGTGTGGTCCCAAGTTATGGTATACATTGTGCGGAGCTTGCAGGTGTTCCAGCCGAGATTTTGAAGAGATGCGTTGAAA
TCTCAGAATGTACCAAAGAAGGGAGACCAATTGAGCGCCTTGGTACAAGACGTACCACTGCCATGGATCAAATTTATAAG
ACGATTGCGGATCTTCTACAGAAGTTCGACTGCGAGAAGGATGATTTGAATAAGTTCCTATCTGAAGTTTTTTCCCTTCT
CGCTAGTTAG