Microexon ID Pp_4:14906193-14906202:-
Species Physcomitrium patens
Coordinates 4:14906193..14906202
Microexon Cluster ID MEP23
Size 10
Phase 0
Pfam Domain Motif AAA
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,10,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TCVTTTGARYAYCTKCARGGMGATTAYTACATYGCTCCTSYYTTCMTGGATAAAGTTGYRKKYCACATTGTGAAGAACTAYMTTGCTMATCTTCTYAATRYYAAARTT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GATAAAGTGG
Microexon Amino Acid seq DKVV
Microexon-tag DNA Seq TCATTTGAGCATATTACAGGAGATTACTACATCGCCCCTGCGTTTATGGATAAAGTGGTGACGCACATTGTCAAGAACTATCTGGCTGCACAAATTGACGGCAAAGTT
Microexon-tag Amino Acid Seq SFEHITGDYYIAPAFMDKVVTHIVKNYLAAQIDGKV
Microexon-tag spanning region14905827-14906441
Microexon-tag prediction score0.9166
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c4_22100V3.1x
Reference Transcript ID Pp3c4_22100V3.1
Gene ID Pp3c4_22100
Gene Name NA
Transcript ID Pp3c4_22100V3.1
Protein ID Pp3c4_22100V3.1
Gene ID Pp3c4_22100
Gene Name NA
Pfam domain motif Torsin
Motif E-value 0.0059
Motif start 166
Motif end 212
Protein seq >Pp3c4_22100V3.1
MEVAMACTTMGPFKIEALAPKQSSFNSRISVVNLSNAGFFSIRNGNRHSQPHKFSGVRACLPVDSSSGDTQADEGLADAG
SKNETKKLSNQSSWEATDVLGNDYLYRLGKESDNLNITVGARTGMIDSLFTGDFLGKEADIVFKYRQKVTRSFEHITGDY
YIAPAFMDKVVTHIVKNYLAAQIDGKVPLILGVWGGKGQGKSFQTELIFKAMGIEPIIMSAGEMESEWAGEPGKLIRERY
RAAHLVINNQGKMSCLMINDLDAGIGRFENTQMTVNNQMVVGTLMNLADNPNRVSVGQAWREADIVNRVPIIVTGNDFST
IWAPLIRDGRMDKFYWQPTRDDLVKIVYQMYKKDGLSEADIGFIIDTFPNQALDFYGALRSRTYDKHVLEWVNEIGGAEQ
IGPKLLRRKKGDAPLPEFIAPEQNVDDLIKAGYELVEEQNMVNNMKLSDEYMKKQTGPGGGLSISAS*
CDS seq >Pp3c4_22100V3.1
ATGGAGGTCGCCATGGCCTGCACTACTATGGGACCTTTTAAAATAGAAGCACTGGCACCAAAACAATCTTCGTTCAACAG
CCGAATTTCAGTTGTGAATCTTTCCAATGCCGGTTTCTTCTCTATTCGCAATGGAAATCGCCATTCTCAACCCCATAAAT
TCTCTGGAGTCCGTGCGTGCCTGCCAGTTGATTCTTCTTCAGGCGACACCCAAGCGGACGAAGGCTTAGCGGACGCTGGA
TCCAAGAATGAGACCAAGAAGTTGTCGAATCAATCGTCGTGGGAAGCTACAGACGTGTTGGGCAATGACTACCTCTACCG
CCTCGGCAAGGAATCGGACAATCTCAATATCACTGTGGGGGCGCGTACAGGAATGATCGACAGCCTCTTCACGGGAGATT
TCTTGGGTAAAGAAGCGGACATCGTCTTCAAGTATCGCCAGAAAGTGACGCGTTCATTTGAGCATATTACAGGAGATTAC
TACATCGCCCCTGCGTTTATGGATAAAGTGGTGACGCACATTGTCAAGAACTATCTGGCTGCACAAATTGACGGCAAAGT
TCCTCTCATTTTGGGTGTCTGGGGAGGAAAGGGTCAAGGCAAGTCTTTTCAGACAGAGCTAATATTCAAGGCAATGGGTA
TTGAACCCATTATCATGTCTGCCGGCGAGATGGAATCAGAATGGGCAGGAGAACCTGGTAAGCTGATTCGAGAACGTTAC
CGAGCCGCACATCTTGTCATAAACAACCAGGGAAAAATGAGCTGTCTGATGATTAATGACCTCGATGCTGGAATAGGACG
ATTTGAAAATACGCAAATGACGGTCAATAACCAGATGGTGGTTGGCACACTTATGAATTTAGCTGACAACCCCAATCGTG
TGAGCGTTGGGCAGGCATGGCGAGAGGCTGATATTGTGAATCGTGTGCCAATCATTGTTACTGGGAATGACTTCTCTACA
ATTTGGGCACCTCTGATTCGAGATGGTCGAATGGATAAATTCTACTGGCAGCCAACGAGGGATGATTTAGTCAAAATTGT
GTACCAAATGTATAAGAAAGATGGTCTTAGTGAAGCTGATATTGGTTTCATCATCGACACATTCCCTAACCAAGCATTGG
ACTTTTATGGCGCTCTCAGGTCGAGGACATACGATAAGCACGTCCTGGAGTGGGTCAACGAGATTGGAGGAGCTGAACAA
ATAGGACCAAAGCTTCTTCGGCGTAAAAAAGGCGATGCACCTCTCCCAGAATTTATAGCACCAGAGCAAAACGTGGACGA
TCTTATCAAGGCTGGTTATGAGCTTGTGGAAGAGCAGAACATGGTGAACAACATGAAGCTCTCAGATGAGTACATGAAAA
AACAAACAGGTCCAGGGGGTGGTCTTTCTATTTCAGCTTCATAA
Microexon DNA seq GATAAAGTGG
Microexon Amino Acid seq DKVV
Microexon-tag DNA Seq TCATTTGAGCATATTACAGGAGATTACTACATCGCCCCTGCGTTTATGGATAAAGTGGTGACGCACATTGTCAAGAACTATCTGGCTGCACAAATTGACGGCAAAGTT
Microexon-tag Amino Acid seq SFEHITGDYYIAPAFMDKVVTHIVKNYLAAQIDGKV
Transcript ID Pp3c4_22100V3.3
Gene ID Pp.19998
Gene Name NA
Pfam domain motif Torsin
Motif E-value 0.0059
Motif start 166
Motif end 212
Protein seq >Pp3c4_22100V3.3
MEVAMACTTMGPFKIEALAPKQSSFNSRISVVNLSNAGFFSIRNGNRHSQPHKFSGVRACLPVDSSSGDTQADEGLADAG
SKNETKKLSNQSSWEATDVLGNDYLYRLGKESDNLNITVGARTGMIDSLFTGDFLGKEADIVFKYRQKVTRSFEHITGDY
YIAPAFMDKVVTHIVKNYLAAQIDGKVPLILGVWGGKGQGKSFQTELIFKAMGIEPIIMSAGEMESEWAGEPGKLIRERY
RAAHLVINNQGKMSCLMINDLDAGIGRFENTQMTVNNQMVVGTLMNLADNPNRVSVGQAWREADIVNRVPIIVTGNDFST
IWAPLIRDGRMDKFYWQPTRDDLVKIVYQMYKKDGLSEADIGFIIDTFPNQALDFYGALRSRTYDKHVLEWVNEIGGAEQ
IGPKLLRRKKGDAPLPEFIAPEQNVDDLIKAGYELVEEQNMVNNMKLSDEYMKKQTGPGGGLSISAS*
CDS seq >Pp3c4_22100V3.3
ATGGAGGTCGCCATGGCCTGCACTACTATGGGACCTTTTAAAATAGAAGCACTGGCACCAAAACAATCTTCGTTCAACAG
CCGAATTTCAGTTGTGAATCTTTCCAATGCCGGTTTCTTCTCTATTCGCAATGGAAATCGCCATTCTCAACCCCATAAAT
TCTCTGGAGTCCGTGCGTGCCTGCCAGTTGATTCTTCTTCAGGCGACACCCAAGCGGACGAAGGCTTAGCGGACGCTGGA
TCCAAGAATGAGACCAAGAAGTTGTCGAATCAATCGTCGTGGGAAGCTACAGACGTGTTGGGCAATGACTACCTCTACCG
CCTCGGCAAGGAATCGGACAATCTCAATATCACTGTGGGGGCGCGTACAGGAATGATCGACAGCCTCTTCACGGGAGATT
TCTTGGGTAAAGAAGCGGACATCGTCTTCAAGTATCGCCAGAAAGTGACGCGTTCATTTGAGCATATTACAGGAGATTAC
TACATCGCCCCTGCGTTTATGGATAAAGTGGTGACGCACATTGTCAAGAACTATCTGGCTGCACAAATTGACGGCAAAGT
TCCTCTCATTTTGGGTGTCTGGGGAGGAAAGGGTCAAGGCAAGTCTTTTCAGACAGAGCTAATATTCAAGGCAATGGGTA
TTGAACCCATTATCATGTCTGCCGGCGAGATGGAATCAGAATGGGCAGGAGAACCTGGTAAGCTGATTCGAGAACGTTAC
CGAGCCGCACATCTTGTCATAAACAACCAGGGAAAAATGAGCTGTCTGATGATTAATGACCTCGATGCTGGAATAGGACG
ATTTGAAAATACGCAAATGACGGTCAATAACCAGATGGTGGTTGGCACACTTATGAATTTAGCTGACAACCCCAATCGTG
TGAGCGTTGGGCAGGCATGGCGAGAGGCTGATATTGTGAATCGTGTGCCAATCATTGTTACTGGGAATGACTTCTCTACA
ATTTGGGCACCTCTGATTCGAGATGGTCGAATGGATAAATTCTACTGGCAGCCAACGAGGGATGATTTAGTCAAAATTGT
GTACCAAATGTATAAGAAAGATGGTCTTAGTGAAGCTGATATTGGTTTCATCATCGACACATTCCCTAACCAAGCATTGG
ACTTTTATGGCGCTCTCAGGTCGAGGACATACGATAAGCACGTCCTGGAGTGGGTCAACGAGATTGGAGGAGCTGAACAA
ATAGGACCAAAGCTTCTTCGGCGTAAAAAAGGCGATGCACCTCTCCCAGAATTTATAGCACCAGAGCAAAACGTGGACGA
TCTTATCAAGGCTGGTTATGAGCTTGTGGAAGAGCAGAACATGGTGAACAACATGAAGCTCTCAGATGAGTACATGAAAA
AACAAACAGGTCCAGGGGGTGGTCTTTCTATTTCAGCTTCATAA