Microexon ID Pp_10:4721193-4721200:-
Species Physcomitrium patens
Coordinates 10:4721193..4721200
Microexon Cluster ID MEP16
Size 8
Phase 1
Pfam Domain Motif SNF2_N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,8,51
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq BTWWMHTGGCTYATACGAHKRTATSAGMWYGGYRTMAAYGKMATTCTYGSWGATGARATGGGWCTKGGRAARACWYTKCAARCYATHTCHTTSYTGRGYTAYYTRMAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GTGATGAG
Microexon Amino Acid seq GDE
Microexon-tag DNA Seq GTGTCATGGCTCATTGGCCGTTACGTGCTTGGCGTCAATGTCATTCTGGGTGATGAGATGGGATTAGGGAAGACGCTGCAGTCCATTGTGTTGCTTGCGTACTTGAAA
Microexon-tag Amino Acid Seq VSWLIGRYVLGVNVILGDEMGLGKTLQSIVLLAYLK
Microexon-tag spanning region4720842-4721593
Microexon-tag prediction score0.9131
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c10_6710V3.1x
Reference Transcript ID Pp3c10_6710V3.1
Gene ID Pp3c10_6710
Gene Name NA
Transcript ID Pp3c10_6710V3.1
Protein ID Pp3c10_6710V3.1
Gene ID Pp3c10_6710
Gene Name NA
Pfam domain motif SNF2_N
Motif E-value 1.7e-46
Motif start 60
Motif end 351
Protein seq >Pp3c10_6710V3.1
MEALKHIQEAVGVVTRFDGRGVCDAGDVLAQCKNWGITAELKPHQAEGVSWLIGRYVLGVNVILGDEMGLGKTLQSIVLL
AYLKFARSCKGPFLVICPLSVTEGWALEMARFAPKLRVLRYVGNKGEREELRRSISEHVNKQAPSARTNPDLPFDVLLST
YELAMADVSFLSRIRWSYAIIDEAQRLKNADSVLFKTLDEQYMLPRRLLLTGTPVQNNLSELWALLHFCMPLIFTNVEEF
LDAFGPAAAATHKDGDKIVDKEDSILTILRETVKVFMLRRTKAALVHSKALVLPPLTEVTIFSPMANIQKQVYVSVLKKE
SPKIIGDSPGTATSLQNIVIQLRKACSHPYLFDGIEPEPFQEGEHIVEASGKLKMLDVILKKLHASGHRVLIFAQMTRTL
DILQDYLEYRSYSYERLDGSVRAEERFNAVHSFSAGHSKCGGDAKSGSAFVFLLTTRAGGVGLNLIGADTVIFYEQDWNP
QADKQALQRAHRIGQISPVLAINLITRHTIDEVIMRRAKKKLELTCNVIGRDDADLETHGPASAVSTDLRSMIMFGINTL
HSSDATQDTEVGRSSELERIVEAALNQRGKAGPGLSNATEFGNVELVTLDAQDDGAENFYNYEGKDFAKPADKALLESWA
VHANASNGDEPSTRRTRKVKASYLSDDADKDNSRAQKRQKAEDKKHAKWKSLGYHTLAIDDPGAGLQPALSSDNVGSDVH
FVIGDCTVPVTTRPDEPCIILSFVDDSGVWGSGGMFNAVAKLSFKIPEAYEAAHEAEDLHLGDFHLVPLPGDRRFVGVAV
IQTYSRRRKVPRSDISISAFETCLRKVVTAAKARSASVHVPRIGSRGQGKNEWYAVERLLRKYATGYDVPFYVYYYRRP*
CDS seq >Pp3c10_6710V3.1
ATGGAGGCGTTGAAGCACATCCAAGAGGCTGTGGGAGTGGTGACGAGGTTTGATGGAAGAGGAGTGTGTGATGCTGGCGA
CGTGTTGGCCCAGTGCAAGAATTGGGGTATTACTGCGGAGCTGAAGCCCCATCAAGCTGAGGGGGTGTCATGGCTCATTG
GCCGTTACGTGCTTGGCGTCAATGTCATTCTGGGTGATGAGATGGGATTAGGGAAGACGCTGCAGTCCATTGTGTTGCTT
GCGTACTTGAAATTCGCACGCTCTTGCAAAGGCCCATTTTTGGTGATATGCCCATTGAGCGTAACAGAAGGATGGGCTCT
GGAGATGGCAAGATTTGCGCCAAAGTTGAGGGTATTGCGGTATGTTGGAAACAAAGGAGAGAGAGAAGAGTTACGCAGAA
GTATCAGCGAGCACGTTAATAAGCAGGCCCCGTCTGCTCGGACAAATCCAGATTTGCCGTTTGATGTACTTCTATCGACT
TATGAGCTTGCGATGGCTGATGTATCATTCTTAAGCCGTATACGCTGGAGTTATGCTATTATTGATGAAGCTCAACGCCT
TAAAAATGCTGACAGTGTGCTTTTCAAAACCCTTGACGAGCAGTATATGTTACCCCGACGCTTGCTGCTCACCGGAACCC
CAGTGCAAAACAACCTTTCAGAACTTTGGGCATTGCTACACTTTTGCATGCCTTTAATCTTTACCAATGTAGAAGAGTTT
CTTGATGCTTTCGGGCCAGCTGCCGCTGCTACACATAAAGATGGAGACAAAATTGTTGATAAGGAAGACTCAATTCTGAC
AATTCTACGGGAAACTGTCAAAGTTTTTATGCTGAGACGTACAAAAGCTGCTCTTGTTCACTCAAAGGCTCTTGTTCTTC
CCCCTCTTACTGAAGTTACCATATTTTCTCCTATGGCTAATATTCAGAAGCAAGTGTACGTCTCAGTATTGAAGAAGGAA
TCTCCAAAGATTATCGGTGACAGTCCTGGAACAGCAACGTCACTGCAGAATATTGTTATCCAGTTAAGAAAAGCTTGCAG
CCATCCCTATCTCTTTGATGGGATCGAACCAGAACCATTTCAAGAGGGTGAGCACATTGTTGAGGCTAGTGGAAAGCTGA
AGATGCTCGATGTGATCCTGAAAAAGCTGCATGCTAGTGGTCATCGTGTTCTTATTTTTGCTCAAATGACCCGCACGCTG
GACATACTTCAGGACTACCTGGAGTATCGGAGCTATTCGTATGAACGCCTTGATGGTTCAGTTCGTGCAGAGGAGCGGTT
TAACGCTGTTCACAGCTTTAGCGCAGGTCACTCGAAATGTGGTGGTGATGCCAAGTCTGGCAGCGCATTTGTGTTTCTGC
TCACTACCCGTGCAGGAGGTGTTGGCTTGAACCTCATCGGTGCAGACACTGTTATCTTTTACGAGCAAGACTGGAACCCT
CAAGCTGACAAGCAGGCGTTGCAACGAGCACATAGAATAGGGCAGATATCACCTGTATTGGCTATTAATCTTATTACACG
TCACACAATTGACGAGGTGATAATGAGGAGGGCAAAAAAGAAACTAGAGCTCACTTGCAATGTGATTGGGCGGGACGATG
CGGATTTGGAAACTCATGGTCCTGCAAGTGCCGTCTCGACAGACTTGAGATCCATGATAATGTTTGGGATTAATACGCTT
CACTCGTCCGATGCTACCCAAGATACTGAGGTTGGTCGTAGCTCAGAGCTGGAAAGAATTGTGGAAGCTGCGTTGAACCA
GCGTGGCAAAGCAGGCCCTGGCTTGTCAAATGCCACTGAATTCGGCAATGTCGAACTCGTTACATTAGATGCCCAAGACG
ATGGCGCTGAAAACTTTTACAATTATGAAGGCAAGGACTTTGCGAAGCCGGCTGATAAAGCACTGCTGGAATCTTGGGCT
GTGCATGCGAATGCAAGTAATGGAGATGAACCTAGCACTCGGCGAACACGCAAAGTAAAAGCTTCTTATTTGAGTGACGA
TGCAGACAAGGACAACTCCAGGGCTCAGAAGAGGCAGAAAGCAGAAGATAAGAAACACGCAAAATGGAAATCACTTGGAT
ATCATACATTAGCCATTGACGATCCTGGTGCAGGACTTCAGCCAGCTCTTTCAAGTGACAACGTGGGCAGCGATGTTCAT
TTTGTGATTGGCGATTGCACAGTTCCTGTGACAACGCGCCCAGACGAACCCTGCATCATATTATCATTCGTCGATGACTC
TGGAGTGTGGGGTAGTGGAGGAATGTTTAATGCTGTTGCAAAGCTTTCCTTCAAGATTCCAGAAGCTTATGAAGCAGCTC
ATGAAGCTGAGGATTTGCACCTTGGTGATTTTCATCTAGTTCCTTTACCTGGAGATAGGAGATTTGTTGGAGTAGCAGTA
ATTCAAACTTACAGCCGCAGAAGGAAGGTCCCTCGCAGTGACATCTCCATCTCAGCCTTCGAAACGTGCCTTCGCAAAGT
TGTTACCGCCGCCAAGGCGCGCTCTGCGTCTGTGCACGTGCCAAGAATTGGGTCGCGGGGGCAAGGGAAGAACGAGTGGT
ATGCGGTGGAGCGGCTGCTGCGAAAGTATGCGACGGGCTACGACGTGCCGTTCTATGTGTACTACTACAGAAGACCTTGA
Microexon DNA seq GTGATGAG
Microexon Amino Acid seq GDE
Microexon-tag DNA Seq GTGTCATGGCTCATTGGCCGTTACGTGCTTGGCGTCAATGTCATTCTGGGTGATGAGATGGGATTAGGGAAGACGCTGCAGTCCATTGTGTTGCTTGCGTACTTGAAA
Microexon-tag Amino Acid seq VSWLIGRYVLGVNVILGDEMGLGKTLQSIVLLAYLK
Transcript ID Pp3c10_6710V3.1
Gene ID Pp.1878
Gene Name NA
Pfam domain motif SNF2_N
Motif E-value 1.7e-46
Motif start 60
Motif end 351
Protein seq >Pp3c10_6710V3.1
MEALKHIQEAVGVVTRFDGRGVCDAGDVLAQCKNWGITAELKPHQAEGVSWLIGRYVLGVNVILGDEMGLGKTLQSIVLL
AYLKFARSCKGPFLVICPLSVTEGWALEMARFAPKLRVLRYVGNKGEREELRRSISEHVNKQAPSARTNPDLPFDVLLST
YELAMADVSFLSRIRWSYAIIDEAQRLKNADSVLFKTLDEQYMLPRRLLLTGTPVQNNLSELWALLHFCMPLIFTNVEEF
LDAFGPAAAATHKDGDKIVDKEDSILTILRETVKVFMLRRTKAALVHSKALVLPPLTEVTIFSPMANIQKQVYVSVLKKE
SPKIIGDSPGTATSLQNIVIQLRKACSHPYLFDGIEPEPFQEGEHIVEASGKLKMLDVILKKLHASGHRVLIFAQMTRTL
DILQDYLEYRSYSYERLDGSVRAEERFNAVHSFSAGHSKCGGDAKSGSAFVFLLTTRAGGVGLNLIGADTVIFYEQDWNP
QADKQALQRAHRIGQISPVLAINLITRHTIDEVIMRRAKKKLELTCNVIGRDDADLETHGPASAVSTDLRSMIMFGINTL
HSSDATQDTEVGRSSELERIVEAALNQRGKAGPGLSNATEFGNVELVTLDAQDDGAENFYNYEGKDFAKPADKALLESWA
VHANASNGDEPSTRRTRKVKASYLSDDADKDNSRAQKRQKAEDKKHAKWKSLGYHTLAIDDPGAGLQPALSSDNVGSDVH
FVIGDCTVPVTTRPDEPCIILSFVDDSGVWGSGGMFNAVAKLSFKIPEAYEAAHEAEDLHLGDFHLVPLPGDRRFVGVAV
IQTYSRRRKVPRSDISISAFETCLRKVVTAAKARSASVHVPRIGSRGQGKNEWYAVERLLRKYATGYDVPFYVYYYRRP*
CDS seq >Pp3c10_6710V3.1
ATGGAGGCGTTGAAGCACATCCAAGAGGCTGTGGGAGTGGTGACGAGGTTTGATGGAAGAGGAGTGTGTGATGCTGGCGA
CGTGTTGGCCCAGTGCAAGAATTGGGGTATTACTGCGGAGCTGAAGCCCCATCAAGCTGAGGGGGTGTCATGGCTCATTG
GCCGTTACGTGCTTGGCGTCAATGTCATTCTGGGTGATGAGATGGGATTAGGGAAGACGCTGCAGTCCATTGTGTTGCTT
GCGTACTTGAAATTCGCACGCTCTTGCAAAGGCCCATTTTTGGTGATATGCCCATTGAGCGTAACAGAAGGATGGGCTCT
GGAGATGGCAAGATTTGCGCCAAAGTTGAGGGTATTGCGGTATGTTGGAAACAAAGGAGAGAGAGAAGAGTTACGCAGAA
GTATCAGCGAGCACGTTAATAAGCAGGCCCCGTCTGCTCGGACAAATCCAGATTTGCCGTTTGATGTACTTCTATCGACT
TATGAGCTTGCGATGGCTGATGTATCATTCTTAAGCCGTATACGCTGGAGTTATGCTATTATTGATGAAGCTCAACGCCT
TAAAAATGCTGACAGTGTGCTTTTCAAAACCCTTGACGAGCAGTATATGTTACCCCGACGCTTGCTGCTCACCGGAACCC
CAGTGCAAAACAACCTTTCAGAACTTTGGGCATTGCTACACTTTTGCATGCCTTTAATCTTTACCAATGTAGAAGAGTTT
CTTGATGCTTTCGGGCCAGCTGCCGCTGCTACACATAAAGATGGAGACAAAATTGTTGATAAGGAAGACTCAATTCTGAC
AATTCTACGGGAAACTGTCAAAGTTTTTATGCTGAGACGTACAAAAGCTGCTCTTGTTCACTCAAAGGCTCTTGTTCTTC
CCCCTCTTACTGAAGTTACCATATTTTCTCCTATGGCTAATATTCAGAAGCAAGTGTACGTCTCAGTATTGAAGAAGGAA
TCTCCAAAGATTATCGGTGACAGTCCTGGAACAGCAACGTCACTGCAGAATATTGTTATCCAGTTAAGAAAAGCTTGCAG
CCATCCCTATCTCTTTGATGGGATCGAACCAGAACCATTTCAAGAGGGTGAGCACATTGTTGAGGCTAGTGGAAAGCTGA
AGATGCTCGATGTGATCCTGAAAAAGCTGCATGCTAGTGGTCATCGTGTTCTTATTTTTGCTCAAATGACCCGCACGCTG
GACATACTTCAGGACTACCTGGAGTATCGGAGCTATTCGTATGAACGCCTTGATGGTTCAGTTCGTGCAGAGGAGCGGTT
TAACGCTGTTCACAGCTTTAGCGCAGGTCACTCGAAATGTGGTGGTGATGCCAAGTCTGGCAGCGCATTTGTGTTTCTGC
TCACTACCCGTGCAGGAGGTGTTGGCTTGAACCTCATCGGTGCAGACACTGTTATCTTTTACGAGCAAGACTGGAACCCT
CAAGCTGACAAGCAGGCGTTGCAACGAGCACATAGAATAGGGCAGATATCACCTGTATTGGCTATTAATCTTATTACACG
TCACACAATTGACGAGGTGATAATGAGGAGGGCAAAAAAGAAACTAGAGCTCACTTGCAATGTGATTGGGCGGGACGATG
CGGATTTGGAAACTCATGGTCCTGCAAGTGCCGTCTCGACAGACTTGAGATCCATGATAATGTTTGGGATTAATACGCTT
CACTCGTCCGATGCTACCCAAGATACTGAGGTTGGTCGTAGCTCAGAGCTGGAAAGAATTGTGGAAGCTGCGTTGAACCA
GCGTGGCAAAGCAGGCCCTGGCTTGTCAAATGCCACTGAATTCGGCAATGTCGAACTCGTTACATTAGATGCCCAAGACG
ATGGCGCTGAAAACTTTTACAATTATGAAGGCAAGGACTTTGCGAAGCCGGCTGATAAAGCACTGCTGGAATCTTGGGCT
GTGCATGCGAATGCAAGTAATGGAGATGAACCTAGCACTCGGCGAACACGCAAAGTAAAAGCTTCTTATTTGAGTGACGA
TGCAGACAAGGACAACTCCAGGGCTCAGAAGAGGCAGAAAGCAGAAGATAAGAAACACGCAAAATGGAAATCACTTGGAT
ATCATACATTAGCCATTGACGATCCTGGTGCAGGACTTCAGCCAGCTCTTTCAAGTGACAACGTGGGCAGCGATGTTCAT
TTTGTGATTGGCGATTGCACAGTTCCTGTGACAACGCGCCCAGACGAACCCTGCATCATATTATCATTCGTCGATGACTC
TGGAGTGTGGGGTAGTGGAGGAATGTTTAATGCTGTTGCAAAGCTTTCCTTCAAGATTCCAGAAGCTTATGAAGCAGCTC
ATGAAGCTGAGGATTTGCACCTTGGTGATTTTCATCTAGTTCCTTTACCTGGAGATAGGAGATTTGTTGGAGTAGCAGTA
ATTCAAACTTACAGCCGCAGAAGGAAGGTCCCTCGCAGTGACATCTCCATCTCAGCCTTCGAAACGTGCCTTCGCAAAGT
TGTTACCGCCGCCAAGGCGCGCTCTGCGTCTGTGCACGTGCCAAGAATTGGGTCGCGGGGGCAAGGGAAGAACGAGTGGT
ATGCGGTGGAGCGGCTGCTGCGAAAGTATGCGACGGGCTACGACGTGCCGTTCTATGTGTACTACTACAGAAGACCTTGA