Microexon ID Pp_18:3000482-3000490:-
Species Physcomitrium patens
Coordinates 18:3000482..3000490
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTACCTTG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAACAGCAAAGTGAAGCTTGGGCTTGCGAGAAGAGGGCGACAAGTTTACCTTGGAGCCTACACTGATGAAGAACAAGCTGCAAAAGCCCACGACTTAGCAGCT
Microexon-tag Amino Acid Seq WDNSKVKLGLARRGRQVYLGAYTDEEQAAKAHDLAA
Microexon-tag spanning region3000029-3000715
Microexon-tag prediction score0.8773
Overlapped with the annotated transcript (%) 100
New Transcript ID Pp3c18_3900V3.1x
Reference Transcript ID Pp3c18_3900V3.1
Gene ID Pp3c18_3900
Gene Name NA
Transcript ID Pp3c18_3900V3.1
Protein ID Pp3c18_3900V3.1
Gene ID Pp3c18_3900
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.1e-10
Motif start 979
Motif end 1038
Protein seq >Pp3c18_3900V3.1
MGVVEERLDRFVSRTVDMEGAVQDQSETETRNGMETVLQRIRATPWERNDTKNRIELGTVPAESLNSTHAYAVEERQASD
SITVQWATCVAPNLATFEERNTVGISGQYEKTAEIIQPSLCALGSGVYLKQYRSASEKDVNDNKLELYALDNLDASDLCV
SELADHQNKFIILPEWPNCYQARGQETFIAPPTDSIINSTTRRVAHQIECEPVESDIEVTEAHRLFLADELLSAGQGVLE
EFDYPPLPKQPSMGQSSSSRCTEKRGIDIDESEGISSQALRRKYLDSEHFEAHSRVLSTSSLQNPDSGQIDINSAFRAEV
IDEDEFEEHRSLEQQSEGRIENGIRFLSHYKMVKEHENGNEVKFGTTVQDIDHQNNSKANGDPCVLVGNYDNLGELEDEV
LEKTMGWFPVEESCNEGNDPTLSVRWILDEPPTAADLPHLESLLEISPSKFWSPESESGKRTSVHFGYLCRSESSQSPHS
IQYFLNDSPASSPEVTRTIFSGVPGTINSSKEPKHHCSQKRPAMHGAIQKIYSPCMDTAIPWTWRESMETNMKSPSLGSG
RKFAQVQICVPHVSPGFSEDKRGNLTSREQNVISTPTQDHTHQETGCLSNFTKSDGAVHRLMSQHGKNLVVDFLKVGPTT
QERIEAIRNDTELRDTFVPPSSKLWVQVENPTMPSDNMRDQFSARLSDKEGTNASSSLELQIPQGNRLTGTADAPLQPLS
SPVKPEVVLQVHKNCKRRRHAELLPLADISNHPNLAARLRDSLHEKLESQLEFPNTLLELLNDERPVSEQCPQNLAPGNA
TWSSPGVPRVSRKRSSRVTTPKDMRPRVSNMLIDDKMLQSATETPIKDHYTCKRGKKITKSAFKELQTFPRNVKLRLLMP
DPASCENDMIEMTNQEGRNVKSPILTKAVKPLRLIKTSPAKGYSWTRIRTTPKSNAVVRCETNIEESVTGEERHEVKGKS
LSKGSPDIETNKIAVLRSSKYRGVTRHRHTGRFEAHLWDNSKVKLGLARRGRQVYLGAYTDEEQAAKAHDLAALKYWGPG
VHTNFPPSLYEEELKTMKNLTKEDYILLLRRKSPGFTRGISKYRGVTRHHQEGRWEARIGRHSGAKYHYLGTYDTEEEAA
VAYDRAAVLHRGPNAVTNFDISNYIKRKTKNPKTEITVCPPQM*
CDS seq >Pp3c18_3900V3.1
ATGGGAGTCGTAGAAGAACGGCTGGACAGGTTTGTCTCCAGGACTGTGGACATGGAAGGGGCTGTACAGGACCAATCAGA
AACCGAAACGAGAAACGGCATGGAAACAGTACTTCAGCGGATACGAGCTACACCATGGGAGCGCAATGATACTAAAAATA
GGATCGAGCTTGGGACCGTGCCAGCGGAAAGTCTAAACTCAACCCATGCTTACGCTGTAGAAGAACGGCAAGCTAGTGAT
TCAATAACTGTACAATGGGCAACATGTGTTGCTCCAAACCTAGCAACATTTGAAGAGCGGAACACAGTGGGAATTAGCGG
CCAATATGAGAAGACTGCGGAAATAATTCAACCATCGTTGTGCGCATTAGGATCCGGGGTATACCTCAAGCAGTATCGTT
CAGCATCGGAAAAGGATGTAAATGACAACAAATTGGAATTATATGCCTTAGATAACTTAGACGCTAGTGACTTATGTGTC
AGTGAATTAGCTGACCATCAGAACAAGTTCATCATATTGCCGGAATGGCCAAATTGTTACCAAGCTCGTGGTCAAGAGAC
GTTCATCGCACCTCCAACAGACTCGATCATCAATTCAACCACCAGACGAGTTGCCCATCAGATTGAATGCGAGCCAGTTG
AAAGTGACATTGAGGTTACGGAGGCGCACAGACTGTTTCTTGCAGACGAATTACTGTCAGCAGGCCAGGGCGTGCTTGAA
GAATTCGACTACCCTCCACTTCCAAAACAACCCTCCATGGGCCAATCTAGTAGCAGTCGATGCACCGAAAAGAGAGGTAT
CGATATTGATGAGAGTGAAGGCATTAGTAGCCAGGCATTAAGAAGGAAGTATCTTGACAGCGAGCATTTCGAAGCACACT
CTAGGGTATTATCTACCAGTTCCCTGCAGAACCCTGACAGTGGCCAAATTGACATCAACAGCGCGTTTCGCGCAGAAGTA
ATAGATGAGGACGAGTTCGAAGAACACCGAAGTCTGGAACAACAATCTGAAGGCAGAATTGAGAATGGTATTAGGTTTCT
GAGCCACTATAAAATGGTGAAAGAGCACGAAAATGGTAATGAGGTGAAATTTGGGACAACTGTGCAGGATATTGACCACC
AAAACAATAGCAAAGCTAATGGAGATCCCTGTGTATTGGTTGGAAACTATGACAACTTGGGTGAGCTGGAGGACGAGGTC
CTGGAGAAGACAATGGGTTGGTTCCCAGTGGAGGAGTCGTGCAATGAGGGCAACGATCCTACCCTTTCAGTGCGTTGGAT
ACTTGACGAACCGCCAACTGCGGCTGATTTACCTCATTTGGAAAGTTTGCTGGAAATCTCACCCTCGAAATTTTGGAGTC
CGGAATCTGAGTCAGGAAAACGAACGTCTGTCCACTTCGGATACCTTTGCCGTTCTGAGTCATCGCAATCGCCGCACTCA
ATCCAATATTTTTTAAACGATTCACCTGCTTCATCTCCGGAGGTGACAAGAACGATCTTTAGTGGCGTCCCTGGTACAAT
CAATTCATCAAAGGAACCGAAGCACCACTGTTCGCAAAAACGCCCTGCTATGCACGGCGCAATACAGAAAATTTATTCAC
CATGCATGGATACAGCGATCCCTTGGACTTGGCGAGAATCCATGGAGACAAACATGAAGTCACCTTCCTTAGGTTCTGGT
AGAAAATTTGCTCAAGTCCAAATTTGTGTTCCACATGTGTCCCCTGGGTTTTCAGAGGACAAACGTGGGAATCTCACTAG
CAGAGAGCAGAATGTGATTTCAACTCCTACTCAAGATCATACCCATCAAGAAACAGGCTGCTTATCGAATTTCACCAAAT
CAGATGGTGCCGTTCACAGATTGATGTCTCAGCATGGGAAAAACCTAGTTGTAGACTTTCTGAAAGTAGGGCCAACGACT
CAGGAAAGGATTGAAGCCATAAGGAACGACACAGAACTCAGAGATACTTTCGTGCCACCGAGTTCAAAATTGTGGGTTCA
AGTTGAGAATCCGACAATGCCATCAGATAACATGCGGGATCAATTTTCTGCAAGGCTCTCTGACAAGGAAGGAACGAACG
CTTCATCGTCTTTGGAACTCCAGATTCCACAAGGAAACCGATTGACGGGTACTGCTGATGCTCCTCTGCAGCCTCTATCT
TCACCCGTAAAGCCTGAAGTCGTACTCCAAGTTCACAAAAATTGCAAACGGAGAAGACATGCGGAATTGCTCCCACTCGC
GGATATTTCAAACCATCCGAATTTGGCTGCGAGGTTAAGAGATTCGCTACATGAGAAGCTAGAGTCACAGCTTGAATTTC
CAAATACACTCTTGGAATTACTCAATGATGAGCGTCCAGTTTCCGAGCAGTGTCCACAAAATTTGGCTCCAGGTAACGCA
ACGTGGTCAAGCCCGGGCGTCCCTCGTGTTTCGCGAAAGCGAAGCAGTCGAGTCACAACGCCGAAAGACATGAGACCGAG
AGTGTCGAACATGCTTATCGACGATAAGATGTTACAGTCTGCGACAGAAACTCCGATAAAAGACCACTACACTTGTAAAA
GAGGAAAGAAGATCACCAAAAGCGCGTTTAAGGAGTTGCAAACGTTCCCAAGAAATGTCAAATTACGACTTCTCATGCCT
GACCCAGCTAGCTGTGAGAATGATATGATAGAAATGACGAACCAAGAAGGTAGAAATGTTAAGTCTCCGATCCTAACGAA
GGCAGTTAAACCTCTGCGGCTAATAAAGACTAGTCCTGCCAAAGGTTATTCTTGGACCAGGATTAGAACGACGCCTAAAA
GCAACGCAGTTGTCAGGTGTGAAACGAACATAGAAGAAAGTGTTACAGGAGAAGAGAGACATGAAGTTAAAGGAAAGAGT
CTGTCAAAAGGATCACCAGATATTGAAACGAACAAAATTGCCGTTCTCCGGAGCTCGAAGTACAGGGGTGTAACAAGGCA
TCGACACACAGGGCGATTTGAGGCGCACTTATGGGATAACAGCAAAGTGAAGCTTGGGCTTGCGAGAAGAGGGCGACAAG
TTTACCTTGGAGCCTACACTGATGAAGAACAAGCTGCAAAAGCCCACGACTTAGCAGCTTTGAAGTATTGGGGCCCTGGT
GTGCACACCAACTTCCCGCCCAGTCTGTATGAGGAGGAGTTGAAGACAATGAAAAACCTAACTAAAGAGGACTACATTCT
TCTTTTGCGCAGGAAAAGTCCCGGGTTCACTAGGGGAATATCAAAATATCGTGGAGTTACAAGGCATCACCAAGAGGGAC
GCTGGGAAGCCCGCATTGGAAGACACTCGGGCGCAAAGTATCATTACCTAGGCACCTATGACACAGAAGAAGAAGCAGCC
GTAGCTTATGACAGAGCTGCAGTTCTTCACCGTGGACCCAATGCTGTCACGAACTTTGACATATCAAATTACATCAAAAG
AAAAACCAAGAACCCCAAAACTGAAATCACTGTATGCCCACCTCAAATGTGA
Microexon DNA seq TTTACCTTG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAACAGCAAAGTGAAGCTTGGGCTTGCGAGAAGAGGGCGACAAGTTTACCTTGGAGCCTACACTGATGAAGAACAAGCTGCAAAAGCCCACGACTTAGCAGCT
Microexon-tag Amino Acid seq WDNSKVKLGLARRGRQVYLGAYTDEEQAAKAHDLAA
Transcript ID Pp3c18_3900V3.1
Gene ID Pp.9295
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.1e-10
Motif start 979
Motif end 1038
Protein seq >Pp3c18_3900V3.1
MGVVEERLDRFVSRTVDMEGAVQDQSETETRNGMETVLQRIRATPWERNDTKNRIELGTVPAESLNSTHAYAVEERQASD
SITVQWATCVAPNLATFEERNTVGISGQYEKTAEIIQPSLCALGSGVYLKQYRSASEKDVNDNKLELYALDNLDASDLCV
SELADHQNKFIILPEWPNCYQARGQETFIAPPTDSIINSTTRRVAHQIECEPVESDIEVTEAHRLFLADELLSAGQGVLE
EFDYPPLPKQPSMGQSSSSRCTEKRGIDIDESEGISSQALRRKYLDSEHFEAHSRVLSTSSLQNPDSGQIDINSAFRAEV
IDEDEFEEHRSLEQQSEGRIENGIRFLSHYKMVKEHENGNEVKFGTTVQDIDHQNNSKANGDPCVLVGNYDNLGELEDEV
LEKTMGWFPVEESCNEGNDPTLSVRWILDEPPTAADLPHLESLLEISPSKFWSPESESGKRTSVHFGYLCRSESSQSPHS
IQYFLNDSPASSPEVTRTIFSGVPGTINSSKEPKHHCSQKRPAMHGAIQKIYSPCMDTAIPWTWRESMETNMKSPSLGSG
RKFAQVQICVPHVSPGFSEDKRGNLTSREQNVISTPTQDHTHQETGCLSNFTKSDGAVHRLMSQHGKNLVVDFLKVGPTT
QERIEAIRNDTELRDTFVPPSSKLWVQVENPTMPSDNMRDQFSARLSDKEGTNASSSLELQIPQGNRLTGTADAPLQPLS
SPVKPEVVLQVHKNCKRRRHAELLPLADISNHPNLAARLRDSLHEKLESQLEFPNTLLELLNDERPVSEQCPQNLAPGNA
TWSSPGVPRVSRKRSSRVTTPKDMRPRVSNMLIDDKMLQSATETPIKDHYTCKRGKKITKSAFKELQTFPRNVKLRLLMP
DPASCENDMIEMTNQEGRNVKSPILTKAVKPLRLIKTSPAKGYSWTRIRTTPKSNAVVRCETNIEESVTGEERHEVKGKS
LSKGSPDIETNKIAVLRSSKYRGVTRHRHTGRFEAHLWDNSKVKLGLARRGRQVYLGAYTDEEQAAKAHDLAALKYWGPG
VHTNFPPSLYEEELKTMKNLTKEDYILLLRRKSPGFTRGISKYRGVTRHHQEGRWEARIGRHSGAKYHYLGTYDTEEEAA
VAYDRAAVLHRGPNAVTNFDISNYIKRKTKNPKTEITVCPPQM*
CDS seq >Pp3c18_3900V3.1
ATGGGAGTCGTAGAAGAACGGCTGGACAGGTTTGTCTCCAGGACTGTGGACATGGAAGGGGCTGTACAGGACCAATCAGA
AACCGAAACGAGAAACGGCATGGAAACAGTACTTCAGCGGATACGAGCTACACCATGGGAGCGCAATGATACTAAAAATA
GGATCGAGCTTGGGACCGTGCCAGCGGAAAGTCTAAACTCAACCCATGCTTACGCTGTAGAAGAACGGCAAGCTAGTGAT
TCAATAACTGTACAATGGGCAACATGTGTTGCTCCAAACCTAGCAACATTTGAAGAGCGGAACACAGTGGGAATTAGCGG
CCAATATGAGAAGACTGCGGAAATAATTCAACCATCGTTGTGCGCATTAGGATCCGGGGTATACCTCAAGCAGTATCGTT
CAGCATCGGAAAAGGATGTAAATGACAACAAATTGGAATTATATGCCTTAGATAACTTAGACGCTAGTGACTTATGTGTC
AGTGAATTAGCTGACCATCAGAACAAGTTCATCATATTGCCGGAATGGCCAAATTGTTACCAAGCTCGTGGTCAAGAGAC
GTTCATCGCACCTCCAACAGACTCGATCATCAATTCAACCACCAGACGAGTTGCCCATCAGATTGAATGCGAGCCAGTTG
AAAGTGACATTGAGGTTACGGAGGCGCACAGACTGTTTCTTGCAGACGAATTACTGTCAGCAGGCCAGGGCGTGCTTGAA
GAATTCGACTACCCTCCACTTCCAAAACAACCCTCCATGGGCCAATCTAGTAGCAGTCGATGCACCGAAAAGAGAGGTAT
CGATATTGATGAGAGTGAAGGCATTAGTAGCCAGGCATTAAGAAGGAAGTATCTTGACAGCGAGCATTTCGAAGCACACT
CTAGGGTATTATCTACCAGTTCCCTGCAGAACCCTGACAGTGGCCAAATTGACATCAACAGCGCGTTTCGCGCAGAAGTA
ATAGATGAGGACGAGTTCGAAGAACACCGAAGTCTGGAACAACAATCTGAAGGCAGAATTGAGAATGGTATTAGGTTTCT
GAGCCACTATAAAATGGTGAAAGAGCACGAAAATGGTAATGAGGTGAAATTTGGGACAACTGTGCAGGATATTGACCACC
AAAACAATAGCAAAGCTAATGGAGATCCCTGTGTATTGGTTGGAAACTATGACAACTTGGGTGAGCTGGAGGACGAGGTC
CTGGAGAAGACAATGGGTTGGTTCCCAGTGGAGGAGTCGTGCAATGAGGGCAACGATCCTACCCTTTCAGTGCGTTGGAT
ACTTGACGAACCGCCAACTGCGGCTGATTTACCTCATTTGGAAAGTTTGCTGGAAATCTCACCCTCGAAATTTTGGAGTC
CGGAATCTGAGTCAGGAAAACGAACGTCTGTCCACTTCGGATACCTTTGCCGTTCTGAGTCATCGCAATCGCCGCACTCA
ATCCAATATTTTTTAAACGATTCACCTGCTTCATCTCCGGAGGTGACAAGAACGATCTTTAGTGGCGTCCCTGGTACAAT
CAATTCATCAAAGGAACCGAAGCACCACTGTTCGCAAAAACGCCCTGCTATGCACGGCGCAATACAGAAAATTTATTCAC
CATGCATGGATACAGCGATCCCTTGGACTTGGCGAGAATCCATGGAGACAAACATGAAGTCACCTTCCTTAGGTTCTGGT
AGAAAATTTGCTCAAGTCCAAATTTGTGTTCCACATGTGTCCCCTGGGTTTTCAGAGGACAAACGTGGGAATCTCACTAG
CAGAGAGCAGAATGTGATTTCAACTCCTACTCAAGATCATACCCATCAAGAAACAGGCTGCTTATCGAATTTCACCAAAT
CAGATGGTGCCGTTCACAGATTGATGTCTCAGCATGGGAAAAACCTAGTTGTAGACTTTCTGAAAGTAGGGCCAACGACT
CAGGAAAGGATTGAAGCCATAAGGAACGACACAGAACTCAGAGATACTTTCGTGCCACCGAGTTCAAAATTGTGGGTTCA
AGTTGAGAATCCGACAATGCCATCAGATAACATGCGGGATCAATTTTCTGCAAGGCTCTCTGACAAGGAAGGAACGAACG
CTTCATCGTCTTTGGAACTCCAGATTCCACAAGGAAACCGATTGACGGGTACTGCTGATGCTCCTCTGCAGCCTCTATCT
TCACCCGTAAAGCCTGAAGTCGTACTCCAAGTTCACAAAAATTGCAAACGGAGAAGACATGCGGAATTGCTCCCACTCGC
GGATATTTCAAACCATCCGAATTTGGCTGCGAGGTTAAGAGATTCGCTACATGAGAAGCTAGAGTCACAGCTTGAATTTC
CAAATACACTCTTGGAATTACTCAATGATGAGCGTCCAGTTTCCGAGCAGTGTCCACAAAATTTGGCTCCAGGTAACGCA
ACGTGGTCAAGCCCGGGCGTCCCTCGTGTTTCGCGAAAGCGAAGCAGTCGAGTCACAACGCCGAAAGACATGAGACCGAG
AGTGTCGAACATGCTTATCGACGATAAGATGTTACAGTCTGCGACAGAAACTCCGATAAAAGACCACTACACTTGTAAAA
GAGGAAAGAAGATCACCAAAAGCGCGTTTAAGGAGTTGCAAACGTTCCCAAGAAATGTCAAATTACGACTTCTCATGCCT
GACCCAGCTAGCTGTGAGAATGATATGATAGAAATGACGAACCAAGAAGGTAGAAATGTTAAGTCTCCGATCCTAACGAA
GGCAGTTAAACCTCTGCGGCTAATAAAGACTAGTCCTGCCAAAGGTTATTCTTGGACCAGGATTAGAACGACGCCTAAAA
GCAACGCAGTTGTCAGGTGTGAAACGAACATAGAAGAAAGTGTTACAGGAGAAGAGAGACATGAAGTTAAAGGAAAGAGT
CTGTCAAAAGGATCACCAGATATTGAAACGAACAAAATTGCCGTTCTCCGGAGCTCGAAGTACAGGGGTGTAACAAGGCA
TCGACACACAGGGCGATTTGAGGCGCACTTATGGGATAACAGCAAAGTGAAGCTTGGGCTTGCGAGAAGAGGGCGACAAG
TTTACCTTGGAGCCTACACTGATGAAGAACAAGCTGCAAAAGCCCACGACTTAGCAGCTTTGAAGTATTGGGGCCCTGGT
GTGCACACCAACTTCCCGCCCAGTCTGTATGAGGAGGAGTTGAAGACAATGAAAAACCTAACTAAAGAGGACTACATTCT
TCTTTTGCGCAGGAAAAGTCCCGGGTTCACTAGGGGAATATCAAAATATCGTGGAGTTACAAGGCATCACCAAGAGGGAC
GCTGGGAAGCCCGCATTGGAAGACACTCGGGCGCAAAGTATCATTACCTAGGCACCTATGACACAGAAGAAGAAGCAGCC
GTAGCTTATGACAGAGCTGCAGTTCTTCACCGTGGACCCAATGCTGTCACGAACTTTGACATATCAAATTACATCAAAAG
AAAAACCAAGAACCCCAAAACTGAAATCACTGTATGCCCACCTCAAATGTGA