Microexon ID Gm_18:25337770-25337778:+
Species Glycine max
Coordinates 18:25337770..25337778
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TGTACTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAACAGCTGTAGGCGAGAGGGTCAGGCCAGGAAAGGGCGTCAAGTGTACTTGGGTGGTTATGACAAGGAAGATAAGGCTGCGAGGGCTTATGATTTAGCAGCT
Microexon-tag Amino Acid Seq WDNSCRREGQARKGRQVYLGGYDKEDKAARAYDLAA
Microexon-tag spanning region25336029-25337937
Microexon-tag prediction score0.9677
Overlapped with the annotated transcript (%) 100
New Transcript ID KRG99477x
Reference Transcript ID KRG99477
Gene ID GLYMA_18G148000
Gene Name NA
Transcript ID KRG99477
Protein ID KRG99477
Gene ID GLYMA_18G148000
Gene Name NA
Pfam domain motif AP2
Motif E-value 5e-12
Motif start 160
Motif end 218
Protein seq >KRG99477
MAGATKNWLTFSLTPMETQFGPYDITTPSHCFNDNNFYGWANSKGVMDSQSETQQLAVPKMEDFYFGDHQQDLKAIVPGF
KALSGTNSVDDSAPNKTTTRVAPAELTGAHSGESCKGSALSLCDVAANGSDDSNDNKAIVAVGFDTRKKVAHTFGQRTSI
YRGVTRHRWTGRYEAHLWDNSCRREGQARKGRQVYLGGYDKEDKAARAYDLAALKYWGPKATTNFPISNYTKELEEMKHV
GKQEFIASLRRKSSGFSRGASAYRGVTRHHQQGRWQARIGRVAGNKDLYLGTFSTEEEAAEAYDIAAIKFRGASAVTNFE
MRRYDVDAILNNSLPVGGIAKRFKVSPETENKALVSTIQPPQNTNASSSINFSTIQPVSSIPYDSAITCYPHNLFHHFNP
TNGESSTAEFAATMTDAIALATLPPLAVPGFFVWPHQNY*
CDS seq >KRG99477
ATGGCTGGTGCCACCAAAAATTGGCTTACGTTTTCGTTGACTCCCATGGAAACGCAGTTCGGTCCATACGACATCACCAC
TCCCTCTCACTGCTTCAACGACAACAACTTTTATGGGTGGGCGAATTCAAAGGGTGTGATGGATTCACAGAGTGAGACCC
AACAACTTGCGGTGCCGAAAATGGAAGATTTTTACTTCGGTGATCACCAGCAAGATCTGAAGGCGATTGTTCCTGGGTTC
AAAGCGCTCTCTGGGACCAACTCGGTGGATGACTCGGCCCCCAACAAGACGACGACTCGGGTTGCGCCCGCTGAGTTGAC
TGGTGCTCACTCAGGCGAGTCTTGCAAAGGATCTGCTTTGTCGTTATGCGATGTTGCTGCAAATGGTTCTGATGATAGTA
ATGATAATAAGGCCATTGTTGCGGTGGGGTTTGATACTCGGAAGAAGGTTGCTCATACCTTTGGCCAGCGAACTTCAATT
TACAGAGGGGTCACAAGGCACCGATGGACGGGCAGATATGAAGCGCATCTATGGGATAACAGCTGTAGGCGAGAGGGTCA
GGCCAGGAAAGGGCGTCAAGTGTACTTGGGTGGTTATGACAAGGAAGATAAGGCTGCGAGGGCTTATGATTTAGCAGCTC
TAAAGTACTGGGGACCAAAAGCAACCACTAACTTTCCTATTTCCAATTATACAAAGGAATTGGAGGAGATGAAGCATGTA
GGAAAGCAAGAATTTATTGCATCACTGCGAAGGAAAAGCAGTGGTTTTTCGAGGGGAGCTTCTGCTTACAGGGGTGTTAC
AAGGCATCACCAACAGGGTAGATGGCAAGCGAGAATTGGCCGGGTTGCGGGAAACAAGGATCTATATTTGGGAACATTTT
CAACTGAAGAGGAGGCAGCAGAGGCATATGATATTGCAGCCATAAAGTTCAGAGGTGCAAGTGCCGTAACCAATTTCGAG
ATGAGGCGATATGACGTCGACGCTATATTGAACAACTCTCTTCCAGTTGGCGGCATAGCAAAACGCTTTAAAGTTTCCCC
AGAAACGGAGAACAAAGCTCTTGTCAGCACCATTCAACCACCTCAGAACACAAACGCCAGTAGCAGCATCAATTTCTCAA
CCATTCAGCCAGTGTCTTCTATACCCTATGATTCTGCAATCACATGTTACCCCCACAACCTCTTCCACCATTTTAATCCC
ACCAACGGTGAAAGTAGCACTGCAGAATTTGCTGCAACTATGACCGATGCAATCGCACTCGCTACCCTTCCACCACTGGC
AGTTCCTGGGTTCTTCGTATGGCCTCATCAGAACTATTGA
Microexon DNA seq TGTACTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAACAGCTGTAGGCGAGAGGGTCAGGCCAGGAAAGGGCGTCAAGTGTACTTGGGTGGTTATGACAAGGAAGATAAGGCTGCGAGGGCTTATGATTTAGCAGCT
Microexon-tag Amino Acid seq WDNSCRREGQARKGRQVYLGGYDKEDKAARAYDLAA
Transcript ID KRG99477
Gene ID Gm.25322
Gene Name NA
Pfam domain motif AP2
Motif E-value 5e-12
Motif start 160
Motif end 218
Protein seq >KRG99477
MAGATKNWLTFSLTPMETQFGPYDITTPSHCFNDNNFYGWANSKGVMDSQSETQQLAVPKMEDFYFGDHQQDLKAIVPGF
KALSGTNSVDDSAPNKTTTRVAPAELTGAHSGESCKGSALSLCDVAANGSDDSNDNKAIVAVGFDTRKKVAHTFGQRTSI
YRGVTRHRWTGRYEAHLWDNSCRREGQARKGRQVYLGGYDKEDKAARAYDLAALKYWGPKATTNFPISNYTKELEEMKHV
GKQEFIASLRRKSSGFSRGASAYRGVTRHHQQGRWQARIGRVAGNKDLYLGTFSTEEEAAEAYDIAAIKFRGASAVTNFE
MRRYDVDAILNNSLPVGGIAKRFKVSPETENKALVSTIQPPQNTNASSSINFSTIQPVSSIPYDSAITCYPHNLFHHFNP
TNGESSTAEFAATMTDAIALATLPPLAVPGFFVWPHQNY*
CDS seq >KRG99477
ATGGCTGGTGCCACCAAAAATTGGCTTACGTTTTCGTTGACTCCCATGGAAACGCAGTTCGGTCCATACGACATCACCAC
TCCCTCTCACTGCTTCAACGACAACAACTTTTATGGGTGGGCGAATTCAAAGGGTGTGATGGATTCACAGAGTGAGACCC
AACAACTTGCGGTGCCGAAAATGGAAGATTTTTACTTCGGTGATCACCAGCAAGATCTGAAGGCGATTGTTCCTGGGTTC
AAAGCGCTCTCTGGGACCAACTCGGTGGATGACTCGGCCCCCAACAAGACGACGACTCGGGTTGCGCCCGCTGAGTTGAC
TGGTGCTCACTCAGGCGAGTCTTGCAAAGGATCTGCTTTGTCGTTATGCGATGTTGCTGCAAATGGTTCTGATGATAGTA
ATGATAATAAGGCCATTGTTGCGGTGGGGTTTGATACTCGGAAGAAGGTTGCTCATACCTTTGGCCAGCGAACTTCAATT
TACAGAGGGGTCACAAGGCACCGATGGACGGGCAGATATGAAGCGCATCTATGGGATAACAGCTGTAGGCGAGAGGGTCA
GGCCAGGAAAGGGCGTCAAGTGTACTTGGGTGGTTATGACAAGGAAGATAAGGCTGCGAGGGCTTATGATTTAGCAGCTC
TAAAGTACTGGGGACCAAAAGCAACCACTAACTTTCCTATTTCCAATTATACAAAGGAATTGGAGGAGATGAAGCATGTA
GGAAAGCAAGAATTTATTGCATCACTGCGAAGGAAAAGCAGTGGTTTTTCGAGGGGAGCTTCTGCTTACAGGGGTGTTAC
AAGGCATCACCAACAGGGTAGATGGCAAGCGAGAATTGGCCGGGTTGCGGGAAACAAGGATCTATATTTGGGAACATTTT
CAACTGAAGAGGAGGCAGCAGAGGCATATGATATTGCAGCCATAAAGTTCAGAGGTGCAAGTGCCGTAACCAATTTCGAG
ATGAGGCGATATGACGTCGACGCTATATTGAACAACTCTCTTCCAGTTGGCGGCATAGCAAAACGCTTTAAAGTTTCCCC
AGAAACGGAGAACAAAGCTCTTGTCAGCACCATTCAACCACCTCAGAACACAAACGCCAGTAGCAGCATCAATTTCTCAA
CCATTCAGCCAGTGTCTTCTATACCCTATGATTCTGCAATCACATGTTACCCCCACAACCTCTTCCACCATTTTAATCCC
ACCAACGGTGAAAGTAGCACTGCAGAATTTGCTGCAACTATGACCGATGCAATCGCACTCGCTACCCTTCCACCACTGGC
AGTTCCTGGGTTCTTCGTATGGCCTCATCAGAACTATTGA