Microexon ID Gm_18:1123922-1123934:+
Species Glycine max
Coordinates 18:1123922..1123934
Microexon Cluster ID MEP33
Size 13
Phase 2
Pfam Domain Motif Unknown
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 47,13,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GAVTTYYTRTCARTHTCRGARTTYCTWGTKGARRTWTCTGATGAYYTGTWTGAYTAYGAGGATGAYGTKTTRRAGAAYAAYTTCAAYATTYTGCGCATGTTTGTYRRA
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GTATGATTATGAG
Microexon Amino Acid seq LYDYE
Microexon-tag DNA Seq GAATTTCTCTCAATTTCGGAGTTCCTAGTGGAAGTTTCTGATGATCTGTATGATTATGAGGATGATGTTTTAGAGAACAATTTCAATATTTTGCGCATGTTTATCCGA
Microexon-tag Amino Acid Seq EFLSISEFLVEVSDDLYDYEDDVLENNFNILRMFIR
Microexon-tag spanning region1123211-1124065
Microexon-tag prediction score0.9788
Overlapped with the annotated transcript (%) 100
New Transcript ID KRG97555x
Reference Transcript ID KRG97555
Gene ID GLYMA_18G015800
Gene Name NA
Transcript ID KRG97555
Protein ID KRG97555
Gene ID GLYMA_18G015800
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >KRG97555
MEELRKLEKVQSMLEFMESHSRGVSNSDQHSNRFLANFILFLIEPCGDLSINHKCSLLSHFIPTLSSAFLEDAYQHHLFT
TSKQQNSGGFQQSLVGNSLLSCNQMKEYSLLQRCNENAAMVGLDSMQMANSTLEDFCRSYFMFHGLDVSKPQSIFKYLPI
LSFTESYIYQLDKMNEKLLQTPCNGNCVFGEKDERETKVLVSCFSNEPVRPLVSILEHKGLLTERIREELRLGEEYWALE
RKLCSALTNKEEILVEDVMKAIHLKSFDYRVLNLLLYQLQGTKVEELHMEFLSISEFLVEVSDDLYDYEDDVLENNFNIL
RMFIRIYGPSAAPAMLAKCIGEAEDKYASLLKSLDPHLSLSYQKRCAEATKEGGKVSEHPLGTWSIPTVIQDEELYRLKL
KSDIS*
CDS seq >KRG97555
ATGGAAGAATTACGGAAACTCGAGAAAGTACAGAGCATGCTTGAATTCATGGAATCACATTCACGGGGCGTTTCAAACTC
CGATCAACACTCTAACCGCTTCCTCGCTAATTTCATTCTTTTCTTGATCGAACCCTGCGGAGACCTTTCCATCAACCACA
AGTGCTCCTTGCTCTCCCACTTTATTCCCACTCTCTCATCTGCATTCCTCGAAGACGCGTATCAACACCACCTTTTCACC
ACCTCTAAACAACAAAATTCTGGTGGTTTTCAACAAAGTCTTGTTGGAAATTCTTTGCTTAGTTGCAATCAGATGAAAGA
ATACAGTTTGTTGCAAAGATGTAATGAAAACGCTGCCATGGTCGGGCTGGATTCCATGCAAATGGCAAATTCTACACTTG
AGGATTTTTGCAGATCTTATTTTATGTTTCACGGATTGGATGTAAGCAAGCCACAATCAATCTTCAAATACTTACCTATT
CTTTCATTCACAGAGAGTTATATTTATCAGCTAGATAAAATGAATGAGAAATTGCTGCAAACACCATGCAATGGAAATTG
TGTATTTGGAGAAAAAGATGAGAGAGAAACTAAAGTATTGGTTTCTTGCTTCTCAAATGAGCCAGTTAGACCACTTGTTT
CTATTCTTGAACACAAAGGCCTTTTGACAGAAAGGATAAGAGAAGAACTTAGACTTGGAGAAGAGTACTGGGCTCTTGAA
AGAAAGCTCTGTTCTGCACTAACAAACAAAGAGGAGATTCTTGTTGAAGATGTGATGAAGGCTATTCATTTAAAGTCTTT
TGATTATCGAGTGCTGAATCTTCTTCTCTATCAACTTCAAGGGACTAAGGTGGAGGAGTTGCATATGGAATTTCTCTCAA
TTTCGGAGTTCCTAGTGGAAGTTTCTGATGATCTGTATGATTATGAGGATGATGTTTTAGAGAACAATTTCAATATTTTG
CGCATGTTTATCCGAATATATGGACCTTCAGCTGCCCCTGCTATGCTGGCGAAGTGCATTGGTGAGGCTGAAGACAAGTA
TGCAAGTTTACTGAAATCACTTGATCCACATCTCTCTCTGAGCTACCAGAAAAGATGTGCAGAAGCCACTAAAGAAGGTG
GAAAGGTATCAGAGCACCCTCTTGGAACATGGAGTATTCCAACCGTGATTCAAGATGAGGAATTGTATAGATTAAAGTTG
AAGTCAGATATTTCGTGA
Microexon DNA seq GTATGATTATGAG
Microexon Amino Acid seq LYDYE
Microexon-tag DNA Seq GAATTTCTCTCAATTTCGGAGTTCCTAGTGGAAGTTTCTGATGATCTGTATGATTATGAGGATGATGTTTTAGAGAACAATTTCAATATTTTGCGCATGTTTATCCGA
Microexon-tag Amino Acid seq EFLSISEFLVEVSDDLYDYEDDVLENNFNILRMFIR
Transcript ID Gm.24130.1
Gene ID Gm.24130
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Gm.24130.1
MEELRKLEKVQSMLEFMESHSRGVSNSDQHSNRFLANFILFLIEPCGDLSINHKCSLLSHFIPTLSSAFLEDAYQHHLFT
TSKQQNSGGFQQSLVGNSLLSCNQMKEYSLLQRCNENAAMVGLDSMQMANSTLEDFCRSYFMFHGLDVSKPQSIFKYLPI
LSFTESYIYQLDKMNEKLLQTPCNGNCVFGEKDERETKVLVSCFSNEPVRPLVSILEHKGLLTERIREELRLGEEYWALE
RKLCSALTNKEEILVEDVMKAIHLKSFDYRVLNLLLYQLQGTKVEELHMEFLSISEFLVEVSDDLYDYEDDVLENNFNIL
RMFIRIYGPSAAPAMLAKCIGEAEDKYASLLKSLDPHLSLSYQKRCAEATKEGGKVSEHPLGTWSIPTVIQDEELYRLKL
KSDIS*
CDS seq >Gm.24130.1
ATGGAAGAATTACGGAAACTCGAGAAAGTACAGAGCATGCTTGAATTCATGGAATCACATTCACGGGGCGTTTCAAACTC
CGATCAACACTCTAACCGCTTCCTCGCTAATTTCATTCTTTTCTTGATCGAACCCTGCGGAGACCTTTCCATCAACCACA
AGTGCTCCTTGCTCTCCCACTTTATTCCCACTCTCTCATCTGCATTCCTCGAAGACGCGTATCAACACCACCTTTTCACC
ACCTCTAAACAACAAAATTCTGGTGGTTTTCAACAAAGTCTTGTTGGAAATTCTTTGCTTAGTTGCAATCAGATGAAAGA
ATACAGTTTGTTGCAAAGATGTAATGAAAACGCTGCCATGGTCGGGCTGGATTCCATGCAAATGGCAAATTCTACACTTG
AGGATTTTTGCAGATCTTATTTTATGTTTCACGGATTGGATGTAAGCAAGCCACAATCAATCTTCAAATACTTACCTATT
CTTTCATTCACAGAGAGTTATATTTATCAGCTAGATAAAATGAATGAGAAATTGCTGCAAACACCATGCAATGGAAATTG
TGTATTTGGAGAAAAAGATGAGAGAGAAACTAAAGTATTGGTTTCTTGCTTCTCAAATGAGCCAGTTAGACCACTTGTTT
CTATTCTTGAACACAAAGGCCTTTTGACAGAAAGGATAAGAGAAGAACTTAGACTTGGAGAAGAGTACTGGGCTCTTGAA
AGAAAGCTCTGTTCTGCACTAACAAACAAAGAGGAGATTCTTGTTGAAGATGTGATGAAGGCTATTCATTTAAAGTCTTT
TGATTATCGAGTGCTGAATCTTCTTCTCTATCAACTTCAAGGGACTAAGGTGGAGGAGTTGCATATGGAATTTCTCTCAA
TTTCGGAGTTCCTAGTGGAAGTTTCTGATGATCTGTATGATTATGAGGATGATGTTTTAGAGAACAATTTCAATATTTTG
CGCATGTTTATCCGAATATATGGACCTTCAGCTGCCCCTGCTATGCTGGCGAAGTGCATTGGTGAGGCTGAAGACAAGTA
TGCAAGTTTACTGAAATCACTTGATCCACATCTCTCTCTGAGCTACCAGAAAAGATGTGCAGAAGCCACTAAAGAAGGTG
GAAAGGTATCAGAGCACCCTCTTGGAACATGGAGTATTCCAACCGTGATTCAAGATGAGGAATTGTATAGATTAAAGTTG
AAGTCAGATATTTCGTGA