Microexon ID Gm_12:688837-688851:-
Species Glycine max
Coordinates 12:688837..688851
Microexon Cluster ID MEP44
Size 15
Phase 1
Pfam Domain Motif Unknown
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 46,15,47
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq RTTTATGRGCTTTGTGATRTTGACTTGAAAGATTTYAGYMTBCAAGCWTWTGGRCARCAAGGBTGTYTACTTCGRAGCTTRCCTRCAGATRTKGTRTTTGACAAYWCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq CTTATGGACAACAAG
Microexon Amino Acid seq AYGQQG
Microexon-tag DNA Seq ATTTATGAGCTTTGTGACGTTGATTTGAAAGATTTCAGCATACAAGCTTATGGACAACAAGGCTGTTTACTTCGAAGCTTGCCTGGTGATGTGATATTTGACAATTCA
Microexon-tag Amino Acid Seq IYELCDVDLKDFSIQAYGQQGCLLRSLPGDVIFDNS
Microexon-tag spanning region688396-689004
Microexon-tag prediction score0.9728
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH23901x
Reference Transcript ID KRH23901
Gene ID GLYMA_12G009500
Gene Name NA
Transcript ID KRH23901
Protein ID KRH23901
Gene ID GLYMA_12G009500
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >KRH23901
MGCSQDANFYKGSLCCQLILFFIWLPTFQDVTAREIHADHRRISSLLELAKEPPSGESALFDPIEISPAVLPKFPYPTES
WPPMYPTFPTRYEPVLTGKCPVNFSHSDISSILDKTASDCSGPLAALVGNVICCPQFSSLIHIFQGFFSMKSDHLVLPNA
VADHCFSDIISILASRGANSTIPTLCSIKSSNFTGGSCPVKDDSTFEKTVNTSKLLEACSTVDPLKECCRPVCQPAIMDA
ALQISGRQMMINNDENMAGEVNHTDYLNDCKSVVYSYLSKKLSFEAANTAFRILSACKVNKVCPLSLKEPTEVINACRNV
AAPSPSCCSSLNTYIAGIQKQMLITNKQAIICATLFGSMLRGGGVMTNIYELCDVDLKDFSIQAYGQQGCLLRSLPGDVI
FDNSSGFSFTCDLSDNIAAPWPSSSSSFASMSLCAPEMSLPALPTSSQTLKNNGCTSDGVGLLVLIFSSFIFSTLLY*
CDS seq >KRH23901
ATGGGCTGCTCTCAGGATGCTAATTTTTATAAGGGTTCTTTGTGCTGTCAGCTAATTCTATTTTTCATCTGGCTTCCTAC
TTTCCAAGATGTGACAGCACGGGAAATACATGCTGACCACAGAAGAATTTCTTCTCTGCTAGAGTTAGCTAAGGAACCTC
CTTCTGGAGAATCTGCTCTCTTTGACCCCATAGAAATATCACCTGCTGTATTACCGAAATTCCCATATCCCACTGAGTCT
TGGCCACCAATGTACCCTACTTTCCCAACTAGATATGAACCAGTTTTAACTGGAAAATGCCCTGTAAACTTTTCTCACTC
AGATATATCAAGTATCCTAGATAAAACAGCATCTGATTGCTCTGGACCTTTGGCAGCCCTTGTTGGGAATGTAATATGTT
GTCCTCAGTTTAGTAGCTTGATCCACATCTTCCAGGGTTTCTTCAGCATGAAATCTGATCATTTGGTTTTGCCAAATGCA
GTTGCTGATCATTGTTTTTCTGATATCATTAGTATTCTAGCCAGTAGAGGGGCAAATAGTACAATCCCCACACTTTGTTC
CATAAAATCATCTAATTTTACAGGCGGGTCATGTCCTGTGAAGGATGATTCTACTTTTGAAAAAACAGTTAACACAAGTA
AGTTACTTGAGGCTTGCAGCACTGTTGATCCACTTAAAGAGTGTTGTAGGCCTGTTTGCCAACCTGCGATAATGGATGCA
GCACTACAGATTTCTGGAAGACAAATGATGATTAACAATGATGAAAATATGGCTGGGGAAGTAAATCACACTGACTATCT
TAACGATTGTAAAAGTGTGGTTTATTCATATCTTTCCAAAAAACTATCATTTGAGGCTGCAAATACTGCATTCCGAATAC
TATCTGCTTGCAAAGTCAACAAAGTTTGTCCTTTGTCTTTAAAGGAGCCAACAGAAGTAATTAATGCATGTCGGAATGTA
GCTGCCCCTAGCCCTTCCTGTTGTAGTTCATTAAACACATATATTGCTGGGATACAAAAGCAAATGCTAATTACAAATAA
ACAAGCTATAATATGTGCAACACTTTTTGGATCTATGTTGCGTGGAGGTGGAGTGATGACAAATATTTATGAGCTTTGTG
ACGTTGATTTGAAAGATTTCAGCATACAAGCTTATGGACAACAAGGCTGTTTACTTCGAAGCTTGCCTGGTGATGTGATA
TTTGACAATTCATCAGGCTTTAGCTTTACATGTGATTTGAGTGATAACATTGCTGCACCCTGGCCTTCTTCATCATCTTC
ATTTGCATCTATGTCACTCTGTGCACCTGAGATGTCATTACCTGCTCTCCCAACTTCTTCTCAGACATTAAAAAATAATG
GTTGTACTTCTGATGGAGTGGGATTGCTTGTGCTTATTTTTTCATCTTTCATCTTCAGTACACTGTTGTATTGA
Microexon DNA seq CTTATGGACAACAAG
Microexon Amino Acid seq AYGQQG
Microexon-tag DNA Seq ATTTATGAGCTTTGTGACGTTGATTTGAAAGATTTCAGCATACAAGCTTATGGACAACAAGGCTGTTTACTTCGAAGCTTGCCTGGTGATGTGATATTTGACAATTCA
Microexon-tag Amino Acid seq IYELCDVDLKDFSIQAYGQQGCLLRSLPGDVIFDNS
Transcript ID Gm.7940.1
Gene ID Gm.7940
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Gm.7940.1
MGCSQDANFYKGSLCCQLILFFIWLPTFQDVTAREIHADHRRISSLLELAKEPPSGESALFDPIEISPAVLPKFPYPTES
WPPMYPTFPTRYEPVLTGKCPVNFSHSDISSILDKTASDCSGPLAALVGNVICCPQFSSLIHIFQGFFSMKSDHLVLPNA
VADHCFSDIISILASRGANSTIPTLCSIKSSNFTGGSCPVKDDSTFEKTVNTSKLLEACSTVDPLKECCRPVCQPAIMDA
ALQISGRQMMINNDENMAGEVNHTDYLNDCKSVVYSYLSKKLSFEAANTAFRILSACKVNKVCPLSLKEPTEVINACRNV
AAPSPSCCSSLNTYIAGIQKQMLITNKQAIICATLFGSMLRGGGVMTNIYELCDVDLKDFSIQAYGQQGCLLRSLPGDVI
FDNSSGFSFTCDLSDNIAAPWPSSSSSFASMSLCAPEMSLPALPTSSQTLKNNGCTSDGVGLLVLIFSSFIFSTLLY*
CDS seq >Gm.7940.1
ATGGGCTGCTCTCAGGATGCTAATTTTTATAAGGGTTCTTTGTGCTGTCAGCTAATTCTATTTTTCATCTGGCTTCCTAC
TTTCCAAGATGTGACAGCACGGGAAATACATGCTGACCACAGAAGAATTTCTTCTCTGCTAGAGTTAGCTAAGGAACCTC
CTTCTGGAGAATCTGCTCTCTTTGACCCCATAGAAATATCACCTGCTGTATTACCGAAATTCCCATATCCCACTGAGTCT
TGGCCACCAATGTACCCTACTTTCCCAACTAGATATGAACCAGTTTTAACTGGAAAATGCCCTGTAAACTTTTCTCACTC
AGATATATCAAGTATCCTAGATAAAACAGCATCTGATTGCTCTGGACCTTTGGCAGCCCTTGTTGGGAATGTAATATGTT
GTCCTCAGTTTAGTAGCTTGATCCACATCTTCCAGGGTTTCTTCAGCATGAAATCTGATCATTTGGTTTTGCCAAATGCA
GTTGCTGATCATTGTTTTTCTGATATCATTAGTATTCTAGCCAGTAGAGGGGCAAATAGTACAATCCCCACACTTTGTTC
CATAAAATCATCTAATTTTACAGGCGGGTCATGTCCTGTGAAGGATGATTCTACTTTTGAAAAAACAGTTAACACAAGTA
AGTTACTTGAGGCTTGCAGCACTGTTGATCCACTTAAAGAGTGTTGTAGGCCTGTTTGCCAACCTGCGATAATGGATGCA
GCACTACAGATTTCTGGAAGACAAATGATGATTAACAATGATGAAAATATGGCTGGGGAAGTAAATCACACTGACTATCT
TAACGATTGTAAAAGTGTGGTTTATTCATATCTTTCCAAAAAACTATCATTTGAGGCTGCAAATACTGCATTCCGAATAC
TATCTGCTTGCAAAGTCAACAAAGTTTGTCCTTTGTCTTTAAAGGAGCCAACAGAAGTAATTAATGCATGTCGGAATGTA
GCTGCCCCTAGCCCTTCCTGTTGTAGTTCATTAAACACATATATTGCTGGGATACAAAAGCAAATGCTAATTACAAATAA
ACAAGCTATAATATGTGCAACACTTTTTGGATCTATGTTGCGTGGAGGTGGAGTGATGACAAATATTTATGAGCTTTGTG
ACGTTGATTTGAAAGATTTCAGCATACAAGCTTATGGACAACAAGGCTGTTTACTTCGAAGCTTGCCTGGTGATGTGATA
TTTGACAATTCATCAGGCTTTAGCTTTACATGTGATTTGAGTGATAACATTGCTGCACCCTGGCCTTCTTCATCATCTTC
ATTTGCATCTATGTCACTCTGTGCACCTGAGATGTCATTACCTGCTCTCCCAACTTCTTCTCAGACATTAAAAAATAATG
GTTGTACTTCTGATGGAGTGGGATTGCTTGTGCTTATTTTTTCATCTTTCATCTTCAGTACACTGTTGTATTGA