Microexon ID Gm_14:8096767-8096775:+
Species Glycine max
Coordinates 14:8096767..8096775
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAACAGTTGCAAGAAGGAAGGGCAAACCAGGAAAGGACGACAAGTTTATTTGGGTGGTTATGACATGGAAGAGAAAGCCGCAAGGGCTTATGATCTTGCGGCT
Microexon-tag Amino Acid Seq WDNSCKKEGQTRKGRQVYLGGYDMEEKAARAYDLAA
Microexon-tag spanning region8096542-8096929
Microexon-tag prediction score0.9807
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH15454x
Reference Transcript ID KRH15454
Gene ID GLYMA_14G089200
Gene Name NA
Transcript ID KRH15454
Protein ID KRH15454
Gene ID GLYMA_14G089200
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.3e-13
Motif start 228
Motif end 286
Protein seq >KRH15454
MMPTSSPKLEDFLSGATMGAQDYGTHEREAMALSLDSIYYSNQNAEPETNRDHSSSLDLLSDPFRHQNHSYYSGLGIYQV
EEEETKEQPPPPPHHVAVCCSQVPQVVEGIACFKNWVPPREFSSSTQQNLEQDQVNSSRSGGLGEDNNNGASGNIGVGSS
VGCGELQSLSLSMSPGSQSSCVTVPTQISSSGTDSVTVDAKKRGSSKLGQKQPVHRKSIDTFGQRTSQYRGVTRHRWTGR
YEAHLWDNSCKKEGQTRKGRQVYLGGYDMEEKAARAYDLAALKYWGPSTHINFSLENYQTELEEMKNMSRQEYVAHLRRK
SSGFSRGASMYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGLNAVTNFDITRYDVERIMAS
NTLLAGELARRNKNSEPRTEAIEYNVVSSQRVISNREEVHEAVNNNNNNENGSSSDWKMGLYHHQQQQQQSNNCDQKTMK
CGNYNRGGGGAAAAFSVSLQDLIGIESVVGSSQGMMNESTKIGTHFSNPSSLVTSLSSSREGSPDKTGPTLLFPKPPMGS
KIVTSPTIANGVTVGSWFPSQMRPVSMSHLPFFAAWSDT*
CDS seq >KRH15454
ATGATGCCAACTTCATCTCCAAAACTTGAGGACTTCCTCAGTGGTGCAACTATGGGGGCTCAAGACTATGGAACCCATGA
GAGAGAAGCAATGGCTCTAAGCCTAGACAGCATCTACTACAGCAACCAGAATGCTGAACCTGAAACCAACAGAGACCATT
CTTCTTCTCTGGACCTTCTTTCTGACCCTTTCAGGCACCAAAACCATTCATATTACTCAGGGCTTGGGATTTACCAAGTG
GAGGAAGAAGAAACAAAGGAACAACCACCACCACCACCCCACCACGTGGCAGTTTGCTGCTCCCAAGTGCCTCAAGTGGT
GGAAGGCATTGCTTGCTTCAAAAACTGGGTGCCCCCAAGGGAATTTTCTTCTTCCACTCAGCAGAATCTGGAGCAGGATC
AAGTGAATAGTAGTCGTAGTGGTGGCCTTGGAGAGGATAATAATAATGGTGCTTCTGGGAATATTGGTGTTGGTAGTAGT
GTTGGTTGTGGGGAGTTACAGTCTCTAAGTCTGTCTATGAGCCCTGGTTCTCAATCAAGCTGTGTCACTGTTCCAACTCA
GATCTCATCTTCTGGAACTGACTCCGTGACTGTGGATGCCAAAAAGAGAGGGTCTTCTAAGCTTGGACAGAAGCAACCTG
TGCATAGGAAATCCATTGACACATTTGGGCAAAGAACTTCTCAGTATAGAGGTGTCACAAGGCATAGATGGACTGGTAGA
TATGAAGCGCATTTGTGGGATAACAGTTGCAAGAAGGAAGGGCAAACCAGGAAAGGACGACAAGTTTATTTGGGTGGTTA
TGACATGGAAGAGAAAGCCGCAAGGGCTTATGATCTTGCGGCTCTCAAGTATTGGGGACCTTCAACGCACATAAACTTCT
CGCTAGAAAATTACCAAACTGAACTTGAAGAAATGAAGAATATGAGTAGGCAGGAATATGTGGCCCACTTGAGAAGAAAG
AGTAGTGGGTTTTCAAGGGGTGCCTCAATGTACAGAGGAGTGACAAGGCACCACCAACATGGCAGGTGGCAAGCAAGGAT
AGGCAGAGTTGCAGGAAATAAGGACCTTTATCTTGGGACATTCAGCACTCAAGAGGAAGCTGCTGAAGCATATGATATAG
CTGCAATCAAATTTCGTGGGTTGAATGCTGTCACCAACTTTGACATAACAAGATACGATGTTGAGAGAATCATGGCCAGC
AACACCCTTCTAGCGGGGGAGCTAGCTAGAAGAAACAAGAATAGTGAGCCAAGAACTGAGGCCATAGAGTACAATGTTGT
GTCAAGCCAACGAGTCATAAGCAACAGGGAAGAAGTTCATGAGGCTGTGAACAACAACAATAATAATGAAAATGGTTCAT
CTTCAGATTGGAAGATGGGTTTGTATCATCATCAGCAGCAGCAACAACAGTCAAACAACTGTGACCAGAAAACCATGAAG
TGTGGAAATTATAATAGAGGTGGTGGTGGTGCTGCTGCTGCTTTCTCTGTGTCCCTACAAGATCTCATTGGGATTGAGTC
AGTAGTAGGATCTAGCCAGGGCATGATGAATGAGTCCACTAAGATAGGGACTCATTTTTCAAACCCTTCCTCGCTGGTCA
CCAGTTTAAGCAGCTCAAGGGAAGGTAGCCCTGATAAAACGGGCCCCACTTTGCTCTTTCCAAAGCCTCCAATGGGGTCA
AAGATTGTTACTAGCCCTACTATTGCTAATGGTGTCACTGTTGGCTCTTGGTTTCCCTCTCAAATGAGGCCAGTCTCAAT
GTCTCACTTGCCATTTTTTGCTGCTTGGAGTGATACCTAG
Microexon DNA seq TTTATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAACAGTTGCAAGAAGGAAGGGCAAACCAGGAAAGGACGACAAGTTTATTTGGGTGGTTATGACATGGAAGAGAAAGCCGCAAGGGCTTATGATCTTGCGGCT
Microexon-tag Amino Acid seq WDNSCKKEGQTRKGRQVYLGGYDMEEKAARAYDLAA
Transcript ID KRH15454
Gene ID Gm.15182
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.3e-13
Motif start 228
Motif end 286
Protein seq >KRH15454
MMPTSSPKLEDFLSGATMGAQDYGTHEREAMALSLDSIYYSNQNAEPETNRDHSSSLDLLSDPFRHQNHSYYSGLGIYQV
EEEETKEQPPPPPHHVAVCCSQVPQVVEGIACFKNWVPPREFSSSTQQNLEQDQVNSSRSGGLGEDNNNGASGNIGVGSS
VGCGELQSLSLSMSPGSQSSCVTVPTQISSSGTDSVTVDAKKRGSSKLGQKQPVHRKSIDTFGQRTSQYRGVTRHRWTGR
YEAHLWDNSCKKEGQTRKGRQVYLGGYDMEEKAARAYDLAALKYWGPSTHINFSLENYQTELEEMKNMSRQEYVAHLRRK
SSGFSRGASMYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGLNAVTNFDITRYDVERIMAS
NTLLAGELARRNKNSEPRTEAIEYNVVSSQRVISNREEVHEAVNNNNNNENGSSSDWKMGLYHHQQQQQQSNNCDQKTMK
CGNYNRGGGGAAAAFSVSLQDLIGIESVVGSSQGMMNESTKIGTHFSNPSSLVTSLSSSREGSPDKTGPTLLFPKPPMGS
KIVTSPTIANGVTVGSWFPSQMRPVSMSHLPFFAAWSDT*
CDS seq >KRH15454
ATGATGCCAACTTCATCTCCAAAACTTGAGGACTTCCTCAGTGGTGCAACTATGGGGGCTCAAGACTATGGAACCCATGA
GAGAGAAGCAATGGCTCTAAGCCTAGACAGCATCTACTACAGCAACCAGAATGCTGAACCTGAAACCAACAGAGACCATT
CTTCTTCTCTGGACCTTCTTTCTGACCCTTTCAGGCACCAAAACCATTCATATTACTCAGGGCTTGGGATTTACCAAGTG
GAGGAAGAAGAAACAAAGGAACAACCACCACCACCACCCCACCACGTGGCAGTTTGCTGCTCCCAAGTGCCTCAAGTGGT
GGAAGGCATTGCTTGCTTCAAAAACTGGGTGCCCCCAAGGGAATTTTCTTCTTCCACTCAGCAGAATCTGGAGCAGGATC
AAGTGAATAGTAGTCGTAGTGGTGGCCTTGGAGAGGATAATAATAATGGTGCTTCTGGGAATATTGGTGTTGGTAGTAGT
GTTGGTTGTGGGGAGTTACAGTCTCTAAGTCTGTCTATGAGCCCTGGTTCTCAATCAAGCTGTGTCACTGTTCCAACTCA
GATCTCATCTTCTGGAACTGACTCCGTGACTGTGGATGCCAAAAAGAGAGGGTCTTCTAAGCTTGGACAGAAGCAACCTG
TGCATAGGAAATCCATTGACACATTTGGGCAAAGAACTTCTCAGTATAGAGGTGTCACAAGGCATAGATGGACTGGTAGA
TATGAAGCGCATTTGTGGGATAACAGTTGCAAGAAGGAAGGGCAAACCAGGAAAGGACGACAAGTTTATTTGGGTGGTTA
TGACATGGAAGAGAAAGCCGCAAGGGCTTATGATCTTGCGGCTCTCAAGTATTGGGGACCTTCAACGCACATAAACTTCT
CGCTAGAAAATTACCAAACTGAACTTGAAGAAATGAAGAATATGAGTAGGCAGGAATATGTGGCCCACTTGAGAAGAAAG
AGTAGTGGGTTTTCAAGGGGTGCCTCAATGTACAGAGGAGTGACAAGGCACCACCAACATGGCAGGTGGCAAGCAAGGAT
AGGCAGAGTTGCAGGAAATAAGGACCTTTATCTTGGGACATTCAGCACTCAAGAGGAAGCTGCTGAAGCATATGATATAG
CTGCAATCAAATTTCGTGGGTTGAATGCTGTCACCAACTTTGACATAACAAGATACGATGTTGAGAGAATCATGGCCAGC
AACACCCTTCTAGCGGGGGAGCTAGCTAGAAGAAACAAGAATAGTGAGCCAAGAACTGAGGCCATAGAGTACAATGTTGT
GTCAAGCCAACGAGTCATAAGCAACAGGGAAGAAGTTCATGAGGCTGTGAACAACAACAATAATAATGAAAATGGTTCAT
CTTCAGATTGGAAGATGGGTTTGTATCATCATCAGCAGCAGCAACAACAGTCAAACAACTGTGACCAGAAAACCATGAAG
TGTGGAAATTATAATAGAGGTGGTGGTGGTGCTGCTGCTGCTTTCTCTGTGTCCCTACAAGATCTCATTGGGATTGAGTC
AGTAGTAGGATCTAGCCAGGGCATGATGAATGAGTCCACTAAGATAGGGACTCATTTTTCAAACCCTTCCTCGCTGGTCA
CCAGTTTAAGCAGCTCAAGGGAAGGTAGCCCTGATAAAACGGGCCCCACTTTGCTCTTTCCAAAGCCTCCAATGGGGTCA
AAGATTGTTACTAGCCCTACTATTGCTAATGGTGTCACTGTTGGCTCTTGGTTTCCCTCTCAAATGAGGCCAGTCTCAAT
GTCTCACTTGCCATTTTTTGCTGCTTGGAGTGATACCTAG