Microexon ID Gm_4:3860327-3860335:+
Species Glycine max
Coordinates 4:3860327..3860335
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TGTATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGTTGCAAGAAGGAAGGGCAAACTAGGAAAGGACGACAAGTGTATTTGGGGGGTTATGATATGGAGGAGAAAGCTGCAAGAGCCTATGATCTCGCGGCC
Microexon-tag Amino Acid Seq WDNSCKKEGQTRKGRQVYLGGYDMEEKAARAYDLAA
Microexon-tag spanning region3860118-3860477
Microexon-tag prediction score0.9763
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH61449x
Reference Transcript ID KRH61449
Gene ID GLYMA_04G047900
Gene Name NA
Transcript ID KRH61449
Protein ID KRH61449
Gene ID GLYMA_04G047900
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.5e-13
Motif start 298
Motif end 356
Protein seq >KRH61449
MKRINESNNTDDGNNHNWLGFSLSPHMKMEATSAATVPTTFYMSPSQSHLSNFGMCYGVGENGNFHSPLTVMPLKSDGSL
CILEALKRSQTQVMVPTSSPKLEDFLGGATMGTHEYGSHERGLSLDSIYYNSQNAEAQPNRDLLSQPFRQQGHMSVQTHP
YYSGLACHGLYQAPLEEETTKETHVSDCSSLMPQMTEGLKNWVAPTREFSTHQQVLEQQMNCGMGNERNGVSLGSVGCGE
LQSLSLSMSPGSQSSCVTAPSGTDSVAVDAKKRGHAKLGQKQPVHRKSIDTFGQRTSQYRGVTRHRWTGRYEAHLWDNSC
KKEGQTRKGRQVYLGGYDMEEKAARAYDLAALKYWGPSTHINFSIENYQVQLEEMKNMSRQEYVAHLRRKSSGFSRGASI
YRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDVAAIKFRGANAVTNFDISRYDVERIMASSNLLAGELAR
RKKDNDPRNKDIDYNKSVVTSVNNEETVQVQAGNNNNENDSEWKMVLFNHPSQQQQANGNGSDQKIMNCGNYRNSAFSMA
LQDLIGIDSVGSGQHNMLDESSKIGTHFSNTSSLVTSLSSSREASPEKRGPSLLFPMPPMETKIVNPIGTSVTSWLPSPT
VQMRPSPAISLSHLPVFASWTDT*
CDS seq >KRH61449
ATGAAGCGCATAAATGAGAGTAACAACACCGATGATGGAAACAATCATAACTGGTTGGGGTTCTCTCTCTCACCCCACAT
GAAAATGGAGGCTACTTCAGCAGCCACTGTTCCGACAACCTTCTACATGTCCCCTTCTCAATCTCACTTGTCCAACTTCG
GAATGTGTTACGGTGTCGGAGAAAATGGTAACTTCCATTCTCCACTTACGGTTATGCCTCTCAAGTCTGATGGGTCACTT
TGTATCTTGGAAGCTCTCAAAAGATCACAAACGCAAGTGATGGTGCCAACTTCGTCTCCGAAATTGGAGGACTTTCTAGG
TGGTGCAACTATGGGAACTCACGAATATGGAAGCCACGAGAGAGGTTTGAGCCTAGACAGCATCTATTATAACTCCCAAA
ACGCAGAGGCTCAACCCAACAGAGACCTTCTTTCACAACCCTTCAGGCAACAAGGTCATATGAGTGTCCAAACACACCCT
TATTACTCAGGCCTTGCTTGCCATGGTTTATATCAAGCACCGTTGGAGGAAGAAACAACAAAGGAAACGCACGTGTCGGA
TTGCAGCTCCCTAATGCCTCAAATGACAGAAGGCTTGAAAAACTGGGTGGCTCCAACAAGGGAGTTTTCAACTCACCAGC
AGGTTTTGGAGCAGCAAATGAATTGTGGCATGGGGAATGAGAGAAATGGTGTGTCTTTAGGATCTGTGGGGTGTGGAGAG
TTACAGTCTCTAAGCTTATCTATGAGTCCTGGTTCTCAGTCTAGTTGTGTCACTGCTCCTTCTGGAACAGATTCTGTTGC
TGTGGATGCAAAGAAGAGAGGGCATGCTAAACTTGGTCAGAAGCAGCCTGTGCATAGAAAATCTATCGACACATTTGGGC
AAAGAACCTCGCAGTATAGAGGTGTCACAAGGCATAGATGGACTGGTAGGTATGAAGCGCATTTGTGGGATAATAGTTGC
AAGAAGGAAGGGCAAACTAGGAAAGGACGACAAGTGTATTTGGGGGGTTATGATATGGAGGAGAAAGCTGCAAGAGCCTA
TGATCTCGCGGCCCTTAAGTACTGGGGACCTTCAACGCATATAAACTTTTCGATAGAGAATTACCAAGTTCAACTTGAGG
AAATGAAGAACATGAGCAGACAGGAATACGTTGCACACTTGAGAAGAAAAAGCAGCGGGTTTTCTAGAGGTGCTTCAATA
TACAGAGGGGTCACAAGGCATCACCAACATGGAAGATGGCAAGCGAGGATAGGCAGAGTTGCTGGGAACAAAGACCTTTA
CCTTGGGACGTTCAGCACCCAAGAGGAAGCAGCAGAAGCATACGATGTAGCGGCGATCAAATTTCGCGGCGCAAATGCAG
TCACAAACTTTGACATTTCAAGATACGATGTGGAGAGAATCATGGCCAGTAGCAATCTCCTCGCTGGGGAGCTTGCAAGG
CGTAAGAAAGATAACGATCCTAGAAACAAGGACATAGACTACAACAAGAGTGTAGTAACAAGTGTGAACAATGAGGAAAC
GGTTCAAGTTCAAGCAGGAAACAACAATAATGAAAACGACTCAGAGTGGAAGATGGTTTTATTTAACCACCCTTCACAGC
AGCAACAGGCAAATGGCAATGGCAGTGACCAAAAAATAATGAACTGTGGAAATTACAGAAACAGTGCATTTTCTATGGCC
CTACAAGATCTTATTGGGATTGATTCGGTGGGTTCTGGGCAGCATAATATGCTGGACGAGTCTAGCAAAATTGGGACTCA
TTTTTCAAACACGTCATCGCTGGTGACAAGTTTAAGCAGCTCAAGAGAGGCTAGTCCTGAGAAAAGGGGTCCCTCGCTTC
TGTTCCCAATGCCTCCAATGGAAACAAAGATTGTGAACCCCATTGGTACCAGTGTTACCTCTTGGCTACCCTCACCAACG
GTTCAAATGAGGCCTTCTCCTGCTATCTCTTTGTCTCACTTGCCAGTTTTTGCTTCTTGGACTGATACTTAA
Microexon DNA seq TGTATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGTTGCAAGAAGGAAGGGCAAACTAGGAAAGGACGACAAGTGTATTTGGGGGGTTATGATATGGAGGAGAAAGCTGCAAGAGCCTATGATCTCGCGGCC
Microexon-tag Amino Acid seq WDNSCKKEGQTRKGRQVYLGGYDMEEKAARAYDLAA
Transcript ID KRH61449
Gene ID Gm.37876
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.5e-13
Motif start 298
Motif end 356
Protein seq >KRH61449
MKRINESNNTDDGNNHNWLGFSLSPHMKMEATSAATVPTTFYMSPSQSHLSNFGMCYGVGENGNFHSPLTVMPLKSDGSL
CILEALKRSQTQVMVPTSSPKLEDFLGGATMGTHEYGSHERGLSLDSIYYNSQNAEAQPNRDLLSQPFRQQGHMSVQTHP
YYSGLACHGLYQAPLEEETTKETHVSDCSSLMPQMTEGLKNWVAPTREFSTHQQVLEQQMNCGMGNERNGVSLGSVGCGE
LQSLSLSMSPGSQSSCVTAPSGTDSVAVDAKKRGHAKLGQKQPVHRKSIDTFGQRTSQYRGVTRHRWTGRYEAHLWDNSC
KKEGQTRKGRQVYLGGYDMEEKAARAYDLAALKYWGPSTHINFSIENYQVQLEEMKNMSRQEYVAHLRRKSSGFSRGASI
YRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDVAAIKFRGANAVTNFDISRYDVERIMASSNLLAGELAR
RKKDNDPRNKDIDYNKSVVTSVNNEETVQVQAGNNNNENDSEWKMVLFNHPSQQQQANGNGSDQKIMNCGNYRNSAFSMA
LQDLIGIDSVGSGQHNMLDESSKIGTHFSNTSSLVTSLSSSREASPEKRGPSLLFPMPPMETKIVNPIGTSVTSWLPSPT
VQMRPSPAISLSHLPVFASWTDT*
CDS seq >KRH61449
ATGAAGCGCATAAATGAGAGTAACAACACCGATGATGGAAACAATCATAACTGGTTGGGGTTCTCTCTCTCACCCCACAT
GAAAATGGAGGCTACTTCAGCAGCCACTGTTCCGACAACCTTCTACATGTCCCCTTCTCAATCTCACTTGTCCAACTTCG
GAATGTGTTACGGTGTCGGAGAAAATGGTAACTTCCATTCTCCACTTACGGTTATGCCTCTCAAGTCTGATGGGTCACTT
TGTATCTTGGAAGCTCTCAAAAGATCACAAACGCAAGTGATGGTGCCAACTTCGTCTCCGAAATTGGAGGACTTTCTAGG
TGGTGCAACTATGGGAACTCACGAATATGGAAGCCACGAGAGAGGTTTGAGCCTAGACAGCATCTATTATAACTCCCAAA
ACGCAGAGGCTCAACCCAACAGAGACCTTCTTTCACAACCCTTCAGGCAACAAGGTCATATGAGTGTCCAAACACACCCT
TATTACTCAGGCCTTGCTTGCCATGGTTTATATCAAGCACCGTTGGAGGAAGAAACAACAAAGGAAACGCACGTGTCGGA
TTGCAGCTCCCTAATGCCTCAAATGACAGAAGGCTTGAAAAACTGGGTGGCTCCAACAAGGGAGTTTTCAACTCACCAGC
AGGTTTTGGAGCAGCAAATGAATTGTGGCATGGGGAATGAGAGAAATGGTGTGTCTTTAGGATCTGTGGGGTGTGGAGAG
TTACAGTCTCTAAGCTTATCTATGAGTCCTGGTTCTCAGTCTAGTTGTGTCACTGCTCCTTCTGGAACAGATTCTGTTGC
TGTGGATGCAAAGAAGAGAGGGCATGCTAAACTTGGTCAGAAGCAGCCTGTGCATAGAAAATCTATCGACACATTTGGGC
AAAGAACCTCGCAGTATAGAGGTGTCACAAGGCATAGATGGACTGGTAGGTATGAAGCGCATTTGTGGGATAATAGTTGC
AAGAAGGAAGGGCAAACTAGGAAAGGACGACAAGTGTATTTGGGGGGTTATGATATGGAGGAGAAAGCTGCAAGAGCCTA
TGATCTCGCGGCCCTTAAGTACTGGGGACCTTCAACGCATATAAACTTTTCGATAGAGAATTACCAAGTTCAACTTGAGG
AAATGAAGAACATGAGCAGACAGGAATACGTTGCACACTTGAGAAGAAAAAGCAGCGGGTTTTCTAGAGGTGCTTCAATA
TACAGAGGGGTCACAAGGCATCACCAACATGGAAGATGGCAAGCGAGGATAGGCAGAGTTGCTGGGAACAAAGACCTTTA
CCTTGGGACGTTCAGCACCCAAGAGGAAGCAGCAGAAGCATACGATGTAGCGGCGATCAAATTTCGCGGCGCAAATGCAG
TCACAAACTTTGACATTTCAAGATACGATGTGGAGAGAATCATGGCCAGTAGCAATCTCCTCGCTGGGGAGCTTGCAAGG
CGTAAGAAAGATAACGATCCTAGAAACAAGGACATAGACTACAACAAGAGTGTAGTAACAAGTGTGAACAATGAGGAAAC
GGTTCAAGTTCAAGCAGGAAACAACAATAATGAAAACGACTCAGAGTGGAAGATGGTTTTATTTAACCACCCTTCACAGC
AGCAACAGGCAAATGGCAATGGCAGTGACCAAAAAATAATGAACTGTGGAAATTACAGAAACAGTGCATTTTCTATGGCC
CTACAAGATCTTATTGGGATTGATTCGGTGGGTTCTGGGCAGCATAATATGCTGGACGAGTCTAGCAAAATTGGGACTCA
TTTTTCAAACACGTCATCGCTGGTGACAAGTTTAAGCAGCTCAAGAGAGGCTAGTCCTGAGAAAAGGGGTCCCTCGCTTC
TGTTCCCAATGCCTCCAATGGAAACAAAGATTGTGAACCCCATTGGTACCAGTGTTACCTCTTGGCTACCCTCACCAACG
GTTCAAATGAGGCCTTCTCCTGCTATCTCTTTGTCTCACTTGCCAGTTTTTGCTTCTTGGACTGATACTTAA