Microexon ID Gm_6:3695264-3695272:+
Species Glycine max
Coordinates 6:3695264..3695272
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TATATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGTTGCAAGAAGGAAGGGCAGACTAGGAAAGGAAGACAAGTATATTTGGGAGGTTATGATATGGAAGAAAAAGCTGCAAGAGCCTATGATCTCGCGGCT
Microexon-tag Amino Acid Seq WDNSCKKEGQTRKGRQVYLGGYDMEEKAARAYDLAA
Microexon-tag spanning region3695055-3695421
Microexon-tag prediction score0.9758
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH52140x
Reference Transcript ID KRH52140
Gene ID GLYMA_06G049200
Gene Name NA
Transcript ID KRH52140
Protein ID KRH52140
Gene ID GLYMA_06G049200
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.5e-13
Motif start 298
Motif end 356
Protein seq >KRH52140
MKRMNESNNTDDGNNHNWLGFSLSPHMKMEVTSAATVSDNNVPTTFYMSPSHMSNSGMCYSVGENGNFHSPLTVMPLKSD
GSLGILEALNRSQTQVMVPTSSPKLEDFLGGATMGTHEYGNHERGLSLDSIYYNSQNAEAQPNRNLLSHPFRQQGHVNVE
THPYYSVFACRGLYQAPSEEEATKETHVSVMPQMTGGGLQNWVAPTREYSTHQQILEQQMNCGIWNERSGVSVGTVGCGE
LQSLSLSMSPGSQSSCVTAPSGTDSVAVDAKKRGHAKLGQKQPVHRKSIDTFGQRTSQYRGVTRHRWTGRYEAHLWDNSC
KKEGQTRKGRQVYLGGYDMEEKAARAYDLAALKYWGPSTHINFSIENYQVQLEEMKNMSRQEYVAHLRRKSSGFSRGASI
YRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGANAVTNFDISRYDVERIMASSNLLAGELAR
RNKDNDPRNEAIDYNKSVVTSVNNGETVQVQARNNNENDSEWKMVLFNHPLQQQANGSDHKIMNCGNSRNSAFSMALQDL
IGVDSVGSEQHNMLDDSSKIGTHFSNPSSLVTSLSSSREASPEKMGPSLLFPKPPPMETKIVNPIGTSVTSWLPSPTVQM
RPSPAISLSHLPVFAAWTDT*
CDS seq >KRH52140
ATGAAGCGCATGAATGAGAGTAACAACACCGATGATGGAAACAATCATAACTGGTTGGGGTTCTCTCTCTCGCCCCACAT
GAAAATGGAGGTTACTTCGGCAGCCACTGTTTCCGACAACAATGTTCCGACAACCTTCTACATGTCCCCTTCTCACATGT
CCAACTCCGGAATGTGTTACAGTGTCGGGGAAAATGGTAACTTCCATTCTCCTCTTACCGTTATGCCTCTCAAGTCTGAT
GGGTCACTTGGTATCTTGGAAGCTCTCAATAGATCACAAACGCAAGTGATGGTGCCAACTTCGTCTCCGAAATTGGAGGA
CTTCTTAGGTGGTGCAACTATGGGAACTCACGAATATGGGAACCACGAGAGAGGTTTGAGCCTAGACAGCATCTATTATA
ACTCACAAAACGCAGAGGCTCAACCCAACAGAAACCTTCTTTCACATCCCTTCAGGCAACAAGGGCATGTGAATGTCGAA
ACACACCCTTATTACTCTGTATTTGCTTGTCGAGGTTTGTATCAGGCACCGTCGGAGGAAGAAGCAACAAAGGAAACGCA
CGTTTCAGTAATGCCTCAAATGACAGGAGGAGGTTTGCAAAACTGGGTAGCTCCAACAAGGGAGTATTCAACTCATCAGC
AGATTCTGGAGCAGCAAATGAACTGTGGCATTTGGAATGAGAGAAGTGGGGTATCTGTTGGAACTGTGGGGTGTGGAGAG
TTGCAATCTCTAAGCTTATCTATGAGTCCTGGTTCTCAGTCTAGTTGTGTCACTGCTCCTTCTGGAACAGATTCTGTGGC
TGTGGATGCAAAGAAGAGAGGGCATGCTAAACTTGGTCAGAAGCAGCCTGTGCATAGAAAATCTATTGACACATTTGGGC
AAAGAACGTCGCAGTATAGAGGCGTCACAAGGCATAGATGGACTGGTAGGTATGAAGCGCATTTGTGGGATAATAGTTGC
AAGAAGGAAGGGCAGACTAGGAAAGGAAGACAAGTATATTTGGGAGGTTATGATATGGAAGAAAAAGCTGCAAGAGCCTA
TGATCTCGCGGCTCTTAAGTACTGGGGACCTTCAACGCACATAAACTTTTCGATAGAAAATTACCAAGTTCAACTTGAGG
AAATGAAGAACATGAGCAGACAGGAATACGTTGCACACTTGAGAAGAAAAAGCAGCGGATTTTCTAGAGGTGCTTCAATA
TACAGAGGGGTCACAAGGCACCACCAACATGGAAGATGGCAAGCGAGGATAGGCAGAGTTGCTGGGAACAAAGACCTTTA
CCTTGGGACATTCAGCACTCAAGAGGAAGCAGCAGAAGCATACGATATAGCCGCAATAAAATTCCGCGGCGCGAATGCAG
TCACAAACTTTGACATTTCAAGATACGATGTCGAGAGAATCATGGCCAGTAGTAATCTCCTCGCTGGGGAGCTTGCAAGG
CGAAATAAAGATAACGATCCAAGAAACGAGGCCATTGACTACAACAAGAGTGTAGTAACAAGTGTGAACAATGGGGAAAC
GGTTCAAGTTCAAGCAAGAAACAACAATGAAAATGATTCGGAGTGGAAGATGGTTTTATTTAATCACCCTTTACAGCAAC
AGGCAAATGGCAGTGACCATAAAATAATGAACTGCGGAAATTCCAGGAACAGTGCATTTTCTATGGCCCTACAAGATCTC
ATTGGGGTTGATTCGGTGGGTTCTGAGCAGCATAATATGCTGGATGATTCTAGCAAAATTGGGACTCATTTTTCAAACCC
GTCCTCGCTGGTGACAAGTTTAAGCAGCTCAAGAGAGGCTAGTCCTGAGAAAATGGGTCCCTCGCTTCTGTTCCCAAAGC
CTCCTCCAATGGAAACAAAGATTGTGAACCCCATTGGTACGAGTGTTACCTCTTGGTTACCCTCACCAACGGTTCAAATG
AGGCCATCTCCTGCTATCTCTTTGTCTCACTTGCCAGTTTTTGCTGCTTGGACTGATACTTAA
Microexon DNA seq TATATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGTTGCAAGAAGGAAGGGCAGACTAGGAAAGGAAGACAAGTATATTTGGGAGGTTATGATATGGAAGAAAAAGCTGCAAGAGCCTATGATCTCGCGGCT
Microexon-tag Amino Acid seq WDNSCKKEGQTRKGRQVYLGGYDMEEKAARAYDLAA
Transcript ID KRH52140
Gene ID Gm.42822
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.5e-13
Motif start 298
Motif end 356
Protein seq >KRH52140
MKRMNESNNTDDGNNHNWLGFSLSPHMKMEVTSAATVSDNNVPTTFYMSPSHMSNSGMCYSVGENGNFHSPLTVMPLKSD
GSLGILEALNRSQTQVMVPTSSPKLEDFLGGATMGTHEYGNHERGLSLDSIYYNSQNAEAQPNRNLLSHPFRQQGHVNVE
THPYYSVFACRGLYQAPSEEEATKETHVSVMPQMTGGGLQNWVAPTREYSTHQQILEQQMNCGIWNERSGVSVGTVGCGE
LQSLSLSMSPGSQSSCVTAPSGTDSVAVDAKKRGHAKLGQKQPVHRKSIDTFGQRTSQYRGVTRHRWTGRYEAHLWDNSC
KKEGQTRKGRQVYLGGYDMEEKAARAYDLAALKYWGPSTHINFSIENYQVQLEEMKNMSRQEYVAHLRRKSSGFSRGASI
YRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTQEEAAEAYDIAAIKFRGANAVTNFDISRYDVERIMASSNLLAGELAR
RNKDNDPRNEAIDYNKSVVTSVNNGETVQVQARNNNENDSEWKMVLFNHPLQQQANGSDHKIMNCGNSRNSAFSMALQDL
IGVDSVGSEQHNMLDDSSKIGTHFSNPSSLVTSLSSSREASPEKMGPSLLFPKPPPMETKIVNPIGTSVTSWLPSPTVQM
RPSPAISLSHLPVFAAWTDT*
CDS seq >KRH52140
ATGAAGCGCATGAATGAGAGTAACAACACCGATGATGGAAACAATCATAACTGGTTGGGGTTCTCTCTCTCGCCCCACAT
GAAAATGGAGGTTACTTCGGCAGCCACTGTTTCCGACAACAATGTTCCGACAACCTTCTACATGTCCCCTTCTCACATGT
CCAACTCCGGAATGTGTTACAGTGTCGGGGAAAATGGTAACTTCCATTCTCCTCTTACCGTTATGCCTCTCAAGTCTGAT
GGGTCACTTGGTATCTTGGAAGCTCTCAATAGATCACAAACGCAAGTGATGGTGCCAACTTCGTCTCCGAAATTGGAGGA
CTTCTTAGGTGGTGCAACTATGGGAACTCACGAATATGGGAACCACGAGAGAGGTTTGAGCCTAGACAGCATCTATTATA
ACTCACAAAACGCAGAGGCTCAACCCAACAGAAACCTTCTTTCACATCCCTTCAGGCAACAAGGGCATGTGAATGTCGAA
ACACACCCTTATTACTCTGTATTTGCTTGTCGAGGTTTGTATCAGGCACCGTCGGAGGAAGAAGCAACAAAGGAAACGCA
CGTTTCAGTAATGCCTCAAATGACAGGAGGAGGTTTGCAAAACTGGGTAGCTCCAACAAGGGAGTATTCAACTCATCAGC
AGATTCTGGAGCAGCAAATGAACTGTGGCATTTGGAATGAGAGAAGTGGGGTATCTGTTGGAACTGTGGGGTGTGGAGAG
TTGCAATCTCTAAGCTTATCTATGAGTCCTGGTTCTCAGTCTAGTTGTGTCACTGCTCCTTCTGGAACAGATTCTGTGGC
TGTGGATGCAAAGAAGAGAGGGCATGCTAAACTTGGTCAGAAGCAGCCTGTGCATAGAAAATCTATTGACACATTTGGGC
AAAGAACGTCGCAGTATAGAGGCGTCACAAGGCATAGATGGACTGGTAGGTATGAAGCGCATTTGTGGGATAATAGTTGC
AAGAAGGAAGGGCAGACTAGGAAAGGAAGACAAGTATATTTGGGAGGTTATGATATGGAAGAAAAAGCTGCAAGAGCCTA
TGATCTCGCGGCTCTTAAGTACTGGGGACCTTCAACGCACATAAACTTTTCGATAGAAAATTACCAAGTTCAACTTGAGG
AAATGAAGAACATGAGCAGACAGGAATACGTTGCACACTTGAGAAGAAAAAGCAGCGGATTTTCTAGAGGTGCTTCAATA
TACAGAGGGGTCACAAGGCACCACCAACATGGAAGATGGCAAGCGAGGATAGGCAGAGTTGCTGGGAACAAAGACCTTTA
CCTTGGGACATTCAGCACTCAAGAGGAAGCAGCAGAAGCATACGATATAGCCGCAATAAAATTCCGCGGCGCGAATGCAG
TCACAAACTTTGACATTTCAAGATACGATGTCGAGAGAATCATGGCCAGTAGTAATCTCCTCGCTGGGGAGCTTGCAAGG
CGAAATAAAGATAACGATCCAAGAAACGAGGCCATTGACTACAACAAGAGTGTAGTAACAAGTGTGAACAATGGGGAAAC
GGTTCAAGTTCAAGCAAGAAACAACAATGAAAATGATTCGGAGTGGAAGATGGTTTTATTTAATCACCCTTTACAGCAAC
AGGCAAATGGCAGTGACCATAAAATAATGAACTGCGGAAATTCCAGGAACAGTGCATTTTCTATGGCCCTACAAGATCTC
ATTGGGGTTGATTCGGTGGGTTCTGAGCAGCATAATATGCTGGATGATTCTAGCAAAATTGGGACTCATTTTTCAAACCC
GTCCTCGCTGGTGACAAGTTTAAGCAGCTCAAGAGAGGCTAGTCCTGAGAAAATGGGTCCCTCGCTTCTGTTCCCAAAGC
CTCCTCCAATGGAAACAAAGATTGTGAACCCCATTGGTACGAGTGTTACCTCTTGGTTACCCTCACCAACGGTTCAAATG
AGGCCATCTCCTGCTATCTCTTTGTCTCACTTGCCAGTTTTTGCTGCTTGGACTGATACTTAA