Microexon ID Gm_9:46953089-46953097:+
Species Glycine max
Coordinates 9:46953089..46953097
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTACTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGTTGTAGAAGAGAGGGACAGACTCGCAAAGGAAGGCAAGTTTACTTGGGTGGTTATGATAAAGAAGAAAAGGCAGCTAGAGCCTACGATTTGGCAGCA
Microexon-tag Amino Acid Seq WDNSCRREGQTRKGRQVYLGGYDKEEKAARAYDLAA
Microexon-tag spanning region46952852-46953273
Microexon-tag prediction score0.9714
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH40267x
Reference Transcript ID KRH40267
Gene ID GLYMA_09G248200
Gene Name NA
Transcript ID KRH40267
Protein ID KRH40267
Gene ID GLYMA_09G248200
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.6e-12
Motif start 266
Motif end 324
Protein seq >KRH40267
MGSMNLLGFSLSPHEEHPSSQDHSQTTPSRFSFNPDGSISSTDVAGGCFDLTSDSTPHLLNLPSYGIYEAFHRNNSINTT
QDWKENYNSQNLLLGTSCNKQNMNQNQQQQPKLENFLGGHSFGEHEQTYGGNSASTDYMFPAQPVSAGGGGSGGGSNNNN
NSNSIGLSMIKTWLRNQPPNSENINNNNESGGNIRSSVQQTLSLSMSTGSQSSTSLPLLTASVDNGESSSDNKQPNTSAA
LDSTQTGAIETAPRKSIDTFGQRTSIYRGVTRHRWTGRYEAHLWDNSCRREGQTRKGRQVYLGGYDKEEKAARAYDLAAL
KYWGTTTTTNFPISHYEKELEEMKHMTRQEYVASLRRKSSGFSRGASIYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFS
TQEEAAEAYDVAAIKFRGLSAVTNFDMSRYDVKSILESTTLPIGGAAKRLKDMEQVELSVDNGHRADQVDHSIIMSSHLT
QGINNNYAGGGTATHHNWHNAHAFHQPQPCTTMHYPYGQRINWCKQEQQDNSDAPHSLSYSDIHQLQLGNNGTHNFFHTN
SGLHPMLSMDSASIDNSSSSNSVVYDGYGGGGGYNVMPMGTTTAVVASDGDQNPRSNHGFGDNEIKALGYESVYGSATDS
YHAHARNLYYLTQQQSSSVDTVKASAYDQGSACNTWVPTAIPTHAPRSTTSMALCHGATTPFSLLHE*
CDS seq >KRH40267
ATGGGGTCTATGAATTTGTTAGGTTTTTCTCTCTCTCCTCACGAAGAACACCCTTCTAGTCAAGATCACTCTCAAACGAC
ACCTTCTCGTTTTAGCTTCAACCCTGATGGATCAATCTCAAGCACTGATGTAGCAGGAGGCTGCTTTGATCTCACTTCTG
ACTCAACTCCTCATTTACTTAACCTTCCTTCTTATGGCATATACGAAGCATTTCACAGAAACAATAGTATTAACACCACT
CAAGATTGGAAGGAGAACTACAACAGCCAAAATTTGCTATTGGGAACTTCGTGCAATAAACAAAACATGAACCAAAACCA
ACAGCAACAGCCAAAGCTTGAAAACTTCCTCGGTGGACACTCATTTGGCGAACATGAGCAAACCTACGGTGGTAACTCAG
CCTCTACAGATTACATGTTTCCTGCTCAGCCAGTATCGGCTGGTGGTGGTGGTAGTGGTGGTGGCAGTAACAATAACAAC
AACAGTAACTCCATAGGGTTATCCATGATAAAGACATGGTTGAGGAACCAACCACCGAACTCAGAAAACATCAACAACAA
CAATGAAAGTGGTGGCAATATTAGAAGCAGTGTGCAGCAAACTCTATCACTTTCCATGAGTACTGGTTCACAATCAAGCA
CATCACTGCCCCTTCTCACTGCTAGTGTGGATAATGGAGAGAGTTCTTCTGATAACAAACAACCAAACACCTCGGCTGCA
CTTGATTCCACCCAAACCGGAGCCATTGAAACTGCACCCAGAAAGTCCATTGACACTTTTGGACAGAGAACTTCTATCTA
CCGTGGTGTAACAAGGCATAGGTGGACGGGGAGGTACGAGGCTCACCTGTGGGATAATAGTTGTAGAAGAGAGGGACAGA
CTCGCAAAGGAAGGCAAGTTTACTTGGGTGGTTATGATAAAGAAGAAAAGGCAGCTAGAGCCTACGATTTGGCAGCACTA
AAATACTGGGGAACAACCACAACAACAAATTTTCCAATTAGCCACTATGAGAAAGAGTTGGAAGAAATGAAGCACATGAC
TAGGCAAGAGTACGTTGCGTCATTGAGAAGGAAGAGTAGTGGGTTTTCTCGCGGTGCATCCATTTATCGAGGAGTGACGA
GACACCACCAACATGGAAGGTGGCAAGCGAGGATTGGAAGAGTTGCTGGCAACAAGGATCTTTACTTGGGAACTTTTAGC
ACCCAAGAAGAGGCAGCGGAAGCATATGATGTAGCAGCAATCAAATTCCGAGGACTAAGTGCTGTTACAAACTTTGACAT
GAGCAGATATGACGTGAAAAGCATACTTGAGAGCACCACTTTGCCAATAGGTGGTGCTGCAAAGCGTTTGAAGGATATGG
AGCAGGTTGAACTGAGTGTGGATAATGGTCATAGAGCAGATCAAGTAGATCATAGTATCATCATGAGTTCTCACCTAACT
CAAGGAATCAATAACAACTATGCAGGAGGGGGAACAGCAACTCATCATAACTGGCACAATGCTCATGCATTCCACCAACC
TCAACCTTGCACCACCATGCACTACCCTTATGGACAAAGAATTAATTGGTGCAAGCAAGAACAACAAGACAACTCTGATG
CCCCTCACTCTTTGTCTTATTCAGATATTCATCAACTTCAGCTAGGGAACAATGGAACACATAACTTCTTTCACACAAAT
TCAGGGTTGCACCCTATGTTGAGCATGGATTCTGCTTCCATTGACAATAGCTCTTCTTCTAACTCGGTTGTTTATGATGG
TTATGGAGGTGGTGGGGGCTACAATGTGATGCCTATGGGAACTACTACTGCTGTTGTTGCAAGTGATGGTGATCAAAATC
CAAGAAGCAATCATGGTTTTGGTGATAATGAGATAAAAGCACTTGGTTATGAAAGTGTGTATGGCTCTGCAACTGATTCT
TATCATGCACATGCAAGGAACTTGTATTATCTTACTCAACAGCAATCATCTTCTGTTGATACAGTGAAGGCTAGTGCATA
TGATCAAGGGTCTGCATGCAATACTTGGGTTCCAACTGCTATTCCAACTCATGCACCCAGATCAACTACTAGTATGGCTC
TCTGCCATGGGGCTACTACACCCTTCTCTTTATTGCATGAATAG
Microexon DNA seq TTTACTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGATAATAGTTGTAGAAGAGAGGGACAGACTCGCAAAGGAAGGCAAGTTTACTTGGGTGGTTATGATAAAGAAGAAAAGGCAGCTAGAGCCTACGATTTGGCAGCA
Microexon-tag Amino Acid seq WDNSCRREGQTRKGRQVYLGGYDKEEKAARAYDLAA
Transcript ID KRH40267
Gene ID Gm.54197
Gene Name NA
Pfam domain motif AP2
Motif E-value 1.6e-12
Motif start 266
Motif end 324
Protein seq >KRH40267
MGSMNLLGFSLSPHEEHPSSQDHSQTTPSRFSFNPDGSISSTDVAGGCFDLTSDSTPHLLNLPSYGIYEAFHRNNSINTT
QDWKENYNSQNLLLGTSCNKQNMNQNQQQQPKLENFLGGHSFGEHEQTYGGNSASTDYMFPAQPVSAGGGGSGGGSNNNN
NSNSIGLSMIKTWLRNQPPNSENINNNNESGGNIRSSVQQTLSLSMSTGSQSSTSLPLLTASVDNGESSSDNKQPNTSAA
LDSTQTGAIETAPRKSIDTFGQRTSIYRGVTRHRWTGRYEAHLWDNSCRREGQTRKGRQVYLGGYDKEEKAARAYDLAAL
KYWGTTTTTNFPISHYEKELEEMKHMTRQEYVASLRRKSSGFSRGASIYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFS
TQEEAAEAYDVAAIKFRGLSAVTNFDMSRYDVKSILESTTLPIGGAAKRLKDMEQVELSVDNGHRADQVDHSIIMSSHLT
QGINNNYAGGGTATHHNWHNAHAFHQPQPCTTMHYPYGQRINWCKQEQQDNSDAPHSLSYSDIHQLQLGNNGTHNFFHTN
SGLHPMLSMDSASIDNSSSSNSVVYDGYGGGGGYNVMPMGTTTAVVASDGDQNPRSNHGFGDNEIKALGYESVYGSATDS
YHAHARNLYYLTQQQSSSVDTVKASAYDQGSACNTWVPTAIPTHAPRSTTSMALCHGATTPFSLLHE*
CDS seq >KRH40267
ATGGGGTCTATGAATTTGTTAGGTTTTTCTCTCTCTCCTCACGAAGAACACCCTTCTAGTCAAGATCACTCTCAAACGAC
ACCTTCTCGTTTTAGCTTCAACCCTGATGGATCAATCTCAAGCACTGATGTAGCAGGAGGCTGCTTTGATCTCACTTCTG
ACTCAACTCCTCATTTACTTAACCTTCCTTCTTATGGCATATACGAAGCATTTCACAGAAACAATAGTATTAACACCACT
CAAGATTGGAAGGAGAACTACAACAGCCAAAATTTGCTATTGGGAACTTCGTGCAATAAACAAAACATGAACCAAAACCA
ACAGCAACAGCCAAAGCTTGAAAACTTCCTCGGTGGACACTCATTTGGCGAACATGAGCAAACCTACGGTGGTAACTCAG
CCTCTACAGATTACATGTTTCCTGCTCAGCCAGTATCGGCTGGTGGTGGTGGTAGTGGTGGTGGCAGTAACAATAACAAC
AACAGTAACTCCATAGGGTTATCCATGATAAAGACATGGTTGAGGAACCAACCACCGAACTCAGAAAACATCAACAACAA
CAATGAAAGTGGTGGCAATATTAGAAGCAGTGTGCAGCAAACTCTATCACTTTCCATGAGTACTGGTTCACAATCAAGCA
CATCACTGCCCCTTCTCACTGCTAGTGTGGATAATGGAGAGAGTTCTTCTGATAACAAACAACCAAACACCTCGGCTGCA
CTTGATTCCACCCAAACCGGAGCCATTGAAACTGCACCCAGAAAGTCCATTGACACTTTTGGACAGAGAACTTCTATCTA
CCGTGGTGTAACAAGGCATAGGTGGACGGGGAGGTACGAGGCTCACCTGTGGGATAATAGTTGTAGAAGAGAGGGACAGA
CTCGCAAAGGAAGGCAAGTTTACTTGGGTGGTTATGATAAAGAAGAAAAGGCAGCTAGAGCCTACGATTTGGCAGCACTA
AAATACTGGGGAACAACCACAACAACAAATTTTCCAATTAGCCACTATGAGAAAGAGTTGGAAGAAATGAAGCACATGAC
TAGGCAAGAGTACGTTGCGTCATTGAGAAGGAAGAGTAGTGGGTTTTCTCGCGGTGCATCCATTTATCGAGGAGTGACGA
GACACCACCAACATGGAAGGTGGCAAGCGAGGATTGGAAGAGTTGCTGGCAACAAGGATCTTTACTTGGGAACTTTTAGC
ACCCAAGAAGAGGCAGCGGAAGCATATGATGTAGCAGCAATCAAATTCCGAGGACTAAGTGCTGTTACAAACTTTGACAT
GAGCAGATATGACGTGAAAAGCATACTTGAGAGCACCACTTTGCCAATAGGTGGTGCTGCAAAGCGTTTGAAGGATATGG
AGCAGGTTGAACTGAGTGTGGATAATGGTCATAGAGCAGATCAAGTAGATCATAGTATCATCATGAGTTCTCACCTAACT
CAAGGAATCAATAACAACTATGCAGGAGGGGGAACAGCAACTCATCATAACTGGCACAATGCTCATGCATTCCACCAACC
TCAACCTTGCACCACCATGCACTACCCTTATGGACAAAGAATTAATTGGTGCAAGCAAGAACAACAAGACAACTCTGATG
CCCCTCACTCTTTGTCTTATTCAGATATTCATCAACTTCAGCTAGGGAACAATGGAACACATAACTTCTTTCACACAAAT
TCAGGGTTGCACCCTATGTTGAGCATGGATTCTGCTTCCATTGACAATAGCTCTTCTTCTAACTCGGTTGTTTATGATGG
TTATGGAGGTGGTGGGGGCTACAATGTGATGCCTATGGGAACTACTACTGCTGTTGTTGCAAGTGATGGTGATCAAAATC
CAAGAAGCAATCATGGTTTTGGTGATAATGAGATAAAAGCACTTGGTTATGAAAGTGTGTATGGCTCTGCAACTGATTCT
TATCATGCACATGCAAGGAACTTGTATTATCTTACTCAACAGCAATCATCTTCTGTTGATACAGTGAAGGCTAGTGCATA
TGATCAAGGGTCTGCATGCAATACTTGGGTTCCAACTGCTATTCCAACTCATGCACCCAGATCAACTACTAGTATGGCTC
TCTGCCATGGGGCTACTACACCCTTCTCTTTATTGCATGAATAG