Microexon ID Gm_1:2248409-2248417:-
Species Glycine max
Coordinates 1:2248409..2248417
Microexon Cluster ID MEP21
Size 9
Phase 1
Pfam Domain Motif AP2
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,9,50
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTTATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAATAGTTGCAGAAGGGAGGGTCAAGCCAGAAAAGGGCGTCAAGTTTATTTGGGTGGATATGATAAGGAAGAAAAGGCCGCGAGAGCTTATGATTTGGCAGCT
Microexon-tag Amino Acid Seq WDNSCRREGQARKGRQVYLGGYDKEEKAARAYDLAA
Microexon-tag spanning region2248244-2249787
Microexon-tag prediction score0.9705
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH74476x
Reference Transcript ID KRH74476
Gene ID GLYMA_01G022500
Gene Name NA
Transcript ID KRH74476
Protein ID KRH74476
Gene ID GLYMA_01G022500
Gene Name NA
Pfam domain motif AP2
Motif E-value 9.6e-13
Motif start 224
Motif end 282
Protein seq >KRH74476
MARATNWLSFSLSPMEMLRTSEPQFLQYDAASATSSHHYYLDNLYTNGWGNGSLKFEQNLNHSDVSFVESSSQSVGHVPP
PPPKLEDFLGDSSAVMRYSDSQTETQDSSLTHIYDHHHHHHHHHGSTSYFGGDQQDLKAITGFQAFSTNSGSEVDDSASI
GKAQASEFGTHSIESSGNEFAAFSGGTTGTLSLAVALSSEKAVVAAESNSSKKIVDTFGQRTSIYRGVTRHRWTGRYEAH
LWDNSCRREGQARKGRQVYLGGYDKEEKAARAYDLAALKYWGPTATTNFPVSNYSKEVEEMKHVTKQEFIASLRRKSSGF
SRGASIYRGVTRHHQQGRWQARIGRVAGNKDLYLGTFATEEEAAEAYDIAAIKFRGANAVTNFEMNRYDVEAIMKSSLPV
GGAAKRLRLSLESEQKAPPVNSSSQQQNPQCGNVSGSINFSAIHQPIASIPCGIPFDSTTAYYPHNLFQHFHPTNAGAAA
SAVTSANATALTALPASAATEFFIWPHQSY*
CDS seq >KRH74476
ATGGCTCGTGCTACTAACTGGCTTTCGTTCTCTCTCTCCCCAATGGAAATGCTCCGAACCTCCGAACCTCAGTTCCTTCA
ATACGACGCCGCTTCCGCTACTTCCTCACATCACTACTACCTCGACAACTTGTACACCAACGGGTGGGGCAACGGGAGCC
TCAAGTTTGAGCAGAATCTCAACCACAGTGACGTGAGTTTTGTTGAATCTTCGTCGCAGAGCGTCGGCCACGTGCCGCCG
CCGCCGCCGAAGCTGGAGGATTTTCTCGGCGACTCCTCCGCCGTGATGCGTTACTCCGACAGCCAGACGGAGACGCAGGA
CTCGTCGCTGACGCACATCTACGACCACCACCACCACCACCACCACCACCACGGTTCTACTTCGTACTTCGGCGGTGACC
AGCAGGATCTCAAGGCCATTACTGGATTTCAAGCTTTTTCGACCAACTCCGGTTCCGAGGTTGATGATTCTGCATCCATC
GGAAAAGCGCAGGCCAGCGAGTTCGGGACTCACTCTATTGAGTCCTCCGGCAACGAGTTCGCCGCGTTCTCCGGTGGCAC
AACCGGAACCTTGTCGCTCGCCGTTGCACTGAGCTCCGAGAAGGCCGTTGTCGCGGCGGAGTCCAATAGCTCGAAGAAGA
TCGTGGATACCTTCGGCCAGCGGACTTCTATTTACAGAGGTGTTACTAGGCACCGATGGACAGGAAGATATGAAGCGCAT
CTATGGGACAATAGTTGCAGAAGGGAGGGTCAAGCCAGAAAAGGGCGTCAAGTTTATTTGGGTGGATATGATAAGGAAGA
AAAGGCCGCGAGAGCTTATGATTTGGCAGCTCTAAAGTACTGGGGTCCCACTGCTACCACCAACTTCCCTGTTTCCAATT
ATTCGAAGGAAGTGGAGGAGATGAAACATGTAACAAAGCAAGAATTTATTGCATCATTGCGGAGGAAAAGTAGTGGTTTC
TCCAGGGGAGCTTCCATATACAGAGGTGTTACAAGGCATCATCAACAGGGTAGGTGGCAAGCAAGAATTGGCCGTGTAGC
TGGAAACAAAGATTTATACTTGGGAACATTCGCAACCGAGGAGGAAGCAGCAGAGGCATATGATATTGCAGCCATAAAGT
TCAGAGGTGCAAACGCGGTAACCAACTTTGAGATGAATAGATATGATGTGGAAGCTATAATGAAGAGTTCTCTTCCAGTG
GGTGGGGCAGCAAAACGCTTGAGGCTTTCCCTTGAATCAGAGCAGAAAGCTCCTCCTGTGAACAGCAGCAGTCAGCAGCA
GAATCCACAGTGTGGTAACGTGAGTGGTAGCATCAATTTCTCAGCCATTCATCAGCCAATTGCTTCAATCCCTTGTGGAA
TTCCGTTTGATTCAACAACAGCATATTATCCTCACAACCTTTTCCAACATTTTCACCCTACCAACGCTGGTGCAGCAGCG
TCTGCTGTTACTTCTGCCAATGCAACCGCACTAACTGCACTGCCAGCATCAGCAGCAACTGAGTTCTTTATTTGGCCTCA
TCAGTCTTATTGA
Microexon DNA seq TTTATTTGG
Microexon Amino Acid seq VYLG
Microexon-tag DNA Seq TGGGACAATAGTTGCAGAAGGGAGGGTCAAGCCAGAAAAGGGCGTCAAGTTTATTTGGGTGGATATGATAAGGAAGAAAAGGCCGCGAGAGCTTATGATTTGGCAGCT
Microexon-tag Amino Acid seq WDNSCRREGQARKGRQVYLGGYDKEEKAARAYDLAA
Transcript ID Gm.240.1
Gene ID Gm.240
Gene Name NA
Pfam domain motif AP2
Motif E-value 9.6e-13
Motif start 224
Motif end 282
Protein seq >Gm.240.1
MARATNWLSFSLSPMEMLRTSEPQFLQYDAASATSSHHYYLDNLYTNGWGNGSLKFEQNLNHSDVSFVESSSQSVGHVPP
PPPKLEDFLGDSSAVMRYSDSQTETQDSSLTHIYDHHHHHHHHHGSTSYFGGDQQDLKAITGFQAFSTNSGSEVDDSASI
GKAQASEFGTHSIESSGNEFAAFSGGTTGTLSLAVALSSEKAVVAAESNSSKKIVDTFGQRTSIYRGVTRHRWTGRYEAH
LWDNSCRREGQARKGRQVYLGGYDKEEKAARAYDLAALKYWGPTATTNFPVSNYSKEVEEMKHVTKQEFIASLRRKSSGF
SRGASIYRGVTRHHQQGRWQARIGRVAGNKDLYLGTFATEEEAAEAYDIAAIKFRGANAVTNFEMNRYDVEAIMKSSLPV
GGAAKRLRLSLESEQKAPPVNSSSQQQNPQCGNVSGSINFSAIHQPIASIPCGIPFDSTTAYYPHNLFQHFHPTNAGAAA
SAVTSANATALTALPASAATEFFIWPHQSY*
CDS seq >Gm.240.1
ATGGCTCGTGCTACTAACTGGCTTTCGTTCTCTCTCTCCCCAATGGAAATGCTCCGAACCTCCGAACCTCAGTTCCTTCA
ATACGACGCCGCTTCCGCTACTTCCTCACATCACTACTACCTCGACAACTTGTACACCAACGGGTGGGGCAACGGGAGCC
TCAAGTTTGAGCAGAATCTCAACCACAGTGACGTGAGTTTTGTTGAATCTTCGTCGCAGAGCGTCGGCCACGTGCCGCCG
CCGCCGCCGAAGCTGGAGGATTTTCTCGGCGACTCCTCCGCCGTGATGCGTTACTCCGACAGCCAGACGGAGACGCAGGA
CTCGTCGCTGACGCACATCTACGACCACCACCACCACCACCACCACCACCACGGTTCTACTTCGTACTTCGGCGGTGACC
AGCAGGATCTCAAGGCCATTACTGGATTTCAAGCTTTTTCGACCAACTCCGGTTCCGAGGTTGATGATTCTGCATCCATC
GGAAAAGCGCAGGCCAGCGAGTTCGGGACTCACTCTATTGAGTCCTCCGGCAACGAGTTCGCCGCGTTCTCCGGTGGCAC
AACCGGAACCTTGTCGCTCGCCGTTGCACTGAGCTCCGAGAAGGCCGTTGTCGCGGCGGAGTCCAATAGCTCGAAGAAGA
TCGTGGATACCTTCGGCCAGCGGACTTCTATTTACAGAGGTGTTACTAGGCACCGATGGACAGGAAGATATGAAGCGCAT
CTATGGGACAATAGTTGCAGAAGGGAGGGTCAAGCCAGAAAAGGGCGTCAAGTTTATTTGGGTGGATATGATAAGGAAGA
AAAGGCCGCGAGAGCTTATGATTTGGCAGCTCTAAAGTACTGGGGTCCCACTGCTACCACCAACTTCCCTGTTTCCAATT
ATTCGAAGGAAGTGGAGGAGATGAAACATGTAACAAAGCAAGAATTTATTGCATCATTGCGGAGGAAAAGTAGTGGTTTC
TCCAGGGGAGCTTCCATATACAGAGGTGTTACAAGGCATCATCAACAGGGTAGGTGGCAAGCAAGAATTGGCCGTGTAGC
TGGAAACAAAGATTTATACTTGGGAACATTCGCAACCGAGGAGGAAGCAGCAGAGGCATATGATATTGCAGCCATAAAGT
TCAGAGGTGCAAACGCGGTAACCAACTTTGAGATGAATAGATATGATGTGGAAGCTATAATGAAGAGTTCTCTTCCAGTG
GGTGGGGCAGCAAAACGCTTGAGGCTTTCCCTTGAATCAGAGCAGAAAGCTCCTCCTGTGAACAGCAGCAGTCAGCAGCA
GAATCCACAGTGTGGTAACGTGAGTGGTAGCATCAATTTCTCAGCCATTCATCAGCCAATTGCTTCAATCCCTTGTGGAA
TTCCGTTTGATTCAACAACAGCATATTATCCTCACAACCTTTTCCAACATTTTCACCCTACCAACGCTGGTGCAGCAGCG
TCTGCTGTTACTTCTGCCAATGCAACCGCACTAACTGCACTGCCAGCATCAGCAGCAACTGAGTTCTTTATTTGGCCTCA
TCAGTCTTATTGA