Microexon ID Gm_20:40614918-40614931:+
Species Glycine max
Coordinates 20:40614918..40614931
Microexon Cluster ID MEP39
Size 14
Phase 1
Pfam Domain Motif Unknown
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 24,22,14,48
Microexon location in the Microexon-tag 3
Microexon-tag DNA Seq GRWGMWGGAGRYATGTAYKSYBTYCAASSTTCTGGAGCYMGKGCAGKTGGATTTCCWCAGATGGSMAATGCTGCAGCMATTGCAGCTGCCTTTGSKGGWGGTTTGCCT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTGGATTTCCACAG
Microexon Amino Acid seq VGFPQ
Microexon-tag DNA Seq GATGCAGGAAATATGTATGCTGCTCAAGGTTCTGGAGCCAGGGCAGTTGGATTTCCACAGATGGCAAATGCTGCAGCCATTGCAGCTGCCTTTGGGGGAGGTCTACCT
Microexon-tag Amino Acid Seq DAGNMYAAQGSGARAVGFPQMANAAAIAAAFGGGLP
Microexon-tag spanning region40614775-40615443
Microexon-tag prediction score0.981
Overlapped with the annotated transcript (%) 100
New Transcript ID KRG91679x
Reference Transcript ID KRG91679
Gene ID GLYMA_20G168400
Gene Name NA
Transcript ID KRG91679
Protein ID KRG91679
Gene ID GLYMA_20G168400
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >KRG91679
MTEPSKVIHVRNVGHEISENDLLQLFQPFGVITKLVMLRAKNQALIQMQDVPSAVNALQFYANVQPSIRGRNVYVQFSSH
QELTTMEQSQGRGDEPNRILLVTVHHMLYPMTVDVLYQVFSPHGSVEKIVTFQKSAGFQALIQYQSRQSAVAARSTLQGR
NIYDGCCQLDIQFSNLDELQVNYNNDRSRDFTNPNLPTEQKGRPSQPGYGDAGNMYAAQGSGARAVGFPQMANAAAIAAA
FGGGLPPGITGTNDRCTVLVSNLNPDRIDEDKLFNLFSIYGNIVRIKLLRNKPDHALIQMGDGFQAELAVHFLKGAMLFE
KRLEVNFSKHPNITQGADTHEYINSNLNRFNRNAAKNYRYCCSPTKMIHLSTLPQDITEEEIVSLVEEHGTIVNSKVFEM
NGKKQALVQFGNEEQATEALVCKHASTLSGSVIRISFSQLQNI*
CDS seq >KRG91679
ATGACTGAACCTTCCAAGGTCATTCACGTTCGAAATGTGGGGCACGAAATATCTGAAAATGATTTACTTCAGCTGTTTCA
GCCTTTTGGAGTCATAACAAAGCTTGTCATGCTGCGTGCAAAAAATCAGGCTCTTATCCAAATGCAAGATGTTCCTTCTG
CAGTTAATGCCTTACAATTTTATGCAAATGTTCAGCCAAGCATAAGGGGGAGGAATGTTTATGTTCAATTTTCCTCACAT
CAGGAATTAACAACAATGGAGCAAAGTCAAGGACGAGGAGATGAGCCAAACCGAATTCTCTTAGTCACAGTTCATCACAT
GCTGTATCCTATGACAGTGGATGTGCTGTATCAAGTATTTTCTCCCCATGGATCTGTGGAAAAGATTGTAACATTTCAGA
AGTCAGCTGGCTTTCAGGCTCTCATCCAGTATCAATCACGTCAGAGTGCTGTTGCGGCAAGAAGTACACTTCAGGGACGC
AATATTTATGATGGTTGCTGTCAGCTGGACATTCAGTTCTCAAACCTTGATGAATTACAAGTGAACTACAATAATGACCG
TTCAAGGGACTTCACAAACCCAAATCTCCCTACAGAGCAGAAAGGAAGACCTTCACAACCTGGATATGGTGATGCAGGAA
ATATGTATGCTGCTCAAGGTTCTGGAGCCAGGGCAGTTGGATTTCCACAGATGGCAAATGCTGCAGCCATTGCAGCTGCC
TTTGGGGGAGGTCTACCTCCTGGAATAACTGGGACAAATGACAGGTGTACAGTTCTTGTATCAAATCTTAATCCTGATAG
AATTGATGAGGATAAACTATTCAACCTGTTCTCTATTTATGGGAATATTGTGAGAATTAAACTTCTCCGAAATAAACCAG
ATCATGCACTTATTCAGATGGGAGATGGTTTTCAAGCAGAATTGGCTGTACACTTTCTGAAGGGAGCCATGTTGTTTGAG
AAACGATTAGAGGTCAACTTCTCCAAGCATCCGAACATAACCCAAGGTGCTGACACACATGAATACATCAATTCAAATCT
CAATCGTTTCAACCGTAACGCAGCCAAAAACTACCGGTACTGCTGCTCCCCAACAAAGATGATCCATTTGTCCACACTCC
CACAAGACATCACTGAAGAGGAGATTGTGAGCCTTGTAGAGGAACATGGAACCATTGTCAACAGCAAAGTCTTTGAGATG
AACGGGAAAAAACAGGCTCTGGTTCAGTTTGGGAATGAGGAGCAGGCTACTGAAGCCCTTGTGTGCAAGCATGCAAGCAC
ACTTTCTGGCTCAGTAATCCGTATCTCCTTTTCACAGTTGCAGAATATATGA
Microexon DNA seq TTGGATTTCCACAG
Microexon Amino Acid seq VGFPQ
Microexon-tag DNA Seq GATGCAGGAAATATGTATGCTGCTCAAGGTTCTGGAGCCAGGGCAGTTGGATTTCCACAGATGGCAAATGCTGCAGCCATTGCAGCTGCCTTTGGGGGAGGTCTACCT
Microexon-tag Amino Acid seq DAGNMYAAQGSGARAVGFPQMANAAAIAAAFGGGLP
Transcript ID Gm.34011.1
Gene ID Gm.34011
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Gm.34011.1
MTEPSKVIHVRNVGHEISENDLLQLFQPFGVITKLVMLRAKNQALIQMQDVPSAVNALQFYANVQPSIRGRNVYVQFSSH
QELTTMEQSQGRGDEPNRILLVTVHHMLYPMTVDVLYQVFSPHGSVEKIVTFQKSAGFQALIQYQSRQSAVAARSTLQGR
NIYDGCCQLDIQFSNLDELQVNYNNDRSRDFTNPNLPTEQKGRPSQPGYGDAGNMYAAQGSGARAVGFPQMANAAAIAAA
FGGGLPPGITGTNDRCTVLVSNLNPDRIDEDKLFNLFSIYGNIVRIKLLRNKPDHALIQMGDGFQAELAVHFLKGAMLFE
KRLEVNFSKHPNITQGADTHEYINSNLNRFNRNAAKNYRYCCSPTKMIHLSTLPQDITEEEIVSLVEEHGTIVNSKVFEM
NGKKQALVQFGNEEQATEALVCKHASTLSGSVIRISFSQLQNI*
CDS seq >Gm.34011.1
ATGACTGAACCTTCCAAGGTCATTCACGTTCGAAATGTGGGGCACGAAATATCTGAAAATGATTTACTTCAGCTGTTTCA
GCCTTTTGGAGTCATAACAAAGCTTGTCATGCTGCGTGCAAAAAATCAGGCTCTTATCCAAATGCAAGATGTTCCTTCTG
CAGTTAATGCCTTACAATTTTATGCAAATGTTCAGCCAAGCATAAGGGGGAGGAATGTTTATGTTCAATTTTCCTCACAT
CAGGAATTAACAACAATGGAGCAAAGTCAAGGACGAGGAGATGAGCCAAACCGAATTCTCTTAGTCACAGTTCATCACAT
GCTGTATCCTATGACAGTGGATGTGCTGTATCAAGTATTTTCTCCCCATGGATCTGTGGAAAAGATTGTAACATTTCAGA
AGTCAGCTGGCTTTCAGGCTCTCATCCAGTATCAATCACGTCAGAGTGCTGTTGCGGCAAGAAGTACACTTCAGGGACGC
AATATTTATGATGGTTGCTGTCAGCTGGACATTCAGTTCTCAAACCTTGATGAATTACAAGTGAACTACAATAATGACCG
TTCAAGGGACTTCACAAACCCAAATCTCCCTACAGAGCAGAAAGGAAGACCTTCACAACCTGGATATGGTGATGCAGGAA
ATATGTATGCTGCTCAAGGTTCTGGAGCCAGGGCAGTTGGATTTCCACAGATGGCAAATGCTGCAGCCATTGCAGCTGCC
TTTGGGGGAGGTCTACCTCCTGGAATAACTGGGACAAATGACAGGTGTACAGTTCTTGTATCAAATCTTAATCCTGATAG
AATTGATGAGGATAAACTATTCAACCTGTTCTCTATTTATGGGAATATTGTGAGAATTAAACTTCTCCGAAATAAACCAG
ATCATGCACTTATTCAGATGGGAGATGGTTTTCAAGCAGAATTGGCTGTACACTTTCTGAAGGGAGCCATGTTGTTTGAG
AAACGATTAGAGGTCAACTTCTCCAAGCATCCGAACATAACCCAAGGTGCTGACACACATGAATACATCAATTCAAATCT
CAATCGTTTCAACCGTAACGCAGCCAAAAACTACCGGTACTGCTGCTCCCCAACAAAGATGATCCATTTGTCCACACTCC
CACAAGACATCACTGAAGAGGAGATTGTGAGCCTTGTAGAGGAACATGGAACCATTGTCAACAGCAAAGTCTTTGAGATG
AACGGGAAAAAACAGGCTCTGGTTCAGTTTGGGAATGAGGAGCAGGCTACTGAAGCCCTTGTGTGCAAGCATGCAAGCAC
ACTTTCTGGCTCAGTAATCCGTATCTCCTTTTCACAGTTGCAGAATATATGA