Microexon ID Gm_20:12622223-12622230:+
Species Glycine max
Coordinates 20:12622223..12622230
Microexon Cluster ID MEP20
Size 8
Phase 2
Pfam Domain Motif VSP
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 16,34,8,50
Microexon location in the Microexon-tag 3
Microexon-tag DNA Seq TWTRTTCCKGGMARAGATSAAAATGGAARYTWYSYRCCHYTRMAGWYRAGHHMARRWGGWWTAKCWRGTGGWGYYATWGCTGGMATATCTRTWGSWGGAGTWRCHGGK
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTCAGGAG
Microexon Amino Acid seq SSGG
Microexon-tag DNA Seq TATGTTCCCGGAAAAGATCAAAATGGTAGCTTTGTGCGTTTGCCGTCTAGTTCAGGAGGTCTTACTGGTAGAGCTATTGCAGGAATAGCTGTCGGAATTGTGGCAGCT
Microexon-tag Amino Acid Seq YVPGKDQNGSFVRLPSSSGGLTGRAIAGIAVGIVAA
Microexon-tag spanning region12621010-12622415
Microexon-tag prediction score0.9022
Overlapped with the annotated transcript (%) 100
New Transcript ID KRG89893x
Reference Transcript ID KRG89893
Gene ID GLYMA_20G054500
Gene Name NA
Transcript ID KRG89893
Protein ID KRG89893
Gene ID GLYMA_20G054500
Gene Name NA
Pfam domain motif SKG6
Motif E-value 0.38
Motif start 218
Motif end 254
Protein seq >KRG89893
MEALRLAYLLLPWWLVFSTAESACKEGCGVALGSYYLWRGSNLTYISSIMASSLLTTPDDIVNYNKDTVPSKDIIIADQR
VNVPFPCDCIDGQFLGHTFRYDVQSQDTYETVARSWFANLTDVAWLRRFNTYPPDNIPDTGTLNVTVNCSCGNTDVANYG
LFVTYPLRIGDTLGSVAANLSLDSALLQRYNPDVNFNQGTGLVYVPGKDQNGSFVRLPSSSGGLTGRAIAGIAVGIVAAL
LLLGVCIYVGYFRKKIQKDEFLPRDSTALFAQDGKDETSRSSANETSGPGGPAIITDITVNKSVEFSYEELATATDNFSL
ANKIGQGGFGSVYYAELRGEKAAIKKMDMQASKEFLAELNVLTRVHHLNLVRLIGYSIEGSLFLVYEYIENGNLSQHLRG
SGSREPLPWATRVQIALDSARGLEYIHEHTVPVYIHRDIKSANILIDKNFRGKVADFGLTKLTEVGSSSLPTGRLVGTFG
YMPPEYAQYGDVSPKVDVYAFGVVLYELISAKEAIVKTNDSVADSKGLVALFDGVLSQPDPTEELCKLVDPRLGDNYPID
SVRKMAQLAKACTQDNPQLRPSMRSIVVALMTLSSTTDDWDVGSFYENQNLVNLMSGR*
CDS seq >KRG89893
ATGGAAGCCTTGAGGTTGGCGTATCTCCTCCTTCCCTGGTGGTTGGTGTTTTCGACAGCAGAATCGGCGTGCAAGGAGGG
CTGTGGAGTGGCACTAGGCTCCTACTACTTATGGCGGGGCTCCAACCTCACCTACATCTCCTCCATCATGGCCTCCAGCC
TTCTGACAACTCCGGACGACATAGTCAACTACAACAAGGACACCGTGCCCAGCAAGGACATCATCATCGCCGACCAAAGG
GTCAACGTCCCCTTCCCCTGCGACTGCATCGACGGCCAGTTTCTTGGCCACACCTTCCGCTATGACGTTCAGTCACAGGA
CACGTACGAGACTGTAGCCAGGAGCTGGTTTGCCAACCTCACCGACGTGGCCTGGCTGAGGAGGTTCAATACCTATCCTC
CCGACAACATTCCCGACACGGGAACGCTGAATGTTACGGTTAATTGCTCTTGTGGGAACACTGATGTTGCCAATTATGGC
TTGTTCGTTACCTACCCTCTTAGGATTGGGGACACTCTGGGGTCGGTGGCTGCTAATTTGAGCCTTGACTCGGCCTTGCT
GCAGAGGTACAATCCCGATGTCAATTTCAACCAGGGGACTGGTCTGGTTTATGTTCCCGGAAAAGATCAAAATGGTAGCT
TTGTGCGTTTGCCGTCTAGTTCAGGAGGTCTTACTGGTAGAGCTATTGCAGGAATAGCTGTCGGAATTGTGGCAGCTCTT
CTGTTATTGGGAGTTTGTATATATGTTGGATATTTCCGTAAGAAGATACAGAAGGATGAATTTCTTCCACGAGATTCCAC
TGCACTCTTTGCTCAAGATGGGAAGGATGAAACTTCTCGTAGTAGTGCAAATGAAACTTCAGGACCTGGTGGACCTGCCA
TCATCACAGACATAACAGTGAACAAATCAGTGGAATTTTCATATGAGGAACTGGCAACCGCCACGGATAACTTCAGTTTG
GCTAATAAAATAGGTCAAGGTGGTTTTGGGTCAGTCTATTATGCAGAACTGAGGGGAGAGAAAGCCGCAATCAAGAAGAT
GGATATGCAAGCATCAAAAGAATTTCTTGCTGAATTGAACGTCTTGACACGTGTTCACCATCTTAATCTGGTGCGGTTGA
TTGGATATAGTATTGAGGGCTCTCTTTTCCTTGTCTATGAATACATTGAGAATGGAAACTTAAGTCAACATCTGCGTGGT
TCAGGTAGTAGAGAACCTCTGCCATGGGCTACTCGGGTGCAGATTGCTTTGGATTCTGCCAGAGGTCTTGAATATATTCA
CGAGCACACAGTGCCTGTATATATTCATCGTGATATAAAGTCAGCAAATATATTAATAGACAAAAACTTCCGGGGCAAGG
TTGCTGATTTTGGTTTGACCAAACTGACAGAAGTTGGAAGCTCATCGCTTCCCACTGGTCGTCTTGTTGGAACATTTGGA
TACATGCCACCGGAGTATGCTCAATATGGAGATGTTTCTCCCAAAGTAGACGTGTATGCTTTTGGAGTTGTTCTTTATGA
ACTTATTTCGGCTAAGGAAGCAATTGTCAAGACGAACGATTCTGTTGCCGACTCAAAGGGCCTCGTAGCTTTGTTTGATG
GAGTTCTTAGTCAGCCAGATCCTACGGAAGAGCTCTGCAAACTAGTTGATCCAAGGCTTGGGGATAACTACCCAATTGAT
TCAGTTCGCAAGATGGCACAACTAGCCAAAGCATGTACACAAGACAATCCCCAACTCCGTCCAAGTATGAGATCTATTGT
GGTTGCTCTTATGACACTTTCTTCAACTACCGACGATTGGGATGTTGGTTCCTTCTACGAAAATCAAAATCTTGTGAATC
TTATGTCCGGAAGATAG
Microexon DNA seq TTCAGGAG
Microexon Amino Acid seq SSGG
Microexon-tag DNA Seq TATGTTCCCGGAAAAGATCAAAATGGTAGCTTTGTGCGTTTGCCGTCTAGTTCAGGAGGTCTTACTGGTAGAGCTATTGCAGGAATAGCTGTCGGAATTGTGGCAGCT
Microexon-tag Amino Acid seq YVPGKDQNGSFVRLPSSSGGLTGRAIAGIAVGIVAA
Transcript ID KRG89894
Gene ID Gm.32897
Gene Name NA
Pfam domain motif SKG6
Motif E-value 0.33
Motif start 218
Motif end 254
Protein seq >KRG89894
MEALRLAYLLLPWWLVFSTAESACKEGCGVALGSYYLWRGSNLTYISSIMASSLLTTPDDIVNYNKDTVPSKDIIIADQR
VNVPFPCDCIDGQFLGHTFRYDVQSQDTYETVARSWFANLTDVAWLRRFNTYPPDNIPDTGTLNVTVNCSCGNTDVANYG
LFVTYPLRIGDTLGSVAANLSLDSALLQRYNPDVNFNQGTGLVYVPGKDQNGSFVRLPSSSGGLTGRAIAGIAVGIVAAL
LLLGVCIYVGYFRKKIQKDEFLPRDSTALFAQDGKDETSRSSANETSGPGGPAIITDITVNKSVEFSYEELATATDNFSL
ANKIGQGGFGSVYYAELRGEKAAIKKMDMQASKEFLAELNVLTRVHHLNLVRLIGYSIEGSLFLVYEYIENGNLSQHLRG
SGSREPLPWATRVQIALDSARGLEYIHEHTVPVYIHRDIKSANILIDKNFRGKVADFGLTKLTEVGSSSLPTGRLVGTFG
YMPPEYAQYGDVSPKVDVYAFGVVLYELISAKEAIVKTNDSVADSKGLVALVDLDFSTQEFLSFSQC*
CDS seq >KRG89894
ATGGAAGCCTTGAGGTTGGCGTATCTCCTCCTTCCCTGGTGGTTGGTGTTTTCGACAGCAGAATCGGCGTGCAAGGAGGG
CTGTGGAGTGGCACTAGGCTCCTACTACTTATGGCGGGGCTCCAACCTCACCTACATCTCCTCCATCATGGCCTCCAGCC
TTCTGACAACTCCGGACGACATAGTCAACTACAACAAGGACACCGTGCCCAGCAAGGACATCATCATCGCCGACCAAAGG
GTCAACGTCCCCTTCCCCTGCGACTGCATCGACGGCCAGTTTCTTGGCCACACCTTCCGCTATGACGTTCAGTCACAGGA
CACGTACGAGACTGTAGCCAGGAGCTGGTTTGCCAACCTCACCGACGTGGCCTGGCTGAGGAGGTTCAATACCTATCCTC
CCGACAACATTCCCGACACGGGAACGCTGAATGTTACGGTTAATTGCTCTTGTGGGAACACTGATGTTGCCAATTATGGC
TTGTTCGTTACCTACCCTCTTAGGATTGGGGACACTCTGGGGTCGGTGGCTGCTAATTTGAGCCTTGACTCGGCCTTGCT
GCAGAGGTACAATCCCGATGTCAATTTCAACCAGGGGACTGGTCTGGTTTATGTTCCCGGAAAAGATCAAAATGGTAGCT
TTGTGCGTTTGCCGTCTAGTTCAGGAGGTCTTACTGGTAGAGCTATTGCAGGAATAGCTGTCGGAATTGTGGCAGCTCTT
CTGTTATTGGGAGTTTGTATATATGTTGGATATTTCCGTAAGAAGATACAGAAGGATGAATTTCTTCCACGAGATTCCAC
TGCACTCTTTGCTCAAGATGGGAAGGATGAAACTTCTCGTAGTAGTGCAAATGAAACTTCAGGACCTGGTGGACCTGCCA
TCATCACAGACATAACAGTGAACAAATCAGTGGAATTTTCATATGAGGAACTGGCAACCGCCACGGATAACTTCAGTTTG
GCTAATAAAATAGGTCAAGGTGGTTTTGGGTCAGTCTATTATGCAGAACTGAGGGGAGAGAAAGCCGCAATCAAGAAGAT
GGATATGCAAGCATCAAAAGAATTTCTTGCTGAATTGAACGTCTTGACACGTGTTCACCATCTTAATCTGGTGCGGTTGA
TTGGATATAGTATTGAGGGCTCTCTTTTCCTTGTCTATGAATACATTGAGAATGGAAACTTAAGTCAACATCTGCGTGGT
TCAGGTAGTAGAGAACCTCTGCCATGGGCTACTCGGGTGCAGATTGCTTTGGATTCTGCCAGAGGTCTTGAATATATTCA
CGAGCACACAGTGCCTGTATATATTCATCGTGATATAAAGTCAGCAAATATATTAATAGACAAAAACTTCCGGGGCAAGG
TTGCTGATTTTGGTTTGACCAAACTGACAGAAGTTGGAAGCTCATCGCTTCCCACTGGTCGTCTTGTTGGAACATTTGGA
TACATGCCACCGGAGTATGCTCAATATGGAGATGTTTCTCCCAAAGTAGACGTGTATGCTTTTGGAGTTGTTCTTTATGA
ACTTATTTCGGCTAAGGAAGCAATTGTCAAGACGAACGATTCTGTTGCCGACTCAAAGGGCCTCGTAGCTTTGGTCGATC
TTGATTTCTCTACTCAAGAATTTTTATCATTTAGTCAATGTTAA