Microexon ID Gm_2:45470554-45470561:+
Species Glycine max
Coordinates 2:45470554..45470561
Microexon Cluster ID MEP20
Size 8
Phase 2
Pfam Domain Motif VSP
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 16,34,8,50
Microexon location in the Microexon-tag 3
Microexon-tag DNA Seq TWTRTTCCKGGMARAGATSAAAATGGAARYTWYSYRCCHYTRMAGWYRAGHHMARRWGGWWTAKCWRGTGGWGYYATWGCTGGMATATCTRTWGSWGGAGTWRCHGGK
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq CTCTGGAG
Microexon Amino Acid seq SSGG
Microexon-tag DNA Seq TATATTCCAGGAAAAGACCAAAATGCTATCTATGTGCCCCTTCATCTAAGCTCTGGAGGTCTTGCGGGTGGTGTTATTGCTGGAATATCTATTGGAGTAGTAACAGGA
Microexon-tag Amino Acid Seq YIPGKDQNAIYVPLHLSSGGLAGGVIAGISIGVVTG
Microexon-tag spanning region45469483-45470757
Microexon-tag prediction score0.9172
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH73391x
Reference Transcript ID KRH73391
Gene ID GLYMA_02G270700
Gene Name NA
Transcript ID KRH73391
Protein ID KRH73391
Gene ID GLYMA_02G270700
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >KRH73391
MEHSFRLPVFFLLCASIAFSAESKCSRGCDLALASYYLSQGDLTYVSKLMESEVVSKPEDILSYNTDTITNKDLLPASIR
VNVPFPCDCIDEEFLGHTFQYNLTTGDTYLSIATQNYSNLTTAEWLRSFNRYLPANIPDSGTLNVTINCSCGNSEVSKDY
GLFITYPLRPEDSLQSIANETGVDRDLLVKYNPGVNFSQGSGLVYIPGKDQNAIYVPLHLSSGGLAGGVIAGISIGVVTG
LLLLAFCVYVTYYRRKKVWKKDLLSEESRKNSARVKNDEASGDSAAEGGTNTIGIRVNKSAEFSYEELANATNNFSLANK
IGQGGFGVVYYAELNGEKAAIKKMDIQATREFLAELKVLTHVHHLNLVRLIGYCVEGSLFLVYEYIENGNLGQHLRKSGF
NPLPWSTRVQIALDSARGLQYIHEHTVPVYIHRDIKSENILIDKNFGAKVADFGLTKLIDVGSSSLPTVNMKGTFGYMPP
EYAYGNVSPKIDVYAFGVVLYELISGKEALSRGGVSGAELKGLVSLFDEVFDQQDTTEGLKKLVDPRLGDNYPIDSVCKM
AQLARACTESDPQQRPNMSSVVVTLTALTSTTEDWDIASIIENPTLANLMSGK*
CDS seq >KRH73391
ATGGAACACAGTTTCAGATTACCAGTTTTCTTCTTGTTATGTGCCTCTATAGCGTTCAGTGCAGAATCCAAGTGTAGCAG
GGGTTGTGATTTAGCTCTAGCTTCCTACTATCTATCACAAGGTGACTTGACATATGTATCAAAGCTTATGGAATCTGAGG
TTGTTTCAAAACCTGAAGATATTCTCAGCTACAACACTGACACCATAACAAACAAAGACCTGTTGCCTGCCTCTATCAGA
GTGAACGTTCCATTCCCTTGTGACTGCATTGATGAAGAGTTTCTTGGCCATACTTTTCAATACAACCTTACAACAGGAGA
CACTTATTTGTCCATTGCCACTCAGAACTACTCTAATTTGACCACTGCTGAGTGGTTGCGGAGCTTCAACAGATATTTAC
CAGCTAATATTCCTGATAGTGGGACTCTTAATGTCACCATTAACTGTTCCTGTGGGAATAGTGAAGTTTCCAAGGATTAT
GGATTGTTCATCACGTACCCTCTTAGACCTGAGGATTCTTTGCAGTCGATTGCCAACGAGACTGGCGTTGATCGTGACTT
GCTGGTTAAGTACAACCCGGGTGTAAATTTTAGCCAAGGGAGTGGTCTGGTTTATATTCCAGGAAAAGACCAAAATGCTA
TCTATGTGCCCCTTCATCTAAGCTCTGGAGGTCTTGCGGGTGGTGTTATTGCTGGAATATCTATTGGAGTAGTAACAGGA
CTTCTGCTATTGGCATTTTGTGTGTATGTTACATATTACCGAAGAAAGAAGGTATGGAAGAAGGATTTGCTCTCAGAAGA
ATCCAGGAAGAACTCTGCTAGAGTTAAGAATGATGAAGCCTCTGGTGATTCGGCTGCAGAAGGTGGTACTAACACCATTG
GCATTAGGGTGAACAAATCAGCAGAGTTTTCATATGAGGAACTAGCCAATGCCACAAATAACTTCAGTTTGGCTAATAAA
ATTGGTCAAGGTGGTTTTGGGGTAGTCTATTATGCAGAGCTGAATGGAGAGAAAGCTGCAATAAAAAAGATGGACATACA
AGCAACAAGAGAATTTCTTGCGGAATTGAAAGTGTTGACACATGTTCATCACTTGAACCTGGTGCGCTTGATTGGATATT
GTGTTGAGGGCTCCCTTTTTCTTGTCTATGAGTACATTGAGAATGGCAACTTAGGACAACATCTACGTAAATCAGGTTTC
AATCCTTTGCCATGGTCTACCCGAGTTCAAATTGCTCTGGATTCAGCCAGAGGTCTTCAATACATTCATGAGCATACGGT
ACCTGTATATATCCATCGTGACATAAAGTCGGAAAACATTTTAATAGACAAAAACTTCGGTGCAAAGGTTGCAGACTTTG
GATTAACCAAGTTGATTGATGTTGGAAGTTCATCACTTCCCACTGTTAATATGAAGGGCACATTTGGTTACATGCCACCA
GAATATGCATATGGCAATGTTTCTCCCAAAATAGATGTCTATGCTTTTGGAGTTGTTCTTTATGAACTAATTTCTGGTAA
AGAAGCATTGAGCAGAGGTGGTGTCTCTGGTGCTGAACTAAAGGGCCTTGTATCTTTGTTTGATGAAGTATTTGATCAGC
AAGATACCACAGAAGGTCTTAAAAAACTGGTGGATCCTAGGCTTGGAGATAACTACCCAATTGATTCAGTTTGCAAGATG
GCACAACTTGCTAGAGCATGCACAGAGAGCGATCCACAACAACGTCCAAATATGAGTTCTGTTGTGGTTACTCTCACAGC
ACTTACTTCAACTACTGAGGATTGGGATATTGCTTCCATCATTGAAAATCCAACTCTTGCAAATCTAATGTCTGGTAAAT
AA
Microexon DNA seq CTCTGGAG
Microexon Amino Acid seq SSGG
Microexon-tag DNA Seq TATATTCCAGGAAAAGACCAAAATGCTATCTATGTGCCCCTTCATCTAAGCTCTGGAGGTCTTGCGGGTGGTGTTATTGCTGGAATATCTATTGGAGTAGTAACAGGA
Microexon-tag Amino Acid seq YIPGKDQNAIYVPLHLSSGGLAGGVIAGISIGVVTG
Transcript ID KRH73391
Gene ID Gm.32033
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >KRH73391
MEHSFRLPVFFLLCASIAFSAESKCSRGCDLALASYYLSQGDLTYVSKLMESEVVSKPEDILSYNTDTITNKDLLPASIR
VNVPFPCDCIDEEFLGHTFQYNLTTGDTYLSIATQNYSNLTTAEWLRSFNRYLPANIPDSGTLNVTINCSCGNSEVSKDY
GLFITYPLRPEDSLQSIANETGVDRDLLVKYNPGVNFSQGSGLVYIPGKDQNAIYVPLHLSSGGLAGGVIAGISIGVVTG
LLLLAFCVYVTYYRRKKVWKKDLLSEESRKNSARVKNDEASGDSAAEGGTNTIGIRVNKSAEFSYEELANATNNFSLANK
IGQGGFGVVYYAELNGEKAAIKKMDIQATREFLAELKVLTHVHHLNLVRLIGYCVEGSLFLVYEYIENGNLGQHLRKSGF
NPLPWSTRVQIALDSARGLQYIHEHTVPVYIHRDIKSENILIDKNFGAKVADFGLTKLIDVGSSSLPTVNMKGTFGYMPP
EYAYGNVSPKIDVYAFGVVLYELISGKEALSRGGVSGAELKGLVSLFDEVFDQQDTTEGLKKLVDPRLGDNYPIDSVCKM
AQLARACTESDPQQRPNMSSVVVTLTALTSTTEDWDIASIIENPTLANLMSGK*
CDS seq >KRH73391
ATGGAACACAGTTTCAGATTACCAGTTTTCTTCTTGTTATGTGCCTCTATAGCGTTCAGTGCAGAATCCAAGTGTAGCAG
GGGTTGTGATTTAGCTCTAGCTTCCTACTATCTATCACAAGGTGACTTGACATATGTATCAAAGCTTATGGAATCTGAGG
TTGTTTCAAAACCTGAAGATATTCTCAGCTACAACACTGACACCATAACAAACAAAGACCTGTTGCCTGCCTCTATCAGA
GTGAACGTTCCATTCCCTTGTGACTGCATTGATGAAGAGTTTCTTGGCCATACTTTTCAATACAACCTTACAACAGGAGA
CACTTATTTGTCCATTGCCACTCAGAACTACTCTAATTTGACCACTGCTGAGTGGTTGCGGAGCTTCAACAGATATTTAC
CAGCTAATATTCCTGATAGTGGGACTCTTAATGTCACCATTAACTGTTCCTGTGGGAATAGTGAAGTTTCCAAGGATTAT
GGATTGTTCATCACGTACCCTCTTAGACCTGAGGATTCTTTGCAGTCGATTGCCAACGAGACTGGCGTTGATCGTGACTT
GCTGGTTAAGTACAACCCGGGTGTAAATTTTAGCCAAGGGAGTGGTCTGGTTTATATTCCAGGAAAAGACCAAAATGCTA
TCTATGTGCCCCTTCATCTAAGCTCTGGAGGTCTTGCGGGTGGTGTTATTGCTGGAATATCTATTGGAGTAGTAACAGGA
CTTCTGCTATTGGCATTTTGTGTGTATGTTACATATTACCGAAGAAAGAAGGTATGGAAGAAGGATTTGCTCTCAGAAGA
ATCCAGGAAGAACTCTGCTAGAGTTAAGAATGATGAAGCCTCTGGTGATTCGGCTGCAGAAGGTGGTACTAACACCATTG
GCATTAGGGTGAACAAATCAGCAGAGTTTTCATATGAGGAACTAGCCAATGCCACAAATAACTTCAGTTTGGCTAATAAA
ATTGGTCAAGGTGGTTTTGGGGTAGTCTATTATGCAGAGCTGAATGGAGAGAAAGCTGCAATAAAAAAGATGGACATACA
AGCAACAAGAGAATTTCTTGCGGAATTGAAAGTGTTGACACATGTTCATCACTTGAACCTGGTGCGCTTGATTGGATATT
GTGTTGAGGGCTCCCTTTTTCTTGTCTATGAGTACATTGAGAATGGCAACTTAGGACAACATCTACGTAAATCAGGTTTC
AATCCTTTGCCATGGTCTACCCGAGTTCAAATTGCTCTGGATTCAGCCAGAGGTCTTCAATACATTCATGAGCATACGGT
ACCTGTATATATCCATCGTGACATAAAGTCGGAAAACATTTTAATAGACAAAAACTTCGGTGCAAAGGTTGCAGACTTTG
GATTAACCAAGTTGATTGATGTTGGAAGTTCATCACTTCCCACTGTTAATATGAAGGGCACATTTGGTTACATGCCACCA
GAATATGCATATGGCAATGTTTCTCCCAAAATAGATGTCTATGCTTTTGGAGTTGTTCTTTATGAACTAATTTCTGGTAA
AGAAGCATTGAGCAGAGGTGGTGTCTCTGGTGCTGAACTAAAGGGCCTTGTATCTTTGTTTGATGAAGTATTTGATCAGC
AAGATACCACAGAAGGTCTTAAAAAACTGGTGGATCCTAGGCTTGGAGATAACTACCCAATTGATTCAGTTTGCAAGATG
GCACAACTTGCTAGAGCATGCACAGAGAGCGATCCACAACAACGTCCAAATATGAGTTCTGTTGTGGTTACTCTCACAGC
ACTTACTTCAACTACTGAGGATTGGGATATTGCTTCCATCATTGAAAATCCAACTCTTGCAAATCTAATGTCTGGTAAAT
AA