Microexon ID Gm_8:45138203-45138213:+
Species Glycine max
Coordinates 8:45138203..45138213
Microexon Cluster ID MEP27
Size 11
Phase 1
Pfam Domain Motif Gelsolin
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 49,11,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq MRRGADRYTGWMAGWGATCCYCAYYTGTTYDCWTKYWCWTTYWMTAAAGGRAADYTKRAGGTKRMRGARRTHTACAAYTTYWCYCARGATGAYYTGWTGACWGARGAT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq GAGACTTCAAG
Microexon Amino Acid seq GDFK
Microexon-tag DNA Seq CAAGGATTCATAGATGATCCACATTTGTTTGCATTAAAAATAGCGAGAGGAGACTTCAAGGTGAAAGAGATATACAATTATACACAGGATGACTTAATAACTGAAGAC
Microexon-tag Amino Acid Seq QGFIDDPHLFALKIARGDFKVKEIYNYTQDDLITED
Microexon-tag spanning region45138019-45138567
Microexon-tag prediction score0.9049
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH46439x
Reference Transcript ID KRH46439
Gene ID GLYMA_08G333900
Gene Name NA
Transcript ID KRH46439
Protein ID KRH46439
Gene ID GLYMA_08G333900
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >KRH46439
MPIVTKDMDSAFQTAGANPGLEVWCIENQRLVSVSNSSHGKFYTGSAYLVLNAVFPKIGPPQYDIHYWLGNEAKKVDSSL
ASDKALDLDAALGSCSVQYREIQGQESQKFLSYFRPCLIPIEGVFTSKQGNLNGEYQVSMYTCKGDYVVHVKEVPFLRSS
LNHEDVFILDTALKIFLFSGCNSTIQERAKALEVVQYIKENKHGGKCEVATIEDGKFVGDSDVGEFWSLFGGYAPIPRDS
PCVQESETPPVKLFWINLQGKLCETGSNAFSKEMLETEKCYMLDCDGEIFVWMGRQTFLTERRTAIRAVEEFVRNEGRSN
KTHLTFLSEGLESTIFRSYFTNWPKTVEPRLYEEGKEKVAAIFKHQGYEVKELPEEDNEPSIDCTGTIKVWRVDGDELSL
LSVTELTKLYSGDCYIVQYTFPGNGRDETLFYAWLGSKCVTEDKAAAISHMSTMADSIRTSPAMAQIHEGKEPAQFFSIL
QRVIIFKGGTSSGYRKFIEEKGIVDETYNKNLVTLFRVQGTSPDNMQAIQVDQVSTSLNSSYCYILQNKASIYTWIGSLS
SARDHNLLDRMVELLNPTWLPVSVREGNEPDIFWDALGGKAEYPKGKEIQGFIDDPHLFALKIARGDFKVKEIYNYTQDD
LITEDILLLDCQREIYVWVGLHSAIKSKQEVLHLGLKFLEMDVLVEGLSMNIPIYIVTEGHEPPFFTRFFSWDHSNENIV
GNSFERKLAILKGKPKTLEGHNRTPLKANSRPSTPNGHRNISVFSNGRGRSSSPILSSAGSDLRQSGDRLLSSSTPVVKK
LLEGSPSHGSAEKTMPQSGSPATELSSSDETVSFPQKDRNVDGENMATYPYERLRVVSANPVTGIDLTKREVYLSNEEFR
EKFGMPKSAFYKLPRWKQNKLKMSLDLF*
CDS seq >KRH46439
ATGCCTATTGTCACTAAAGATATGGATTCTGCATTCCAAACTGCTGGAGCAAACCCAGGCTTAGAAGTTTGGTGTATTGA
GAACCAGCGGCTGGTTTCAGTGTCAAATTCAAGCCATGGAAAATTCTATACTGGAAGTGCATACTTAGTCTTGAATGCAG
TCTTTCCAAAAATTGGCCCTCCTCAGTATGACATACATTATTGGTTGGGAAATGAAGCAAAGAAGGTAGACTCAAGCTTG
GCATCAGACAAGGCACTTGATCTGGATGCAGCCTTAGGATCGTGTAGTGTTCAATACAGGGAAATTCAAGGCCAAGAATC
GCAGAAGTTTCTGTCATACTTCAGACCTTGTCTAATACCCATTGAAGGGGTGTTTACTTCAAAGCAGGGGAACTTGAATG
GTGAATACCAAGTTAGCATGTATACTTGCAAGGGAGACTATGTTGTTCACGTGAAAGAAGTGCCATTTCTAAGGTCATCA
TTGAATCATGAAGATGTATTCATTCTTGACACTGCCTTAAAAATCTTCCTCTTCAGTGGGTGCAACTCTACCATTCAAGA
AAGAGCCAAAGCTTTGGAGGTTGTTCAGTATATCAAGGAGAATAAGCATGGAGGAAAATGCGAAGTGGCAACAATAGAGG
ATGGAAAATTTGTTGGTGATTCTGATGTGGGTGAATTTTGGAGTTTATTTGGTGGTTACGCTCCCATTCCTCGAGATTCG
CCTTGTGTTCAGGAATCTGAGACTCCTCCTGTAAAGCTATTTTGGATAAATTTACAGGGAAAACTTTGTGAGACTGGAAG
CAATGCATTCAGCAAAGAAATGCTTGAGACAGAAAAGTGTTATATGTTGGACTGTGATGGTGAGATTTTTGTCTGGATGG
GAAGGCAGACTTTTTTGACAGAAAGAAGAACAGCAATCAGAGCTGTAGAAGAATTTGTCAGAAATGAAGGCAGATCAAAC
AAGACTCATTTGACATTTTTATCAGAAGGATTGGAAAGTACCATCTTTCGGTCATACTTTACTAATTGGCCTAAAACTGT
GGAGCCTAGGCTTTATGAGGAAGGCAAAGAAAAAGTGGCAGCCATATTCAAGCACCAGGGTTATGAGGTGAAAGAGCTTC
CTGAAGAAGACAATGAGCCATCTATAGATTGCACTGGCACAATAAAAGTTTGGCGGGTGGATGGTGATGAATTGTCCCTT
CTTTCAGTTACAGAACTGACAAAGCTTTACAGTGGAGATTGCTATATAGTACAGTATACATTTCCAGGAAATGGAAGGGA
TGAGACACTATTTTATGCTTGGCTTGGCTCCAAGTGTGTAACGGAGGATAAAGCAGCTGCCATTTCCCACATGAGTACTA
TGGCTGATTCAATCAGAACTAGTCCGGCTATGGCTCAAATCCATGAGGGTAAGGAACCAGCTCAATTTTTCTCAATACTT
CAGAGAGTAATCATATTCAAGGGGGGAACCAGTTCAGGATATAGGAAGTTTATAGAAGAAAAAGGTATAGTGGATGAAAC
TTACAATAAAAACCTGGTTACTTTGTTTCGGGTACAAGGTACAAGTCCAGATAATATGCAGGCCATCCAAGTTGATCAAG
TTTCAACCTCCTTGAATTCATCGTATTGCTACATTCTGCAAAATAAAGCATCTATCTATACTTGGATTGGGAGCCTATCT
TCAGCTAGAGACCATAATCTTCTTGATAGAATGGTGGAGCTACTTAATCCAACATGGCTGCCTGTTTCTGTGAGGGAAGG
GAATGAGCCTGATATTTTCTGGGATGCTCTCGGTGGAAAGGCAGAGTATCCAAAGGGCAAAGAAATTCAAGGATTCATAG
ATGATCCACATTTGTTTGCATTAAAAATAGCGAGAGGAGACTTCAAGGTGAAAGAGATATACAATTATACACAGGATGAC
TTAATAACTGAAGACATCTTATTGCTTGATTGCCAAAGAGAGATTTATGTTTGGGTTGGCTTACATTCGGCTATCAAATC
AAAACAAGAAGTTCTTCATCTTGGCCTGAAATTTCTGGAAATGGATGTCCTTGTTGAAGGCCTATCAATGAACATCCCTA
TTTATATTGTAACGGAAGGTCATGAGCCACCATTTTTCACTCGTTTCTTTTCATGGGATCACTCAAATGAAAATATTGTT
GGTAACTCATTTGAGAGAAAGCTTGCAATTCTGAAAGGAAAACCAAAAACTTTAGAGGGACATAATAGAACCCCGTTGAA
AGCAAACTCCAGGCCTTCTACTCCTAATGGACACAGAAACATTTCTGTTTTCTCCAATGGCCGTGGAAGAAGTAGTTCAC
CTATACTGAGTAGTGCAGGCTCAGATCTTAGGCAATCAGGTGATAGGCTTCTTTCTAGCTCTACTCCAGTTGTCAAGAAG
CTCCTTGAAGGATCTCCTTCCCATGGTAGTGCCGAAAAAACAATGCCACAGTCTGGTTCTCCAGCAACAGAACTGAGCTC
ATCTGACGAGACTGTGAGTTTCCCTCAGAAGGATAGAAATGTGGATGGTGAGAATATGGCAACATACCCCTATGAGCGCC
TGAGAGTGGTTTCTGCTAATCCAGTAACTGGCATTGATTTGACCAAAAGAGAGGTATATTTATCTAATGAAGAGTTCCGC
GAGAAGTTTGGAATGCCAAAATCTGCTTTCTATAAGCTTCCTAGATGGAAACAAAACAAACTGAAGATGTCACTGGATCT
ATTTTAG
Microexon DNA seq GAGACTTCAAG
Microexon Amino Acid seq GDFK
Microexon-tag DNA Seq CAAGGATTCATAGATGATCCACATTTGTTTGCATTAAAAATAGCGAGAGGAGACTTCAAGGTGAAAGAGATATACAATTATACACAGGATGACTTAATAACTGAAGAC
Microexon-tag Amino Acid seq QGFIDDPHLFALKIARGDFKVKEIYNYTQDDLITED
Transcript ID Gm.51422.3
Gene ID Gm.51422
Gene Name NA
Pfam domain motif Unknown
Motif E-value NA
Motif start NA
Motif end NA
Protein seq >Gm.51422.3
MPIVTKDMDSAFQTAGANPGLEVWCIENQRLVSVSNSSHGKFYTGSAYLVLNAVFPKIGPPQYDIHYWLGNEAKKVDSSL
ASDKALDLDAALGSCSVQYREIQGQESQKFLSYFRPCLIPIEGVFTSKQGNLNGEYQVSMYTCKGDYVVHVKEVPFLRSS
LNHEDVFILDTALKIFLFSGCNSTIQERAKALEVVQYIKENKHGGKCEVATIEDGKFVGDSDVGEFWSLFGGYAPIPRDS
PCVQESETPPVKLFWINLQGKLCETGSNAFSKEMLETEKCYMLDCDGEIFVWMGRQTFLTERRTAIRAVEEFVRNEGRSN
KTHLTFLSEGLESTIFRSYFTNWPKTVEPRLYEEGKEKVAAIFKHQGYEVKELPEEDNEPSIDCTGTIKVWRVDGDELSL
LSVTELTKLYSGDCYIVQYTFPGNGRDETLFYAWLGSKCVTEDKAAAISHMSTMADSIRTSPAMAQIHEGKEPAQFFSIL
QRVIIFKGGTSSGYRKFIEEKGIVDETYNKNLVTLFRVQGTSPDNMQAIQVDQVSTSLNSSYCYILQNKASIYTWIGSLS
SARDHNLLDRMVELLNPTWLPVSVREGNEPDIFWDALGGKAEYPKGKEIQGFIDDPHLFALKIARGDFKVKEIYNYTQDD
LITEDILLLDCQREIYVWVGLHSAIKSKQEVLHLGLKFLEMDVLVEGLSMNIPIYIVTEGHEPPFFTRFFSWDHSNENIV
GNSFERKLAILKGKPKTLEGHNRTPLKANSRPSTPNGHRNISVFSNGRGRSSSPILSSAGSDLRQSGDRLLSSSTPVVKK
LLEGSPSHGSAEKTMPQSGSPATELSSSDETVSFPQKDRNVDGENMATYPYERLRVVSANPVTGIDLTKREVYLSNEEFR
EKFGMPKSAFYKLPRWKQNKLKMSLDLF*
CDS seq >Gm.51422.3
ATGCCTATTGTCACTAAAGATATGGATTCTGCATTCCAAACTGCTGGAGCAAACCCAGGCTTAGAAGTTTGGTGTATTGA
GAACCAGCGGCTGGTTTCAGTGTCAAATTCAAGCCATGGAAAATTCTATACTGGAAGTGCATACTTAGTCTTGAATGCAG
TCTTTCCAAAAATTGGCCCTCCTCAGTATGACATACATTATTGGTTGGGAAATGAAGCAAAGAAGGTAGACTCAAGCTTG
GCATCAGACAAGGCACTTGATCTGGATGCAGCCTTAGGATCGTGTAGTGTTCAATACAGGGAAATTCAAGGCCAAGAATC
GCAGAAGTTTCTGTCATACTTCAGACCTTGTCTAATACCCATTGAAGGGGTGTTTACTTCAAAGCAGGGGAACTTGAATG
GTGAATACCAAGTTAGCATGTATACTTGCAAGGGAGACTATGTTGTTCACGTGAAAGAAGTGCCATTTCTAAGGTCATCA
TTGAATCATGAAGATGTATTCATTCTTGACACTGCCTTAAAAATCTTCCTCTTCAGTGGGTGCAACTCTACCATTCAAGA
AAGAGCCAAAGCTTTGGAGGTTGTTCAGTATATCAAGGAGAATAAGCATGGAGGAAAATGCGAAGTGGCAACAATAGAGG
ATGGAAAATTTGTTGGTGATTCTGATGTGGGTGAATTTTGGAGTTTATTTGGTGGTTACGCTCCCATTCCTCGAGATTCG
CCTTGTGTTCAGGAATCTGAGACTCCTCCTGTAAAGCTATTTTGGATAAATTTACAGGGAAAACTTTGTGAGACTGGAAG
CAATGCATTCAGCAAAGAAATGCTTGAGACAGAAAAGTGTTATATGTTGGACTGTGATGGTGAGATTTTTGTCTGGATGG
GAAGGCAGACTTTTTTGACAGAAAGAAGAACAGCAATCAGAGCTGTAGAAGAATTTGTCAGAAATGAAGGCAGATCAAAC
AAGACTCATTTGACATTTTTATCAGAAGGATTGGAAAGTACCATCTTTCGGTCATACTTTACTAATTGGCCTAAAACTGT
GGAGCCTAGGCTTTATGAGGAAGGCAAAGAAAAAGTGGCAGCCATATTCAAGCACCAGGGTTATGAGGTGAAAGAGCTTC
CTGAAGAAGACAATGAGCCATCTATAGATTGCACTGGCACAATAAAAGTTTGGCGGGTGGATGGTGATGAATTGTCCCTT
CTTTCAGTTACAGAACTGACAAAGCTTTACAGTGGAGATTGCTATATAGTACAGTATACATTTCCAGGAAATGGAAGGGA
TGAGACACTATTTTATGCTTGGCTTGGCTCCAAGTGTGTAACGGAGGATAAAGCAGCTGCCATTTCCCACATGAGTACTA
TGGCTGATTCAATCAGAACTAGTCCGGCTATGGCTCAAATCCATGAGGGTAAGGAACCAGCTCAATTTTTCTCAATACTT
CAGAGAGTAATCATATTCAAGGGGGGAACCAGTTCAGGATATAGGAAGTTTATAGAAGAAAAAGGTATAGTGGATGAAAC
TTACAATAAAAACCTGGTTACTTTGTTTCGGGTACAAGGTACAAGTCCAGATAATATGCAGGCCATCCAAGTTGATCAAG
TTTCAACCTCCTTGAATTCATCGTATTGCTACATTCTGCAAAATAAAGCATCTATCTATACTTGGATTGGGAGCCTATCT
TCAGCTAGAGACCATAATCTTCTTGATAGAATGGTGGAGCTACTTAATCCAACATGGCTGCCTGTTTCTGTGAGGGAAGG
GAATGAGCCTGATATTTTCTGGGATGCTCTCGGTGGAAAGGCAGAGTATCCAAAGGGCAAAGAAATTCAAGGATTCATAG
ATGATCCACATTTGTTTGCATTAAAAATAGCGAGAGGAGACTTCAAGGTGAAAGAGATATACAATTATACACAGGATGAC
TTAATAACTGAAGACATCTTATTGCTTGATTGCCAAAGAGAGATTTATGTTTGGGTTGGCTTACATTCGGCTATCAAATC
AAAACAAGAAGTTCTTCATCTTGGCCTGAAATTTCTGGAAATGGATGTCCTTGTTGAAGGCCTATCAATGAACATCCCTA
TTTATATTGTAACGGAAGGTCATGAGCCACCATTTTTCACTCGTTTCTTTTCATGGGATCACTCAAATGAAAATATTGTT
GGTAACTCATTTGAGAGAAAGCTTGCAATTCTGAAAGGAAAACCAAAAACTTTAGAGGGACATAATAGAACCCCGTTGAA
AGCAAACTCCAGGCCTTCTACTCCTAATGGACACAGAAACATTTCTGTTTTCTCCAATGGCCGTGGAAGAAGTAGTTCAC
CTATACTGAGTAGTGCAGGCTCAGATCTTAGGCAATCAGGTGATAGGCTTCTTTCTAGCTCTACTCCAGTTGTCAAGAAG
CTCCTTGAAGGATCTCCTTCCCATGGTAGTGCCGAAAAAACAATGCCACAGTCTGGTTCTCCAGCAACAGAACTGAGCTC
ATCTGACGAGACTGTGAGTTTCCCTCAGAAGGATAGAAATGTGGATGGTGAGAATATGGCAACATACCCCTATGAGCGCC
TGAGAGTGGTTTCTGCTAATCCAGTAACTGGCATTGATTTGACCAAAAGAGAGGTATATTTATCTAATGAAGAGTTCCGC
GAGAAGTTTGGAATGCCAAAATCTGCTTTCTATAAGCTTCCTAGATGGAAACAAAACAAACTGAAGATGTCACTGGATCT
ATTTTAG