Microexon ID Gm_12:20458824-20458837:-
Species Glycine max
Coordinates 12:20458824..20458837
Microexon Cluster ID MEP38
Size 14
Phase 1
Pfam Domain Motif Myosin_head
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 46,14,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq CAYTTYARTRMAACTGGRAARATATSTGGTGCYAADATTCAAACWTTTYTRCTTGARAAGTCWAGAGTWGTYCARYKTGCWGAWGGWGARAGRTCATAYCATATWTTT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TATTGTTGGAAAAG
Microexon Amino Acid seq LLLEK
Microexon-tag DNA Seq CATTTCAGCTCCATGGGTAAAATATGTGGGGCTAAAATTAAAACCTTATTGTTGGAAAAGTCTAGAGTTGTTCAACTGGCCAATGGTGAGAGATCATATCACATATTT
Microexon-tag Amino Acid Seq HFSSMGKICGAKIKTLLLEKSRVVQLANGERSYHIF
Microexon-tag spanning region20458686-20459016
Microexon-tag prediction score0.9422
Overlapped with the annotated transcript (%) 100
New Transcript ID KRH26030x
Reference Transcript ID KRH26030
Gene ID GLYMA_12G147000
Gene Name NA
Transcript ID KRH26030
Protein ID KRH26030
Gene ID GLYMA_12G147000
Gene Name NA
Pfam domain motif Myosin_head
Motif E-value 2.1e-218
Motif start 189
Motif end 831
Protein seq >KRH26030
MVLSASPCSLARSSLEEMLDSLRRRDEEEEKKDSPPALPARPASRARLPPARRSLPNNFRVGGSQRVIAPENGVGTNGES
DLKENDLGQKRRRNCFERKRMNKDVESPYVALSSSDSSGMVWELDDDDNISYFIKKLRVWCRQPRGQWELGTIQSTSGEE
ASISLSNGNVIKVVRSEILPANPGVLEGVDDLIKLGYLNEPSVLHNLKLRYSQGMIYNKAGPILIALNPFKDLQTNGNDY
VSAYRQRIIDSLHVYAVADVAYNKMIRDEVNQSIIISGESGSGKTETAKIALQHLAALGGGGSCAIENEFLQINRILEAF
GNAKTSRNNNSSRFGKLIEVHFSSMGKICGAKIKTLLLEKSRVVQLANGERSYHIFYQLCAGSSSDLKERLNLRAVCEYK
YLVQSDCTSIDDADDAKNFPQLKKALDTVQICKEDQEMIFKMLAAILWLGNISFQVDSENHIEVVDDEAVTSTAQLMGCS
SQELMTALCSHKIQSDEDTIAKNLTLRQAIERRDAIAKFIYASLFDWLVEQVNKSLEVGKQYTGKSISILDIYGFQTFQK
NSFEQFYINYANERIQQHFNRHLFKLEQEDYELDGVDWTKVDFEDNEVCLDLFEKKPHGLLSLLDEESNLAKASDLTFAN
KLKHHLNANPCFKGEKGRAFRVRHYAGEVLYDTNGFLEKNRDMLSSDSIQFLSSCNCELLQLFSKMFNQSQMQSVATKFK
VQLFMLMHQLESTTPHFIRCIKPNTKQLPGIFDEVLVLQQLRCCEVLEVVRVSRAGYPTRMAHQEFSRRYGFLLSEANVL
QDPLSISVAVLQKFNIPSEMYHVGYTKLYLRAGQIDSLENKRKQVLQGILGIQKCFRGHRARVYFCELKNGVTTLQSFIR
GENTRRKYGVTVKSSVTIYSRILEEIHAIILLQSVIRGWLVRRGDASHINRSKRYPENAKPRWKSFMKIIPEVKPDLSKE
PVQNLLSALADLQRRVDKADAIVKQKEDENTELREQLKQSERKRIEYETKMKSMEEAWQKQMASLQMSLVAARKSLAPEN
ASVQPVRRDFVLPRGYDSEDATSMGSRTPGGSTPMLSGSLSASDAGRQVNGTLTTVGNLMKEFEQERQNFDDEVKALNDV
KPEQSANTNSFEELRKLKQRFEGWKNQYKVRLRETKTRLYKSETEKSRRTWWGKLSSKA*
CDS seq >KRH26030
ATGGTGTTATCGGCTTCGCCATGTTCCTTGGCGAGAAGCTCGCTAGAGGAGATGCTCGATTCCCTTCGCCGGAGGGACGA
GGAGGAGGAGAAGAAGGACTCGCCACCGGCGTTGCCCGCGAGGCCGGCCTCGCGGGCACGGCTTCCTCCGGCACGGCGGT
CGCTGCCGAACAACTTCAGGGTTGGCGGCAGCCAGCGCGTCATCGCGCCCGAAAATGGAGTCGGTACTAATGGGGAGAGT
GACTTGAAGGAGAATGACTTGGGGCAGAAAAGGAGAAGAAATTGTTTTGAGAGAAAGAGGATGAACAAGGATGTGGAGTC
TCCTTATGTGGCACTCTCTTCCAGTGATTCATCAGGGATGGTTTGGGAATTGGATGATGATGATAACATTTCTTATTTCA
TCAAAAAGCTTCGTGTTTGGTGTAGGCAACCAAGAGGGCAGTGGGAACTAGGAACAATACAGTCAACTTCAGGAGAGGAA
GCATCCATTTCACTCTCAAATGGAAATGTTATTAAAGTGGTCAGATCAGAGATTCTACCAGCGAATCCTGGTGTTTTAGA
GGGTGTGGATGATCTTATTAAACTTGGTTATTTGAACGAGCCATCGGTTCTTCACAATCTGAAGTTGAGATATTCTCAAG
GAATGATTTATAATAAAGCAGGGCCAATTTTAATTGCCCTCAATCCTTTCAAAGATCTTCAGACAAATGGAAATGATTAT
GTCTCAGCTTATAGGCAGAGAATTATTGATAGTCTTCATGTTTATGCTGTGGCAGATGTGGCTTATAACAAGATGATAAG
AGATGAAGTAAATCAGTCCATTATCATAAGTGGTGAGAGTGGATCTGGGAAGACAGAAACAGCTAAAATTGCATTGCAAC
ACTTAGCTGCTCTTGGTGGTGGTGGCAGTTGTGCTATAGAAAATGAATTTCTTCAGATAAATCGTATACTAGAAGCTTTT
GGGAATGCAAAAACATCTAGGAATAACAACTCTAGCAGATTTGGAAAGTTGATTGAAGTTCATTTCAGCTCCATGGGTAA
AATATGTGGGGCTAAAATTAAAACCTTATTGTTGGAAAAGTCTAGAGTTGTTCAACTGGCCAATGGTGAGAGATCATATC
ACATATTTTATCAACTTTGTGCTGGATCTTCTTCTGATCTTAAAGAGAGACTGAATCTTAGAGCAGTCTGTGAATATAAA
TATCTAGTTCAGAGTGACTGCACATCAATTGATGATGCCGATGATGCTAAAAACTTTCCTCAGCTGAAGAAAGCCCTGGA
TACTGTTCAAATTTGTAAAGAGGATCAAGAGATGATCTTTAAGATGCTCGCTGCAATACTATGGCTGGGAAATATATCAT
TCCAAGTAGACAGTGAAAATCACATTGAGGTTGTTGATGATGAAGCTGTAACCAGTACTGCCCAGCTGATGGGTTGCAGT
TCCCAGGAATTAATGACAGCATTATGTAGCCATAAAATTCAATCTGACGAGGATACTATTGCCAAAAATCTGACATTGAG
GCAGGCAATCGAAAGAAGAGATGCAATTGCAAAATTCATCTATGCAAGCTTGTTTGACTGGCTTGTAGAACAAGTTAACA
AGTCACTTGAAGTGGGTAAACAATATACTGGGAAATCCATAAGTATCCTAGATATTTATGGGTTTCAGACTTTCCAGAAA
AACAGCTTTGAACAGTTTTATATAAATTATGCCAATGAGAGGATTCAACAACATTTTAATCGGCATCTGTTTAAACTTGA
GCAGGAGGATTATGAATTGGATGGCGTTGATTGGACTAAGGTAGATTTTGAGGATAATGAAGTGTGCTTGGATCTATTTG
AGAAGAAACCTCACGGTCTACTCTCTTTATTGGATGAGGAGTCAAATTTAGCCAAGGCTTCTGATTTAACATTTGCCAAC
AAACTTAAGCACCACCTGAATGCTAATCCTTGCTTCAAAGGAGAAAAAGGCAGAGCTTTCCGTGTTCGTCACTATGCAGG
GGAGGTTCTGTATGATACAAATGGCTTTCTAGAAAAGAACAGAGACATGTTGTCTTCTGATTCCATTCAATTCCTGTCAT
CATGTAATTGTGAACTGCTGCAGTTGTTCTCCAAAATGTTTAACCAGTCTCAAATGCAGAGTGTTGCAACAAAGTTCAAG
GTTCAATTGTTCATGTTGATGCATCAGTTGGAGAGTACCACACCTCACTTTATTCGCTGTATAAAGCCAAATACTAAGCA
GCTTCCTGGCATTTTTGATGAAGTCCTTGTCCTACAACAGCTCAGATGTTGTGAAGTTCTAGAAGTTGTTAGAGTTTCAA
GGGCTGGATATCCTACTCGAATGGCCCATCAAGAGTTTTCCAGAAGGTATGGGTTTCTGCTTTCTGAGGCCAATGTATTG
CAGGATCCATTGAGCATCTCGGTTGCTGTTTTGCAAAAATTTAATATCCCTTCTGAAATGTACCATGTTGGCTACACCAA
ATTGTATCTTCGAGCAGGGCAGATTGATTCACTGGAGAATAAGAGAAAGCAGGTTTTACAGGGAATACTTGGGATTCAAA
AATGCTTCCGTGGTCATCGAGCTCGTGTTTATTTCTGTGAACTTAAGAATGGAGTGACAACATTGCAATCATTTATTCGT
GGAGAAAATACAAGAAGGAAATATGGTGTTACAGTGAAGTCTTCAGTAACAATTTATTCTAGAATACTGGAGGAGATCCA
TGCAATCATACTATTACAATCTGTAATTCGTGGTTGGCTGGTTAGAAGGGGGGATGCTAGTCACATAAATAGGTCAAAGA
GATATCCTGAAAATGCTAAACCTAGGTGGAAGTCCTTTATGAAGATAATACCGGAAGTAAAGCCGGACTTGTCCAAAGAG
CCGGTTCAGAATCTGCTTTCAGCTTTAGCAGATCTCCAAAGGCGAGTTGACAAGGCTGATGCAATTGTGAAGCAAAAGGA
AGACGAAAATACCGAATTGAGGGAACAGCTAAAACAATCTGAGAGGAAGAGGATCGAATATGAGACAAAAATGAAATCAA
TGGAGGAGGCTTGGCAAAAGCAGATGGCATCTTTGCAAATGAGTCTTGTTGCTGCTAGAAAGAGCCTTGCTCCCGAGAAT
GCTTCAGTTCAGCCTGTAAGACGTGATTTTGTGCTACCTCGTGGTTATGATTCTGAAGATGCTACATCCATGGGATCTCG
AACACCTGGTGGGAGCACACCAATGCTTTCTGGTAGTCTATCTGCCTCTGATGCAGGGAGACAGGTCAATGGTACATTGA
CCACAGTTGGCAATCTGATGAAGGAATTCGAGCAGGAAAGACAGAACTTTGATGATGAAGTGAAAGCTTTGAACGATGTT
AAACCGGAGCAGTCTGCCAATACAAATTCCTTTGAAGAGCTTCGGAAACTGAAACAGAGATTTGAGGGATGGAAGAATCA
ATACAAGGTTAGATTACGAGAGACTAAAACAAGGCTTTATAAATCAGAAACGGAAAAAAGTCGGCGAACATGGTGGGGGA
AGTTAAGCTCAAAAGCATAA
Microexon DNA seq TATTGTTGGAAAAG
Microexon Amino Acid seq LLLEK
Microexon-tag DNA Seq CATTTCAGCTCCATGGGTAAAATATGTGGGGCTAAAATTAAAACCTTATTGTTGGAAAAGTCTAGAGTTGTTCAACTGGCCAATGGTGAGAGATCATATCACATATTT
Microexon-tag Amino Acid seq HFSSMGKICGAKIKTLLLEKSRVVQLANGERSYHIF
Transcript ID Gm.9195.1
Gene ID Gm.9195
Gene Name NA
Pfam domain motif Myosin_head
Motif E-value 2.1e-218
Motif start 190
Motif end 832
Protein seq >Gm.9195.1
MVLSASPCSLARSSLEEMLDSLRRRDEEEEKKDSPPALPARPASRARLPPARRSLPNNFRVGGSQRVIAPENGVGTNGES
DLKENDLGQKRRRNCFERKRMNKDVESPYVALSSSDSSGMVWELDDDDNISYFIKKKLRVWCRQPRGQWELGTIQSTSGE
EASISLSNGNVIKVVRSEILPANPGVLEGVDDLIKLGYLNEPSVLHNLKLRYSQGMIYNKAGPILIALNPFKDLQTNGND
YVSAYRQRIIDSLHVYAVADVAYNKMIRDEVNQSIIISGESGSGKTETAKIALQHLAALGGGGSCAIENEFLQINRILEA
FGNAKTSRNNNSSRFGKLIEVHFSSMGKICGAKIKTLLLEKSRVVQLANGERSYHIFYQLCAGSSSDLKERLNLRAVCEY
KYLVQSDCTSIDDADDAKNFPQLKKALDTVQICKEDQEMIFKMLAAILWLGNISFQVDSENHIEVVDDEAVTSTAQLMGC
SSQELMTALCSHKIQSDEDTIAKNLTLRQAIERRDAIAKFIYASLFDWLVEQVNKSLEVGKQYTGKSISILDIYGFQTFQ
KNSFEQFYINYANERIQQHFNRHLFKLEQEDYELDGVDWTKVDFEDNEVCLDLFEKKPHGLLSLLDEESNLAKASDLTFA
NKLKHHLNANPCFKGEKGRAFRVRHYAGEVLYDTNGFLEKNRDMLSSDSIQFLSSCNCELLQLFSKMFNQSQMQSVATKF
KVQLFMLMHQLESTTPHFIRCIKPNTKQLPGIFDEVLVLQQLRCCEVLEVVRVSRAGYPTRMAHQEFSRRYGFLLSEANV
LQDPLSISVAVLQKFNIPSEMYHVGYTKLYLRAGQIDSLENKRKQVLQGILGIQKCFRGHRARVYFCELKNGVTTLQSFI
RGENTRRKYGVTVKSSVTIYSRILEEIHAIILLQSVIRGWLVRRGDASHINRSKRYPENAKPRWKSFMKIIPEVKPDLSK
EPVQNLLSALADLQRRVDKADAIVKQKEDENTELREQLKQSERKRIEYETKMKSMEEAWQKQMASLQMSLVAARKSLAPE
NASVQPVRRDFVLPRGYDSEDATSMGSRTPGGSTPMLSGSLSASDAGRQVNGTLTTVGNLMKEFEQERQNFDDEVKALND
VKPEQSANTNSFEELRKLKQRFEGWKNQYKVRLRETKTRLYKSETEKSRRTWWGKLSSKA*
CDS seq >Gm.9195.1
ATGGTGTTATCGGCTTCGCCATGTTCCTTGGCGAGAAGCTCGCTAGAGGAGATGCTCGATTCCCTTCGCCGGAGGGACGA
GGAGGAGGAGAAGAAGGACTCGCCACCGGCGTTGCCCGCGAGGCCGGCCTCGCGGGCACGGCTTCCTCCGGCACGGCGGT
CGCTGCCGAACAACTTCAGGGTTGGCGGCAGCCAGCGCGTCATCGCGCCCGAAAATGGAGTCGGTACTAATGGGGAGAGT
GACTTGAAGGAGAATGACTTGGGGCAGAAAAGGAGAAGAAATTGTTTTGAGAGAAAGAGGATGAACAAGGATGTGGAGTC
TCCTTATGTGGCACTCTCTTCCAGTGATTCATCAGGGATGGTTTGGGAATTGGATGATGATGATAACATTTCTTATTTCA
TCAAAAAGAAGCTTCGTGTTTGGTGTAGGCAACCAAGAGGGCAGTGGGAACTAGGAACAATACAGTCAACTTCAGGAGAG
GAAGCATCCATTTCACTCTCAAATGGAAATGTTATTAAAGTGGTCAGATCAGAGATTCTACCAGCGAATCCTGGTGTTTT
AGAGGGTGTGGATGATCTTATTAAACTTGGTTATTTGAACGAGCCATCGGTTCTTCACAATCTGAAGTTGAGATATTCTC
AAGGAATGATTTATAATAAAGCAGGGCCAATTTTAATTGCCCTCAATCCTTTCAAAGATCTTCAGACAAATGGAAATGAT
TATGTCTCAGCTTATAGGCAGAGAATTATTGATAGTCTTCATGTTTATGCTGTGGCAGATGTGGCTTATAACAAGATGAT
AAGAGATGAAGTAAATCAGTCCATTATCATAAGTGGTGAGAGTGGATCTGGGAAGACAGAAACAGCTAAAATTGCATTGC
AACACTTAGCTGCTCTTGGTGGTGGTGGCAGTTGTGCTATAGAAAATGAATTTCTTCAGATAAATCGTATACTAGAAGCT
TTTGGGAATGCAAAAACATCTAGGAATAACAACTCTAGCAGATTTGGAAAGTTGATTGAAGTTCATTTCAGCTCCATGGG
TAAAATATGTGGGGCTAAAATTAAAACCTTATTGTTGGAAAAGTCTAGAGTTGTTCAACTGGCCAATGGTGAGAGATCAT
ATCACATATTTTATCAACTTTGTGCTGGATCTTCTTCTGATCTTAAAGAGAGACTGAATCTTAGAGCAGTCTGTGAATAT
AAATATCTAGTTCAGAGTGACTGCACATCAATTGATGATGCCGATGATGCTAAAAACTTTCCTCAGCTGAAGAAAGCCCT
GGATACTGTTCAAATTTGTAAAGAGGATCAAGAGATGATCTTTAAGATGCTCGCTGCAATACTATGGCTGGGAAATATAT
CATTCCAAGTAGACAGTGAAAATCACATTGAGGTTGTTGATGATGAAGCTGTAACCAGTACTGCCCAGCTGATGGGTTGC
AGTTCCCAGGAATTAATGACAGCATTATGTAGCCATAAAATTCAATCTGACGAGGATACTATTGCCAAAAATCTGACATT
GAGGCAGGCAATCGAAAGAAGAGATGCAATTGCAAAATTCATCTATGCAAGCTTGTTTGACTGGCTTGTAGAACAAGTTA
ACAAGTCACTTGAAGTGGGTAAACAATATACTGGGAAATCCATAAGTATCCTAGATATTTATGGGTTTCAGACTTTCCAG
AAAAACAGCTTTGAACAGTTTTATATAAATTATGCCAATGAGAGGATTCAACAACATTTTAATCGGCATCTGTTTAAACT
TGAGCAGGAGGATTATGAATTGGATGGCGTTGATTGGACTAAGGTAGATTTTGAGGATAATGAAGTGTGCTTGGATCTAT
TTGAGAAGAAACCTCACGGTCTACTCTCTTTATTGGATGAGGAGTCAAATTTAGCCAAGGCTTCTGATTTAACATTTGCC
AACAAACTTAAGCACCACCTGAATGCTAATCCTTGCTTCAAAGGAGAAAAAGGCAGAGCTTTCCGTGTTCGTCACTATGC
AGGGGAGGTTCTGTATGATACAAATGGCTTTCTAGAAAAGAACAGAGACATGTTGTCTTCTGATTCCATTCAATTCCTGT
CATCATGTAATTGTGAACTGCTGCAGTTGTTCTCCAAAATGTTTAACCAGTCTCAAATGCAGAGTGTTGCAACAAAGTTC
AAGGTTCAATTGTTCATGTTGATGCATCAGTTGGAGAGTACCACACCTCACTTTATTCGCTGTATAAAGCCAAATACTAA
GCAGCTTCCTGGCATTTTTGATGAAGTCCTTGTCCTACAACAGCTCAGATGTTGTGAAGTTCTAGAAGTTGTTAGAGTTT
CAAGGGCTGGATATCCTACTCGAATGGCCCATCAAGAGTTTTCCAGAAGGTATGGGTTTCTGCTTTCTGAGGCCAATGTA
TTGCAGGATCCATTGAGCATCTCGGTTGCTGTTTTGCAAAAATTTAATATCCCTTCTGAAATGTACCATGTTGGCTACAC
CAAATTGTATCTTCGAGCAGGGCAGATTGATTCACTGGAGAATAAGAGAAAGCAGGTTTTACAGGGAATACTTGGGATTC
AAAAATGCTTCCGTGGTCATCGAGCTCGTGTTTATTTCTGTGAACTTAAGAATGGAGTGACAACATTGCAATCATTTATT
CGTGGAGAAAATACAAGAAGGAAATATGGTGTTACAGTGAAGTCTTCAGTAACAATTTATTCTAGAATACTGGAGGAGAT
CCATGCAATCATACTATTACAATCTGTAATTCGTGGTTGGCTGGTTAGAAGGGGGGATGCTAGTCACATAAATAGGTCAA
AGAGATATCCTGAAAATGCTAAACCTAGGTGGAAGTCCTTTATGAAGATAATACCGGAAGTAAAGCCGGACTTGTCCAAA
GAGCCGGTTCAGAATCTGCTTTCAGCTTTAGCAGATCTCCAAAGGCGAGTTGACAAGGCTGATGCAATTGTGAAGCAAAA
GGAAGACGAAAATACCGAATTGAGGGAACAGCTAAAACAATCTGAGAGGAAGAGGATCGAATATGAGACAAAAATGAAAT
CAATGGAGGAGGCTTGGCAAAAGCAGATGGCATCTTTGCAAATGAGTCTTGTTGCTGCTAGAAAGAGCCTTGCTCCCGAG
AATGCTTCAGTTCAGCCTGTAAGACGTGATTTTGTGCTACCTCGTGGTTATGATTCTGAAGATGCTACATCCATGGGATC
TCGAACACCTGGTGGGAGCACACCAATGCTTTCTGGTAGTCTATCTGCCTCTGATGCAGGGAGACAGGTCAATGGTACAT
TGACCACAGTTGGCAATCTGATGAAGGAATTCGAGCAGGAAAGACAGAACTTTGATGATGAAGTGAAAGCTTTGAACGAT
GTTAAACCGGAGCAGTCTGCCAATACAAATTCCTTTGAAGAGCTTCGGAAACTGAAACAGAGATTTGAGGGATGGAAGAA
TCAATACAAGGTTAGATTACGAGAGACTAAAACAAGGCTTTATAAATCAGAAACGGAAAAAAGTCGGCGAACATGGTGGG
GGAAGTTAAGCTCAAAAGCATAA