Microexon ID At_4:13698954-13698967:-
Species Arabidopsis thaliana
Coordinates 4:13698954..13698967
Microexon Cluster ID MEP38
Size 14
Phase 1
Pfam Domain Motif Myosin_head
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 46,14,48
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq CAYTTYARTRMAACTGGRAARATATSTGGTGCYAADATTCAAACWTTTYTRCTTGARAAGTCWAGAGTWGTYCARYKTGCWGAWGGWGARAGRTCATAYCATATWTTT
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTCTCCTAGAAAAG
Microexon Amino Acid seq FLLEK
Microexon-tag DNA Seq CATTTTAGTGCAAAGGGAAAGATATGCGGAGCTAAACTTGAAACTTTTCTCCTAGAAAAGTCAAGAGTGGCTCAGCTGTGCAATGGCGAGAGATGCTACCATATCTTT
Microexon-tag Amino Acid Seq HFSAKGKICGAKLETFLLEKSRVAQLCNGERCYHIF
Microexon-tag spanning region13698825-13699126
Microexon-tag prediction score0.9302
Overlapped with the annotated transcript (%) 100
New Transcript ID AT4G27370.2x
Reference Transcript ID AT4G27370.2
Gene ID AT4G27370
Gene Name VIIIB
Transcript ID AT4G27370.2
Protein ID AT4G27370.2
Gene ID AT4G27370
Gene Name VIIIB
Pfam domain motif Myosin_head
Motif E-value 4.5e-233
Motif start 168
Motif end 820
Protein seq >AT4G27370.2
MMKSSVKEILESLRLLDSSERSSSLPSPSTFRAPMPLIRQSLPAKFRNAISLESKTIEKEDKDWSTEQITQSAEKEKTGN
EVVKISTAQMSRAKNSHDPEWINSAEYFVREKLCVWCRVAANGQWHLGKIHSTSSSDDVCVMLSANDDVVKVAMEEIFPA
NPEILEGVEDLTQLSYLNEPSLLYNLRVRYSQDLIYSKAGPVLIAVNPFKNVQIYGEEFLSAYQKNALDAPHVYAVADAA
YDDMMREEKNQSIIISGESGAGKTETAKYAMQYLEALGGGSFGVENEILKTNCILEAFGNAKTSRNDNSSRFGKLMEIHF
SAKGKICGAKLETFLLEKSRVAQLCNGERCYHIFYQLCAGASPILKERLKIKAASEYNYLNQSNCLTIDRTDDAQKFHKL
MEAFNIVQIPQEYQERTFALLAAVLWLGNVSFEVIDNENHVEVVADEAVTNVAMLMGCNSKKLMVVLSTCKLQAGRDCIA
KRLTLRQATDMRDSLAKIIYASLFNWLVEQINISLEVGNSRTGRSISILDIYGFESFKDNSFEQFCINYANERLQQHFNR
HLFKLEQEEYEGDGIDWTKVEFIDNQECLNLIEKKPIGLVSLLNEESNFPKATDTTFANKLKQHLNANSCFKGERGRGFR
IKHYAGEVLYNTNGFLEKNRDPLHVDLIQLLSLCKCQLLNLFSTKMHHDFLKPATFSDSMNQSVIAKFKGQLFKLMNKLE
DTTPHFIRCIKPNSNQLPGLYEENHVLQQLRCCGVLEIVRISRSGYPTRLTHQELAVRYGCLLLDTRISQDPLSTSKAIL
KQCNLPPEMYQVGYTKIYLRTGVISVLEERKKYVLRGILGLQKQFRGYQTREYFHNMRNAAVILQSYIRGENARRNYIVV
GESAIVSTAITKELDAAIHLQYMVRKWLARKLLNSTQQKNKPRNEKKKTRRKSTKRVSEDKELLSEQFEVQPCVLADLQS
RVLKVEAAIMQKEDENTALQEELQRFEERWLENETRMKSMEDTWQKHMSSMQMSLAAACKVLAPDKTASHGTDSEDTMSF
GTPTKELKGSLSDVNNLSTEFDQRSVIIHEDPKSLVEVKSDSISNRKQHAEELRRLKSRFEKWKKDYKTRLRETKARVRL
NGDEGRHRNWWCKKSY*
CDS seq >AT4G27370.2
ATGATGAAAAGTTCAGTGAAGGAAATTTTGGAGTCTCTTCGCCTGCTTGATTCAAGCGAGAGGTCTTCGTCGCTTCCCTC
TCCGTCTACTTTCAGGGCTCCAATGCCGTTGATTCGTCAATCTCTTCCGGCGAAGTTTCGTAATGCAATTTCGCTGGAAA
GCAAAACCATAGAGAAAGAAGACAAGGATTGGAGCACTGAGCAGATCACACAATCTGCAGAGAAGGAGAAAACTGGAAAT
GAAGTTGTGAAGATTAGTACTGCTCAGATGAGTCGAGCGAAGAACTCGCATGATCCTGAATGGATCAATAGCGCTGAGTA
TTTCGTCAGGGAGAAACTTTGTGTTTGGTGTCGAGTTGCTGCTAATGGGCAGTGGCACTTGGGGAAGATACATTCTACTT
CTTCTAGTGATGATGTGTGTGTTATGCTGTCCGCTAACGACGATGTTGTGAAAGTAGCAATGGAGGAGATTTTCCCTGCT
AACCCAGAGATTTTAGAAGGAGTTGAAGACCTTACTCAGCTTAGTTATTTGAATGAGCCATCTCTTCTCTACAACCTCCG
GGTCAGATACTCACAAGACTTGATTTATAGTAAAGCAGGGCCAGTTTTGATCGCAGTAAATCCTTTTAAGAATGTCCAGA
TCTATGGAGAAGAGTTTCTATCAGCTTATCAGAAGAATGCTTTGGATGCTCCTCATGTTTATGCAGTGGCAGATGCAGCT
TATGATGATATGATGAGAGAGGAGAAGAACCAATCTATTATCATAAGTGGAGAAAGTGGAGCTGGGAAAACTGAGACAGC
AAAATATGCAATGCAGTACTTGGAAGCTCTTGGTGGAGGTAGCTTTGGGGTGGAAAATGAAATCCTAAAGACAAATTGTA
TACTTGAAGCTTTTGGGAATGCTAAAACATCAAGAAATGATAACTCTAGCCGATTCGGTAAACTGATGGAGATTCATTTT
AGTGCAAAGGGAAAGATATGCGGAGCTAAACTTGAAACTTTTCTCCTAGAAAAGTCAAGAGTGGCTCAGCTGTGCAATGG
CGAGAGATGCTACCATATCTTTTATCAATTGTGTGCAGGAGCTTCACCAATTCTTAAAGAGAGATTGAAGATTAAAGCAG
CAAGTGAATACAATTATTTGAACCAGAGCAATTGCTTAACCATTGATCGCACTGATGATGCTCAGAAATTTCACAAACTG
ATGGAAGCTTTCAACATTGTTCAAATTCCTCAAGAATACCAAGAACGTACATTTGCACTGCTTGCAGCTGTGTTATGGCT
AGGAAATGTATCTTTTGAAGTGATTGACAATGAAAACCATGTCGAAGTGGTAGCAGATGAAGCTGTCACTAATGTAGCTA
TGTTAATGGGCTGCAACTCAAAAAAGCTTATGGTGGTTTTGTCTACATGCAAGCTCCAAGCTGGTAGAGATTGCATTGCT
AAAAGACTGACATTGCGACAGGCAACTGATATGAGGGATTCTCTGGCGAAAATCATCTATGCGAGCTTATTCAACTGGCT
TGTTGAGCAAATAAACATTTCACTGGAAGTTGGTAACTCACGTACAGGAAGATCTATTAGTATCCTTGATATATATGGGT
TTGAGTCATTTAAGGATAATAGTTTCGAACAATTTTGCATAAACTATGCAAATGAAAGACTGCAGCAACATTTCAACCGA
CATCTATTTAAACTTGAACAAGAGGAATATGAAGGGGATGGAATTGATTGGACTAAGGTCGAATTCATAGACAATCAAGA
GTGTCTAAATCTGATTGAGAAGAAACCCATTGGTTTGGTATCGCTATTAAATGAGGAATCAAATTTTCCAAAGGCAACTG
ATACGACGTTTGCCAACAAGCTCAAGCAGCATTTGAATGCTAACTCTTGTTTCAAGGGAGAGAGAGGTCGGGGTTTCAGA
ATTAAGCATTATGCCGGAGAGGTTCTCTATAATACAAATGGCTTCCTGGAAAAGAACAGAGATCCGCTGCATGTTGATCT
TATCCAACTTTTATCATTGTGCAAATGTCAATTATTGAACCTGTTTTCTACTAAGATGCATCACGATTTTCTAAAGCCGG
CAACCTTTTCGGACTCCATGAATCAAAGTGTGATTGCCAAGTTTAAGGGCCAACTTTTCAAGTTGATGAACAAGTTAGAA
GACACTACTCCTCATTTTATTCGATGCATAAAACCAAATTCAAATCAGCTTCCTGGACTTTACGAAGAAAATCATGTTCT
ACAACAGCTTAGATGTTGTGGCGTTTTGGAGATTGTTAGAATATCAAGATCAGGATATCCTACGCGGTTGACACACCAGG
AGCTTGCAGTAAGATATGGATGCCTACTGCTGGACACAAGAATCTCCCAGGATCCACTTAGCACATCAAAAGCTATTTTG
AAACAATGCAATCTTCCTCCTGAAATGTATCAAGTTGGTTATACGAAAATATACCTCCGAACTGGAGTAATTAGCGTCCT
TGAGGAAAGAAAAAAGTATGTTTTGCGTGGCATACTTGGATTACAGAAGCAATTTCGTGGTTATCAGACTCGTGAATACT
TTCACAATATGAGGAATGCTGCAGTGATACTTCAGTCATATATCCGTGGAGAAAATGCCAGGAGAAACTACATTGTGGTA
GGAGAATCAGCTATTGTTTCCACAGCAATAACTAAAGAGCTTGATGCAGCTATACACTTGCAATATATGGTACGCAAATG
GCTGGCTCGTAAACTCTTAAATAGTACGCAACAGAAGAACAAACCTCGCAATGAGAAGAAAAAAACGAGAAGAAAGTCAA
CAAAGAGGGTTTCCGAAGACAAGGAACTTCTTTCAGAGCAGTTTGAGGTCCAACCTTGTGTTCTTGCTGATCTCCAGAGC
CGGGTTCTGAAAGTCGAAGCAGCTATAATGCAGAAGGAAGATGAAAACACTGCATTGCAGGAGGAGTTGCAACGATTTGA
GGAAAGATGGTTAGAGAACGAAACAAGAATGAAGTCAATGGAGGACACATGGCAAAAGCATATGTCGTCAATGCAGATGA
GTCTCGCAGCTGCCTGTAAGGTTTTGGCTCCAGACAAGACTGCAAGTCACGGTACTGACTCAGAAGACACAATGTCCTTT
GGAACCCCCACGAAAGAGCTTAAGGGCAGCTTGAGTGACGTTAATAATCTTTCTACAGAATTTGATCAACGAAGCGTCAT
AATTCACGAAGATCCGAAAAGTCTTGTCGAGGTGAAATCAGACTCGATCTCAAACAGGAAGCAGCACGCAGAAGAACTCA
GGAGACTAAAGTCAAGATTTGAGAAATGGAAGAAAGATTACAAAACAAGACTAAGAGAGACAAAGGCAAGAGTCCGGTTA
AATGGTGATGAAGGTCGTCATCGGAATTGGTGGTGCAAGAAAAGTTATTGA
Microexon DNA seq TTCTCCTAGAAAAG
Microexon Amino Acid seq FLLEK
Microexon-tag DNA Seq CATTTTAGTGCAAAGGGAAAGATATGCGGAGCTAAACTTGAAACTTTTCTCCTAGAAAAGTCAAGAGTGGCTCAGCTGTGCAATGGCGAGAGATGCTACCATATCTTT
Microexon-tag Amino Acid seq HFSAKGKICGAKLETFLLEKSRVAQLCNGERCYHIF
Transcript ID AT4G27370.2
Gene ID At.20200
Gene Name VIIIB
Pfam domain motif Myosin_head
Motif E-value 4.5e-233
Motif start 168
Motif end 820
Protein seq >AT4G27370.2
MMKSSVKEILESLRLLDSSERSSSLPSPSTFRAPMPLIRQSLPAKFRNAISLESKTIEKEDKDWSTEQITQSAEKEKTGN
EVVKISTAQMSRAKNSHDPEWINSAEYFVREKLCVWCRVAANGQWHLGKIHSTSSSDDVCVMLSANDDVVKVAMEEIFPA
NPEILEGVEDLTQLSYLNEPSLLYNLRVRYSQDLIYSKAGPVLIAVNPFKNVQIYGEEFLSAYQKNALDAPHVYAVADAA
YDDMMREEKNQSIIISGESGAGKTETAKYAMQYLEALGGGSFGVENEILKTNCILEAFGNAKTSRNDNSSRFGKLMEIHF
SAKGKICGAKLETFLLEKSRVAQLCNGERCYHIFYQLCAGASPILKERLKIKAASEYNYLNQSNCLTIDRTDDAQKFHKL
MEAFNIVQIPQEYQERTFALLAAVLWLGNVSFEVIDNENHVEVVADEAVTNVAMLMGCNSKKLMVVLSTCKLQAGRDCIA
KRLTLRQATDMRDSLAKIIYASLFNWLVEQINISLEVGNSRTGRSISILDIYGFESFKDNSFEQFCINYANERLQQHFNR
HLFKLEQEEYEGDGIDWTKVEFIDNQECLNLIEKKPIGLVSLLNEESNFPKATDTTFANKLKQHLNANSCFKGERGRGFR
IKHYAGEVLYNTNGFLEKNRDPLHVDLIQLLSLCKCQLLNLFSTKMHHDFLKPATFSDSMNQSVIAKFKGQLFKLMNKLE
DTTPHFIRCIKPNSNQLPGLYEENHVLQQLRCCGVLEIVRISRSGYPTRLTHQELAVRYGCLLLDTRISQDPLSTSKAIL
KQCNLPPEMYQVGYTKIYLRTGVISVLEERKKYVLRGILGLQKQFRGYQTREYFHNMRNAAVILQSYIRGENARRNYIVV
GESAIVSTAITKELDAAIHLQYMVRKWLARKLLNSTQQKNKPRNEKKKTRRKSTKRVSEDKELLSEQFEVQPCVLADLQS
RVLKVEAAIMQKEDENTALQEELQRFEERWLENETRMKSMEDTWQKHMSSMQMSLAAACKVLAPDKTASHGTDSEDTMSF
GTPTKELKGSLSDVNNLSTEFDQRSVIIHEDPKSLVEVKSDSISNRKQHAEELRRLKSRFEKWKKDYKTRLRETKARVRL
NGDEGRHRNWWCKKSY*
CDS seq >AT4G27370.2
ATGATGAAAAGTTCAGTGAAGGAAATTTTGGAGTCTCTTCGCCTGCTTGATTCAAGCGAGAGGTCTTCGTCGCTTCCCTC
TCCGTCTACTTTCAGGGCTCCAATGCCGTTGATTCGTCAATCTCTTCCGGCGAAGTTTCGTAATGCAATTTCGCTGGAAA
GCAAAACCATAGAGAAAGAAGACAAGGATTGGAGCACTGAGCAGATCACACAATCTGCAGAGAAGGAGAAAACTGGAAAT
GAAGTTGTGAAGATTAGTACTGCTCAGATGAGTCGAGCGAAGAACTCGCATGATCCTGAATGGATCAATAGCGCTGAGTA
TTTCGTCAGGGAGAAACTTTGTGTTTGGTGTCGAGTTGCTGCTAATGGGCAGTGGCACTTGGGGAAGATACATTCTACTT
CTTCTAGTGATGATGTGTGTGTTATGCTGTCCGCTAACGACGATGTTGTGAAAGTAGCAATGGAGGAGATTTTCCCTGCT
AACCCAGAGATTTTAGAAGGAGTTGAAGACCTTACTCAGCTTAGTTATTTGAATGAGCCATCTCTTCTCTACAACCTCCG
GGTCAGATACTCACAAGACTTGATTTATAGTAAAGCAGGGCCAGTTTTGATCGCAGTAAATCCTTTTAAGAATGTCCAGA
TCTATGGAGAAGAGTTTCTATCAGCTTATCAGAAGAATGCTTTGGATGCTCCTCATGTTTATGCAGTGGCAGATGCAGCT
TATGATGATATGATGAGAGAGGAGAAGAACCAATCTATTATCATAAGTGGAGAAAGTGGAGCTGGGAAAACTGAGACAGC
AAAATATGCAATGCAGTACTTGGAAGCTCTTGGTGGAGGTAGCTTTGGGGTGGAAAATGAAATCCTAAAGACAAATTGTA
TACTTGAAGCTTTTGGGAATGCTAAAACATCAAGAAATGATAACTCTAGCCGATTCGGTAAACTGATGGAGATTCATTTT
AGTGCAAAGGGAAAGATATGCGGAGCTAAACTTGAAACTTTTCTCCTAGAAAAGTCAAGAGTGGCTCAGCTGTGCAATGG
CGAGAGATGCTACCATATCTTTTATCAATTGTGTGCAGGAGCTTCACCAATTCTTAAAGAGAGATTGAAGATTAAAGCAG
CAAGTGAATACAATTATTTGAACCAGAGCAATTGCTTAACCATTGATCGCACTGATGATGCTCAGAAATTTCACAAACTG
ATGGAAGCTTTCAACATTGTTCAAATTCCTCAAGAATACCAAGAACGTACATTTGCACTGCTTGCAGCTGTGTTATGGCT
AGGAAATGTATCTTTTGAAGTGATTGACAATGAAAACCATGTCGAAGTGGTAGCAGATGAAGCTGTCACTAATGTAGCTA
TGTTAATGGGCTGCAACTCAAAAAAGCTTATGGTGGTTTTGTCTACATGCAAGCTCCAAGCTGGTAGAGATTGCATTGCT
AAAAGACTGACATTGCGACAGGCAACTGATATGAGGGATTCTCTGGCGAAAATCATCTATGCGAGCTTATTCAACTGGCT
TGTTGAGCAAATAAACATTTCACTGGAAGTTGGTAACTCACGTACAGGAAGATCTATTAGTATCCTTGATATATATGGGT
TTGAGTCATTTAAGGATAATAGTTTCGAACAATTTTGCATAAACTATGCAAATGAAAGACTGCAGCAACATTTCAACCGA
CATCTATTTAAACTTGAACAAGAGGAATATGAAGGGGATGGAATTGATTGGACTAAGGTCGAATTCATAGACAATCAAGA
GTGTCTAAATCTGATTGAGAAGAAACCCATTGGTTTGGTATCGCTATTAAATGAGGAATCAAATTTTCCAAAGGCAACTG
ATACGACGTTTGCCAACAAGCTCAAGCAGCATTTGAATGCTAACTCTTGTTTCAAGGGAGAGAGAGGTCGGGGTTTCAGA
ATTAAGCATTATGCCGGAGAGGTTCTCTATAATACAAATGGCTTCCTGGAAAAGAACAGAGATCCGCTGCATGTTGATCT
TATCCAACTTTTATCATTGTGCAAATGTCAATTATTGAACCTGTTTTCTACTAAGATGCATCACGATTTTCTAAAGCCGG
CAACCTTTTCGGACTCCATGAATCAAAGTGTGATTGCCAAGTTTAAGGGCCAACTTTTCAAGTTGATGAACAAGTTAGAA
GACACTACTCCTCATTTTATTCGATGCATAAAACCAAATTCAAATCAGCTTCCTGGACTTTACGAAGAAAATCATGTTCT
ACAACAGCTTAGATGTTGTGGCGTTTTGGAGATTGTTAGAATATCAAGATCAGGATATCCTACGCGGTTGACACACCAGG
AGCTTGCAGTAAGATATGGATGCCTACTGCTGGACACAAGAATCTCCCAGGATCCACTTAGCACATCAAAAGCTATTTTG
AAACAATGCAATCTTCCTCCTGAAATGTATCAAGTTGGTTATACGAAAATATACCTCCGAACTGGAGTAATTAGCGTCCT
TGAGGAAAGAAAAAAGTATGTTTTGCGTGGCATACTTGGATTACAGAAGCAATTTCGTGGTTATCAGACTCGTGAATACT
TTCACAATATGAGGAATGCTGCAGTGATACTTCAGTCATATATCCGTGGAGAAAATGCCAGGAGAAACTACATTGTGGTA
GGAGAATCAGCTATTGTTTCCACAGCAATAACTAAAGAGCTTGATGCAGCTATACACTTGCAATATATGGTACGCAAATG
GCTGGCTCGTAAACTCTTAAATAGTACGCAACAGAAGAACAAACCTCGCAATGAGAAGAAAAAAACGAGAAGAAAGTCAA
CAAAGAGGGTTTCCGAAGACAAGGAACTTCTTTCAGAGCAGTTTGAGGTCCAACCTTGTGTTCTTGCTGATCTCCAGAGC
CGGGTTCTGAAAGTCGAAGCAGCTATAATGCAGAAGGAAGATGAAAACACTGCATTGCAGGAGGAGTTGCAACGATTTGA
GGAAAGATGGTTAGAGAACGAAACAAGAATGAAGTCAATGGAGGACACATGGCAAAAGCATATGTCGTCAATGCAGATGA
GTCTCGCAGCTGCCTGTAAGGTTTTGGCTCCAGACAAGACTGCAAGTCACGGTACTGACTCAGAAGACACAATGTCCTTT
GGAACCCCCACGAAAGAGCTTAAGGGCAGCTTGAGTGACGTTAATAATCTTTCTACAGAATTTGATCAACGAAGCGTCAT
AATTCACGAAGATCCGAAAAGTCTTGTCGAGGTGAAATCAGACTCGATCTCAAACAGGAAGCAGCACGCAGAAGAACTCA
GGAGACTAAAGTCAAGATTTGAGAAATGGAAGAAAGATTACAAAACAAGACTAAGAGAGACAAAGGCAAGAGTCCGGTTA
AATGGTGATGAAGGTCGTCATCGGAATTGGTGGTGCAAGAAAAGTTATTGA