Microexon ID At_5:23159719-23159725:-
Species Arabidopsis thaliana
Coordinates 5:23159719..23159725
Microexon Cluster ID MEP13
Size 7
Phase 0
Pfam Domain Motif DNA_ligase_A_M
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 13,38,7,50
Microexon location in the Microexon-tag 3
Microexon-tag DNA Seq AAYCAGGAAATWGCMAAGGCWGCAARRGAKGGRYTKGMSASTGATMGACAGTTRTGYTATGTTGCWTTTGAYRTTCTKTATGYTGGAGAYACYAGYGTYATYCAYCAR
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq TTGTGTT
Microexon Amino Acid seq LCY
Microexon-tag DNA Seq AATCAGGAAATAGCCAAGGCAGCAAGGGAGGGTCTTGACAGCCATAAACAGTTGTGTTATGTTGCCTTTGATGTTCTTTATGTTGGCGATACTAGCGTCATCCACCAG
Microexon-tag Amino Acid Seq NQEIAKAAREGLDSHKQLCYVAFDVLYVGDTSVIHQ
Microexon-tag spanning region23159579-23159943
Microexon-tag prediction score0.9618
Overlapped with the annotated transcript (%) 100
New Transcript ID AT5G57160.1x
Reference Transcript ID AT5G57160.1
Gene ID AT5G57160
Gene Name LIG4
Transcript ID AT5G57160.1
Protein ID AT5G57160.1
Gene ID AT5G57160
Gene Name LIG4
Pfam domain motif DNA_ligase_A_M
Motif E-value 9.6e-42
Motif start 227
Motif end 442
Protein seq >AT5G57160.1
MTEEIKFSVLVSLFNWIQKSKTSSQKRSKFRKFLDTYCKPSDYFVAVRLIIPSLDRERGSYGLKESVLATCLIDALGISR
DAPDAVRLLNWRKGGTAKAGANAGNFSLIAAEVLQRRQGMASGGLTIKELNDLLDRLASSENRAEKTLVLSTLIQKTNAQ
EMKWVIRIILKDLKLGMSEKSIFQEFHPDAEDLFNVTCDLKLVCEKLRDRHQRHKRQDIEVGKAVRPQLAMRIGDVNAAW
KKLHGKDVVAECKFDGDRIQIHKNGTDIHYFSRNFLDHSEYAHAMSDLIVQNILVDKCILDGEMLVWDTSLNRFAEFGSN
QEIAKAAREGLDSHKQLCYVAFDVLYVGDTSVIHQSLKERHELLKKVVKPLKGRLEVLVPEGGLNVHRPSGEPSWSIVVH
AAADVERFFKETVENRDEGIVLKDLESKWEPGDRSGKWMKLKPEYIRAGADLDVLIIGGYYGSGRRGGEVAQFLVALADR
AEANVYPRRFMSFCRVGTGLSDDELNTVVSKLKPYFRKNEHPKKAPPSFYQVTNHSKERPDVWIDSPEKSIILSITSDIR
TIRSEVFVAPYSLRFPRIDKVRYDKPWHECLDVQAFVELVNSSNGTTQKQKESESTQDNPKVNKSSKRGEKKNVSLVPSQ
FIQTDVSDIKGKTSIFSNMIFYFVNVPRSHSLETFHKMVVENGGKFSMNLNNSVTHCIAAESSGIKYQAAKRQRDVIHFS
WVLDCCSRNKMLPLLPKYFLHLTDASRTKLQDDIDEFSDSYYWDLDLEGLKQVLSNAKQSEDSKSIDYYKKKLCPEKRWS
CLLSCCVYFYPYSQTLSTEEEALLGIMAKRLMLEVLMAGGKVSNNLAHASHLVVLAMAEEPLDFTLVSKSFSEMEKRLLL
KKRLHVVSSHWLEESLQREEKLCEDVYTLRPKYMEESDTEESDKSEHDTTEVASQGSAQTKEPASSKIAITSSRGRSNTR
AVKRGRSSTNSLQRVQRRRGKQPSKISGDETEESDASEEKVSTRLSDIAEETDSFGEAQRNSSRGKCAKRGKSRVGQTQR
VQRSRRGKKAAKIGGDESDENDELDGNNNVSADAEEGNAAGRSVENEETREPDIAKYTESQQRDNTVAVEEALQDSRNAK
TEMDMKEKLQIHEDPLQAMLMKMFPIPSQKTTETSNRTTGEYRKANVSGECESSEKRKLDAETDNTSVNAGAESDVVPPL
VKKKKVSYRDVAGELLKDW*
CDS seq >AT5G57160.1
ATGACGGAGGAGATCAAATTCAGCGTACTGGTATCTCTCTTTAACTGGATTCAAAAAAGCAAAACCTCATCTCAGAAACG
ATCCAAATTCCGCAAGTTTCTCGACACTTACTGCAAACCCTCCGACTACTTCGTCGCCGTCCGTTTAATCATTCCGTCGC
TTGATCGAGAACGAGGTAGCTACGGCCTTAAAGAGTCAGTGCTCGCCACGTGTCTGATCGACGCACTTGGTATCTCGCGT
GATGCTCCTGATGCTGTTCGTCTTCTCAATTGGCGTAAAGGAGGAACTGCTAAAGCTGGAGCCAATGCTGGAAACTTCTC
CTTAATTGCTGCTGAGGTATTGCAACGTAGACAAGGAATGGCTTCTGGCGGTTTGACTATTAAGGAATTGAATGATTTGC
TTGATCGTTTGGCTTCAAGTGAGAACAGAGCGGAGAAAACTTTGGTTCTTTCTACATTGATTCAGAAGACAAATGCTCAG
GAGATGAAGTGGGTCATTAGGATTATTCTAAAAGATTTGAAACTGGGAATGAGTGAGAAGAGCATTTTTCAGGAGTTCCA
TCCAGATGCTGAGGACTTGTTTAATGTCACATGTGATTTGAAACTAGTCTGTGAAAAGTTGAGGGATCGACACCAACGGC
ACAAGCGCCAGGATATAGAGGTTGGAAAAGCCGTACGACCCCAGTTAGCTATGCGTATTGGTGACGTAAATGCTGCTTGG
AAGAAGCTGCATGGGAAAGATGTAGTTGCTGAATGCAAATTTGATGGGGATCGCATACAAATACACAAAAATGGGACTGA
CATACATTACTTCTCTAGGAACTTCCTTGACCATTCTGAATACGCACATGCCATGTCGGATCTTATTGTACAAAATATAT
TGGTGGACAAGTGTATCCTTGATGGTGAAATGTTGGTCTGGGATACATCTCTGAATCGGTTTGCTGAGTTTGGTTCGAAT
CAGGAAATAGCCAAGGCAGCAAGGGAGGGTCTTGACAGCCATAAACAGTTGTGTTATGTTGCCTTTGATGTTCTTTATGT
TGGCGATACTAGCGTCATCCACCAGAGTTTGAAGGAACGGCATGAACTGCTGAAGAAAGTGGTCAAACCTTTAAAAGGTC
GGCTCGAAGTTTTAGTACCTGAAGGTGGTCTCAATGTTCACCGCCCTTCAGGGGAACCTAGTTGGTCTATTGTTGTTCAT
GCTGCTGCTGATGTTGAGCGATTTTTCAAAGAAACAGTTGAAAACAGAGATGAAGGAATTGTGCTTAAAGACCTGGAATC
AAAATGGGAACCTGGAGATCGTAGTGGCAAGTGGATGAAATTGAAGCCTGAATATATCCGGGCTGGTGCTGACCTGGATG
TTCTTATAATAGGAGGATACTATGGCTCCGGACGTCGTGGAGGAGAGGTAGCACAGTTTCTTGTAGCCTTGGCTGACCGG
GCAGAGGCAAATGTATATCCCAGGCGATTTATGTCCTTTTGTAGAGTTGGCACTGGGCTTTCTGATGATGAGCTTAACAC
TGTTGTAAGCAAACTGAAGCCTTATTTCAGAAAAAATGAGCACCCAAAGAAGGCTCCACCAAGCTTCTATCAGGTCACTA
ATCACTCCAAAGAGAGACCGGATGTTTGGATTGACAGCCCGGAAAAATCAATAATACTTTCAATCACTAGTGATATCAGG
ACGATAAGGTCTGAGGTGTTTGTTGCACCTTACAGTCTGAGGTTTCCTCGTATTGATAAAGTAAGATATGACAAGCCTTG
GCATGAATGTCTTGATGTGCAGGCTTTTGTGGAATTGGTGAATTCGAGTAACGGCACCACACAGAAACAGAAGGAGTCTG
AAAGCACACAGGACAATCCGAAAGTCAATAAATCCTCCAAGAGAGGAGAGAAAAAGAATGTCTCTCTTGTTCCCTCTCAG
TTTATTCAAACTGATGTATCGGATATCAAGGGCAAGACCTCAATCTTCTCAAATATGATATTTTATTTTGTTAACGTGCC
TCGGTCCCATTCTCTTGAGACATTCCACAAAATGGTGGTTGAAAACGGAGGGAAGTTTTCAATGAACTTAAACAACTCAG
TGACTCATTGCATTGCAGCAGAAAGCAGTGGAATAAAGTATCAGGCAGCAAAGCGTCAGCGAGATGTCATCCACTTTTCA
TGGGTCTTAGATTGTTGTTCACGAAATAAGATGCTTCCTTTGCTGCCAAAGTACTTCCTCCACCTCACTGATGCTTCAAG
GACAAAATTACAGGATGATATTGATGAATTTTCTGATTCTTATTACTGGGATTTAGACCTTGAAGGTCTCAAACAGGTCT
TAAGCAATGCCAAGCAATCTGAAGATTCAAAATCTATTGACTACTACAAGAAAAAGTTATGTCCTGAAAAAAGATGGTCC
TGCCTCTTAAGCTGCTGTGTTTACTTTTACCCGTATAGTCAAACATTGAGCACTGAAGAGGAAGCCCTATTGGGAATTAT
GGCTAAGAGATTAATGCTTGAAGTCTTAATGGCTGGTGGTAAAGTTAGCAATAATCTTGCTCATGCGTCGCACCTTGTAG
TTCTTGCTATGGCAGAAGAACCTTTGGATTTTACTTTAGTTTCGAAAAGTTTCAGCGAAATGGAGAAACGTCTTTTGCTG
AAGAAAAGGCTTCATGTTGTTAGCTCGCACTGGTTAGAGGAAAGCTTGCAAAGAGAGGAAAAACTGTGCGAGGATGTCTA
CACTCTGAGGCCTAAGTATATGGAAGAGTCTGATACTGAAGAATCAGATAAATCCGAACATGATACAACGGAAGTAGCTT
CCCAAGGTAGTGCTCAAACCAAAGAGCCAGCTTCCTCTAAAATTGCAATTACGAGTTCAAGAGGACGGTCAAATACTCGA
GCTGTTAAAAGGGGAAGGTCTTCTACAAACTCACTGCAACGAGTACAGAGACGCAGAGGCAAGCAGCCGTCTAAGATAAG
CGGAGATGAAACAGAGGAAAGTGATGCTTCTGAAGAAAAGGTTTCAACAAGACTCAGTGATATAGCAGAAGAGACGGATT
CGTTTGGTGAGGCTCAAAGAAACTCTAGCAGAGGAAAATGTGCGAAGAGAGGAAAGTCTCGTGTGGGACAAACCCAAAGA
GTACAGAGATCACGCAGAGGCAAGAAAGCTGCGAAGATAGGAGGAGATGAGTCTGATGAAAATGATGAATTAGATGGCAA
TAACAATGTGTCAGCAGATGCGGAAGAGGGTAATGCAGCTGGTAGATCTGTGGAAAACGAAGAAACCCGAGAGCCTGACA
TAGCGAAGTACACAGAATCACAGCAAAGAGACAACACAGTAGCAGTTGAAGAGGCCTTACAAGATTCTAGAAATGCAAAG
ACGGAGATGGATATGAAAGAGAAGTTACAAATCCATGAGGATCCACTACAAGCTATGTTAATGAAAATGTTCCCTATTCC
CAGTCAAAAAACCACAGAGACATCAAACCGAACCACTGGGGAATATAGAAAGGCTAATGTTTCTGGTGAATGTGAGTCTT
CAGAGAAAAGAAAACTCGACGCTGAGACTGATAATACATCTGTGAACGCAGGCGCAGAATCAGACGTAGTTCCTCCTCTA
GTGAAGAAAAAGAAAGTCAGCTACCGGGATGTTGCCGGAGAACTGCTCAAAGACTGGTGA
Microexon DNA seq TTGTGTT
Microexon Amino Acid seq LCY
Microexon-tag DNA Seq AATCAGGAAATAGCCAAGGCAGCAAGGGAGGGTCTTGACAGCCATAAACAGTTGTGTTATGTTGCCTTTGATGTTCTTTATGTTGGCGATACTAGCGTCATCCACCAG
Microexon-tag Amino Acid seq NQEIAKAAREGLDSHKQLCYVAFDVLYVGDTSVIHQ
Transcript ID AT5G57160.1
Gene ID At.26859
Gene Name LIG4
Pfam domain motif DNA_ligase_A_M
Motif E-value 9.6e-42
Motif start 227
Motif end 442
Protein seq >AT5G57160.1
MTEEIKFSVLVSLFNWIQKSKTSSQKRSKFRKFLDTYCKPSDYFVAVRLIIPSLDRERGSYGLKESVLATCLIDALGISR
DAPDAVRLLNWRKGGTAKAGANAGNFSLIAAEVLQRRQGMASGGLTIKELNDLLDRLASSENRAEKTLVLSTLIQKTNAQ
EMKWVIRIILKDLKLGMSEKSIFQEFHPDAEDLFNVTCDLKLVCEKLRDRHQRHKRQDIEVGKAVRPQLAMRIGDVNAAW
KKLHGKDVVAECKFDGDRIQIHKNGTDIHYFSRNFLDHSEYAHAMSDLIVQNILVDKCILDGEMLVWDTSLNRFAEFGSN
QEIAKAAREGLDSHKQLCYVAFDVLYVGDTSVIHQSLKERHELLKKVVKPLKGRLEVLVPEGGLNVHRPSGEPSWSIVVH
AAADVERFFKETVENRDEGIVLKDLESKWEPGDRSGKWMKLKPEYIRAGADLDVLIIGGYYGSGRRGGEVAQFLVALADR
AEANVYPRRFMSFCRVGTGLSDDELNTVVSKLKPYFRKNEHPKKAPPSFYQVTNHSKERPDVWIDSPEKSIILSITSDIR
TIRSEVFVAPYSLRFPRIDKVRYDKPWHECLDVQAFVELVNSSNGTTQKQKESESTQDNPKVNKSSKRGEKKNVSLVPSQ
FIQTDVSDIKGKTSIFSNMIFYFVNVPRSHSLETFHKMVVENGGKFSMNLNNSVTHCIAAESSGIKYQAAKRQRDVIHFS
WVLDCCSRNKMLPLLPKYFLHLTDASRTKLQDDIDEFSDSYYWDLDLEGLKQVLSNAKQSEDSKSIDYYKKKLCPEKRWS
CLLSCCVYFYPYSQTLSTEEEALLGIMAKRLMLEVLMAGGKVSNNLAHASHLVVLAMAEEPLDFTLVSKSFSEMEKRLLL
KKRLHVVSSHWLEESLQREEKLCEDVYTLRPKYMEESDTEESDKSEHDTTEVASQGSAQTKEPASSKIAITSSRGRSNTR
AVKRGRSSTNSLQRVQRRRGKQPSKISGDETEESDASEEKVSTRLSDIAEETDSFGEAQRNSSRGKCAKRGKSRVGQTQR
VQRSRRGKKAAKIGGDESDENDELDGNNNVSADAEEGNAAGRSVENEETREPDIAKYTESQQRDNTVAVEEALQDSRNAK
TEMDMKEKLQIHEDPLQAMLMKMFPIPSQKTTETSNRTTGEYRKANVSGECESSEKRKLDAETDNTSVNAGAESDVVPPL
VKKKKVSYRDVAGELLKDW*
CDS seq >AT5G57160.1
ATGACGGAGGAGATCAAATTCAGCGTACTGGTATCTCTCTTTAACTGGATTCAAAAAAGCAAAACCTCATCTCAGAAACG
ATCCAAATTCCGCAAGTTTCTCGACACTTACTGCAAACCCTCCGACTACTTCGTCGCCGTCCGTTTAATCATTCCGTCGC
TTGATCGAGAACGAGGTAGCTACGGCCTTAAAGAGTCAGTGCTCGCCACGTGTCTGATCGACGCACTTGGTATCTCGCGT
GATGCTCCTGATGCTGTTCGTCTTCTCAATTGGCGTAAAGGAGGAACTGCTAAAGCTGGAGCCAATGCTGGAAACTTCTC
CTTAATTGCTGCTGAGGTATTGCAACGTAGACAAGGAATGGCTTCTGGCGGTTTGACTATTAAGGAATTGAATGATTTGC
TTGATCGTTTGGCTTCAAGTGAGAACAGAGCGGAGAAAACTTTGGTTCTTTCTACATTGATTCAGAAGACAAATGCTCAG
GAGATGAAGTGGGTCATTAGGATTATTCTAAAAGATTTGAAACTGGGAATGAGTGAGAAGAGCATTTTTCAGGAGTTCCA
TCCAGATGCTGAGGACTTGTTTAATGTCACATGTGATTTGAAACTAGTCTGTGAAAAGTTGAGGGATCGACACCAACGGC
ACAAGCGCCAGGATATAGAGGTTGGAAAAGCCGTACGACCCCAGTTAGCTATGCGTATTGGTGACGTAAATGCTGCTTGG
AAGAAGCTGCATGGGAAAGATGTAGTTGCTGAATGCAAATTTGATGGGGATCGCATACAAATACACAAAAATGGGACTGA
CATACATTACTTCTCTAGGAACTTCCTTGACCATTCTGAATACGCACATGCCATGTCGGATCTTATTGTACAAAATATAT
TGGTGGACAAGTGTATCCTTGATGGTGAAATGTTGGTCTGGGATACATCTCTGAATCGGTTTGCTGAGTTTGGTTCGAAT
CAGGAAATAGCCAAGGCAGCAAGGGAGGGTCTTGACAGCCATAAACAGTTGTGTTATGTTGCCTTTGATGTTCTTTATGT
TGGCGATACTAGCGTCATCCACCAGAGTTTGAAGGAACGGCATGAACTGCTGAAGAAAGTGGTCAAACCTTTAAAAGGTC
GGCTCGAAGTTTTAGTACCTGAAGGTGGTCTCAATGTTCACCGCCCTTCAGGGGAACCTAGTTGGTCTATTGTTGTTCAT
GCTGCTGCTGATGTTGAGCGATTTTTCAAAGAAACAGTTGAAAACAGAGATGAAGGAATTGTGCTTAAAGACCTGGAATC
AAAATGGGAACCTGGAGATCGTAGTGGCAAGTGGATGAAATTGAAGCCTGAATATATCCGGGCTGGTGCTGACCTGGATG
TTCTTATAATAGGAGGATACTATGGCTCCGGACGTCGTGGAGGAGAGGTAGCACAGTTTCTTGTAGCCTTGGCTGACCGG
GCAGAGGCAAATGTATATCCCAGGCGATTTATGTCCTTTTGTAGAGTTGGCACTGGGCTTTCTGATGATGAGCTTAACAC
TGTTGTAAGCAAACTGAAGCCTTATTTCAGAAAAAATGAGCACCCAAAGAAGGCTCCACCAAGCTTCTATCAGGTCACTA
ATCACTCCAAAGAGAGACCGGATGTTTGGATTGACAGCCCGGAAAAATCAATAATACTTTCAATCACTAGTGATATCAGG
ACGATAAGGTCTGAGGTGTTTGTTGCACCTTACAGTCTGAGGTTTCCTCGTATTGATAAAGTAAGATATGACAAGCCTTG
GCATGAATGTCTTGATGTGCAGGCTTTTGTGGAATTGGTGAATTCGAGTAACGGCACCACACAGAAACAGAAGGAGTCTG
AAAGCACACAGGACAATCCGAAAGTCAATAAATCCTCCAAGAGAGGAGAGAAAAAGAATGTCTCTCTTGTTCCCTCTCAG
TTTATTCAAACTGATGTATCGGATATCAAGGGCAAGACCTCAATCTTCTCAAATATGATATTTTATTTTGTTAACGTGCC
TCGGTCCCATTCTCTTGAGACATTCCACAAAATGGTGGTTGAAAACGGAGGGAAGTTTTCAATGAACTTAAACAACTCAG
TGACTCATTGCATTGCAGCAGAAAGCAGTGGAATAAAGTATCAGGCAGCAAAGCGTCAGCGAGATGTCATCCACTTTTCA
TGGGTCTTAGATTGTTGTTCACGAAATAAGATGCTTCCTTTGCTGCCAAAGTACTTCCTCCACCTCACTGATGCTTCAAG
GACAAAATTACAGGATGATATTGATGAATTTTCTGATTCTTATTACTGGGATTTAGACCTTGAAGGTCTCAAACAGGTCT
TAAGCAATGCCAAGCAATCTGAAGATTCAAAATCTATTGACTACTACAAGAAAAAGTTATGTCCTGAAAAAAGATGGTCC
TGCCTCTTAAGCTGCTGTGTTTACTTTTACCCGTATAGTCAAACATTGAGCACTGAAGAGGAAGCCCTATTGGGAATTAT
GGCTAAGAGATTAATGCTTGAAGTCTTAATGGCTGGTGGTAAAGTTAGCAATAATCTTGCTCATGCGTCGCACCTTGTAG
TTCTTGCTATGGCAGAAGAACCTTTGGATTTTACTTTAGTTTCGAAAAGTTTCAGCGAAATGGAGAAACGTCTTTTGCTG
AAGAAAAGGCTTCATGTTGTTAGCTCGCACTGGTTAGAGGAAAGCTTGCAAAGAGAGGAAAAACTGTGCGAGGATGTCTA
CACTCTGAGGCCTAAGTATATGGAAGAGTCTGATACTGAAGAATCAGATAAATCCGAACATGATACAACGGAAGTAGCTT
CCCAAGGTAGTGCTCAAACCAAAGAGCCAGCTTCCTCTAAAATTGCAATTACGAGTTCAAGAGGACGGTCAAATACTCGA
GCTGTTAAAAGGGGAAGGTCTTCTACAAACTCACTGCAACGAGTACAGAGACGCAGAGGCAAGCAGCCGTCTAAGATAAG
CGGAGATGAAACAGAGGAAAGTGATGCTTCTGAAGAAAAGGTTTCAACAAGACTCAGTGATATAGCAGAAGAGACGGATT
CGTTTGGTGAGGCTCAAAGAAACTCTAGCAGAGGAAAATGTGCGAAGAGAGGAAAGTCTCGTGTGGGACAAACCCAAAGA
GTACAGAGATCACGCAGAGGCAAGAAAGCTGCGAAGATAGGAGGAGATGAGTCTGATGAAAATGATGAATTAGATGGCAA
TAACAATGTGTCAGCAGATGCGGAAGAGGGTAATGCAGCTGGTAGATCTGTGGAAAACGAAGAAACCCGAGAGCCTGACA
TAGCGAAGTACACAGAATCACAGCAAAGAGACAACACAGTAGCAGTTGAAGAGGCCTTACAAGATTCTAGAAATGCAAAG
ACGGAGATGGATATGAAAGAGAAGTTACAAATCCATGAGGATCCACTACAAGCTATGTTAATGAAAATGTTCCCTATTCC
CAGTCAAAAAACCACAGAGACATCAAACCGAACCACTGGGGAATATAGAAAGGCTAATGTTTCTGGTGAATGTGAGTCTT
CAGAGAAAAGAAAACTCGACGCTGAGACTGATAATACATCTGTGAACGCAGGCGCAGAATCAGACGTAGTTCCTCCTCTA
GTGAAGAAAAAGAAAGTCAGCTACCGGGATGTTGCCGGAGAACTGCTCAAAGACTGGTGA