Microexon ID At_4:218545-218559:-
Species Arabidopsis thaliana
Coordinates 4:218545..218559
Microexon Cluster ID MEP42
Size 15
Phase 0
Pfam Domain Motif bHLH-MYC_N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq AGCGCATCAATTCAG
Microexon Amino Acid seq SASIQ
Microexon-tag DNA Seq GCTCAATATGCGGAGAACAAGCTCTTCTCTCGTTCTTTGTTAGCAAGAAGCGCATCAATTCAGACTGTTGTGTGTTTCCCTTACTTGGGCGGAGTCATTGAGCTGGGC
Microexon-tag Amino Acid Seq AQYAENKLFSRSLLARSASIQTVVCFPYLGGVIELG
Microexon-tag spanning region218356-218732
Microexon-tag prediction score0.9339
Overlapped with the annotated transcript (%) 100
New Transcript ID AT4G00480.1x
Reference Transcript ID AT4G00480.1
Gene ID AT4G00480
Gene Name ATMYC1
Transcript ID AT4G00480.1
Protein ID AT4G00480.1
Gene ID AT4G00480
Gene Name ATMYC1
Pfam domain motif bHLH-MYC_N
Motif E-value 9.9e-50
Motif start 23
Motif end 216
Protein seq >AT4G00480.1
MSLTMADGVEAAAGRSKRQNSLLRKQLALAVRSVQWSYAIFWSSSLTQPGVLEWGEGCYNGDMKKRKKSYESHYKYGLQK
SKELRKLYLSMLEGDSGTTVSTTHDNLNDDDDNCHSTSMMLSPDDLSDEEWYYLVSMSYVFSPSQCLPGRASATGETIWL
CNAQYAENKLFSRSLLARSASIQTVVCFPYLGGVIELGVTELISEDHNLLRNIKSCLMEISAHQDNDDEKKMEIKISEEK
HQLPLGISDEDLHYKRTISTVLNYSADRSGKNDKNIRHRQPNIVTSEPGSSFLRWKQCEQQVSGFVQKKKSQNVLRKILH
DVPLMHTKRMFPSQNSGLNQDDPSDRRKENEKFSVLRTMVPTVNEVDKESILNNTIKYLQELEARVEELESCMGSVNFVE
RQRKTTENLNDSVLIEETSGNYDDSTKIDDNSGETEQVTVFRDKTHLRVKLKETEVVIEVRCSYRDYIVADIMETLSNLH
MDAFSVRSHTLNKFLTLNLKAKFRGAAVASVGMIKRELRRVIGDLF*
CDS seq >AT4G00480.1
ATGTCTTTGACAATGGCTGATGGTGTAGAAGCTGCAGCAGGAAGAAGTAAAAGACAAAACAGCTTATTAAGAAAACAACT
TGCTTTAGCTGTAAGAAGTGTTCAATGGAGCTACGCAATCTTCTGGTCGTCTTCACTTACTCAACCTGGGGTTTTGGAGT
GGGGAGAAGGATGTTACAATGGAGATATGAAGAAGAGGAAGAAGAGTTATGAATCTCATTATAAATATGGGTTGCAAAAA
AGCAAGGAGCTTCGGAAACTTTATTTGTCTATGCTTGAAGGAGACAGTGGTACTACTGTTAGTACTACTCATGATAATCT
CAATGATGATGATGATAATTGTCACAGTACAAGTATGATGCTGTCACCAGATGACCTCTCTGATGAAGAGTGGTACTATT
TAGTCTCCATGTCCTATGTCTTCTCTCCTTCACAATGTTTGCCTGGAAGAGCTTCAGCGACGGGTGAGACCATATGGCTC
TGCAACGCTCAATATGCGGAGAACAAGCTCTTCTCTCGTTCTTTGTTAGCAAGAAGCGCATCAATTCAGACTGTTGTGTG
TTTCCCTTACTTGGGCGGAGTCATTGAGCTGGGCGTCACTGAATTGATTTCAGAAGACCATAACCTGCTTCGAAACATCA
AATCTTGCTTGATGGAAATATCTGCACACCAAGACAACGATGACGAGAAGAAGATGGAGATTAAGATCAGTGAAGAGAAG
CATCAGCTTCCATTAGGTATTTCTGATGAAGACTTGCATTACAAAAGAACCATTTCAACAGTACTCAACTACTCCGCAGA
TAGATCAGGTAAGAACGATAAGAACATTCGTCATCGTCAGCCAAATATTGTTACTTCTGAACCTGGCTCAAGTTTCTTGC
GGTGGAAGCAATGTGAGCAGCAAGTCTCGGGTTTTGTTCAGAAAAAAAAGTCACAGAATGTGTTGCGGAAGATATTGCAT
GATGTCCCTTTGATGCACACAAAGAGAATGTTCCCAAGTCAGAACTCTGGTCTGAATCAAGATGATCCTTCAGATAGAAG
AAAAGAGAACGAAAAGTTCAGTGTCCTTAGAACTATGGTTCCCACTGTCAACGAGGTTGATAAAGAATCGATACTAAACA
ACACAATCAAGTACCTGCAAGAACTGGAGGCAAGAGTAGAAGAGCTAGAATCTTGTATGGGATCAGTTAATTTTGTAGAA
AGACAAAGAAAGACGACAGAGAACCTTAACGACTCTGTGTTGATCGAAGAGACATCAGGGAACTACGATGATAGCACGAA
GATCGATGACAATTCAGGAGAAACCGAACAAGTCACTGTTTTCAGAGATAAGACACATTTGAGAGTTAAACTCAAAGAAA
CAGAAGTTGTGATCGAAGTAAGATGTTCTTACAGAGACTACATAGTTGCGGACATCATGGAAACTCTGAGCAATCTTCAC
ATGGATGCTTTCTCTGTTAGATCTCACACGCTCAATAAGTTCCTCACATTGAATCTCAAGGCCAAGTTTCGCGGGGCTGC
AGTTGCGTCCGTAGGAATGATTAAGCGAGAGCTGAGAAGAGTCATTGGTGATTTGTTTTAA
Microexon DNA seq AGCGCATCAATTCAG
Microexon Amino Acid seq SASIQ
Microexon-tag DNA Seq GCTCAATATGCGGAGAACAAGCTCTTCTCTCGTTCTTTGTTAGCAAGAAGCGCATCAATTCAGACTGTTGTGTGTTTCCCTTACTTGGGCGGAGTCATTGAGCTGGGC
Microexon-tag Amino Acid seq AQYAENKLFSRSLLARSASIQTVVCFPYLGGVIELG
Transcript ID AT4G00480.2
Gene ID At.17503
Gene Name ATMYC1
Pfam domain motif bHLH-MYC_N
Motif E-value 1.2e-49
Motif start 23
Motif end 216
Protein seq >AT4G00480.2
MSLTMADGVEAAAGRSKRQNSLLRKQLALAVRSVQWSYAIFWSSSLTQPGVLEWGEGCYNGDMKKRKKSYESHYKYGLQK
SKELRKLYLSMLEGDSGTTVSTTHDNLNDDDDNCHSTSMMLSPDDLSDEEWYYLVSMSYVFSPSQCLPGRASATGETIWL
CNAQYAENKLFSRSLLARSASIQTVVCFPYLGGVIELGVTELISEDHNLLRNIKSCLMEISAHQDNDDEKKMEIKISEEK
HQLPLGISDEDLHYKRTISTVLNYSADRSGKNDKNIRHRQPNIVTSEPGSSFLRWKQCEQQVSGFVQKKKSQNVLRKILH
DVPLMHTKRMFPSQNSGLNQDDPSDRRKENEKFSVLRTMVPTVNEVDKESILNNTIKYLQELEARVEELESCMGSVNFVE
RQRKTTENLNDSVLIEETSGNYDDSTKIDDNSGETEQVTVFRDKTHLRVKLKETEVVIEVRCSYRDYIVADIMETLSNLH
MDAFSVRSHTLNKFLTLNLKAKFRGAAVASVGMIKRELRRVIDFREPICDVPLSLHQVFRVFVCKVCQSLVGIFDNVVSS
SSTKPRSILIHNSWAICIFH*
CDS seq >AT4G00480.2
ATGTCTTTGACAATGGCTGATGGTGTAGAAGCTGCAGCAGGAAGAAGTAAAAGACAAAACAGCTTATTAAGAAAACAACT
TGCTTTAGCTGTAAGAAGTGTTCAATGGAGCTACGCAATCTTCTGGTCGTCTTCACTTACTCAACCTGGGGTTTTGGAGT
GGGGAGAAGGATGTTACAATGGAGATATGAAGAAGAGGAAGAAGAGTTATGAATCTCATTATAAATATGGGTTGCAAAAA
AGCAAGGAGCTTCGGAAACTTTATTTGTCTATGCTTGAAGGAGACAGTGGTACTACTGTTAGTACTACTCATGATAATCT
CAATGATGATGATGATAATTGTCACAGTACAAGTATGATGCTGTCACCAGATGACCTCTCTGATGAAGAGTGGTACTATT
TAGTCTCCATGTCCTATGTCTTCTCTCCTTCACAATGTTTGCCTGGAAGAGCTTCAGCGACGGGTGAGACCATATGGCTC
TGCAACGCTCAATATGCGGAGAACAAGCTCTTCTCTCGTTCTTTGTTAGCAAGAAGCGCATCAATTCAGACTGTTGTGTG
TTTCCCTTACTTGGGCGGAGTCATTGAGCTGGGCGTCACTGAATTGATTTCAGAAGACCATAACCTGCTTCGAAACATCA
AATCTTGCTTGATGGAAATATCTGCACACCAAGACAACGATGACGAGAAGAAGATGGAGATTAAGATCAGTGAAGAGAAG
CATCAGCTTCCATTAGGTATTTCTGATGAAGACTTGCATTACAAAAGAACCATTTCAACAGTACTCAACTACTCCGCAGA
TAGATCAGGTAAGAACGATAAGAACATTCGTCATCGTCAGCCAAATATTGTTACTTCTGAACCTGGCTCAAGTTTCTTGC
GGTGGAAGCAATGTGAGCAGCAAGTCTCGGGTTTTGTTCAGAAAAAAAAGTCACAGAATGTGTTGCGGAAGATATTGCAT
GATGTCCCTTTGATGCACACAAAGAGAATGTTCCCAAGTCAGAACTCTGGTCTGAATCAAGATGATCCTTCAGATAGAAG
AAAAGAGAACGAAAAGTTCAGTGTCCTTAGAACTATGGTTCCCACTGTCAACGAGGTTGATAAAGAATCGATACTAAACA
ACACAATCAAGTACCTGCAAGAACTGGAGGCAAGAGTAGAAGAGCTAGAATCTTGTATGGGATCAGTTAATTTTGTAGAA
AGACAAAGAAAGACGACAGAGAACCTTAACGACTCTGTGTTGATCGAAGAGACATCAGGGAACTACGATGATAGCACGAA
GATCGATGACAATTCAGGAGAAACCGAACAAGTCACTGTTTTCAGAGATAAGACACATTTGAGAGTTAAACTCAAAGAAA
CAGAAGTTGTGATCGAAGTAAGATGTTCTTACAGAGACTACATAGTTGCGGACATCATGGAAACTCTGAGCAATCTTCAC
ATGGATGCTTTCTCTGTTAGATCTCACACGCTCAATAAGTTCCTCACATTGAATCTCAAGGCCAAGTTTCGCGGGGCTGC
AGTTGCGTCCGTAGGAATGATTAAGCGAGAGCTGAGAAGAGTCATTGACTTTCGTGAACCGATATGCGATGTGCCATTAT
CTTTACATCAAGTTTTCAGGGTTTTTGTATGTAAAGTTTGCCAAAGTTTGGTTGGAATTTTCGACAACGTTGTCTCTTCC
TCTTCTACAAAACCAAGATCTATACTTATTCATAATTCATGGGCGATTTGTATCTTTCATTAA