Microexon ID At_4:6183341-6183355:+
Species Arabidopsis thaliana
Coordinates 4:6183341..6183355
Microexon Cluster ID MEP42
Size 15
Phase 0
Pfam Domain Motif bHLH-MYC_N
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) 48,15,45
Microexon location in the Microexon-tag 2
Microexon-tag DNA Seq GCWMAYKATGCMGAYAGYAAARTYTTYTCTCGMKCTHTKCTWGCMAAGAGTGCWTSWATTCAGACWGTGGTRTGYWTTCCYYWYMTRGRYGGYGTBRTTGARCTWGGY
Logo of Microexon-tag DNA Seq NT60 Logo
Alignment of exons MSA
Microexon DNA seq AGTGCTAAAATTCAG
Microexon Amino Acid seq SAKIQ
Microexon-tag DNA Seq GCAAATGAAGTTGACAGTAAAACTTTTTCTAGAGCTATTCTCGCTAAGAGTGCTAAAATTCAGACAGTGGTTTGCATTCCAATGCTTGATGGTGTTGTGGAACTAGGC
Microexon-tag Amino Acid Seq ANEVDSKTFSRAILAKSAKIQTVVCIPMLDGVVELG
Microexon-tag spanning region6183105-6183482
Microexon-tag prediction score0.9292
Overlapped with the annotated transcript (%) 100
New Transcript ID AT4G09820.1x
Reference Transcript ID AT4G09820.1
Gene ID AT4G09820
Gene Name TT8
Transcript ID AT4G09820.1
Protein ID AT4G09820.1
Gene ID AT4G09820
Gene Name TT8
Pfam domain motif bHLH-MYC_N
Motif E-value 5.9e-51
Motif start 20
Motif end 201
Protein seq >AT4G09820.1
MDESSIIPAEKVAGAEKKELQGLLKTAVQSVDWTYSVFWQFCPQQRVLVWGNGYYNGAIKTRKTTQPAEVTAEEAALERS
QQLRELYETLLAGESTSEARACTALSPEDLTETEWFYLMCVSFSFPPPSGMPGKAYARRKHVWLSGANEVDSKTFSRAIL
AKSAKIQTVVCIPMLDGVVELGTTKKVREDVEFVELTKSFFYDHCKTNPKPALSEHSTYEVHEEAEDEEEVEEEMTMSEE
MRLGSPDDEDVSNQNLHSDLHIESTHTLDTHMDMMNLMEEGGNYSQTVTTLLMSHPTSLLSDSVSTSSYIQSSFATWRVE
NGKEHQQVKTAPSSQWVLKQMIFRVPFLHDNTKDKRLPREDLSHVVAERRRREKLNEKFITLRSMVPFVTKMDKVSILGD
TIAYVNHLRKRVHELENTHHEQQHKRTRTCKRKTSEEVEVSIIENDVLLEMRCEYRDGLLLDILQVLHELGIETTAVHTS
VNDHDFEAEIRAKVRGKKASIAEVKRAIHQVIIHDTNL*
CDS seq >AT4G09820.1
ATGGATGAATCAAGTATTATTCCGGCAGAGAAAGTGGCCGGAGCTGAGAAAAAAGAGCTTCAAGGGCTGCTTAAGACGGC
GGTTCAATCTGTGGACTGGACTTATAGTGTCTTCTGGCAATTTTGTCCTCAACAACGGGTCTTGGTGTGGGGGAATGGAT
ACTACAACGGTGCAATAAAGACGAGGAAGACAACTCAACCAGCGGAGGTGACGGCAGAAGAGGCGGCGTTAGAGAGGAGC
CAACAGCTCAGGGAGCTTTATGAGACACTTTTAGCCGGAGAGTCAACGTCAGAAGCAAGAGCATGCACCGCATTGTCACC
GGAGGATTTGACGGAGACAGAATGGTTTTATCTAATGTGCGTCTCTTTCTCTTTTCCTCCTCCATCTGGGATGCCAGGAA
AAGCGTATGCAAGGAGGAAGCACGTATGGCTAAGTGGTGCAAATGAAGTTGACAGTAAAACTTTTTCTAGAGCTATTCTC
GCTAAGAGTGCTAAAATTCAGACAGTGGTTTGCATTCCAATGCTTGATGGTGTTGTGGAACTAGGCACAACGAAAAAGGT
AAGAGAAGATGTAGAGTTTGTTGAGCTCACAAAGAGTTTCTTCTATGACCACTGCAAGACGAACCCAAAGCCGGCTCTTT
CTGAACACTCCACCTACGAAGTGCATGAAGAAGCCGAAGACGAAGAAGAAGTAGAAGAAGAGATGACAATGTCAGAGGAA
ATGAGGCTTGGCTCTCCTGATGATGAAGATGTTTCCAATCAAAATCTACACTCTGATCTTCATATTGAATCAACCCATAC
GTTAGACACACATATGGACATGATGAATCTAATGGAGGAAGGTGGAAACTATTCTCAGACAGTAACAACACTTCTCATGT
CACACCCCACAAGTCTTCTTTCAGATTCAGTTTCCACATCTTCTTACATCCAATCATCGTTTGCCACGTGGAGGGTTGAG
AATGGCAAAGAGCATCAGCAAGTGAAAACGGCGCCGTCGTCACAATGGGTGCTCAAACAAATGATCTTCAGAGTTCCTTT
CCTCCATGACAACACTAAAGATAAGAGGCTACCGCGGGAAGATCTGAGCCACGTAGTAGCAGAGCGACGCAGGAGGGAGA
AGCTGAACGAGAAATTCATAACGTTGAGATCAATGGTTCCATTTGTGACCAAGATGGATAAAGTCTCAATCCTTGGAGAC
ACCATTGCGTACGTAAATCATCTTCGAAAGAGGGTCCATGAGCTTGAGAATACTCATCATGAGCAACAGCATAAGCGGAC
GCGTACTTGTAAGAGAAAAACATCGGAGGAGGTGGAGGTTTCCATCATAGAGAATGATGTTTTGTTAGAGATGAGATGTG
AGTACCGAGATGGTTTGTTGCTTGACATTCTTCAGGTTCTTCATGAGCTTGGTATAGAGACTACGGCAGTTCATACCTCG
GTGAACGACCATGATTTCGAGGCGGAGATAAGGGCGAAAGTAAGAGGGAAGAAAGCAAGCATCGCTGAGGTCAAAAGAGC
CATCCACCAAGTCATAATACATGATACTAATCTATAG
Microexon DNA seq AGTGCTAAAATTCAG
Microexon Amino Acid seq SAKIQ
Microexon-tag DNA Seq GCAAATGAAGTTGACAGTAAAACTTTTTCTAGAGCTATTCTCGCTAAGAGTGCTAAAATTCAGACAGTGGTTTGCATTCCAATGCTTGATGGTGTTGTGGAACTAGGC
Microexon-tag Amino Acid seq ANEVDSKTFSRAILAKSAKIQTVVCIPMLDGVVELG
Transcript ID AT4G09820.1
Gene ID At.18240
Gene Name TT8
Pfam domain motif bHLH-MYC_N
Motif E-value 5.9e-51
Motif start 20
Motif end 201
Protein seq >AT4G09820.1
MDESSIIPAEKVAGAEKKELQGLLKTAVQSVDWTYSVFWQFCPQQRVLVWGNGYYNGAIKTRKTTQPAEVTAEEAALERS
QQLRELYETLLAGESTSEARACTALSPEDLTETEWFYLMCVSFSFPPPSGMPGKAYARRKHVWLSGANEVDSKTFSRAIL
AKSAKIQTVVCIPMLDGVVELGTTKKVREDVEFVELTKSFFYDHCKTNPKPALSEHSTYEVHEEAEDEEEVEEEMTMSEE
MRLGSPDDEDVSNQNLHSDLHIESTHTLDTHMDMMNLMEEGGNYSQTVTTLLMSHPTSLLSDSVSTSSYIQSSFATWRVE
NGKEHQQVKTAPSSQWVLKQMIFRVPFLHDNTKDKRLPREDLSHVVAERRRREKLNEKFITLRSMVPFVTKMDKVSILGD
TIAYVNHLRKRVHELENTHHEQQHKRTRTCKRKTSEEVEVSIIENDVLLEMRCEYRDGLLLDILQVLHELGIETTAVHTS
VNDHDFEAEIRAKVRGKKASIAEVKRAIHQVIIHDTNL*
CDS seq >AT4G09820.1
ATGGATGAATCAAGTATTATTCCGGCAGAGAAAGTGGCCGGAGCTGAGAAAAAAGAGCTTCAAGGGCTGCTTAAGACGGC
GGTTCAATCTGTGGACTGGACTTATAGTGTCTTCTGGCAATTTTGTCCTCAACAACGGGTCTTGGTGTGGGGGAATGGAT
ACTACAACGGTGCAATAAAGACGAGGAAGACAACTCAACCAGCGGAGGTGACGGCAGAAGAGGCGGCGTTAGAGAGGAGC
CAACAGCTCAGGGAGCTTTATGAGACACTTTTAGCCGGAGAGTCAACGTCAGAAGCAAGAGCATGCACCGCATTGTCACC
GGAGGATTTGACGGAGACAGAATGGTTTTATCTAATGTGCGTCTCTTTCTCTTTTCCTCCTCCATCTGGGATGCCAGGAA
AAGCGTATGCAAGGAGGAAGCACGTATGGCTAAGTGGTGCAAATGAAGTTGACAGTAAAACTTTTTCTAGAGCTATTCTC
GCTAAGAGTGCTAAAATTCAGACAGTGGTTTGCATTCCAATGCTTGATGGTGTTGTGGAACTAGGCACAACGAAAAAGGT
AAGAGAAGATGTAGAGTTTGTTGAGCTCACAAAGAGTTTCTTCTATGACCACTGCAAGACGAACCCAAAGCCGGCTCTTT
CTGAACACTCCACCTACGAAGTGCATGAAGAAGCCGAAGACGAAGAAGAAGTAGAAGAAGAGATGACAATGTCAGAGGAA
ATGAGGCTTGGCTCTCCTGATGATGAAGATGTTTCCAATCAAAATCTACACTCTGATCTTCATATTGAATCAACCCATAC
GTTAGACACACATATGGACATGATGAATCTAATGGAGGAAGGTGGAAACTATTCTCAGACAGTAACAACACTTCTCATGT
CACACCCCACAAGTCTTCTTTCAGATTCAGTTTCCACATCTTCTTACATCCAATCATCGTTTGCCACGTGGAGGGTTGAG
AATGGCAAAGAGCATCAGCAAGTGAAAACGGCGCCGTCGTCACAATGGGTGCTCAAACAAATGATCTTCAGAGTTCCTTT
CCTCCATGACAACACTAAAGATAAGAGGCTACCGCGGGAAGATCTGAGCCACGTAGTAGCAGAGCGACGCAGGAGGGAGA
AGCTGAACGAGAAATTCATAACGTTGAGATCAATGGTTCCATTTGTGACCAAGATGGATAAAGTCTCAATCCTTGGAGAC
ACCATTGCGTACGTAAATCATCTTCGAAAGAGGGTCCATGAGCTTGAGAATACTCATCATGAGCAACAGCATAAGCGGAC
GCGTACTTGTAAGAGAAAAACATCGGAGGAGGTGGAGGTTTCCATCATAGAGAATGATGTTTTGTTAGAGATGAGATGTG
AGTACCGAGATGGTTTGTTGCTTGACATTCTTCAGGTTCTTCATGAGCTTGGTATAGAGACTACGGCAGTTCATACCTCG
GTGAACGACCATGATTTCGAGGCGGAGATAAGGGCGAAAGTAAGAGGGAAGAAAGCAAGCATCGCTGAGGTCAAAAGAGC
CATCCACCAAGTCATAATACATGATACTAATCTATAG