Microexon ID At_4:16543522-16543535:+
Species Arabidopsis thaliana
Coordinates 4:16543522..16543535
Microexon Cluster ID Unclassified
Size 14
At_4:16543522-16543535:+ does not have available information here.
At_4:16543522-16543535:+ does not have available information here.
Microexon DNA seq GTTTATTTCTGCAG
Microexon Amino Acid seq GLFLQ
Microexon-tag DNA Seq TCAGAGGTTTTGACACCAGATTGGGAGGCGATTTCCAATTCAATGGGTTTATTTCTGCAGAAAACAAACATTATCAAAGATTATCTTGAAGACATTAATGAGAGACCA
Microexon-tag Amino Acid seq SEVLTPDWEAISNSMGLFLQKTNIIKDYLEDINERP
Transcript ID At.21038.1
Gene ID At.21038
Gene Name SQS2
Pfam domain motif SQS_PSY
Motif E-value 1.7e-42
Motif start 45
Motif end 316
Protein seq >At.21038.1
MGSLGTMLRYPDDIYPLLKMKRAIEKAEKQIPPEPHWGFCYSMLHKVSRSFSLVIQQLNTELRNAVCVFYLVLRALDTVE
DDTSIPTDEKVPILIAFHRHIYDTDWHYSCGTKEYKILMDQFHHVSAAFLELEKGYQEAIEEITRRMGAGMAKFICQEVE
TVDDYDEYCHYVAGLVGLGLSKLFLAAGSEVLTPDWEAISNSMGLFLQKTNIIKDYLEDINERPKSRMFWPREIWGKYVD
KLEDFKNEEKATKAVQCLNEMVTNALNHVEDCLKSLASLRDPAIFQSCAIPQIVAIGTLALCYNNVQVFRGVVRLRRGLI
AKVIDRTKTMDDVYGAFYDFSCMLQTKVDNNDPNAMKTLNRLETIKKFCKENGGLHKRKSYVNDETQSKAIFVVMFVLLL
AIVVVYLKANQCK*
CDS seq >At.21038.1
ATGGGGAGCTTGGGGACGATGCTGAGATATCCGGATGACATATATCCGCTCCTGAAGATGAAACGAGCGATTGAGAAAGC
GGAGAAGCAGATCCCTCCTGAGCCACACTGGGGTTTCTGCTATTCGATGCTCCACAAGGTTTCTCGAAGCTTTTCTCTCG
TTATTCAGCAACTCAACACCGAGCTCCGTAACGCCGTGTGTGTGTTCTACTTGGTTCTCCGAGCTCTTGATACTGTTGAG
GATGATACTAGCATACCAACTGATGAAAAGGTTCCCATCCTGATAGCTTTTCACCGGCACATATACGATACTGATTGGCA
TTATTCATGTGGTACGAAGGAGTACAAGATTCTAATGGACCAATTTCACCATGTTTCTGCAGCTTTTTTGGAACTTGAAA
AAGGGTATCAAGAGGCTATCGAGGAAATTACTAGAAGAATGGGTGCAGGGATGGCCAAGTTTATCTGCCAAGAGGTAGAA
ACTGTTGATGACTACGATGAATACTGCCACTATGTTGCTGGGCTTGTTGGTTTAGGTTTGTCGAAACTCTTCCTCGCTGC
AGGATCAGAGGTTTTGACACCAGATTGGGAGGCGATTTCCAATTCAATGGGTTTATTTCTGCAGAAAACAAACATTATCA
AAGATTATCTTGAAGACATTAATGAGAGACCAAAGTCGCGCATGTTTTGGCCTCGTGAGATTTGGGGAAAATATGTTGAC
AAACTTGAGGACTTCAAAAATGAGGAGAAAGCTACAAAAGCAGTGCAGTGTTTGAATGAAATGGTCACTAATGCATTGAA
TCATGTTGAAGATTGTTTGAAATCCTTGGCTTCACTGCGTGATCCTGCAATATTTCAGTCTTGCGCCATCCCTCAGATCG
TGGCGATTGGAACACTTGCGTTATGCTATAACAATGTACAAGTGTTTAGAGGTGTCGTGAGATTGAGACGAGGTCTAATA
GCTAAAGTCATTGATCGCACAAAGACAATGGATGATGTCTACGGTGCGTTCTATGATTTTTCTTGCATGCTACAAACAAA
GGTTGACAATAACGATCCAAATGCTATGAAAACATTAAACCGACTCGAAACCATCAAGAAATTTTGCAAAGAAAATGGAG
GACTTCACAAAAGAAAATCTTATGTTAACGATGAAACACAATCCAAGGCTATCTTTGTTGTAATGTTTGTGCTTCTACTG
GCCATAGTCGTTGTATATCTCAAAGCAAACCAATGTAAGTGA