
Microexon ID | At_4:16543522-16543535:+ |
Species | Arabidopsis thaliana | Coordinates | 4:16543522..16543535 |
Microexon Cluster ID | Unclassified |
Size | 14 |
At_4:16543522-16543535:+ does not have available information here.
At_4:16543522-16543535:+ does not have available information here.
Microexon DNA seq | GTTTATTTCTGCAG |
Microexon Amino Acid seq | GLFLQ |
Microexon-tag DNA Seq | TCAGAGGTTTTGACACCAGATTGGGAGGCGATTTCCAATTCAATGGGTTTATTTCTGCAGAAAACAAACATTATCAAAGATTATCTTGAAGACATTAATGAGAGACCA |
Microexon-tag Amino Acid seq | SEVLTPDWEAISNSMGLFLQKTNIIKDYLEDINERP |
Transcript ID | At.21038.1 |
Gene ID | At.21038 |
Gene Name | SQS2 |
Pfam domain motif | SQS_PSY |
Motif E-value | 1.7e-42 |
Motif start | 45 |
Motif end | 316 |
Protein seq | >At.21038.1 MGSLGTMLRYPDDIYPLLKMKRAIEKAEKQIPPEPHWGFCYSMLHKVSRSFSLVIQQLNTELRNAVCVFYLVLRALDTVE DDTSIPTDEKVPILIAFHRHIYDTDWHYSCGTKEYKILMDQFHHVSAAFLELEKGYQEAIEEITRRMGAGMAKFICQEVE TVDDYDEYCHYVAGLVGLGLSKLFLAAGSEVLTPDWEAISNSMGLFLQKTNIIKDYLEDINERPKSRMFWPREIWGKYVD KLEDFKNEEKATKAVQCLNEMVTNALNHVEDCLKSLASLRDPAIFQSCAIPQIVAIGTLALCYNNVQVFRGVVRLRRGLI AKVIDRTKTMDDVYGAFYDFSCMLQTKVDNNDPNAMKTLNRLETIKKFCKENGGLHKRKSYVNDETQSKAIFVVMFVLLL AIVVVYLKANQCK* |
CDS seq | >At.21038.1 ATGGGGAGCTTGGGGACGATGCTGAGATATCCGGATGACATATATCCGCTCCTGAAGATGAAACGAGCGATTGAGAAAGC GGAGAAGCAGATCCCTCCTGAGCCACACTGGGGTTTCTGCTATTCGATGCTCCACAAGGTTTCTCGAAGCTTTTCTCTCG TTATTCAGCAACTCAACACCGAGCTCCGTAACGCCGTGTGTGTGTTCTACTTGGTTCTCCGAGCTCTTGATACTGTTGAG GATGATACTAGCATACCAACTGATGAAAAGGTTCCCATCCTGATAGCTTTTCACCGGCACATATACGATACTGATTGGCA TTATTCATGTGGTACGAAGGAGTACAAGATTCTAATGGACCAATTTCACCATGTTTCTGCAGCTTTTTTGGAACTTGAAA AAGGGTATCAAGAGGCTATCGAGGAAATTACTAGAAGAATGGGTGCAGGGATGGCCAAGTTTATCTGCCAAGAGGTAGAA ACTGTTGATGACTACGATGAATACTGCCACTATGTTGCTGGGCTTGTTGGTTTAGGTTTGTCGAAACTCTTCCTCGCTGC AGGATCAGAGGTTTTGACACCAGATTGGGAGGCGATTTCCAATTCAATGGGTTTATTTCTGCAGAAAACAAACATTATCA AAGATTATCTTGAAGACATTAATGAGAGACCAAAGTCGCGCATGTTTTGGCCTCGTGAGATTTGGGGAAAATATGTTGAC AAACTTGAGGACTTCAAAAATGAGGAGAAAGCTACAAAAGCAGTGCAGTGTTTGAATGAAATGGTCACTAATGCATTGAA TCATGTTGAAGATTGTTTGAAATCCTTGGCTTCACTGCGTGATCCTGCAATATTTCAGTCTTGCGCCATCCCTCAGATCG TGGCGATTGGAACACTTGCGTTATGCTATAACAATGTACAAGTGTTTAGAGGTGTCGTGAGATTGAGACGAGGTCTAATA GCTAAAGTCATTGATCGCACAAAGACAATGGATGATGTCTACGGTGCGTTCTATGATTTTTCTTGCATGCTACAAACAAA GGTTGACAATAACGATCCAAATGCTATGAAAACATTAAACCGACTCGAAACCATCAAGAAATTTTGCAAAGAAAATGGAG GACTTCACAAAAGAAAATCTTATGTTAACGATGAAACACAATCCAAGGCTATCTTTGTTGTAATGTTTGTGCTTCTACTG GCCATAGTCGTTGTATATCTCAAAGCAAACCAATGTAAGTGA |