
Microexon ID | Sb_3:70154883-70154891:- |
Species | Sorghum Bicolor | Coordinates | 3:70154883..70154891 |
Microexon Cluster ID | MEP21 |
Size | 9 |
Phase | 1 |
Pfam Domain Motif | AP2 |
Structure of Microexon-tag (flanking exon, microexon, flanking exon sizes) | 49,9,50 |
Microexon location in the Microexon-tag | 2 |
Microexon-tag DNA Seq | TGGGAYAAYAGYWSYWGRARWGARRGYCARAVYARGAARGGAARRCAAGTTTAYYTRGGKGSWTATGAYRAKGARGARRMWGCWGCWMGRGCWTATGAYYTDGCWGCW |
Logo of Microexon-tag DNA Seq | ![]() |
Alignment of exons | ![]() |
Microexon DNA seq | TCTATTTAG |
Microexon Amino Acid seq | VYLG |
Microexon-tag DNA Seq | TGGGACAACAGTTGCAGAAGGGAAGGACAAACTCGCAAGGGTCGTCAAGTCTATTTAGGTGGCTATGATAAAGAGGAGAAAGCTGCTAGGGCTTATGATCTGGCTGCT |
Microexon-tag Amino Acid Seq | WDNSCRREGQTRKGRQVYLGGYDKEEKAARAYDLAA |
Microexon-tag spanning region | 70154708-70155092 |
Microexon-tag prediction score | 0.9694 |
Overlapped with the annotated transcript (%) | 100 |
New Transcript ID | KXG33917x |
Reference Transcript ID | KXG33917 |
Gene ID | SORBI_3003G390600 |
Gene Name | NA |
Transcript ID | KXG33917 |
Protein ID | |
Gene ID | SORBI_3003G390600 |
Gene Name | |
Pfam domain motif | AP2 |
Motif E-value | 1.80E-11 |
Motif start | 278 |
Motif end | 335 |
Protein seq | >KXG33917 MATVNNWLAFSLSPQELPPTQTDSTLISAATTDDVSGDVCFNIPQDWSMRGSELSALVAEPKLEDFLGGISFSEQHHKAN CNMIPSTSSTACYASSGATAGYHHQLYHQPTSSALHFADSVMVASSAGGVHDGGAMLSAASANGSAGAGAASANGSGSIG LSMIKNWLRSQPAPMQPRVAAAESVQGLSLSMNMAGATQGAAGMPLLAGERGRAPESVSTSAQGGAVVTAPKEDSGGSGV AATGALVAVSTDTGGSGASADNTARKTVDTFGQRTSIYRGVTRHRWTGRYEAHLWDNSCRREGQTRKGRQVYLGGYDKEE KAARAYDLAALKYWGPTTTTNFPVNNYEKELEDMKHMTRQEFVASLRRKSSGFSRGASIYRGVTRHHQHGRWQARIGRVA GNKDLYLGTFSTQEEAAEAYDIAAIKFRGLNAVTNFDMSRYDVKSILDSSALPIGSAAKRLKEAEAAASAQHHAGVVSYD VGRIASQLGDGGALAAAYGAHYHGAWPTIAFQPSAATGLYHPYAQPMRGWCKQEQDHAVIAAAHSLQELHHLNLGAAAGA HDFFSAGQQAAMHGLGSMDNASLEHSTGSNSVVYNGVGDSNGSTVVGSGGYMMPMSAATATATTAMVSHEQVHARAQGDH HDEAKQAAQMGYESYLVNAENYGGGRMSAAWATVSAPPAASSNDNMADVGHGGAQLFSVWNDT* |
CDS seq | >KXG33917 ATGGCTACTGTGAACAACTGGCTCGCTTTCTCCCTCTCCCCGCAGGAGCTGCCGCCCACCCAGACGGACTCCACCCTCAT CTCTGCCGCCACCACCGACGATGTCTCCGGCGATGTCTGCTTCAACATCCCCCAAGATTGGAGCATGAGGGGATCCGAGC TTTCGGCGCTCGTCGCCGAGCCGAAGCTGGAGGACTTCCTCGGCGGAATCTCCTTCTCCGAGCAGCACCACAAGGCCAAC TGCAACATGATCCCCAGCACTAGCAGCACAGCTTGCTACGCGAGCTCGGGTGCTACCGCCGGCTACCATCACCAGCTGTA CCACCAGCCCACCAGCTCCGCGCTCCACTTCGCTGACTCCGTCATGGTGGCCTCCTCGGCCGGCGGCGTCCACGACGGAG GTGCCATGCTCAGCGCGGCCAGCGCTAATGGTAGCGCTGGCGCTGGCGCTGCCAGTGCCAATGGCAGCGGCAGCATCGGG CTGTCCATGATCAAGAACTGGCTGCGGAGCCAACCAGCTCCCATGCAGCCGAGGGTGGCGGCGGCTGAGAGCGTGCAGGG GCTCTCTTTGTCCATGAACATGGCGGGGGCGACGCAAGGCGCCGCTGGCATGCCACTTCTTGCTGGAGAGCGCGGCCGGG CGCCCGAGAGTGTCTCGACGTCGGCACAGGGTGGAGCCGTCGTCACGGCTCCAAAGGAGGATAGCGGTGGCAGCGGTGTT GCCGCCACCGGCGCCCTAGTAGCCGTGAGCACGGACACGGGTGGCAGCGGCGCGTCGGCTGACAACACGGCAAGGAAGAC GGTGGACACGTTCGGGCAGCGCACGTCGATTTACCGTGGCGTGACAAGGCATAGATGGACTGGGAGATATGAAGCACATC TGTGGGACAACAGTTGCAGAAGGGAAGGACAAACTCGCAAGGGTCGTCAAGTCTATTTAGGTGGCTATGATAAAGAGGAG AAAGCTGCTAGGGCTTATGATCTGGCTGCTCTTAAGTACTGGGGTCCCACGACAACAACAAATTTTCCAGTGAATAACTA CGAAAAGGAGCTGGAGGATATGAAGCACATGACAAGGCAGGAGTTTGTAGCGTCTCTGAGAAGGAAGAGCAGTGGTTTCT CCAGAGGTGCATCCATTTACAGGGGAGTGACTAGGCATCACCAGCATGGAAGATGGCAAGCACGGATTGGACGAGTTGCA GGGAACAAGGATCTCTACTTGGGCACCTTCAGCACGCAGGAGGAGGCAGCGGAGGCATACGACATTGCGGCGATCAAGTT CCGCGGCCTCAACGCCGTCACAAACTTCGACATGAGCCGCTACGACGTCAAGAGCATCCTGGACAGCAGTGCGCTCCCCA TCGGCAGCGCCGCCAAGCGTCTCAAGGAGGCCGAGGCCGCCGCGTCCGCACAGCACCATGCCGGCGTGGTGAGCTACGAC GTCGGCCGCATAGCCTCACAGCTCGGCGACGGCGGCGCCCTGGCGGCGGCGTACGGCGCGCACTACCATGGCGCCTGGCC GACCATCGCGTTCCAGCCGAGCGCGGCCACGGGCCTGTACCACCCGTACGCGCAGCCGATGCGCGGGTGGTGCAAGCAGG AGCAGGACCACGCGGTGATCGCGGCCGCGCACAGCCTGCAGGAGCTCCACCACCTGAACCTGGGTGCTGCCGCCGGCGCG CACGACTTCTTCTCGGCGGGGCAGCAGGCGGCGATGCACGGCCTGGGTAGCATGGACAATGCATCACTCGAGCACAGCAC CGGCTCCAACTCCGTCGTGTACAACGGTGTTGGTGATAGCAACGGCAGCACCGTCGTCGGCAGTGGTGGCTACATGATGC CTATGAGCGCTGCCACGGCGACGGCTACCACGGCAATGGTGAGCCACGAGCAGGTGCATGCACGGGCACAGGGTGATCAC CACGACGAAGCCAAGCAGGCTGCTCAGATGGGGTACGAGAGCTACCTGGTGAACGCAGAGAACTATGGCGGCGGGAGGAT GTCTGCGGCCTGGGCGACTGTCTCAGCGCCACCGGCGGCAAGCAGCAACGATAACATGGCGGACGTCGGCCATGGCGGCG CACAGCTCTTCAGTGTCTGGAACGATACTTAA |
Sb_3:70154883-70154891:- does not have available information here.