
Microexon ID | At_1:17118337-17118350:+ |
Species | Arabidopsis thaliana | Coordinates | 1:17118337..17118350 |
Microexon Cluster ID | Unclassified |
Size | 14 |
At_1:17118337-17118350:+ does not have available information here.
Transcript ID | AT1G45191.4 |
Protein ID | AT1G45191.4 |
Gene ID | AT1G45191 |
Gene Name | BGLU1 |
Pfam domain motif | Glyco_hydro_1 |
Motif E-value | 2e-140 |
Motif start | 32 |
Motif end | 484 |
Protein seq | >AT1G45191.4 MEDVLTLITMIVLLLLAFHGFGKCSSDLYSRSDFPEGFVFGAGISAYQWEGAVDEDGRKPSVWDTFLHCRKMDNGDIACD GYHKYKEDVQLMAETGLHTFRFSISWSRLISNGRGSINPKGLQFYKNFIQELVKHGIEPHVTLHHYDFPQYLEDDYGGWT NRKIIKDFTAYADVCFREFGNHVKFWTTINEANIFTIGGYNDGNSPPGRCSFPGRNCTLGNSSTETYIVGHNLLLAHASV SRLYKQKYKDIQGGSVGFSLFAMNFTPSTNSKDDEIATKRANDFYLGWMLEPLIYGDYPDVMKRTIGSRLPVFSKEESEQ VKGSSDFIGVIHYLTALVTNIDINPSLSGIPDFNSDMGESINILSMRSVVSPWAMEGILEYIKQSYGNPPVYILENGKTM NQDLELQQKDTPRIEYLDAYIGAVLKAVRNGSDTRGYFVWSFMDLYELLNGYKSSFGLYSVNFSDPHRKRSPKLSAHWYS GFLKGKPTFLGSQGITQLHSNFSSSR* |
CDS seq | >AT1G45191.4 ATGGAAGATGTTTTGACTCTCATTACCATGATTGTGTTGCTTCTTCTAGCTTTCCATGGATTTGGAAAATGCAGTAGCGA TCTTTACAGCAGGAGCGATTTCCCGGAAGGCTTCGTTTTCGGAGCCGGGATATCTGCTTATCAGTGGGAAGGAGCTGTTG ATGAAGATGGGAGGAAACCTAGCGTCTGGGATACTTTCCTTCACTGTCGTAAAATGGATAATGGAGACATAGCTTGTGAT GGATATCACAAGTATAAGGAAGATGTGCAGCTCATGGCCGAAACTGGCTTACATACATTCAGATTCTCCATCTCTTGGTC TAGACTCATCTCTAATGGAAGAGGTTCCATTAACCCGAAAGGTCTACAGTTCTACAAGAATTTCATTCAAGAACTTGTCA AACATGGAATTGAGCCACATGTTACACTACATCACTACGATTTTCCTCAATATCTCGAGGATGACTATGGAGGCTGGACC AACCGCAAAATCATCAAAGACTTTACCGCTTATGCAGATGTTTGCTTTAGAGAGTTTGGGAACCACGTCAAATTCTGGAC CACGATCAACGAGGCTAATATATTTACTATTGGAGGTTACAACGATGGGAATTCACCGCCTGGTCGTTGCTCCTTTCCGG GCAGAAACTGCACGTTAGGGAACTCTTCCACTGAAACATATATCGTAGGCCATAACTTGCTGCTTGCTCACGCCTCTGTT TCAAGACTATATAAGCAAAAGTACAAGGATATACAAGGAGGTTCTGTTGGATTTAGTTTATTTGCAATGAATTTTACTCC TTCTACAAACTCCAAGGATGATGAAATCGCAACGAAAAGAGCCAACGATTTCTACCTCGGATGGATGCTTGAGCCTCTTA TATATGGAGACTATCCTGATGTGATGAAAAGAACCATTGGATCAAGACTGCCAGTTTTCTCGAAGGAAGAATCAGAACAA GTGAAAGGCTCATCTGACTTCATAGGAGTCATTCACTATCTCACGGCTTTGGTCACAAACATCGATATCAACCCTTCACT TTCAGGAATTCCAGATTTTAACTCAGACATGGGTGAATCTATTAATATTTTATCTATGAGGTCTGTTGTTTCTCCATGGG CTATGGAAGGCATCCTAGAGTATATAAAGCAGAGCTATGGCAATCCTCCAGTCTACATTCTTGAGAATGGTAAAACAATG AACCAAGATTTGGAGCTGCAACAAAAGGACACACCAAGGATTGAGTACTTAGATGCTTACATTGGTGCGGTGCTCAAAGC TGTTAGGAATGGATCAGACACGAGAGGCTACTTCGTATGGTCATTTATGGATTTGTACGAATTACTAAACGGATACAAGA GTAGTTTTGGATTGTACTCTGTCAATTTCAGTGATCCCCATCGCAAGAGATCTCCCAAACTCTCTGCTCACTGGTACTCT GGTTTTCTCAAGGGCAAACCCACATTTCTTGGTTCCCAAGGCATCACACAATTGCATAGCAACTTCTCTTCTTCCAGATA G |
Microexon DNA seq | TTTTATCTATGAGG |
Microexon Amino Acid seq | ILSMR |
Microexon-tag DNA Seq | TCAGGAATTCCAGATTTTAACTCAGACATGGGTGAATCTATTAATATTTTATCTATGAGGTCTGTTGTTTCTCCATGGGCTATGGAAGGCATCCTAGAGTATATAAAG |
Microexon-tag Amino Acid seq | SGIPDFNSDMGESINILSMRSVVSPWAMEGILEYIK |
Transcript ID | AT1G45191.4 |
Gene ID | At.3874 |
Gene Name | BGLU1 |
Pfam domain motif | Glyco_hydro_1 |
Motif E-value | 2.1e-140 |
Motif start | 32 |
Motif end | 484 |
Protein seq | >AT1G45191.4 MEDVLTLITMIVLLLLAFHGFGKCSSDLYSRSDFPEGFVFGAGISAYQWEGAVDEDGRKPSVWDTFLHCRKMDNGDIACD GYHKYKEDVQLMAETGLHTFRFSISWSRLISNGRGSINPKGLQFYKNFIQELVKHGIEPHVTLHHYDFPQYLEDDYGGWT NRKIIKDFTAYADVCFREFGNHVKFWTTINEANIFTIGGYNDGNSPPGRCSFPGRNCTLGNSSTETYIVGHNLLLAHASV SRLYKQKYKDIQGGSVGFSLFAMNFTPSTNSKDDEIATKRANDFYLGWMLEPLIYGDYPDVMKRTIGSRLPVFSKEESEQ VKGSSDFIGVIHYLTALVTNIDINPSLSGIPDFNSDMGESINILSMRSVVSPWAMEGILEYIKQSYGNPPVYILENGKTM NQDLELQQKDTPRIEYLDAYIGAVLKAVRNGSDTRGYFVWSFMDLYELLNGYKSSFGLYSVNFSDPHRKRSPKLSAHWYS GFLKGKPTFLGSQGITQLHSNFSSSR* |
CDS seq | >AT1G45191.4 ATGGAAGATGTTTTGACTCTCATTACCATGATTGTGTTGCTTCTTCTAGCTTTCCATGGATTTGGAAAATGCAGTAGCGA TCTTTACAGCAGGAGCGATTTCCCGGAAGGCTTCGTTTTCGGAGCCGGGATATCTGCTTATCAGTGGGAAGGAGCTGTTG ATGAAGATGGGAGGAAACCTAGCGTCTGGGATACTTTCCTTCACTGTCGTAAAATGGATAATGGAGACATAGCTTGTGAT GGATATCACAAGTATAAGGAAGATGTGCAGCTCATGGCCGAAACTGGCTTACATACATTCAGATTCTCCATCTCTTGGTC TAGACTCATCTCTAATGGAAGAGGTTCCATTAACCCGAAAGGTCTACAGTTCTACAAGAATTTCATTCAAGAACTTGTCA AACATGGAATTGAGCCACATGTTACACTACATCACTACGATTTTCCTCAATATCTCGAGGATGACTATGGAGGCTGGACC AACCGCAAAATCATCAAAGACTTTACCGCTTATGCAGATGTTTGCTTTAGAGAGTTTGGGAACCACGTCAAATTCTGGAC CACGATCAACGAGGCTAATATATTTACTATTGGAGGTTACAACGATGGGAATTCACCGCCTGGTCGTTGCTCCTTTCCGG GCAGAAACTGCACGTTAGGGAACTCTTCCACTGAAACATATATCGTAGGCCATAACTTGCTGCTTGCTCACGCCTCTGTT TCAAGACTATATAAGCAAAAGTACAAGGATATACAAGGAGGTTCTGTTGGATTTAGTTTATTTGCAATGAATTTTACTCC TTCTACAAACTCCAAGGATGATGAAATCGCAACGAAAAGAGCCAACGATTTCTACCTCGGATGGATGCTTGAGCCTCTTA TATATGGAGACTATCCTGATGTGATGAAAAGAACCATTGGATCAAGACTGCCAGTTTTCTCGAAGGAAGAATCAGAACAA GTGAAAGGCTCATCTGACTTCATAGGAGTCATTCACTATCTCACGGCTTTGGTCACAAACATCGATATCAACCCTTCACT TTCAGGAATTCCAGATTTTAACTCAGACATGGGTGAATCTATTAATATTTTATCTATGAGGTCTGTTGTTTCTCCATGGG CTATGGAAGGCATCCTAGAGTATATAAAGCAGAGCTATGGCAATCCTCCAGTCTACATTCTTGAGAATGGTAAAACAATG AACCAAGATTTGGAGCTGCAACAAAAGGACACACCAAGGATTGAGTACTTAGATGCTTACATTGGTGCGGTGCTCAAAGC TGTTAGGAATGGATCAGACACGAGAGGCTACTTCGTATGGTCATTTATGGATTTGTACGAATTACTAAACGGATACAAGA GTAGTTTTGGATTGTACTCTGTCAATTTCAGTGATCCCCATCGCAAGAGATCTCCCAAACTCTCTGCTCACTGGTACTCT GGTTTTCTCAAGGGCAAACCCACATTTCTTGGTTCCCAAGGCATCACACAATTGCATAGCAACTTCTCTTCTTCCAGATA G |