Microexon ID At_1:17118337-17118350:+
Species Arabidopsis thaliana
Coordinates 1:17118337..17118350
Microexon Cluster ID Unclassified
Size 14
At_1:17118337-17118350:+ does not have available information here.
Transcript ID AT1G45191.4
Protein ID AT1G45191.4
Gene ID AT1G45191
Gene Name BGLU1
Pfam domain motif Glyco_hydro_1
Motif E-value 2e-140
Motif start 32
Motif end 484
Protein seq >AT1G45191.4
MEDVLTLITMIVLLLLAFHGFGKCSSDLYSRSDFPEGFVFGAGISAYQWEGAVDEDGRKPSVWDTFLHCRKMDNGDIACD
GYHKYKEDVQLMAETGLHTFRFSISWSRLISNGRGSINPKGLQFYKNFIQELVKHGIEPHVTLHHYDFPQYLEDDYGGWT
NRKIIKDFTAYADVCFREFGNHVKFWTTINEANIFTIGGYNDGNSPPGRCSFPGRNCTLGNSSTETYIVGHNLLLAHASV
SRLYKQKYKDIQGGSVGFSLFAMNFTPSTNSKDDEIATKRANDFYLGWMLEPLIYGDYPDVMKRTIGSRLPVFSKEESEQ
VKGSSDFIGVIHYLTALVTNIDINPSLSGIPDFNSDMGESINILSMRSVVSPWAMEGILEYIKQSYGNPPVYILENGKTM
NQDLELQQKDTPRIEYLDAYIGAVLKAVRNGSDTRGYFVWSFMDLYELLNGYKSSFGLYSVNFSDPHRKRSPKLSAHWYS
GFLKGKPTFLGSQGITQLHSNFSSSR*
CDS seq >AT1G45191.4
ATGGAAGATGTTTTGACTCTCATTACCATGATTGTGTTGCTTCTTCTAGCTTTCCATGGATTTGGAAAATGCAGTAGCGA
TCTTTACAGCAGGAGCGATTTCCCGGAAGGCTTCGTTTTCGGAGCCGGGATATCTGCTTATCAGTGGGAAGGAGCTGTTG
ATGAAGATGGGAGGAAACCTAGCGTCTGGGATACTTTCCTTCACTGTCGTAAAATGGATAATGGAGACATAGCTTGTGAT
GGATATCACAAGTATAAGGAAGATGTGCAGCTCATGGCCGAAACTGGCTTACATACATTCAGATTCTCCATCTCTTGGTC
TAGACTCATCTCTAATGGAAGAGGTTCCATTAACCCGAAAGGTCTACAGTTCTACAAGAATTTCATTCAAGAACTTGTCA
AACATGGAATTGAGCCACATGTTACACTACATCACTACGATTTTCCTCAATATCTCGAGGATGACTATGGAGGCTGGACC
AACCGCAAAATCATCAAAGACTTTACCGCTTATGCAGATGTTTGCTTTAGAGAGTTTGGGAACCACGTCAAATTCTGGAC
CACGATCAACGAGGCTAATATATTTACTATTGGAGGTTACAACGATGGGAATTCACCGCCTGGTCGTTGCTCCTTTCCGG
GCAGAAACTGCACGTTAGGGAACTCTTCCACTGAAACATATATCGTAGGCCATAACTTGCTGCTTGCTCACGCCTCTGTT
TCAAGACTATATAAGCAAAAGTACAAGGATATACAAGGAGGTTCTGTTGGATTTAGTTTATTTGCAATGAATTTTACTCC
TTCTACAAACTCCAAGGATGATGAAATCGCAACGAAAAGAGCCAACGATTTCTACCTCGGATGGATGCTTGAGCCTCTTA
TATATGGAGACTATCCTGATGTGATGAAAAGAACCATTGGATCAAGACTGCCAGTTTTCTCGAAGGAAGAATCAGAACAA
GTGAAAGGCTCATCTGACTTCATAGGAGTCATTCACTATCTCACGGCTTTGGTCACAAACATCGATATCAACCCTTCACT
TTCAGGAATTCCAGATTTTAACTCAGACATGGGTGAATCTATTAATATTTTATCTATGAGGTCTGTTGTTTCTCCATGGG
CTATGGAAGGCATCCTAGAGTATATAAAGCAGAGCTATGGCAATCCTCCAGTCTACATTCTTGAGAATGGTAAAACAATG
AACCAAGATTTGGAGCTGCAACAAAAGGACACACCAAGGATTGAGTACTTAGATGCTTACATTGGTGCGGTGCTCAAAGC
TGTTAGGAATGGATCAGACACGAGAGGCTACTTCGTATGGTCATTTATGGATTTGTACGAATTACTAAACGGATACAAGA
GTAGTTTTGGATTGTACTCTGTCAATTTCAGTGATCCCCATCGCAAGAGATCTCCCAAACTCTCTGCTCACTGGTACTCT
GGTTTTCTCAAGGGCAAACCCACATTTCTTGGTTCCCAAGGCATCACACAATTGCATAGCAACTTCTCTTCTTCCAGATA
G
Microexon DNA seq TTTTATCTATGAGG
Microexon Amino Acid seq ILSMR
Microexon-tag DNA Seq TCAGGAATTCCAGATTTTAACTCAGACATGGGTGAATCTATTAATATTTTATCTATGAGGTCTGTTGTTTCTCCATGGGCTATGGAAGGCATCCTAGAGTATATAAAG
Microexon-tag Amino Acid seq SGIPDFNSDMGESINILSMRSVVSPWAMEGILEYIK
Transcript ID AT1G45191.4
Gene ID At.3874
Gene Name BGLU1
Pfam domain motif Glyco_hydro_1
Motif E-value 2.1e-140
Motif start 32
Motif end 484
Protein seq >AT1G45191.4
MEDVLTLITMIVLLLLAFHGFGKCSSDLYSRSDFPEGFVFGAGISAYQWEGAVDEDGRKPSVWDTFLHCRKMDNGDIACD
GYHKYKEDVQLMAETGLHTFRFSISWSRLISNGRGSINPKGLQFYKNFIQELVKHGIEPHVTLHHYDFPQYLEDDYGGWT
NRKIIKDFTAYADVCFREFGNHVKFWTTINEANIFTIGGYNDGNSPPGRCSFPGRNCTLGNSSTETYIVGHNLLLAHASV
SRLYKQKYKDIQGGSVGFSLFAMNFTPSTNSKDDEIATKRANDFYLGWMLEPLIYGDYPDVMKRTIGSRLPVFSKEESEQ
VKGSSDFIGVIHYLTALVTNIDINPSLSGIPDFNSDMGESINILSMRSVVSPWAMEGILEYIKQSYGNPPVYILENGKTM
NQDLELQQKDTPRIEYLDAYIGAVLKAVRNGSDTRGYFVWSFMDLYELLNGYKSSFGLYSVNFSDPHRKRSPKLSAHWYS
GFLKGKPTFLGSQGITQLHSNFSSSR*
CDS seq >AT1G45191.4
ATGGAAGATGTTTTGACTCTCATTACCATGATTGTGTTGCTTCTTCTAGCTTTCCATGGATTTGGAAAATGCAGTAGCGA
TCTTTACAGCAGGAGCGATTTCCCGGAAGGCTTCGTTTTCGGAGCCGGGATATCTGCTTATCAGTGGGAAGGAGCTGTTG
ATGAAGATGGGAGGAAACCTAGCGTCTGGGATACTTTCCTTCACTGTCGTAAAATGGATAATGGAGACATAGCTTGTGAT
GGATATCACAAGTATAAGGAAGATGTGCAGCTCATGGCCGAAACTGGCTTACATACATTCAGATTCTCCATCTCTTGGTC
TAGACTCATCTCTAATGGAAGAGGTTCCATTAACCCGAAAGGTCTACAGTTCTACAAGAATTTCATTCAAGAACTTGTCA
AACATGGAATTGAGCCACATGTTACACTACATCACTACGATTTTCCTCAATATCTCGAGGATGACTATGGAGGCTGGACC
AACCGCAAAATCATCAAAGACTTTACCGCTTATGCAGATGTTTGCTTTAGAGAGTTTGGGAACCACGTCAAATTCTGGAC
CACGATCAACGAGGCTAATATATTTACTATTGGAGGTTACAACGATGGGAATTCACCGCCTGGTCGTTGCTCCTTTCCGG
GCAGAAACTGCACGTTAGGGAACTCTTCCACTGAAACATATATCGTAGGCCATAACTTGCTGCTTGCTCACGCCTCTGTT
TCAAGACTATATAAGCAAAAGTACAAGGATATACAAGGAGGTTCTGTTGGATTTAGTTTATTTGCAATGAATTTTACTCC
TTCTACAAACTCCAAGGATGATGAAATCGCAACGAAAAGAGCCAACGATTTCTACCTCGGATGGATGCTTGAGCCTCTTA
TATATGGAGACTATCCTGATGTGATGAAAAGAACCATTGGATCAAGACTGCCAGTTTTCTCGAAGGAAGAATCAGAACAA
GTGAAAGGCTCATCTGACTTCATAGGAGTCATTCACTATCTCACGGCTTTGGTCACAAACATCGATATCAACCCTTCACT
TTCAGGAATTCCAGATTTTAACTCAGACATGGGTGAATCTATTAATATTTTATCTATGAGGTCTGTTGTTTCTCCATGGG
CTATGGAAGGCATCCTAGAGTATATAAAGCAGAGCTATGGCAATCCTCCAGTCTACATTCTTGAGAATGGTAAAACAATG
AACCAAGATTTGGAGCTGCAACAAAAGGACACACCAAGGATTGAGTACTTAGATGCTTACATTGGTGCGGTGCTCAAAGC
TGTTAGGAATGGATCAGACACGAGAGGCTACTTCGTATGGTCATTTATGGATTTGTACGAATTACTAAACGGATACAAGA
GTAGTTTTGGATTGTACTCTGTCAATTTCAGTGATCCCCATCGCAAGAGATCTCCCAAACTCTCTGCTCACTGGTACTCT
GGTTTTCTCAAGGGCAAACCCACATTTCTTGGTTCCCAAGGCATCACACAATTGCATAGCAACTTCTCTTCTTCCAGATA
G