Microexon ID At_5:6026595-6026600:-
Species Arabidopsis thaliana
Coordinates 5:6026595..6026600
Microexon Cluster ID Unclassified
Size 6
At_5:6026595-6026600:- does not have available information here.
Transcript ID AT5G18230.2
Protein ID AT5G18230.2
Gene ID AT5G18230
Gene Name NA
Pfam domain motif Not3
Motif E-value 4.1e-80
Motif start 4
Motif end 238
Protein seq >AT5G18230.2
MGASRKLQGEIDRVLKKVQEGVDVFDSIWNKWNVYDTDNVNQKEKFEADLKKEIKKLQRYRDQIKTWIQSSEIKDKKVSA
SYEQSLVDARKLIEKEMERFKICEKETKTKAFSKEGLGQQPKTDPKEKAKSETRDWLNNVVSELESQIDSFEAELEGLSV
KKGKTRPPRLTHLETSITRHKDHIIKLELILRLLDNDELSPEQVNDVKDFLDDYVERNQDDFDEFSDVDELYSTLPLDEV
EGLEDLVTAGPLVKGTPLSMKSSLAASASQVRSISLPTHHQEKTEDTSLPDSSAEMVPKTPPPKNGAGLHSAPSTPAGGR
PSLNVPAGNVSNTSVTLSTSIPTQTSIESMGSLSPVAAKEEDATTLPSRKPPSSVADTPLRGIGRVGIPNQPQPSQPPSP
IPANGSRISATSAAEVAKRNIMGVESNVQPLTSPLSKMVLPPTAKGNDGTASDSNPGDVAASIGRAFSPSIVSGSQWRPG
SPFQSQNETVRGRTEIAPDQREKFLQRLQQVQQGHGNLLGIPSLSGGNEKQFSSQQQNPLLQQSSSISPHGSLGIGVQAP
GFNVMSSASLQQQSNAMSQQLGQQPSVADVDHVRNDDQSQQNLPDDSASIAASKAIQSEDDSKVLFDTPSGMPSYMLDPV
QVSSGPDFSPGQPIQPGQSSSSLGVIGRRSNSELGAIGDPSAVGPMHDQMHNLQMLEAAFYKRPQPSDSERPRPYSPRNP
AITPQTFPQTQAPIINNPLLWERLGSDAYGTDTLFFAFYYQQNSYQQYLAAKELKKQSWRYHRKFNTWFQRHKEPKIATD
EYEQGAYVYFDFQTPKDENQEGGWCQRIKNEFTFEYSYLEDELVV*
CDS seq >AT5G18230.2
ATGGGTGCGAGCCGGAAATTACAAGGCGAGATAGATCGGGTGCTGAAGAAGGTTCAAGAAGGTGTTGATGTTTTCGACAG
CATCTGGAACAAGTGGAATGTATATGATACAGACAATGTTAATCAAAAGGAAAAGTTTGAGGCGGACTTGAAGAAGGAAA
TCAAGAAGCTGCAGCGGTATAGAGACCAGATCAAGACATGGATTCAGTCTAGTGAGATCAAAGATAAGAAAGTCAGTGCA
TCTTATGAGCAATCCCTGGTGGATGCTCGGAAGCTTATTGAGAAAGAGATGGAGAGGTTTAAGATATGTGAAAAAGAGAC
CAAGACAAAAGCCTTCTCCAAGGAAGGACTGGGTCAGCAACCTAAAACTGACCCAAAAGAGAAAGCAAAGTCAGAGACAA
GGGATTGGTTGAACAATGTGGTGAGTGAACTGGAGTCGCAGATTGATAGCTTTGAAGCTGAGTTGGAAGGACTGTCTGTC
AAAAAAGGAAAGACAAGACCGCCCAGATTGACTCATCTTGAGACATCTATTACAAGACACAAGGATCACATAATAAAGTT
GGAACTGATCTTGAGGCTTCTGGACAATGATGAATTAAGTCCAGAACAAGTAAATGACGTCAAAGATTTTCTGGATGATT
ATGTTGAACGAAATCAGGATGATTTTGATGAATTCAGTGATGTCGATGAGCTCTATAGCACGTTGCCACTAGATGAGGTG
GAGGGTCTTGAAGATCTAGTTACCGCTGGCCCACTTGTCAAGGGTACTCCTTTAAGCATGAAGAGTTCTTTGGCAGCGTC
AGCATCTCAAGTTCGGAGCATAAGTTTGCCAACTCACCATCAAGAGAAAACAGAGGATACATCTTTACCGGATAGCAGTG
CTGAGATGGTTCCAAAAACCCCTCCGCCAAAGAATGGTGCAGGCCTTCACTCAGCACCATCAACACCTGCCGGAGGACGT
CCAAGTTTGAACGTGCCTGCCGGTAATGTTTCAAATACATCAGTTACCTTATCAACTTCTATTCCTACTCAAACTTCCAT
AGAAAGCATGGGGAGTTTGTCTCCCGTGGCTGCCAAGGAAGAAGACGCAACAACCTTGCCTTCTCGTAAACCACCCTCAT
CTGTTGCGGATACTCCATTGAGGGGCATTGGTAGAGTTGGTATCCCCAACCAACCCCAACCAAGCCAGCCTCCGTCTCCT
ATTCCAGCTAACGGGTCTCGCATTAGTGCAACTTCAGCTGCTGAAGTTGCAAAGAGAAATATAATGGGAGTTGAGAGCAA
CGTCCAACCTCTTACTTCTCCACTGAGCAAAATGGTGTTGCCACCAACTGCAAAGGGTAATGATGGAACTGCCTCTGATA
GCAACCCTGGTGATGTTGCGGCTAGTATTGGTAGAGCTTTTTCACCATCTATTGTATCTGGTTCGCAGTGGAGGCCTGGT
AGTCCCTTTCAGAGTCAGAATGAAACGGTTCGTGGGAGAACTGAAATAGCACCAGACCAAAGAGAGAAATTCTTACAGAG
ATTACAGCAAGTACAGCAAGGCCATGGTAACCTCTTAGGCATACCTTCTTTATCTGGAGGAAACGAGAAGCAGTTTTCTT
CACAACAGCAAAATCCTCTTTTACAGCAGAGCTCTTCCATCTCTCCTCATGGAAGCTTGGGAATCGGAGTTCAGGCACCA
GGTTTTAATGTCATGAGTTCTGCCTCCTTACAGCAGCAATCAAATGCCATGAGTCAACAATTGGGTCAGCAACCTTCTGT
TGCAGATGTAGACCATGTCAGAAATGATGATCAATCCCAGCAAAACTTACCTGATGATTCAGCTTCCATAGCAGCTTCAA
AAGCTATTCAAAGTGAGGATGACTCTAAAGTTCTATTTGATACTCCGTCGGGAATGCCCAGCTACATGTTGGATCCAGTA
CAAGTATCTAGCGGTCCTGATTTCTCTCCTGGACAACCTATACAACCGGGTCAATCTTCAAGTAGCCTCGGGGTCATTGG
ACGGAGAAGTAACTCTGAGTTAGGAGCCATTGGTGACCCTTCAGCTGTAGGGCCAATGCATGATCAAATGCACAATCTCC
AGATGCTTGAAGCTGCTTTTTACAAACGTCCTCAACCCTCAGATTCAGAACGTCCTAGACCCTATTCTCCGAGGAACCCA
GCAATCACACCTCAAACATTTCCCCAAACACAAGCACCAATCATAAACAACCCTTTGCTCTGGGAACGGTTAGGCAGCGA
TGCTTATGGAACCGATACTTTGTTCTTTGCGTTCTACTATCAGCAGAACTCATACCAGCAATATCTTGCTGCAAAAGAGC
TGAAGAAACAGTCATGGAGATACCACAGGAAGTTCAACACTTGGTTTCAGAGACATAAAGAGCCAAAGATTGCAACCGAT
GAATATGAACAAGGAGCCTACGTTTACTTCGATTTCCAAACCCCGAAAGACGAGAATCAAGAAGGAGGATGGTGCCAAAG
GATCAAAAACGAGTTCACATTTGAATACAGTTATCTTGAAGATGAACTCGTCGTATAG
Microexon DNA seq TGGAAT
Microexon Amino Acid seq WN
Microexon-tag DNA Seq CTGAAGAAGGTTCAAGAAGGTGTTGATGTTTTCGACAGCATCTGGAACAAGTGGAATGTATATGATACAGACAATGTTAATCAAAAGGAAAAGTTTGAGGCGGACTTG
Microexon-tag Amino Acid seq MKKVQEGVDVFDSIWNKWNVYDTDNVNQKEKFEADL
Transcript ID AT5G18230.2
Gene ID At.23440
Gene Name NA
Pfam domain motif Not3
Motif E-value 4.1e-80
Motif start 4
Motif end 238
Protein seq >AT5G18230.2
MGASRKLQGEIDRVLKKVQEGVDVFDSIWNKWNVYDTDNVNQKEKFEADLKKEIKKLQRYRDQIKTWIQSSEIKDKKVSA
SYEQSLVDARKLIEKEMERFKICEKETKTKAFSKEGLGQQPKTDPKEKAKSETRDWLNNVVSELESQIDSFEAELEGLSV
KKGKTRPPRLTHLETSITRHKDHIIKLELILRLLDNDELSPEQVNDVKDFLDDYVERNQDDFDEFSDVDELYSTLPLDEV
EGLEDLVTAGPLVKGTPLSMKSSLAASASQVRSISLPTHHQEKTEDTSLPDSSAEMVPKTPPPKNGAGLHSAPSTPAGGR
PSLNVPAGNVSNTSVTLSTSIPTQTSIESMGSLSPVAAKEEDATTLPSRKPPSSVADTPLRGIGRVGIPNQPQPSQPPSP
IPANGSRISATSAAEVAKRNIMGVESNVQPLTSPLSKMVLPPTAKGNDGTASDSNPGDVAASIGRAFSPSIVSGSQWRPG
SPFQSQNETVRGRTEIAPDQREKFLQRLQQVQQGHGNLLGIPSLSGGNEKQFSSQQQNPLLQQSSSISPHGSLGIGVQAP
GFNVMSSASLQQQSNAMSQQLGQQPSVADVDHVRNDDQSQQNLPDDSASIAASKAIQSEDDSKVLFDTPSGMPSYMLDPV
QVSSGPDFSPGQPIQPGQSSSSLGVIGRRSNSELGAIGDPSAVGPMHDQMHNLQMLEAAFYKRPQPSDSERPRPYSPRNP
AITPQTFPQTQAPIINNPLLWERLGSDAYGTDTLFFAFYYQQNSYQQYLAAKELKKQSWRYHRKFNTWFQRHKEPKIATD
EYEQGAYVYFDFQTPKDENQEGGWCQRIKNEFTFEYSYLEDELVV*
CDS seq >AT5G18230.2
ATGGGTGCGAGCCGGAAATTACAAGGCGAGATAGATCGGGTGCTGAAGAAGGTTCAAGAAGGTGTTGATGTTTTCGACAG
CATCTGGAACAAGTGGAATGTATATGATACAGACAATGTTAATCAAAAGGAAAAGTTTGAGGCGGACTTGAAGAAGGAAA
TCAAGAAGCTGCAGCGGTATAGAGACCAGATCAAGACATGGATTCAGTCTAGTGAGATCAAAGATAAGAAAGTCAGTGCA
TCTTATGAGCAATCCCTGGTGGATGCTCGGAAGCTTATTGAGAAAGAGATGGAGAGGTTTAAGATATGTGAAAAAGAGAC
CAAGACAAAAGCCTTCTCCAAGGAAGGACTGGGTCAGCAACCTAAAACTGACCCAAAAGAGAAAGCAAAGTCAGAGACAA
GGGATTGGTTGAACAATGTGGTGAGTGAACTGGAGTCGCAGATTGATAGCTTTGAAGCTGAGTTGGAAGGACTGTCTGTC
AAAAAAGGAAAGACAAGACCGCCCAGATTGACTCATCTTGAGACATCTATTACAAGACACAAGGATCACATAATAAAGTT
GGAACTGATCTTGAGGCTTCTGGACAATGATGAATTAAGTCCAGAACAAGTAAATGACGTCAAAGATTTTCTGGATGATT
ATGTTGAACGAAATCAGGATGATTTTGATGAATTCAGTGATGTCGATGAGCTCTATAGCACGTTGCCACTAGATGAGGTG
GAGGGTCTTGAAGATCTAGTTACCGCTGGCCCACTTGTCAAGGGTACTCCTTTAAGCATGAAGAGTTCTTTGGCAGCGTC
AGCATCTCAAGTTCGGAGCATAAGTTTGCCAACTCACCATCAAGAGAAAACAGAGGATACATCTTTACCGGATAGCAGTG
CTGAGATGGTTCCAAAAACCCCTCCGCCAAAGAATGGTGCAGGCCTTCACTCAGCACCATCAACACCTGCCGGAGGACGT
CCAAGTTTGAACGTGCCTGCCGGTAATGTTTCAAATACATCAGTTACCTTATCAACTTCTATTCCTACTCAAACTTCCAT
AGAAAGCATGGGGAGTTTGTCTCCCGTGGCTGCCAAGGAAGAAGACGCAACAACCTTGCCTTCTCGTAAACCACCCTCAT
CTGTTGCGGATACTCCATTGAGGGGCATTGGTAGAGTTGGTATCCCCAACCAACCCCAACCAAGCCAGCCTCCGTCTCCT
ATTCCAGCTAACGGGTCTCGCATTAGTGCAACTTCAGCTGCTGAAGTTGCAAAGAGAAATATAATGGGAGTTGAGAGCAA
CGTCCAACCTCTTACTTCTCCACTGAGCAAAATGGTGTTGCCACCAACTGCAAAGGGTAATGATGGAACTGCCTCTGATA
GCAACCCTGGTGATGTTGCGGCTAGTATTGGTAGAGCTTTTTCACCATCTATTGTATCTGGTTCGCAGTGGAGGCCTGGT
AGTCCCTTTCAGAGTCAGAATGAAACGGTTCGTGGGAGAACTGAAATAGCACCAGACCAAAGAGAGAAATTCTTACAGAG
ATTACAGCAAGTACAGCAAGGCCATGGTAACCTCTTAGGCATACCTTCTTTATCTGGAGGAAACGAGAAGCAGTTTTCTT
CACAACAGCAAAATCCTCTTTTACAGCAGAGCTCTTCCATCTCTCCTCATGGAAGCTTGGGAATCGGAGTTCAGGCACCA
GGTTTTAATGTCATGAGTTCTGCCTCCTTACAGCAGCAATCAAATGCCATGAGTCAACAATTGGGTCAGCAACCTTCTGT
TGCAGATGTAGACCATGTCAGAAATGATGATCAATCCCAGCAAAACTTACCTGATGATTCAGCTTCCATAGCAGCTTCAA
AAGCTATTCAAAGTGAGGATGACTCTAAAGTTCTATTTGATACTCCGTCGGGAATGCCCAGCTACATGTTGGATCCAGTA
CAAGTATCTAGCGGTCCTGATTTCTCTCCTGGACAACCTATACAACCGGGTCAATCTTCAAGTAGCCTCGGGGTCATTGG
ACGGAGAAGTAACTCTGAGTTAGGAGCCATTGGTGACCCTTCAGCTGTAGGGCCAATGCATGATCAAATGCACAATCTCC
AGATGCTTGAAGCTGCTTTTTACAAACGTCCTCAACCCTCAGATTCAGAACGTCCTAGACCCTATTCTCCGAGGAACCCA
GCAATCACACCTCAAACATTTCCCCAAACACAAGCACCAATCATAAACAACCCTTTGCTCTGGGAACGGTTAGGCAGCGA
TGCTTATGGAACCGATACTTTGTTCTTTGCGTTCTACTATCAGCAGAACTCATACCAGCAATATCTTGCTGCAAAAGAGC
TGAAGAAACAGTCATGGAGATACCACAGGAAGTTCAACACTTGGTTTCAGAGACATAAAGAGCCAAAGATTGCAACCGAT
GAATATGAACAAGGAGCCTACGTTTACTTCGATTTCCAAACCCCGAAAGACGAGAATCAAGAAGGAGGATGGTGCCAAAG
GATCAAAAACGAGTTCACATTTGAATACAGTTATCTTGAAGATGAACTCGTCGTATAG