Genome sequence of the anaerobic bacterium Bacillus sp. strain ZYK, a selenite and nitrate reducer from paddy soil

Bacillus sp. strain ZYK, a member of the phylum Firmicutes, is of interest for its ability to reduce nitrate and selenite and for its resistance to arsenic under anaerobic conditions. Here we describe some key features of this organism, together with the complete genome sequence and annotation. The 3,575,797 bp long chromosome with its 3,454 protein-coding and 70 RNA genes, and the information gained from its sequence will be relevant to the elucidation of microbially-mediated transformations of nitrogen, selenium and arsenic in paddy soil.


Introduction
Bacillus sp. ZYK (=DSM 26460 =CGMCC 1.5179) was isolated from a paddy soil in Dehong, Yunnan, China and is an anaerobic nitrate-reducing, Grampositive bacterium [1]. Strain ZYK belongs to the genus Bacillus, and based on 16S rRNA phylogeny, is most closely related to Bacillus azotoformans isolated from garden soil, which is capable of reducing nitrate, nitrite, nitrous oxide, and nitric oxide under anaerobic conditions [2][3][4]. Strain ZYK is capable of nitrate-reduction under anaerobic conditions and, in addition, demonstrated selenite-reducing ability and arsenic resistance (unpublished data). Bacillus spp. are commonly found in paddy soil and may play important roles in elemental cycling during periodically changing redox conditions [5][6][7][8]. Therefore, strain ZYK is a suitable model for studying the properties of genes involved in denitrification, selenitereduction and arsenic resistance pathways of paddy soil bacteria. Here we summarize the features of Bacillus sp. strain ZYK and provide a description of its sequenced genome, now available for detailed analysis.

Classification and features
Based on 16S rRNA gene phylogeny and genome information, strain ZYK was a member of the genus Bacillus, most closely related to Bacillus azotoformans (AB363732), with a sequence similarity of 96.3% based on a Blast analysis [9] of the most recent release of the Greengenes database [10]. A phylogenetic tree ( Figure 1) was constructed using the Maximum likelihood method under the default settings for complete sequences of genes encoding 16S rRNA derived from sequenced genomes of Bacillus spp., along with the sequences of representative members of the genus. Strain ZYK is an anaerobic, Gram-positive, sporeforming, motile, rod-shaped (0.2-0.3 µm wide and 1.5-2.0 µm long) ( Figure 2). The strain grew optimally at pH 7.0-7.2 (range 6.0-7.6), 30-40°C (range 21-45°C) and at low salinity (NaCl range 0-1.1%) ( Table 1) in freshwater anaerobic medium [24]. On anaerobic LB agar, strain ZYK forms small, white colonies with entire edges (data not shown). Carbon substrates utilized for growth by strain ZYK included D-glucose, maltose, lactose, and sucrose. Strain ZYK reduces nitrate and selenite under anaerobic conditions in freshwater medium.  , not directly observed for the living, isolated sample, but based on a g enerally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [23]. If the evidence code is IDA, the property was directly observed by one of the authors. Standards in Genomic Sciences

Genome project history
Bacillus sp. ZYK was selected for sequencing because of its phylogenetic affiliation with a lineage of paddy soil bacteria that may influence elemental cycling in paddy fields. The genome project is deposited in the Genomes OnLine Database (GOLD) as project Gi22906, and the complete genome sequence is in GenBank under accession number ANOK00000000 (Table 2). A summary of the main project information is shown in Table 2.

Growth conditions and DNA isolation
For the preparation of genomic DNA, one colony was picked from an anaerobic LB agar plate, and grown in anaerobic freshwater medium at 30°C [24]. A culture (1.0 ml) at 0.6 OD600nm was inoculated into 100 ml of anaerobic freshwater media. Cells were collected by centrifugation after growing to 0.6 OD600nm. Cells were suspended in TE buffer (10 mM NaCl, 20 mM Tris-HCl, 1.0 mM EDTA, pH 8.0), and treated with lysozyme to lyse the cell wall. SDS and proteinaseK were added to denature and degrade proteins. Cell lysates were extracted with phenol-chloroform and the nucleic acids were precipitated by addition of isoamylol.
The nucleic acid pellet was washed with 100% ethanol, dissolved in double distilled water and then treated with RNase A [25].

Genome annotation
Genes were identified using Glimmer, version 3.0 [29]. The predicted CDSs were translated and used to search KEGG, COG, SwissPort, TrEMBL, NR and GO databases. These data sources were combined to assert a product description for each predicted protein. Transposons were identified using RepeatMaster (with Repbase) and RepeatProteinMasker (with its own database) software. Tandem repeat sequences were predicted by TRF (Tandem Repeat Finder) software. The rRNA, tRNA and sRNA were predicted by using rRNAmmer [30], tRNAscan [31] and Rfam [32] software, respectively.

Genome properties
The genome consists of a circular chromosome of 3,575,797 bp in size with a GC content of 36.1% ( Figure 3, Table 3). Of the 3,454 predicted genes, 70 are RNA genes, 136 are secreted protein coding genes, and 3,318 are non-secreted protein coding genes. Of the total predicted genes, 2,030 represent COG functional categories. The distribution of genes into COG functional categories is presented in Figure 3 and Table 4.      , chain B), respectively. The discovery of an arsenate reductase coding gene (arsC) and DMSO reductase sequences suggests that the reduction capabilities of strain ZYK are broader than expected, and that other substrates be tested. Particularly, we are interested in determining whether selenite reduction activity in ZYK is mediated by a hydrogenase [33], a nitrite reductase [34] or a DMSO reductase. While the reduction of selenite to elemental selenium is a common feature of diverse microorganisms, the genes responsible for this process remain largely uncharacterized and virtually nothing is known about their regulation [33][34][35], or their interac-tions with other respiratory pathways. In addition to Bacillus sp. ZYK, the genomes of two bacteria capable of selenite reduction, Bacillus selenitireducens (NC_014219.1) [36] and Desulfirispirillum indicum S5 [37,38], have been sequenced. The investigation of the functional genes of strain ZYK will consequently enhance the understanding of the electron acceptor utilization pathways in microorganisms, and how nitrogen, selenium and arsenic cycling is mediated by microorganisms active in paddy soil. Further study of these reductase gene-coding sequences may reveal the importance of the Bacillus genus in elemental cycling in paddy soils.