Complete genome sequence of Serratia plymuthica strain AS12

A plant-associated member of the family Enterobacteriaceae, Serratia plymuthica strain AS12 was isolated from rapeseed roots. It is of scientific interest because it promotes plant growth and inhibits plant pathogens. The genome of S. plymuthica AS12 comprises a 5,443,009 bp long circular chromosome, which consists of 4,952 protein-coding genes, 87 tRNA genes and 7 rRNA operons. This genome was sequenced within the 2010 DOE-JGI Community Sequencing Program (CSP2010) as part of the project entitled “Genomics of four rapeseed plant growth promoting bacteria with antagonistic effect on plant pathogens”.


Introduction
Plant associated Serratia species are commonly found as free--living bacteria in rhizosphere soil and as endophytes within plant roots. They in-clude strains with the ability to stimulate plant growth and to inhibit the growth of soil borne pathogens of economically important agricultural plants [1--3]. One Serratia strain, S. plymuthica HRO--C48, is successfully used as an alternative to chemical agents for control of soil--borne fungal diseases in different crops such as strawberry and rapeseed [3,4]. Its ability to degrade chitin, a fun-gal cell wall component, may be responsible for antifungal activity, whereas the production of the plant hormone indole--3--acetic acid (IAA) could be involved in plant growth promotion [3]. S. plymuthica AS12 has chitinolytic activity and was isolated from rapeseed roots from Uppsala, Swe-den in 1998 [5]. The reason for our interest in S. plymuthica AS12 is its ability to inhibit Verticillium longisporum (earlier V. dahliae), a soil borne fun-gal pathogen of rapeseed, thus promoting the rapeseed growth both directly and indirectly [5].
Here we present a description of the complete ge-nome of S. plymuthica AS12 and its annotation.

Classification and features
A representative 16S rRNA gene sequence of the strain AS12 genome was used for comparison us-ing NCBI BLAST [6] under default settings with the most recent databases. The relative frequen-cies of taxa and BLAST scores were determined. The most frequently occurring genus is Serratia where some of the 'hits' share a 100% identity. When considering high--scoring segment pairs (HSPs) from the best 250 hits, the most frequent matches were Serratia sp. (17.2%) with a maxi-mum identity of 97--100%, while S. plymuthica (5.2%) had a maximum identity of 97--100%, S. proteomaculans (4.8%) with a maximum identity of 97--99%, S. marcescens (4.8%) with a maximum identity of 96--97% and different strains of Rahnella (7%) with a maximum identity of 97--98%.
A phylogenetic tree (Figure  1) was constructed us-ing 16S rRNA sequences of S. plymuthica AS12 with other genera within the family Enterobacteriaceae including two species within the genus Serratia. The tree shows the position of S. plymuthica AS12 within the genus Serratia and its distinct clustering with S. plymuthica, which was confirmed by digital DNA--DNA hybridization values [11] above 70% with the (unpublished) draft genome sequence of the S. plymuthica type strain Breed K--7 T from a DSM 4540 culture as well as with the complete ge-nome sequence of S. plymuthica AS9 [12] using the GGDC web server [13]. The cells of strain AS12 stain Gram--negative and are rod shaped, 1--2 µm long, 0.5--0.7 µm wide (Fig-ure 2 and Table 1) and motile. The culture forms red to pink colored colonies of 1--2 mm diameter on tryptic soy agar and potato dextrose agar, but the colony color depends on different factors such as the growth substrate, pH of the medium and growth temperature. The organism is a facultative anaerobe and grows at temperatures ranging from 4 °C -- 40 °C and within a pH range of 4 -- 10. It has the ability to utilize a wide range of carbon sources such as glucose, sucrose, succinate, mannitol and arabinose. It also has cellulolytic, phospholytic, chitinolytic and proteolytic activity [5]. The strain is deposited in the Culture Collection, University of Göteborg, Sweden (CCUG) as Serratia sp. AS12 (= CCUG 61397).

Chemotaxonomy
The cells of S. plymuthica AS12 contain a mixture of saturated and unsaturated fatty acids. The domi-nant fatty acids in strain AS12 are C 16:0 (22.94%), C (17.08%), C (19.65%), C 14:0 (5.11%), along with other minor fatty acid components. No information is available for other compounds. Pre-viously it has been shown that Serratia spp. contain a mixture of C 14:0 , C 16:0 , C 16:1 and C 18:1+2 fatty acids in which 50--80% of the total fatty acid in the cell is C 14:0 and others each less than 3% [28]. This is con-sistent with the observation that C 14:0 is a charac-teristic fatty acid of the family Enterobacteriaceae.  [7]. The tree was inferred under the maximum likelihood criterion using MEGA5 software [8] and rooted with Pseudomonas trivialis (a member of the Pseudomonadaceae family). The branches are mapped by the expected number of substitutions per site. The numbers above the branches are support values from 1,000 bootstrap replicates if larger than 60% [9]. Lineages with genome sequences registered in GOLD [10] are shown in blue.  , not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [27]. If the evidence code is IDA, then the property should have been directly observed, for the purpose of this specific publication, for a live isolate by one of the authors, or an expert or reputable institution mentioned in the acknowledgements.

Genome sequencing information
S. plymuthica AS12 was selected for sequencing on the basis of its ability to promote rapeseed plant growth as well as to inhibit fungal pathogens of rapeseed [5]. The genome sequence is deposited in the Genomes On Line Database [10] (GOLD ID = Gc01771) and in GenBank (INSDC ID = CP002774). Sequencing, finishing and annotation were performed by the DOE Joint Genome Insti-tute (JGI). A summary of the project information and its association with MIGS identifiers is shown in Table  2.

Growth conditions and DNA isolation
The cells of S. plymuthica AS12 were grown in Lu-ria Broth (LB) medium at 28°C with constant shaking at 200 rpm. The cells were harvested after 12 hours when the cells were in the early station-ary phase. The cells were pelleted and resuspend-ed in TE buffer (Sigma Aldrich). The DNA was ex-tracted from the resuspended cells by following the standard CTAB protocol for bacterial genomic DNA isolation, which is available at JGI [29].

Genome sequencing and assembly
The genome of S. plymuthica AS12 was sequenced using a combination of Illumina [30] and 454 se-quencing platforms [31]. The detailed information on library construction and sequencing can be found at the JGI website [29]. The sequence data from Illumina GAii (1,800 Mb) were assembled with Velvet [32] and the consensus sequence was computationally shredded into 1.5 kb overlapping fake reads. The sequencing data from 454 pyrosequencing (81.6 Mb) were assembled with Newbler. The initial draft assembly contained 61 contigs in one scaffold and consensus sequences were computationally shredded into 2 kb over-lapping fake reads. The 454 Newbler consensus reads, the Illumina Velvet consensus reads and the read pairs in the 454 paired end library were inte-grated using a software parallel Phrap [33]. Possi-ble mis--assemblies were corrected with gapResolution [29], Dupfinisher [34], or by se-quencing cloned bridging PCR fragments with subcloning or transposon bombing (Epicentre Bi-otechnologies, Madison, WI). The gaps between contigs were closed by editing in the software Consed [35--37], by PCR and by Bubble PCR (J.--F. Chang, unpublished) primer walks. A total of 160 additional reactions was necessary to close gaps and to raise the quality of the finished sequence. The sequence reads from Illumina were used to correct potential base errors and increase consen-sus quality using the software Polisher developed at JGI [38]. The final assembly is based on 47.4 Mb of 454 draft data which provides an average 8.8 × coverage of the genome and 315 Mb of Illumina draft data which provides an average 59 × cover-age of the genome.

Genome annotation
The S. plymuthica AS12 genes were identified us-ing Prodigal [39] as part of the genome annotation pipeline at Oak Ridge National Laboratory (ORNL), Oak Ridge, TN, USA, followed by a round of manual curation using the JGI GenePRIMP pipe-line [40]. The predicted CDS were translated and used to search the National Center for Biotechnol-ogy Information (NCBI) nonredundant database, Uniport, TIGR--Fam, Pfam, PRIAM, KEGG, COG and InterPro databases. The miscellaneous functions were predicted using tRNAscan--SE [41], RNAmmer [42], TMHMM [43], and signalP [44]. Additional gene prediction analysis and functional annotation was performed within the Integrated Microbial Genomes -Expert Review (IMG--ER) platform developed by the Joint Genome Institute, Walnut Creek, CA, USA [45].

Genome properties
The genome of S. plymuthica AS12 comprises a single circular chromosome of 5,443,009 bp with 55.96% GC content ( Figure 3 and Table 3) and 5,140 predicted genes. Among those predicted genes, 4,952 were assigned as protein--coding genes and 88.71% of protein coding genes were assigned for putative function and the remaining ones were annotated as hypothetical proteins. There were 76 pseudogenes and 113 RNA genes with seven rRNA operons. The distribution of genes into the COG functional categories is pre-sented in Table  4.  a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.