Genome sequence of the Listia angolensis microsymbiont Microvirga lotononidis strain WSM3557T

Microvirga lotononidis is a recently described species of root-nodule bacteria that is an effective nitrogen- (N2) fixing microsymbiont of the symbiotically specific African legume Listia angolensis (Welw. ex Bak.) B.-E. van Wyk & Boatwr. M. lotononidis possesses several properties that are unusual in root-nodule bacteria, including pigmentation and the ability to grow at temperatures of up to 45°C. Strain WSM3557T is an aerobic, motile, Gram-negative, non-spore-forming rod isolated from a L. angolensis root nodule collected in Chipata, Zambia in 1963. This is the first report of a complete genome sequence for the genus Microvirga. Here we describe the features of Microvirga lotononidis strain WSM3557T, together with genome sequence information and annotation. The 7,082,538 high-quality-draft genome is arranged in 18 scaffolds of 104 contigs, contains 6,956 protein-coding genes and 84 RNA-only encoding genes, and is one of 20 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Community Sequencing Program.


Introduction
Legume-rhizobia symbioses are important components of southern Australian agricultural systems, in which symbiotic N2-fixation provides a significant amount of the nitrogen input that is required to boost food and animal production [1,2]. Traditionally, pasture legumes have been Mediterranean annuals such as medics and subterranean clover [3]. However, recent changes to the rainfall patterns in south-western Western Australia, resulting in a 10-20% decrease in annual rainfall [4], have adversely affected production from these annual legumes. Researchers are therefore seeking to introduce alternative perennial legume species and associated rhizobia that are better adapted to the arid climate and acid, infertile soils found in these systems [2]. Among the perennial, herbaceous forage legumes selected for further study are several species within the papilionoid legume clade Lotononis sensu lato.
Lotononis s. l. is grouped within tribe Crotalarieae, has a centre of origin in South Africa and consists of some 150 species, divided into 15 sections [5]. The taxonomy has recently been revised and the three distinct clades within Lotononis s. l. are now recognized at the generic level as Listia, Leobordea and Lotononis s. s. [6]. Species within the genus Listia are of agronomic interest, as they have potential as perennial pasture legumes that are able to reduce groundwater recharge and assist in preventing dry land salinity in southern Australian agricultural systems [7]. Listia spp. produce stoloniferous roots on their lower branches (a characteristic thought to be associated with the seasonally wet habitats where these species are found) [5] and form lupinoid, rather than indeterminate nodules, in response to infection by rhizobia [7,8]. The symbioses between Listia species and their associated root-nodule bacteria are highly specific. All studied host species are nodulated by strains of pigmented methylobacteria [7,9,10], except for Listia angolensis, which is effectively nodulated only by newly described species of Microvirga [11]. Microvirga lotononidis strain WSM3557 T is the type strain for this species. Here we present a set of preliminary classification and general features for M. lotononidis strain WSM3557 T together with the description of the genome sequence and annotation.

Classification and general features
M. lotononidis strain WSM3557 T is a motile, Gramnegative, non-spore-forming rod with one to several flagella (Figure 1, left and center panel). It is a member of the family Methylobacteriaceae in the class Alphaproteobacteria ( Figure 2). WSM3557 T is fast growing, forming 0.5-1.5 mm diameter colonies within 2-3 days. It is moderately thermophilic and has a mean generation time of 1.6 h when grown in broth at the optimum growth temperature of 41°C [15]. WSM3557 T is pigmented, an unusual property for rhizobia. Colonies on half Lupin Agar (½LA) [7] are pale pink, opaque, slightly domed, moderately mucoid with smooth margins (Figure 1, right panel). The color develops after several days. WSM3557 T is able to tolerate a pH range between 6.0 and 9.5 [11]. Carbon source utilization, cellular fatty acid profiles, polar lipid analysis and respiratory lipoquinone analysis have been described previously [11]. Minimum Information about the Genome Sequence (MIGS) is provided in Table 1.

Genome sequencing and annotation Genome project history
This organism was selected for sequencing on the basis of its environmental and agricultural relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance, and is part of the Community Sequencing Program at the U.S. Department of Energy, Joint Genome Institute (JGI) for projects of relevance to agency missions. The genome project is deposited in the Genomes OnLine Database [14] and an improved-high-quality-draft genome sequence in IMG. Sequencing, finishing and annotation were performed by the JGI. A summary of the project information is shown in Table 2.

Growth conditions and DNA isolation
Microvirga lotononidis WSM3557 T was grown to mid-logarithmic phase in TY rich medium [23] on a gyratory shaker at 28°C. DNA was isolated from 60 mL of cells using a CTAB (Cetyl trimethyl ammonium bromide) bacterial genomic DNA isolation method [24].  Phylum Proteobacteria TAS [18] Class Alphaproteobacteria TAS [18] Order Rhizob iales TAS [19] Family Methylobacteriaceae TAS [20] Genus M icrov irga TAS [21] Species Microvirga lotononidis TAS [15] Gram stain Negative TAS [15] Cell shape Rod TAS [15] Motility Motile TAS [15] Sporulation Non-sporulating TAS [15] Temperature range Mesophile TAS [15] Optimum temperature 41°C TAS [15] Salinity the living , isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [22]. . All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [12]. The tree was built using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis [13] with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [14] are in bold print and the GOLD ID is mentioned after the accession number. Published genomes are designated with an asterisk.

Genome sequencing and assembly
The improved high quality draft genome of Microvirga lotononidis WSM3557 T was generated at the DOE Joint Genome Institute (JGI) using a combination of Illumina [25] and 454 technologies [26].

Genome annotation
Genes were identified using Prodigal [32] as part of the DOE-JGI Annotation pipeline [33], followed by a round of manual curation using the JGI GenePRIMP pipeline [34]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information (NCBI) non-redundant database, UniProt, TIGRFam, Pfam, PRIAM, KEGG, COG, and InterPro databases. These data sources were combined to assert a product description for each predicted protein. Non-coding genes and miscellaneous features were predicted using tRNAscan-SE [35], RNAMMer [36], Rfam [37], TMHMM [38], and SignalP [39]. Additional gene prediction analyses and functional annotation were performed within the Integrated Microbial Genomes (IMG-ER) platform [40].

Genome properties
The genome is 7,082,538 nucleotides with 63.00% GC content ( Table 3) and comprised of 18 scaffolds (Figures 3a,3b and Figure 3c) of 104 contigs. From a total of 7,040 genes, 6,956 were protein encoding and 84 RNA only encoding genes. The majority of genes (67.64%) were assigned a putative function while the remaining genes were annotated as hypothetical. The distribution of genes into COGs functional categories is presented in Table 4.