Open Access

Draft genome sequence of Arthrospira platensis C1 (PCC9438)

  • Supapon Cheevadhanarak
  • , Kalyanee Paithoonrangsarid
  • , Peerada Prommeenate
  • , Warunee Kaewngam
  • , Apiluck Musigkain
  • , Somvong Tragoonrung
  • , Satoshi Tabata
  • , Takakazu Kaneko
  • , Jeerayut Chaijaruwanich
  • , Duangjai Sangsrakru
  • , Sithichoke Tangphatsornruang
  • , Juntima Chanprasert
  • , Sissades Tongsima
  • , Kanthida Kusonmano
  • , Wattana Jeamton
  • , Sudarat Dulsawat
  • , Amornpan Klanchui
  • , Tayvich Vorapreeda
  • , Vasunun Chumchua
  • , Chiraphan Khannapho
  • , Chinae Thammarongtham
  • , Vethachai Plengvidhya
  • , Sanjukta Subudhi
  • , Apiradee Hongsthong
  • , Marasri Ruengjitchatchawalya
  • , Asawin Meechai
  • , Jittisak Senachak
  • and Morakot Tanticharoen
Corresponding author

DOI: 10.4056/sigs.2525955

Received: 05 March 2012

Published: 19 March 2012


Arthrospira platensis is a cyanobacterium that is extensively cultivated outdoors on a large commercial scale for consumption as a food for humans and animals. It can be grown in monoculture under highly alkaline conditions, making it attractive for industrial production. Here we describe the complete genome sequence of A. platensis C1 strain and its annotation. The A. platensis C1 genome contains 6,089,210 bp including 6,108 protein-coding genes and 45 RNA genes, and no plasmids. The genome information has been used for further comparative analysis, particularly of metabolic pathways, photosynthetic efficiency and barriers to gene transfer.


Arthrospira platensis is a cyanobacterium that contains large amounts of proteins, vitamins, lipids and pigments [1]. It is widely used as a human food and an animal feed. In addition, its extracts can enhance the immune system and promote health [1,2]. As the natural habitat is soda lakes, Arthrospira spp are cultivated under highly alkaline conditions in open ponds on a large commercial scale. This condition can minimize and sometimes prevent the culture from contamination [3]. Unlike many plant food products whose nutritional value rapidly deteriorates at high temperatures, the nutritional value of Arthrospira products is maintained even when the cells are processed at high temperatures [4]. In contrast to many cyanobacteria, there is no report of toxicity of Arthrospira for humans, animals or environments [4].

The genome sequences of Arthrospira spp. have been the subject of immense interest due to the beneficial properties of these organisms in the biotechnology and environmental fields [5,6]. A. platensis C1 is the fifth complete genome report for a member of the genus Arthrospira. A. platensis C1 has long been used as a laboratory strain for physiological and molecular studies due to its non-gliding property, which enables single colonies formation. This property facilitates studies at the molecular level and strain improvement, particularly, the development of a transformation system. Currently, a successful transformation system for Arthrospira has not yet been established. Thus the genome sequences may help to identify barriers responsible for the instability of the transformants. Here, we present a summary classification and a set of features of A. platensis C1 together with the complete genomic sequence and its annotation.

Classification and features

Historically, the classification of the Arthrospira and Spirulina genera [Figure 1] was a subject of controversy. For the commercial strain, Arthrospira or Spirulina was used interchangeably. Both Arthrospira and Spirulina are similar in morphological characters; cylindrical, multicellular, filamentous cyanobacteria with an open, left-handed helical shape [Table 1]. They both belong to the Phylum Cyanobacteria, Order Oscillatoriales and Family Oscillatoriaceae [13]. However, they can be differentiated by the presence of cell septa: Arthrospira possess septa, whereas Spirulina do not [14].

Figure 1

The phylogenetic tree of 51 cyanobacterial concatenated ribosomal proteins. The main topology is in agreement with earlier inferences of the phylogeny of this taxon with the 16s rRNA based on the GTR+G+I substitution model [7]. The tree is built using the Neighbor-Joining method and 1,000 re-samplings to calculate bootstrap values. A. platensis C1 was clustered together with other strains in the order Oscillatoriales and was clearly separated from related species in the order Nostocales. The conserved, concatenated ribosomal protein phylogenetic tree indicated the monophyly of this Arthrospira genus.

Table 1

Classification and general features of A. platensis C1 according to the MIGS recommendations [8]




     Evidence Code

      Domain Bacteria

     TAS [9]

      Phylum Cyanobacteria

     TAS [10,11]

      Class Cyanobacteria

     TAS [11,12]

     Current Classification

      Order Oscillatoriales

     TAS [13]

      Family Oscillatoriaceae

     TAS [13]

      Genus Arthrospira

     TAS [11]

      Species Arthrospira platensis C1

     TAS [11]

     Gram stain


     TAS [14]

     Cell shape


     TAS [14]





     Temperature range

      20 – 40

     TAS [14]

     Optimum temperature


     TAS [14]


     Relationship to Oxygen


     TAS [3]

     Carbon source

      Phototroph, Mixotroph

     TAS [3]

     Energy source


     TAS [3]


     Habitat (EnvO)

      Fresh water

     TAS [3]



      20 - 40

     TAS [3]



      8.0 -10.0

     TAS [3]




     TAS [3]


     Extrachromosomal elements


     TAS [6]


     Estimated Size

      6.08 Mb



     Known Pathogenicity




     Biotic Relationship

      Free living



     Geographic Location

      Not reported




      Not reported




      Not reported




      Not reported




      Not reported


Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [15]. If the evidence code is IDA, then the property was directly observed for a living isolate by one of the authors or an expert mentioned in the acknowledgements.


Arthrospira platensis C1 or Arthrospira sp. PCC 9438, as designated based on its morphology by Pasteur Institute, Paris, France, was originally classified as Spirulina platensis C1 [1,3]. This reclassification was in agreement with the presence of γ-linolenic acid (GLA) in the fatty acid profile, a chemotaxonomic marker of Arthrospira, while GLA is absence in Spirulina [16-18].

The phylogenetic tree of cyanobacteria was reconstructed with evolutionary information embedded in conserved, concatenated ribosomal proteins. A. platensis C1 was positioned into the genus Arthrospira with 100% of the bootstrapping value. The closest strain is Arthrospira sp. PCC 8005 with 97.43% sequence identity, whereas the other strains of this genus share 94.93-96.58% sequence identity. The nearest related order is Nostocales with approximately 70% sequence identity.

Genome sequencing and annotation

Genome project history

Arthrospira genome sequencing projects have been carried out in 5 research centers: Genoscope, France; the DOE Joint Genome Institute, USA; the National Institute of Technology and Evaluation (NITE), Japan; the University of Applied Sciences, Switzerland, and King Mongkut’s University of Technology Thonburi (KMUTT), Thailand, using various Arthrospira strains (A. platensis PCC8005, A. maxima CS-328, A. platensis NIES- 39, A. platensis Paraca and A. platensis C1 (this study), respectively). In this study, the genome of A. platensis C1 has been sequenced, and the results provide data that can be used for the further study of its biological functions.

The genome project is deposited in the GenBank Database (NCBI ID 67617 and accession number AFXD00000000). DNA sequencing and finishing were performed in collaboration between the Genome Institute, BIOTEC-NSTDA, Thailand and Kazusa DNA Research Institute, Japan. The genome assembly and annotation steps were performed in collaboration between KMUTT, the Genome Institute, BIOTEC-NSTDA, Chiang Mai University, Thailand and Kazusa DNA Research Institute. The summary of the project information is shown in Table 2.

Table 2

Genome sequencing project information





     Finishing quality



     Libraries used

     Genomic libraries: Sanger (one each of 2 and 5 kb library), standard 454 pyrosequence library and BAC library


     Sequencing platforms

     454 and Sanger Technology


     Sequence coverage




     Newbler version 2.3, Phrap





     Gene calling method

     Glimmer 3.0

     GOLD ID


     NCBI project ID


     Database: IMG-GEBA


     Source material identifier


     Project relevance


Growth conditions and DNA isolation

A. platensis strain C1 was obtained from Prof. Avigad Vonshak (Algal Biotechnology, Ben-Gurion University of the Negev, Israel). The cells were routinely grown at 35 °C with constant shaking at 150 rpm in Zarrouk’s medium [19], under aeration of 1% (v/v) CO2 and illumination of 100 μE m-2 s-1 from fluorescent lamps. For stock cultures, cells were transferred into new Zarrouk’s medium [19] every two weeks.

For the shotgun genome library preparation, a total DNA sample was sheared by nebulization with nitrogen at 45 psi for 1 min and purified using a QIAGEN Purification kit (QIAGEN, Valencia, CA). DNA fragments with sizes greater than 300 bp were selected by Ampure Bead purification (Beckman Coulter, Brea, CA) and blunt-ended with T4 DNA polymerase and T4 polynucleotide kinase (454 Life Sciences, Roche, Branford, CT). After DNA purification, 454-library adaptors were added to both ends of the fragments, which were immobilized using Library Immobilization Beads for further DNA sequencing.

For paired-end library preparation, a total DNA sample was sheared by nebulization with nitrogen at 10 psi for 2 min to obtain 2.2 kb fragments with hairpin adaptors added to both ends of the DNA fragments. The DNA fragments with hairpin adaptors were immobilized using Library Immobilization Beads and ligated to paired-end adaptors. DNA libraries were linked to the capture beads and amplified by emulsion PCR (Roche Applied Science protocol).

Genome sequencing and assembly

The genome of A. platensis C1 was sequenced using a hybrid method between the 454 Life Sciences technology on the Genome Sequencer (GS) FLX System and BigDye Terminator v3.1 Cycle sequencing. Pyrosequencing reads were assembled using the Newbler de novo sequence assembly software version 2.0.0 (Roche). The Phred/Phrap/Consed software package [20] was used for sequence assembly and quality assessment in the finishing process. The remaining gaps between contigs were closed by custom primer walk or PCR amplification and then editing in Consed. The final assembly contains 739,684 reads from pyrosequencing and 45,959 reads from Sanger sequencing, resulting in 28× coverage of the genome. Employing A. maxima CS-328 contigs [21] as a reference, the A. platensis C1 circular genome of 6.08 Mb total size with 1 scaffold and 63 gaps has been constructed.

Genome annotation

In agreement with the result from the Integrated Microbial Genomes Expert Review (IMG-ER) platform [22], all the genes in the A. platensis C1 genome were identified using the GLIMMER 3.0 program in our Microbial Inhouse Annotation Pipeline. Initial criteria for automated functional assignment required a minimum of 50% residue identity with over 80% length match for BLASTP alignments to the NCBI nonredundant database, InterPro, SwissProt, SignalP, COG, and KEGG databases. The tRNAscan-SE tool [23] was used to find tRNA genes, whereas ribosomal RNAs were found by using the tool RNAmmer [24]. Additional gene prediction analysis and functional annotation were performed within IMG-ER platform [22] and a round of manual curation, including confirmation with proteomic data [25,26].

Genome properties

The genome of A. platensis C1, with a total of 6.08 Mbp (6,089,210 base pairs), contains 44.68% G+C (Table 3) and, in agreement with the findings of Fujisawa et al. (2010) [6], no plasmid DNA. Our results confirm the presence of a single genome in A. platensis C1. The A. platensis C1 circular genome of 6.08 Mbp was compared with the A. maxima CS-328 and A. platensis NIES-39 genomes [6]. A total of 6,153 open reading frames (orfs) were predicted. Of these, 3,757 were annotated as coding for known protein functions and 45 for RNA genes (6 for rRNA and 39 for tRNA). The distribution of genes into COGs is presented in Table 4.

Table 3

Genome statistics



     % of Total

Genome size (bp)



DNA coding region (bp)



G+C content (bp)



DNA scaffolds



Total genes



Protein coding genes



RNA genes



Genes with protein function prediction



Genes with enzymes



Genes with Transporter Classification



Genes assigns to KEGG pathways



Genes assigned to KEGG Orthology (KO)



Genes assigned to COGs



Genes assigned to Pfam



Genes assigned to TIGRfam



Genes assigned to InterPro



Genes assigned in paralog clusters



Genes assigned with signal peptides



Genes assigned with transmembrane proteins



Table 4

Number of genes associated with the general COG functional categories



     % age





     Translation, ribosomal structure and biogenesis




     RNA processing and modification








     Replication, recombination and repair




     Chromatin structure and dynamics




     Cell cycle control, cell division, chromosome partitioning




     Nuclear structure




     Defense mechanisms




     Signal transduction mechanisms




     Cell wall/membrane/envelope biogenesis




     Cell motility








     Extracellular structures




     Intracellular trafficking, secretion, and vesicular transport




     Posttranslational modification, protein turnover, chaperones




     Energy production and conversion




     Carbohydrate transport and metabolism




     Amino acid transport and metabolism




     Nucleotide transport and metabolism




     Coenzyme transport and metabolism




     Lipid transport and metabolism




     Inorganic ion transport and metabolism




     Secondary metabolite biosynthesis, transport and catabolism




     General function prediction only




     Function unknown




     Not in COG

The genome atlas of A. platensis C1 is shown in Figure 2. The atlas was created based on one FASTA file containing the nucleotide sequences in one piece. Together with the annotation file, positions of genes were used for the map.

Figure 2

Graphical circular map of the chromosome of Arthrospira platensis C1. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.

In the genome atlas, the gene annotation, base content, AT and GC skew, percent AT and some structural properties of the DNA are shown. The structural properties are Position Preference, Stacking Energy and Intrinsic Curvature, all of which are related to the flexibility and strength of the DNA molecule [27].

Genomic comparison

The genome sequences from all Arthrospira strains (A. maxima CS-328, A. platensis NITE-39, A. platensis PCC8005, A. platensis Paraca, and A. platensis C1) provided the data for a comparative genome analysis of these strains. Based on the genome statistics comparison (Table 5), all of the Arthrospira spp. genomes revealed highly conserved sequences. Interestingly, the number of signal peptides in A. platensis C1 has been reported to be the lowest among the Arthrospira spp. Further study of these primary sequences may reveal the importance of signal peptides in protein targeting in this cyanobacterial species compared with other laboratory and commercial strains.

Table 5

Genome statistics comparison among Arthrospira spp.

Genome Name

    A. platensis C1

    A. platensis NIES-39

     A. maxima CS-328

    A. platensis Paraca

    A. platensis PCC8005

Genome size (bp)






Total genes






Protein coding genes






Protein with function prediction






RNA genes












% Enzymes






Transporter Classification






%Transporter Classification






KEGG pathways






% KEGG pathways






KEGG Orthology (KO)






% KEGG Orthology (KO)












% COGs












% Pfam












% TIGRfam












% InterPro






signal peptides






% signal peptides






Transmembrane proteins






% Transmembrane proteins






COG clusters






KOG clusters






Pfam clusters






TIGRfam clusters






NA: Information not available.

Insights from the genome sequence

Analysis of the genome of Arthrospira spp. compared to other cyanobacteria confirmed that the Arthrospira are non-nitrogen fixing, filamentous cyanobacteria. They are nontoxic, which might be due to the absence of polyketide and non-ribosomal peptide-producing genes.

Genes involved in gliding motility in A. platensis NIES-39 (a vigorously motile strain utilizing type IV pili as the major mechanism for gliding) [6] have been compared with those in A. platensis C1 (a nonmotile strain). Interestingly, all the genes involved in type IV pili are present in the A. platensis C1 genome, however, the lack of gliding ability in A. platensis C1 is due to an unknown mechanism. Further studies are needed to elucidate this mechanism of cell motility.

Cellular defense mechanism

Like other genomes of Arthrospira spp., the genome of A. platensis C1 contains highly interspersed repetitive sequences that account for 9% of its genome. Genome comparisons among cyanobacteria revealed unusual genes involved in defense mechanisms, including restriction and modification enzymes, group II introns, insertion elements and CRISPR. These genes are considered to be major barriers for stable transformation. Therefore, these genes have been targeted for the development of a stable gene transformation system for Arthrospira spp. Because of the nonmotile property of A. platensis C1, single colonies can be selected and used for further strain improvement and genetic manipulation experiments.

By combining a stable transformation system with the advantage of colony-forming ability, we should be able to harness A. platensis C1 for many biotechnological applications using gene manipulation and systems biology.

Transporter Characteristic of A. platensis C1

Because of the ability of cyanobacteria to adapt to extremely different habitats, the relationship between their membrane transporter proteins and their habitats has been a focus of interest. Membrane transporters are proteins that allow cell membranes to deliver essential nutrients, eject waste products, and help the cell sense environmental conditions around it [28].

Interestingly, all Arthrospira species contain genes for Na+/H+ antiporters. The NapA-type Na+/H+ antiporter homolog, which is reported to be involved in salt tolerance at alkaline pH in some cyanobacteria, is found in all Arthrospira genomes. Arthrospira species live in high-alkalinity environments, and there are many alkali transporters in their genomes. These results revealed the relationship between the transporters and the lifestyles and niche adaptations of cyanobacteria.



This work was supported by the National Center for Genetic Engineering and Biotechnology (BIOTEC), Thailand under the grant numbers P-00-20061 and P-09-00186. The authors would like to thank Prof. Avigad Vonshak (Algal Biotechnology, Ben-Gurion University of the Negev, Israel) for the A. platensis C1 strain used in our experiments and Prof. Norio Murata for all his valuable suggestions during this project.


  1. Habib MAB, Pariv M, Huntington TC, Hasan MR. A review on culture, production and use of spirulina as food for humans and feeds for domestic animals and fish. Rome: Food and Agriculture Organisation (FAO) of the United Nations. 2008; 1-33.
  2. Richmond A. Spirulina, Microalgal Biotechnology. In: Borowitzka M, Borowitzka L, editors. Microalgal Biotechnology. New York: Cambridge University Press; 1987. p. 85-121.
  3. Vonshak A. Spirulina: Growth, physiology and biochemistry. In: Vonshak A, editor. Spirulina platensis (Arthrospira). London: Taylor & Francis; 1997, p. 43-65.
  4. Henrikson R. Spirulina World Food: How this micro algae can transform your health and our planet. Ronore Enterprises, Inc. PO Box 909, Hana, Maui, Hawaii 96718; 2010.
  5. Janssen PJ, Morin N, Mergeay M, Leroy B, Wattiez R, Vallaeys T, Waleron K, Waleron M, Wilmotte A and Quillardet P. Genome sequence of the edible cyanobacterium Arthrospira sp. PCC 8005. J Bacteriol. 2010; 192:2465-2466 View ArticlePubMed
  6. Fujisawa T, Narikawa R, Okamoto S, Ehira S, Yoshimura H, Suzuki I, Masuda T, Mochimaru M, Takaichi S and Awai K. Genomic structure of an economically important cyanobacterium, Arthrospira (Spirulina) platensis NIES-39. DNA Res. 2010; 17:85-103 View ArticlePubMed
  7. Schirrmeister BE, Antonelli A and Bagheri HC. The origin of multicellularity in cyanobacteria. BMC Evol Biol. 2011; 11:45 View ArticlePubMed
  8. Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, Tatusova T, Thomson N, Allen MJ and Angiuoli SV. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008; 26:541-547 View ArticlePubMed
  9. Woese CR, Kandler O and Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci USA. 1990; 87:4576-4579 View ArticlePubMed
  10. Woese CR, Stackebrandt E, Macke TJ and Fox GE. A phylogenetic definition of the major eubacterial taxa. Syst Appl Microbiol. 1985; 6:143-151 View ArticlePubMed
  11. McNeill J, Barrie FR, Burdet HM, Demoulin V, Hawksworth DL, Marhold K, Nicolson DH, Prado J, Silva PC, Skog JE, et al International Code of Botanical Nomenclature, A.R.G. Ganter, Königstein, 2006, p. 1.
  12. Woese CR, Stackebrandt E, Macke TJ and Fox GE. A phylogenetic definition of the major eubacterial taxa. Syst Appl Microbiol. 1985; 6:143-151 View ArticlePubMed
  13. Castenholz, R.W. 2001. Oxygenic photosynthetic bacteria. In: Boone D.R. and Castenholz R.W. (eds), Bergey’s Manual of Systematic Bacteriology (2nd ed.), Volume 1, Springer-Verlag, New York, pp. 473-600.View ArticlePubMed
  14. Tomaselli L, Maria Rosa P and Mario RT. On the correct use of the Spirulina designation. Algol Stud. 1996; 83:539-548
  15. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS and Eppig JT. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000; 25:25-29 View ArticlePubMed
  16. Romano I, Bellitti MR, Nicolaus B, Lama L, Manca MC, Pagnotta E and Gambacorta A. Lipid profile: a useful chemotaxonomic marker for classification of a new cyanobacterium in Spirulina genus. Phytochemistry. 2000; 54:289-294 View ArticlePubMed
  17. Cohen Z, Margheri MC and Tomaselli L. Chemotaxonomy of cyanobacteria. Phytochemistry. 1995; 40:1155-1158 View Article
  18. Cohen Z and Vonshak A. Fatty acid composition of Spirulina and Spirulina-like cyanobacteria in relation to their chemotaxonomy. Phytochemistry. 1991; 30:205-206 View Article
  19. Zarrouk C. Contribution a` l’e’tude d’une cyanophyce’e. Influence de divers facteurs physiques et chimiques sur la croissance et la photosynthe`se de Spirulina maxima Ph. D. Thesis, Universite´ de Paris. 1966.
  20. Phrap and Phred for Windows, MacOS, Linux, and Unix.
  21. Arthrospira maxima CS-328 analysis files. Web Site
  22. Markowitz VM, Mavromatis K, Ivanova NN, Chen IM, Chu K and Kyrpides NC. IMG ER: a system for microbial genome annotation expert review and curation. Bioinformatics. 2009; 25:2271-2278 View ArticlePubMed
  23. Lowe TM and Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997; 25:955-964 View ArticlePubMed
  24. Lagesen K, Hallin P, Rødland EA, Staerfeldt HH, Rognes T and Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007; 35:3100-3108 View ArticlePubMed
  25. Hongsthong A, Sirijuntarut M, Prommeenate P, Lertladaluck K, Porkaew K, Cheevadhanarak S and Tanticharoen M. Proteome analysis at the subcellular level of the cyanobacterium Spirulina platensis in response to low-temperature stress conditions. FEMS Microbiol Lett. 2008; 288:92-101 View ArticlePubMed
  26. Hongsthong A, Sirijuntarut M, Yutthanasirikul R, Senachak J, Kurdrid P, Cheevadhanarak S and Tanticharoen M. Subcellular proteomic characterization of the high-temperature stress response of the cyanobacterium Spirulina platensis. Proteome Sci. 2009; 7:33 View ArticlePubMed
  27. Hallin PF, Binnewies TT and Ussery DW. The genome BLASTatlas-a GeneWiz extension for visualization of whole-genome homology. Mol Biosyst. 2008; 4:363-371 View ArticlePubMed
  28. Ren Q and Paulsen IT. Comparative analyses of fundamental differences in membrane transport capabilities in prokaryotes and eukaryotes. PLOS Comput Biol. 2005; 1:e27 View ArticlePubMed