AbstractsBiology & Animal Science

Genome-based natural product biosynthetic gene cluster discovery : from sequencing to mining

by Hao Wang




Institution: University of Helsinki
Department: Department of Food and Environmental Sciences, Division of Microbiology and Biotechnology
Year: 2014
Keywords: microbiology
Record ID: 1145417
Full text PDF: http://hdl.handle.net/10138/42874


Abstract

Natural products are small molecules produced by a range of living organisms. They may be toxic or have pharmaceutical applications as antibiotics, anticancer, antiparasitic and anti-fungal agents. Natural products are commonly synthesized by nonribosomal peptide synthetases (NRPSs) and polyketide synthases (PKSs), such as microcystins. Ribosomal pathways in cyanobacteria are also known for the synthesis of bacteriocins, lantibiotics, cyanobactins and microviridins. Genes encoding biosynthetic enzymes of these systems are often found together and form gene clusters. The filamentous cyanobacterium Anabaena sp. strain 90, a hepatotoxin producer isolated from a bloom of a Finnish lake, was selected for genome sequencing, in order to explore its full capacity of bioactive compound production. The 5.3-Mb Anabaena sp. 90 genome displays a multi-chromosomal composition with five circular replicons: two chromosomes and three plasmids. A total of four non-ribosomal biosynthetic gene clusters, which are responsible for the production of anabaenopeptilides, anabaenopeptins, microcystins and the novel glycolipopeptides hassallidins, were identified in chromosome I. Genome annotation revealed that Anabaena sp. 90 genome also harbors an anacyclamide-encoding cyanobactin gene cluster and seven putative bacteriocin gene clusters, which belong to the ribosomal pathways. These biosynthetic gene clusters amount to a total of ~250 kb, and 5% of the genome. Analysis of the Anabaena sp. 90 genome suggested that cyanobacteria might produce bacteriocins. A thorough genome mining at the phylum level was conducted targeting the discovery of cyanobacterial bacteriocin biosynthetic pathways. The results demonstrated the common presence of bacteriocin gene clusters in cyanobacteria. A total of 145 bacteriocin gene clusters were discovered, the majority of them were previously unknown. Based on their gene organization and domain composition, these gene clusters were classified into seven groups. This classification is supported by the phylogenetic analysis, which also indicates independent evolutionary trajectories of the gene clusters in different groups. By scrutinizing the surrounding regions of these gene clusters, a total of 290 putative precursors were located. They showed diverse structures and very little sequence conservation of the core peptide. To explore the distribution of NRPSs and PKSs, a comprehensive genome-mining study was carried out and demonstrated their widespread occurrence across the three domains of life, with the discovery of 3,339 gene clusters from 991 organisms, by examining a total of 2,699 genomes. The majority of these gene clusters were found in bacteria, in which high correlation between bacterial genome size and the capacity of NRPS and PKS biosynthetic pathways was observed. Currently, PKSs are classified into three types. Type I PKSs and NRPSs are known to share a modular scheme with a multidomain structure. Surprisingly, a large number (8,906) of enzymes encoding a single NRPS or type I PKS functional…