First bacterial genome created entirely with a computer
Caulobacter crescentus is a harmless bacterium living in fresh water throughout the world (electron microscope image).
All the genome sequences of organisms known throughout the world are stored in a database belonging to the National Center for Biotechnology Information in the United States. As of today, the database has an additional entry: Caulobacter ethensis-2.0. It is the world's first fully computer-generated genome of a living organism, developed by scientists at ETH Zurich. However, it must be emphasised that although the genome for C. ethensis-2.0 was physically produced in the form of a very large DNA molecule, a corresponding organism does not yet exist.
C. ethensis-2.0 is based on the genome of a well-studied and harmless freshwater bacterium, Caulobacter crescentus, which is a naturally occurring bacterium found in spring water, rivers and lakes around the globe. It does not cause any diseases. C. crescentus is also a model organism commonly used in research laboratories to study the life of bacteria. The genome of this bacterium contains 4,000 genes. Scientists previously demonstrated that only about 680 of these genes are crucial to the survival of the species in the lab. Bacteria with this minimal genome are viable under laboratory conditions.
Beat Christen, Professor of Experimental Systems Biology at ETH Zurich, and his brother, Matthias Christen, a chemist at ETH Zurich, took the minimal genome of C. crescentus as a starting point. They set out to chemically synthesise this genome from scratch as a continuous ring-shaped chromosome. The task is considered to be a true tour de force: The chemically synthesised bacterial genome presented 11 years ago by the American genetics pioneer Craig Venter was the result of 10 years of work by 20 scientists, according to media reports. The cost of the project is said to have totaled $40 million.
Rationalising the production process
While Venter's team made an exact copy of a natural genome, the researchers at ETH Zurich radically altered their genome using a computer algorithm. Their motivation was twofold: one, to make it much easier to produce genomes, and two, to address fundamental questions of biology.
To create a DNA molecule as large as a bacterial genome, scientists must proceed step by step. In the case of the Caulobacter genome, the scientists at ETH Zurich synthesised 236 genome segments, which they subsequently pieced together. "The synthesis of these segments is not always easy," explains Matthias Christen. "DNA molecules not only possess the ability to stick to other DNA molecules, but depending on the sequence, they can also twist themselves into loops and knots, which can hamper the production process or render manufacturing impossible," explains Matthias Christen.
Simplified DNA sequences
To synthesise the genome segments in the simplest possible way, and then piece together all segments in the most streamlined manner, the scientists radically simplified the genome sequence without modifying the actual genetic information (at the protein level). There is ample latitude for the simplification of genomes, because biology has built-in redundancies for storing genetic information. For example, for many protein components (amino acids), there are two, four or even more possibilities to write their information into DNA.
The algorithm developed by the scientists at ETH Zurich makes optimal use of this redundancy of the genetic code. Using this algorithm, the researchers computed the ideal DNA sequence for the synthesis and construction of the genome, which they ultimately utilised for their work.
As a result, the scientists seeded many small modifications into the minimal genome, which in their entirety are, however, impressive: more than a sixth of all of the 800,000 DNA letters in the artificial genome were replaced, compared to the "natural" minimal genome. "Through our algorithm, we have completely rewritten our genome into a new sequence of DNA letters that no longer resembles the original sequence. However, the biological function at the protein level remains the same," says Beat Christen.
Litmus test for genetics
The rewritten genome is also interesting from a biological perspective. "Our method is a litmus test to see whether we biologists have correctly understood genetics, and it allows us to highlight possible gaps in our knowledge," explains Beat Christen. Naturally, the rewritten genome can contain only information that the researchers have actually understood. Possible "hidden" additional information that is located in the DNA sequence, and has not yet been understood by scientists, would have been lost in the process of creating the new code.
For research purposes, the scientists produced strains of bacteria that contained both the naturally occurring Caulobacter genome and also segments of the new artificial genome. By turning off certain natural genes in these bacteria, the researchers were able to test the functions of the artificial genes. They tested each one of the artificial genes in a multistep process.
In these experiments, the researchers found out that only about 580 of the 680 artificial genes were functional. "With the knowledge we have gained, it will, however, be possible for us to improve our algorithm and develop a fully functional genome version 3.0," says Beat Christen.
Enormous potential for biotechnology
"Even though the current version of the genome is not yet perfect, our work nevertheless shows that biological systems are constructed in such a simple manner that in the future, we will be able to work out the design specifications on the computer according to our goals, and then build them," says Matthias Christen. And this can be accomplished in a comparatively straightforward way, as Beat Christen emphasises: "What took ten years with Craig Venter's approach, our small group achieved with our new technology within the time frame of one year with manufacturing costs of 120,000 Swiss francs."
"We believe that it will also soon be possible to produce functional bacterial cells with such a genome," says Beat Christen. Such a development would hold great potential. Among the possible future applications are synthetic microorganisms that could be utilised in biotechnology for the production of complex pharmaceutically active molecules or vitamins, for example. The technology can be employed universally for all microorganisms, not just Caulobacter. Another possibility would be the production of DNA vaccines.
"As promising as the research results and possible applications may be, they demand a profound discussion in society about the purposes for which this technology can be used and, at the same time, about how abuses can be prevented," says Beat Christen. It is still not clear when the first bacterium with an artificial genome will be produced—but it is now clear that it can and will be developed. "We must use the time we have for intensive discussions among scientists, and also in society as a whole. We stand ready to contribute to that discussion, with all of the know-how we possess."