Designing secure biosystems to protect environmental microorganisms.
Genetically engineered microorganisms (GEMs) play an important role in building and maintaining a sustainable bioeconomy. To reduce the risk of unintended ecological consequences from environmentally deployed GEMs, the Secure Biosystems Design Scientific Focus Area (SFA) at LLNL is developing built-in security mechanisms that ensure GEMs function where and when needed.
Our security mechanisms will help safeguard the deployment of engineered microbes in the rhizosphere (the narrow region of soil directly influenced by root secretions and associated soil microorganisms known as the root microbiome). Additionally, layered containment strategies at the sequence, cellular, and population levels are expected to increase the overall system robustness to environmental fluctuations.
By stabilizing GEMs and preventing the transfer of potentially invasive traits to native microbiomes, our ultimate objective is to control the niche-specific function of GEMs for safer and more effective environmental applications.
Explore this page to learn more about our research and capabilities.
Research
Multilayered containment strategies can safeguard the deployment of engineered microbes in the rhizosphere. Image courtesy of Dan Park (LLNL).
With vast potential for use in large-scale applications, GEMs are critical in building and maintaining a sustainable bioeconomy. The need for biocontainment strategies is particularly relevant for the sustainable development of bioenergy crops and carbon sequestration. The microbiota colonizing the rhizosphere of plant roots, especially plant-benefiting microorganisms (PBMs), contribute to plant growth and modulate soil carbon input, release, and storage. Genetic engineering approaches have been used to enhance the beneficial traits of PBMs, such as nutrient acquisition and drought resistance.
Our work focuses on establishing robust biocontainment strategies in soil microbes at the DNA sequence, cellular, and population levels without sacrificing microbial fitness. Leveraging LLNL’s high-performance computing (HPC) and high-throughput gene-editing capabilities, we will advance a synthetic gene entanglement strategy for containment. In this method, two genes are encoded as overlapping sequences within the same DNA molecule to protect engineered functions against mutational inactivation and mitigate the potential transfer of engineered genes to naturally occurring microbes.
Building on this layer of genetic stability, our team will incorporate additional strategies that control cellular physiology and direct population coordination to increase the overall system robustness to environmental fluctuations. With the help of the “Microbes Persist” Soil Microbiome SFA at LLNL and their rich experience in soil microbial ecology, we will evaluate the ecological benefits of these containment mechanisms in soil and rhizosphere environments. Ultimately, our biosystem designs—created to improve gene fitness, function, and evolutionary stability—will provide a knowledge base to enable safer use of GEMS in distinct applications like plant probiotics, carbon sequestration, metal recovery, nuclear-activity detection, and biomaterial production.
Capabilities
Our core capabilities in genome editing and analysis enable effective GEM containment.
Extending the DropSynth method for pooled synthesis of gene entanglement libraries and parallel testing in cells. Multiplex reactions across hundreds of thousands of emulsion droplets enable simultaneously assembling tens of thousands of genes, which can be released from the droplets and subsequently amplified by PCR or cloned into shuttle vectors en masse.
Synthesizing genes in a cheap, fast, and parallel fashion enables large-scale genetic design testing that is essential for various synthetic biology and biocontainment applications. Commercial oligonucleotide synthesis manufacturers deliver tens of thousands of short oligonucleotides (for example, 300 nucleotides in length) that can only test very short sequence designs. However, our work requires synthesis capabilities that generate genes with an average length of 1000 nucleotides, which has remained challenging and expensive.
The DropSynth method leverages picoliter emulsion-based gene synthesis to produce gene libraries with thousands of designs of up to 700 base pairs at prices cheaper than commercial gene synthesis. We use this method to produce libraries of hundreds to thousands of gene-sized sequences to test the sequence designs from our computational design pipelines for sequence entanglement studies. We are also developing compatible protocols that leverage DropSynth for a variety of downstream high-throughput screening applications. Our team is generating hundreds to thousands of entangled sequences to be tested simultaneously through homologous recombination en masse in our target organism of interest.
Our project requires gene-editing tools that enable stable, efficient, and rapid genome modification to enhance native capabilities, introduce new capabilities, and implement biocontainment approaches. We will develop genome engineering tools for the environmental Pseudomonas species, including classic (transposon mutagenesis and gene removal/replacement using homologous recombination) and modern (recombineering, CRISPRi, and CRISPR/Cas) approaches. CRISPR-based gene-editing tools have been developed for P. aeruginosa and P. putida, but have yet to be successfully applied in P. fluorescens.
Our team’s experience places us at the forefront of gene editing and circuit design in environmental microorganisms, which is critical for realizing our layered containment approach. Co-principal investigator Jeffrey Gralnick’s laboratory developed recombineering and CRISPR-Cas counter-selection tools in Shewanella oneidensis, a close relative of environmental Pseudomonads. In addition, the LLNL team has extensive experience in genome editing in a diverse range of Gram-negative bacteria, including E. coli, Caulobacter crescentus, Rhodopseudomonas palustris, and Ralstonia eutropha.
Identifying regulatory elements, such as promoters and ribosome binding sites (RBS), that function in a broad or host-specific manner enables the design of gene regulatory circuits that produce species-/genus-specific outputs, thereby providing an additional framework for safeguarding against horizontal gene transfer. We developed in vivo and in vitro platforms that enable more than ten thousand metagenomically mined regulatory sequences to be quantitatively assayed for transcriptional and translational activity in multiplexed format in a diverse range of microorganisms.
For in vivo assays, regulatory sequences from annotated genomes are synthesized as an oligo library that includes unique barcodes, flanking restriction sites, and amplification sequences. The oligo library is amplified and cloned as a pool into species-specific vectors for transformation into recipients. RNA-seq, DNA-seq, and FACS-seq enable us to quantify transcription and translation activity. This approach is useful in biocontainment, as is evidenced by the finding of differences in the capacity of B. subtilis, E. coli, and P. aeruginosa to utilize exogenous regulatory sequences. This knowledge provides a proof-of-concept for circuit design with species-specific output patterns.
To extend the regulatory element characterization to microbes without an established transformation protocol, we developed a high-throughput cell-free framework—the DNA regulatory element analysis by cell-free transcription and sequencing (DRAFTS)—that produces transcription measurements highly correlated with those from the in vivo platform. DRAFTS enabled thousands of regulatory elements to be characterized across ten diverse species relevant to biotechnology and the gut microbiome. Leveraging LLNL’s extensive expertise in cell-free systems and in vivo screening, we are extending the regulatory element characterization to a range of Pseudomonads under conditions relevant to soils. Our goal is to identify promoters and RBS that function robustly in narrow or broad host ranges.
FAST is an LLNL-developed, open source, deep learning software toolkit for learning experimentally derived protein-ligand binding affinity values from 3D structures. (SG=spatial graph, CNN=convolutional neural network. Nodes in the spatial graph are heavy atoms.) We will extend this modelling approach to learn 3D-protein structure configurations associated with the positive outcomes of successfully engineered sequences for entanglement.
We leveraged LLNL’s experience in structural and computational biology to augment the Constraining Adaptive Mutations using Engineered Overlapping Sequences (CAMEOS) algorithm with protein structural constraints and design specifications obtained from available crystal structures. However, representing 3D structural features for modeling can be challenging given the complexity of protein structures. A potential unique value of deep learning is to simultaneously learn the relevant features and the experimentally derived response function.
The LLNL-developed, open source, deep learning software toolkit called fusion models for atomic and molecular structures (FAST) learns experimentally derived binding affinity values from the 3D atomistic representation of a protein-small molecule interaction. We will extend this modelling approach to learn 3D protein-structure configurations associated with the positive outcomes of successfully engineered entangled sequences. Input is the 3D homology model of the essential or functional protein and the algorithm encodes a spatial graph to capture pairwise interactions and a 3D voxel cube to capture additional spatial relationships. The model learns a compressed protein representation to predict viable or non-viable protein configurations.
A platform that builds on LLNL’s recent advances in machine learning, design optimization, simulations, and experimental assays to support LLNL’s high-performance computing (HPC) mission: building new methods for applying HPC to solve scientific problems using state-of-the-art machine learning algorithms. We will use this platform to develop an iterative active-learning framework for sequence entanglement design.
LLNL established a framework for applying HPC to solve scientific problems using machine learning (ML) algorithms with feedback from wet-lab experiments and computational simulations. We demonstrated an example of this framework with a drug discovery pipeline, which searches through chemical space to find molecules that satisfy the design criteria for anti-cancer therapies and then evaluates candidates using property-prediction models. The loop represents the search process as an iterative search through chemical space, with break points to collect feedback from physical- or simulation-based experiments. One of the innovations of this approach is to evaluate whether a more efficient search can be done when chemical space is converted into a continuous feature space that is amenable to gradient-based optimization algorithms.
Collecting data on new molecules is a critical component, with a ML loop guiding the collection of new experimental feedback and validation for new molecules that fall outside of the domain of applicability of the initial training data. LLNL’s HPC clusters support every aspect of this platform, including the extensive hyperparameter search for deep learning model training, the training of autoencoders for the Generative Molecular Design loop on tens to hundreds of millions of compounds, and the iterative multi-property prediction design search—which can scale to search the design space in parallel on a large cluster.
We are developing an iterative active-learning pipeline by which ML models acquire and guide new experimental data on entanglement. By iteratively including newly generated data (fitness, expression), we augment and refine the generative models that form the basis of CAMEOS to increase the success of double-encoding entanglement designs. Additionally, these methods can be adapted to work without the acquisition of new data, enabling in silico screening of designed variants, thereby further improving the likelihood of synthesizing functional entangled sequences as experimental validation data is collected.
Leveraging the extensive expertise of our co-principal investigators William Bentley and Gregory Payne, we will employ and advance microfluidic platforms to meet various conditions and needs in evaluating the effectiveness of our containment circuits. These platforms provide a quantitative means to study spatially and temporally heterogenous environments that are characteristic of soils and the rhizosphere. Coupled with real-time, high-resolution microscopy, we will be able to examine how steady-state, non-steady state, and fluctuating concentration gradients of environmental variables affect containment in a spatially resolved fashion.
The use of transparent soils, multiple hydrogel layers, and 2D fluidic systems, will allow us to further examine how different spatial arrangements influence containment. We can track cell migration and quantify swimming parameters using simple CCD camera-based approaches developed by our team. Furthermore, our microfluidic devices can modulate redox context and detect electrochemical signals that can be correlated with synthetic biology inputs and outputs as a novel non-optical imaging modality to probe sensing and containment dynamics in real time.
Publications
Comparison of phage-derived recombinases for genetic manipulation of Pseudomonas species | BioRvix, 2023
M. J. Kalb, A.W. Grenfell, A. Jain, J. Fenske-Newbart, J.A. Gralnick
Electrogenetic signaling and information propagation for controlling microbial consortia via programmed lysis | Biotechnol. Bioeng., 2023
E. VanArsdale, A. Navid, M.J. Chu, T.M. Halvorsen, G.F. Payne, Y. Jiao, W.E. Bentley, M.C. Yung
Fast and efficient template-mediated synthesis of genetic variants | Nature Methods, 2023
L. Liu, Y. Huang, H.H. Wang
Prolonging genetic circuit stability through adaptive evolution of overlapping genes | Nucleic Acids Research, 2023
J.L. Chlebek, S.P. Leonard, C.S. Kang-Yun, M.C. Yung, D.P. Ricci, Y. Jiao, D.M. Park
Comparison of kill switch toxins in plant-beneficial Pseudomonas fluorescens reveals drivers of lethality, stability, and escape | ACS Synth. Biol., 2022
T.M. Halvorsen, D.P. Ricci, D.M. Park., Y. Jiao, and M.C. Yung
Electrogenetic signal transmission and propagation in coculture to guide production of a small molecule, tyrosine | ACS Synth. Biol., 2022
E. VanArsdale, J. Pitzer, S. Wang, K. Stephens, C-Y. Chen, G.F. Payne, and W.E. Bentley
Learning protein fitness models from evolutionary and assay-labeled data | Nat. Biotechnol., 2022
C. Hsu, H. Nisonoff, C. Fannjiang, and J. Listgarten
Mediated electrochemistry for redox-based biological targeting: entangling sensing and actuation for maximizing information transfer (Review) | Curr. Opin. Biotechnol., 2021
D. Motabar, J. Li, G.F. Payne, and W.E. Bentley
Electronic signals are electrogenetically relayed to control cell growth and co-culture composition | Met. Eng. Comm., 2021
K. Stephens, F.R. Zakaria, E. VanArsdale, G.F. Payne, and W.E. Bentley
Mediated Electrochemical Probing: A Systems-Level Tool for Redox Biology | ACS Chem. Biol., 2021
Z. Zhao, E.E. Ozcan, E. VanArsdale, J. Li, E. Kim, A.D. Sandler, D.L. Kelly, W.E. Bentley, and G.F. Payne
Interactive Materials for Bidirectional Redox‐Based Communication | Adv. Mater., 2021
J. Li, S.P. Wang, G. Zong, E. Kim, C.-Y. Tsao, E. VanArsdale, L.‐X. Wang, W.E. Bentley, and G.F. Payne
Synthetic sequence entanglement augments stability and containment of genetic information in cells | Science, 2019
T. Blazejewski, H-I. Ho, and H.H. Wang
Team
Our multi-institutional team includes experts in bioscience and biotechnology.
LLNL

Jonathan Bethke

Chenling Xu

Bentley Lim
Columbia University
University of California, Berkeley
University of Pittsburgh
Previous members
- Jennifer Pett-Ridge (LLNL)
- Adam Zemla (LLNL)
- Ali Navid (LLNL)
- Matt Coleman (LLNL)
- Jose Manuel Marti Martinez (LLNL)
- Tiffany Halvorsen (LLNL)
- Fangchao Song (LLNL)
- Tomasz Blazejewski (Columbia University)
- Guillaume Urtecho (Columbia University)
- Carlotta Ronda (Columbia University)
- Charlotte Rochereau (Columbia University)
- Jinyang Li (University of Maryland)
- Eric VanArsdale (University of Maryland)
- William Bentley (University of Maryland)
- Gregory Payne (University of Maryland)
- Rahma Zakaria (University of Maryland)
- Jeffrey Gralnick (University of Minnesota)
- Madison Kalb (University of Minnesota)
- Chloe Hsu (UC Berkeley)
- Akosua Busia (UC Berkeley)