Main 

Performing computation in a living cell will revolutionize biotechnology by improving existing processes and enabling new applications. In the short term, the production of bio-based chemicals can be improved by timing gene expression at different stages of fermentation or by turning on an enzyme only under particular conditions (e.g., high cell density)1,2,3,4,5,6. As circuits become more advanced, entire algorithms from control theory could be implemented to improve biochemical production7,8,9,10,11,12,13,14,15,16 (Fig. 1a). Synthetic regulation is also an important tool for the discovery of natural products including pharmaceuticals, insecticides and entirely new classes of chemicals. Accessing these products may require synthetic regulation because many of the relevant gene clusters are 'silent', meaning that the conditions under which they are induced are unknown17,18,19,20,21,22. Outside of the fermenter, living cells could be programmed to serve as therapeutic agents that correct genetic disease (Fig. 1b) or colonize niches in the human microbiome to perform a therapeutic function23,24,25,26,27,28,29,30,31,32,33,34,35 (Fig. 1c). Longer-term applications include 'smart' plants that sense and adapt to environmental challenges (Fig. 1d) and bacteria that organize to weave functional materials with nanoscale features36,37,38,39,40,41,42.
在活细胞中进行计算将彻底改变生物技术,通过改善现有过程并启用新应用。在短期内,通过在发酵的不同阶段调控基因表达或仅在特定条件下(例如,高细胞密度)激活酶,可以提高生物基化学品的生产 1,2,3,4,5,6。随着电路的不断进步,控制理论中的整个算法可以被实施,以改善生化生产 7,8,9,10,11,12,13,14,15,16(图 1a)。合成调控也是发现天然产物的重要工具,包括药物、杀虫剂和全新类别的化学品。获取这些产品可能需要合成调控,因为许多相关的基因簇是“沉默”的,这意味着诱导它们的条件尚不清楚 17,18,19,20,21,22。在发酵罐外,活细胞可以被编程为治疗剂,以纠正遗传疾病(图 1b)或在人体微生物组中定殖特定生态位以执行治疗功能 23,24,25,26,27,28,29,30,31,32,33,34,35(图 1c)。 长期应用包括能够感知并适应环境挑战的“智能”植物(图 1d)以及能够组织起来编织具有纳米尺度特征的功能材料的细菌 36,37,38,39,40,41,42。

Figure 1: Potential uses of synthetic genetic circuits.
图 1:合成基因电路的潜在用途。
figure 1

All of these examples are hypothetical and have not yet been realized. (a) A circuit controlling the production of a diesel-fuel alternative (bisabolane221). The circuit reduces accumulation of a toxic intermediate (HMG-CoA) by sensing the bisabolane sugar precursor and oscillating the production of HMGR222. (b) Gene therapy circuits based on CRISPRi technology. The circuit detects two single-nucleotide polymorphisms associated with colon cancer susceptibility (rs4444235 and rs9929218)223 and uses a tissue-specific promoter (pAMUC2) to control the expression of Cas9 and misregulated genes (DLGAP5, NO3 (NBL1) and DDX28). (c) Commensal bacteria programmed to stabilize pH in the human stomach to treat gastoesophageal acid reflux. The bacteria use a proportional integral controller224, whose output is a proton pump inhibitor, to achieve set-point control over stomach pH. The circuit also restricts acid regulation to the stomach by terminating the bacterium via an irreversible switch that turns on a bacterial toxin (CcdB) when it leaves this organ225. (d) Genetic circuits in 'smart' plants that sense environmental stimuli and implement a response. The circuit, built into chloroplasts, integrates sensors for drought (pSpark), temperature (pCBF) and plant maturity (pSAG12) to control pesticide (Bt) production and drought tolerance (IPT).
所有这些例子都是假设性的,尚未实现。(a) 控制柴油燃料替代品(bisabolane221)生产的电路。该电路通过感知 bisabolane 糖前体并调节 HMGR222 的生产,减少有毒中间体(HMG-CoA)的积累。(b) 基于 CRISPRi 技术的基因治疗电路。该电路检测与结肠癌易感性相关的两个单核苷酸多态性(rs4444235 和 rs9929218)223,并使用组织特异性启动子(pAMUC2)来控制 Cas9 和失调基因(DLGAP5、NO3(NBL1)和 DDX28)的表达。(c) 程序化的共生细菌,用于稳定人类胃中的 pH 值以治疗胃食管酸 reflux。细菌使用比例积分控制器 224,其输出为质子泵抑制剂,以实现对胃 pH 值的设定点控制。该电路还通过在细菌离开该器官时通过不可逆开关终止细菌,限制酸的调节到胃中,该开关会激活细菌毒素(CcdB)225。(d) 在“智能”植物中,基因电路感知环境刺激并实施响应。 该电路嵌入在叶绿体中,集成了用于干旱(pSpark)、温度(pCBF)和植物成熟(pSAG12)的传感器,以控制农药(Bt)生产和抗旱性(IPT)。

Despite its potential, genetic circuit design remains one of the most challenging aspects of genetic engineering43. The earlier fields of protein and metabolic engineering have yielded tools to optimize enzymes and fluxes through a metabolic network. These tools include computational methods that can predict the impact of an amino acid substitution on protein thermostability44 or the distribution of flux through modified metabolic networks45. Biotech companies often have research groups dedicated to protein and metabolic engineering that have specialized training in these tools. However, industrial groups dedicated to building synthetic regulation are rare, and even simple tasks, such as building a switch or inducible system, tend to be one-off projects performed by a nonspecialist.

Several features of genetic circuits make them challenging to work with, relative to other areas of genetic engineering. First, circuits require the precise balancing of their component regulators to generate the proper response46,47. Computational tools and part libraries that enable the tuning of expression levels have been developed only recently48,49,50,51. Before this, only course-grained control was achievable with small sets of parts46,47,52. Second, many circuits are difficult to screen in directed-evolution experiments for correct performance. Digital logic has clear ON and OFF states that can form the basis for a screen12,53,54,55,56,57,58,59. However, screening for dynamic circuits, such as oscillators, is significantly more complex60, and it is hard to imagine how screens would be established for more sophisticated functions, such as a PID (proportional integral derivative) controller with proscribed response properties. Third, there are few tools to measure circuit performance. Typically, a fluorescent reporter is used to measure the output, but fluorescence detection requires artificially high expression, and fluorescent protein degradation rates can limit the ability to measure dynamics. Fourth, synthetic circuits are very sensitive to environment, growth conditions and genetic context in ways that are poorly understood61. Finally, the process of building a large genetic circuit requires the assembly of many DNA parts, and this process has been both technically challenging (until recently) and fraught with its own sources of errors58,62,63,64,65,66,67.

The purpose of this Review is to serve as a guide to designing a prokaryotic transcriptional circuit, in which both the inputs and outputs are promoters53,55,68,69,70,71. Transcriptional circuits maintain a common signal carrier, which simplifies the connection of circuits to build more sophisticated operations72. Post-transcriptional circuits, including those based on protein and RNA interactions, are covered in other excellent reviews73,74,75. Although the majority of this guide is dedicated to bacterial circuits, many of the principles, albeit not the details, are relevant for eukaryotes, including human cells and plants76,77.

Genetic circuit design based on different regulator classes

Transcriptional circuits function by changing the flow of RNA polymerase (RNAP) on DNA. There are a number of regulators that influence this flux that have been used as the basis for building synthetic circuits (Fig. 2). For example, DNA-binding proteins can recruit or block RNAP to increase or decrease the flux, respectively. Analogously, the CRISPRi system uses the Cas9 protein to bind to the DNA and alter transcription78,79. RNAP flux can also be altered with invertases that change the orientation of promoters, terminators or gene sequences. Additionally, RNA translational repressors, such as RNA-IN/OUT, can be converted to control RNAP flux80,81. In this section, we describe recent advances in these methods and analyze the impact that each regulator has on circuit response.

Figure 2: Logic gates built on the basis of different regulator types.
figure 2

(ad) All gates are transcriptional, with two input promoters (PIN1 and PIN2) and one output promoter (POUT). The graphs show how the gates respond to inputs introduced at the same time (center) or sequentially (right). In all panels, the ON state is assumed to generate tenfold higher response than the OFF state. (a) Top, NOR gate based on a repressor that binds DNA110. The response curves are based on measured induction (τ1/2 ≈ 36 min) and relaxation (τ1/2 ≈ 35 min) half-lives226. Bottom, AND gate based on an activator that requires a second protein to be active55. The responses are based on measured induction (τ1/2 ≈ 36 min) and approximate relaxation (τ1/2 ≈ 35 min) half-lives55. (b) Top, NOR gate based on integrases that flip two terminators to turn off the output122,123. The responses are based on an on rate of 1.8 h (refs. 119,121,122). Bottom, AND gate based on integrases122. (c) Hypothetical NOR gate based on CRISPRi. Cas9 is expressed constitutively, and two input promoters drive the expression of two sgRNAs. The lines are based on measured induction (τ1/2 = 35 min) and relaxation (τ1/2 = 47 min) half-lives79. (d) NOR gate based on the RNA-IN/RNA-OUT system80. RNA-OUT represses translation of tnaC, which allows Rho to bind the mRNA and repress transcription of the output. The response lines are based on theoretical induction (τ1/2 ≈ 30 min) and relaxation (τ1/2 ≈ 35 min) half-lives162,226.

DNA-binding proteins. Many families of proteins can bind to specific DNA sequences (operators). The simplest way to use these proteins as regulators is to design promoters with operators that block the binding or progression of RNAP. Such repressors have been built out of zinc-finger proteins82, transcription activator–like effectors83,84, TetR homologs71, phage repressors85,86 and LacI homologs87. A core set of three repressors were used to build many of the first synthetic circuits (CI, TetR, LacI)47,53,88,89,90,91. However, recently there have been efforts to expand the number of DNA-binding proteins that are available for circuit design54,92,93,94,95,96,97,98,99. Expanding protein libraries can be challenging because each repressor has to be orthogonal; i.e., only interact with their operators and not the others in the set. Because of their simple function, repressors are relatively easy to move between species, including to eukaryotes92,93,94,95,96,97. DNA-binding proteins can also function as activators that increase the flux of RNAP on DNA. Recent efforts have increased the number of such proteins that are available for constructing circuits54,98,99,100.

Many logic gates have been constructed with DNA-binding proteins71,101,102,103,104,105,106,107,108,109. For example, NOT and NOR gates have been built by connecting input promoter(s) to a repressor that turns off an output promoter47,53,71,88,110 (Fig. 2a). Other types of transcriptional logic gates have been built using pairs of proteins in which one either activates or inhibits the other. For example, AND gates have been built with artificially split proteins111 and activators that require chaperones55,101 (Fig. 2a). Similarly, NAND gates can be built with proteins that block the activity of an activator, such as anti-σ factors, which inhibit σ factors100.

DNA-binding proteins have also been used to build circuits that incorporate positive and negative feedback loops, which form the basis for dynamic circuits, such as pulse generators112, bistable switches47,53,113 and oscillators70,88,114,115,116. Analog circuits, which allow complex computational functions to be generated with fewer regulators, have also been built with DNA-binding proteins. For example, two or three transcription factors can be used to build an adder or a ratiometer103.

There are also several challenges in using DNA-binding proteins to build circuits. Individual transcription factors may appear nontoxic, but often a combination of multiple regulators can lead to acute toxicity. The circuits can also be very dependent on growth rate because differences in the dilution rate change how quickly regulators accumulate or degrade, which alters their steady-state concentration, ultimately affecting their response. Finally, the response functions are often suboptimal and difficult to control because they have high OFF states (meaning they generate significant transcriptional signals in the OFF state) and low dynamic ranges.

Recombinases. Recombinases are proteins that can facilitate the inversion of DNA segments between binding sites117. Site specific recombinases often mediate 'cut-and-paste' recombination, during which DNA is looped, cleaved and religated118. Two types of recombinases have been used to build genetic circuits. The first is tyrosine recombinases, such as Cre, Flp and FimBE, which require host-specific factors69,119,120,121. These recombinases can be reversible and flip the DNA in both directions, or irreversible and flip in only a single direction. The second class of recombinases is serine integrases, which catalyze unidirectional reactions that rely on double-strand breaks to invert DNA. Serine integrases typically do not require host factors and often have cognate excisionases that can be expressed independently to return the DNA to its original orientation.

Recombinases have been used to build switches119, memory circuits120,121, counters69 and logic gates122,123. These proteins are ideal for memory storage because they flip DNA permanently, and once the DNA is flipped, its new orientation is maintained without the continuous input of materials or energy. In recombinase logic gates, these discrete physical states of the DNA can correspond to ON and OFF states (1 and 0). However, using recombinases can be challenging because their reactions are slow (requiring 2–6 h) and often generate mixed populations when targeting a multicopy plasmid121. Reversible recombinases can also generate mixed populations; however, this limitation was overcome recently when a unidirectional serine integrase was used to flip DNA in one direction and an integrase-excisionase pair was used to return it to the original state124.

All two-input gates, including AND and NOR logic, have been constructed using orthogonal serine integrases122,123 (Fig. 2b). The gates are organized such that two input promoters express a pair of orthogonal recombinases, which change RNAP flux by inverting unidirectional terminators, promoters or entire genes. These gates are based on unidirectional serine integrases without excisionases and therefore operate as memory circuits that record exposure to two input signals. Once flipped, the circuits cannot be returned to their original state; therefore, the gates do not distinguish the order in which they were exposed to the inputs or even whether the inputs occurred at the same time. To overcome this limitation, rewritable switches could be used to build logic gates that respond transiently to pulses of inputs. To do this, one recombinase is constitutively expressed to maintain the state, and the other is induced in response to an input signal.

CRISPRi. Clustered, regularly interspaced, short palindromic repeat (CRISPR) arrays function as a bacterial 'immune system' that targets specific DNA sequence motifs for degradation125. CRISPR systems use a Cas (CRISPR-associated) nuclease and guide RNA to introduce double-strand breaks to specific DNA sequences126. Mutant Cas proteins (such as dCas9 (ref. 79) and Cas9N− (ref. 127)) that do not have nuclease activity have been developed and used as transcription factors that knock down gene expression by forming a DNA bubble that interferes with RNAP activity78,79. CRISPR can also activate transcription by fusing an RNAP recruiting domain to catalytically inactive Cas9 (refs. 78,127,128,129,130,131). One advantage of CRISPR interference (CRISPRi) is the designability of the RNA-DNA complex. It is possible to imagine creating a very large set of orthogonal guide sequences that target different promoters. This set would enable the construction of large genetic circuits, but it would need to be experimentally screened because predicting guide RNA orthogonality is complicated132,133,134,135.

CRISPRi is still relatively new, and NOT gates are the most complex circuits built to date79. The NOT gates induce synthetic guide RNA (sgRNA) and dCas9 expression simultaneously to repress transcription at an output promoter. In theory, a NOR gate could be created by introducing a second sgRNA that targets the same output promoter (Fig. 2c). In general, the properties of CRISPRi circuits will probably resemble DNA-binding protein circuits. Circuits based on CRISPRi are expected to operate on timescales similar to those of protein-based circuits because of the stability of the regulatory dCas9-sgRNA-DNA complex79.

A current challenge in implementing CRISPRi circuits is toxicity, which is difficult to control. Toxicity could be the result of Cas9 binding to the host genome at protospacer-adjacent motifs (such as NGG), forming bubbles that deleteriously affect host gene expression136,137. It appears that this nonspecific binding occurs when a guide RNA is absent; therefore, one of the roles of the RNA is to repel Cas9 from off-target sequences. Toxicity is less noticeable when Cas9 is used as a nuclease because the RNA is in excess, but in a circuit Cas9 would need to be able to be carried in an RNA-free state before the gate is turned on. Another consideration for building CRISPRi circuits is retroactivity138, which could arise from using Cas9 as a shared resource (see “Common failure modes from connecting circuits” below). One way to circumvent retroactivity would be to express multiple orthogonal Cas9 homologs132,139.

Adapted RNA-IN/OUT. The RNA-IN/OUT system from Escherichia coli represses translation of a target protein when a short noncoding RNA (RNA-OUT) is expressed. In the natural system, RNA-OUT binds to a specific sequence at the 5′ end of an mRNA (RNA-IN) to occlude ribosome binding and increase mRNA degradation140,141,142. Arkin and coworkers retooled this system to repress transcription, instead of translation, using a transcriptional adaptor from the tna operon80. The tna regulatory element is composed of a ribosome-binding site (RBS), the coding sequence for a short peptide called TnaC, a Rho factor–binding site and an RNAP pause site that facilitates Rho-mediated transcription termination. Translation of tnaC causes ribosomal stalling, which blocks Rho factor binding and allows RNAP to transcribe genes downstream of tnaC. However, when translation of tnaC is prohibited by the RNA-IN/OUT system, Rho binds the growing mRNA and knocks off RNAP, thereby inhibiting transcription elongation. As with CRISPRi, the adapted RNA-IN/OUT system could be used to generate a large set of orthogonal regulators because it is based on designable RNA-RNA interactions. To date, more than 150 different families of at least seven orthogonal RNA-IN/OUT mutants have been designed using the RNA-IN/OUT model, and all of the mutants tested experimentally have been functional and orthogonal81.

Adapted RNA-IN/OUT has been used to build two-, three- and four-input NOR gates80 (Fig. 2d). In these systems, orthogonal RNA-IN variants were connected such that expression of any cognate RNA-OUT represses transcription of the output gene. Additional layers of regulation could be engineered into the adapted RNA-IN/OUT system with ligand-responsive aptamers that regulate RNA-OUT activity143 or tRNAs that control ribosomal pausing in tnaC144. A challenge in building larger RNA-IN/OUT circuits is that each transcriptional regulator requires the same tna regulatory element (290 bp). The reuse of this part in multiple circuits could lead to homologous recombination (see below). Engineering tnaC to reduce the length of the repeated sequence80 or using homologs from other organisms and alternative Rho-binding sites could potentially attenuate recombination.

Selecting parts to tune the circuit response

Genetic circuits need to be tuned to meet the specifications required for a particular application. For example, a large dynamic range may be required to strongly activate a pathway. Similarly, low OFF states are desirable when expressing toxic proteins145. When the first synthetic circuits were built, there were few options available for tuning circuits and only course-grained changes were possible46,47. New libraries of well-characterized parts and computational tools have made it easier to design and tune genetic circuits. Moreover, new classes of insulators improve the reliability of these parts when they are placed in the local genetic context of a circuit. Additional biochemical tools, such as small RNA (sRNA), have been incorporated into circuits in order to provide more tuning knobs. In a prior review, we detailed advances in part design and tools that allow engineers to obtain reliable expression levels146. Here we show how the selection or modification of different parts affects the response of a circuit.

Two circuits are used as model systems to demonstrate the effects of various tuning knobs. The first, a NOT gate, represents a simple logic operation46,53 (Fig. 3a). Logic gates are often characterized by their response function, which captures how the steady-state output changes as a function of input. The shape of this function is defined by: (i) the ON and OFF states, which define the circuit's dynamic range, (ii) the amount of input required to reach the half-maximum output (also referred to as the threshold) and (iii) the cooperativity of the switch147,148. We selected an oscillator as an example of a dynamic circuit (Fig. 3h). These types of circuits can be very difficult to tune because they need to be balanced in a narrow region of parameter space in order to function properly90,149,150. For an oscillator, tuning will affect the period, amplitude and shape of the oscillations. Tuning can also force the system out of the oscillating parameter space and cause the circuit to fail90.

Figure 3: Methods of modifying circuit behavior.
figure 3

(an) Ordinary differential equation models were used to simulate a NOT gate (a) and an oscillator (h) (for model equations and parameters in SBML format205,227, see Supplementary Notes 1 and 2); parameters were adjusted (bg and in) to show the effect of each tuning knob on circuit performance. Black, original parameters; gray, tuning knob variations. Inputs in af are isopropyl β-d-1-thiogalactopyranoside (IPTG). (b,i) Promoter strength is increased (dashed gray line) or decreased (solid gray line) by a factor of 2. (c,j) Enzymatic degradation of the reporter protein was modeled as a fivefold increase in the protein degradation rate228. (d,k) Gene dosage. The circuits are moved between a high-copy plasmid ten times more abundant than in the original circuit (dashed gray line) and the genome (solid gray line) to tune expression. (e,l) RBS strength. Repressor RBSs (RBS 1) are increased (dashed gray line) or decreased (solid gray line) by a factor of 5. (f,m) Small RNAs designed to bind repressor mRNA with the same affinity as a ribosome (this value was chosen arbitrarily and can be modulated to change circuit dynamics). In this model, sRNAs are produced constitutively, and sRNA-mRNA duplexes are degraded faster than either RNA alone. (g,n) Decoy operators that bind repressor proteins with the same dissociation constant Kd as the repressible promoter.

The response function of a digital logic gate can be shifted up or down by changing promoter strengths151 (Fig. 3b), RBS strengths or the proteins' degradation rates152 (Fig. 3c). Promoter strength can be altered with mutations in the promoter sequence153 or by selecting new promoters from a characterized library49,154. Increased degradation can be achieved with protease tags or N-terminal degrons152. Circuit components are often distributed between multiple plasmids at different copy numbers in order to synthesize each component at the necessary level. However, when entire circuits are expressed on one plasmid, copy number can be shifted to simultaneously alter the circuit's dynamic range and threshold155 (Fig. 3d).

The threshold of the gate can be changed via several methods. Selecting a stronger or weaker RBS, adding multiple operators or changing operator positions within a promoter can change the threshold59,71,156,157 (Fig. 3e). The threshold of a gate becomes steeper and more switch-like when small changes in the input have a large effect on the output158. Increased cooperativity makes connecting gates easier by decreasing the range of input needed from an upstream circuit to span the induction threshold of the next circuit in the series. One way to make a gate more switch-like is to change the cooperativity of repressor binding to the promoter or to introduce DNA looping159,160. Another approach is to express a sequestering molecule that binds a circuit component and prevents it from functioning. Sequestration has been achieved using sRNAs that bind to mRNA161,162 (Fig. 3f), proteins that bind to transcription factors113,163,164, and decoy DNA operators that titrate the transcription factor away from the output promoter165 (Fig. 3g).

In an oscillator, parts that affect the rate of gene expression change the amplitude of the response and can shift the period (Fig. 3i,l). Rapid protein degradation is critical for dynamic circuits to function correctly. If proteins are slow to degrade, then the circuit may slow down or stop functioning altogether166 (Fig. 3j). Protease tags can be used to decrease the degradation rate from several hours to 20 min, which will increase the rate at which a gate switches70,89,112,152. Changing plasmid copy number can affect the amplitude of oscillations (Fig. 3k). Cooperativity is critical for obtaining robust oscillators because it increases the region of phase space that produces oscillations159. Therefore, sequestration approaches are predicted to have a large impact on the period and amplitude of oscillations167 (Fig. 3m,n).

Common failure modes from connecting circuits

Gates can be combined to build larger circuits that implement more sophisticated computational operations. Transcriptional gates can be connected by using the output promoter of one circuit as the input promoter to the next. This method applies for all transcriptional circuits, including digital, analog and dynamic circuits or a combination of types. To be connected, circuits have to be broken up into their component parts and then combined in a particular order (Fig. 4a). Reorganizing the parts places them in new local contexts that are different from those where they were characterized. This can be problematic because circuit components can behave differently in new genetic contexts, and small circuits may have identical component parts (e.g., terminators) that interfere with each other in the larger circuit. In this section, we discuss failure modes that can arise when building larger circuits, show the impact that each failure has on circuit function, and discuss engineering approaches to mitigate these problems.

Figure 4: Common failure modes and their impact on circuit dynamics.
figure 4

(a) Parts are assembled to build complex circuits. The AND gate71 is built with two inducible input promoters (PIN1 and PIN2), one output promoter (PR3) and two internal repressor-promoter pairs (PR1 is repressed by R1). The steady-state response to different combinations of inputs is shown as a bar graph, where the OFF states are gray and the ON state is black. The oscillator70 is built with PA-R promoters, which are repressed by R and activated by A. (bh) The impact of various failures (red) are shown for the AND gate (left) and oscillator (right). R2 or R3 expression as a function of input 2 (unless indicated otherwise) is also shown in each panel to explain the AND gate failures. Ordinary differential equation models were used to simulate repressor induction (buffer gate) and oscillator behavior (parameters and model equations used for the oscillator simulation is included in Supplementary Note 2 in SBML format205,227). Outputs from the buffer gate simulations were fed into a separate AND gate model71. Figure from ref. 71, Nature Publishing Group. (b) Mismatched response functions. In the AND gate, R3 was modeled as a different repressor: BET1 (dissociation constant kd = 0.2, Hill coefficient n = 2.4, max = 13, min = 0.4) instead of ORF2 (kd = 0.4, n = 6.1, max = 16, min = 0.2)71. R2 is the input for the R3 transfer function. In the oscillator, the R translation rate is increased tenfold. (c) Strength of the indicated promoters is reduced by 50% in both circuits. (d) Translation rates of R2 (AND gate) and R (oscillator) are set to 0. (e) 30% read-through from upstream operons through the red terminator is simulated in both circuits. Here the R2 expression is shown as a function of input 1 (instead of input 2). (f) Part-junction interference. A new constitutive promoter (AND gate) is simulated as having approximately 20% of the strength of PIN2. The new terminator (oscillator) decreases transcription 40%. (g) Orthogonality. R3max is set as R2min to simulate repression of PR3 by R2. Additional equations are added to the oscillator model to simulate repressor-activator complex formation. (h) Recombination. R2 and R were removed from the AND gate and oscillator models, respectively.

A common problem when connecting circuits is that the upstream circuit's output does not span the dynamic range required to stimulate next circuit in series (Fig. 4b). In digital logic, this mismatch manifests as either a decrease in the dynamic range of the complete circuit or a loss of function. Connectivity mismatches can be corrected by selecting parts that shift the thresholds of individual gates. For example, RBSs can be mutated to force the threshold of a gate to fall within the dynamic range produced from an upstream circuit46,102. Mismatches in an oscillator can dampen oscillations or force the system outside the functional parameter space (Fig. 4b). Mathematical models can be used to streamline circuit design by predicting the functional parameter space and selecting appropriate RBSs and promoters to achieve the required expression levels48,68,157.

Genetic parts are often context dependent, meaning their functions change when the DNA sequences on either side of the part are altered168,169. Context dependencies complicate part substitutions because part characterizations are often carried out in isolation and their activity in a new context may not match the measured strength. For example, promoters that are defined as DNA sequences of <50 bp may behave differently in new contexts because the α-domain of E. coli RNAP can contact the DNA 100 bp upstream of the transcription start site153. In a digital circuit, reducing promoter efficiency attenuates the response of individual gates and reduces the output of the complete circuit (Fig. 4c). Promoter attenuation can increase the amplitude of an oscillator and elongate the period by reducing repressor expression. Insulator sequences can relieve some compositional context effects by standardizing the DNA sequences flanking promoters169,170.

Context effects can also occur when promoters are fused to different RBSs. Promoters are sensitive to the DNA sequences near the transcription start site because that region can alter promoter melting and polymerase escape frequency154. Transcription start sites can also fluctuate according to the local sequence context171,172, which can affect RBS strength by altering the length of the 5′ untranslated region (UTR) and changing mRNA secondary structure. Tandem promoters can generate especially long 5′ UTRs that exacerbate this effect by base pairing with the RBS or sequences in the open reading frame173,174,175. Circuits can fail completely when mutations in the 5′ UTRs cause hairpins to completely occlude the RBSs and prohibit translation (Fig. 4d). As a solution, the 5′ UTR can be cleaved with ribozymes or CRISPR processing to standardize RBS accessibility170,176. Catalytic insulator elements serve dual functions by standardizing both the 5′ end of mRNA and the promoter region downstream of the transcription start site. RBSs can be further insulated from the local context using bicistronic designs, which prime the mRNA for translation with an upstream RBS that keeps the mRNA unfolded49.

Transcriptional read-through can be a problem in genetic circuits with monocistronic designs, in which every gene has its own promoter and terminator. These designs require strong terminators to insulate against read-through from neighboring promoters. Failure to fully insulate each cistron can link the expression of genes that are supposed to be regulated independently (Fig. 4e) and can contribute to the leaky expression of uninduced genes. Strong, tandem terminators can be placed on either side of each gene to ensure isolated expression of individual operons177. Large libraries of Rho-independent terminators were recently built and characterized to enable the construction of large circuits that are robust with respect to read-through and homologous recombination (described below)50,177.

DNA sequences are information rich; therefore, connecting two parts can create a new functional sequence at the junction178. New regulatory elements, such as promoters or terminators, can be generated at a part junction if the combination creates a sequence of DNA that resembles a regulatory element. For large circuits, many parts have to be combined in a new order, and unexpected parts that interfere with gene expression can be generated (Fig. 4f). One way to scan for unintended functional sequences is to use computer algorithms that search for various regulatory elements48,177,179,180,181,182,183,184,185.

Cross-talk, which occurs when regulators interact with each other's targets, can change the topology of a circuit and can lead to errors in the desired operation55. For example, cross-talk between a repressor and noncognate promoter can inappropriately decrease expression of a gene and cause a circuit to fail (Fig. 4g). Avoiding cross-talk requires that parts be screened for orthogonality via combinatorial experiments that test every combination of promoter and regulatory element71,81,83,98,100,186.

Many of the circuits built to date reuse the same regulatory parts, which can lead to homologous recombination. Homologous recombination deletes DNA between repeated sequences and can result in the loss of circuit components and circuit failure177 (Fig. 4h). In general, the rate of recombination increases with circuit toxicity187 and homologous DNA length, with the threshold occurring between 20 and 30 bp (ref. 188). Homologous recombination can be avoided with large libraries of parts with redundant functions that have enough sequence diversity to avoid recombination177,189.

Interactions between synthetic circuits and the host organism

Genetic circuits are based on biochemical interactions within living cells. Most circuits use host resources to function, including transcription and translation machinery (e.g., ribosomes and RNAP), DNA-replication equipment and metabolites (e.g., amino acids). The availability of these resources and the details of the intracellular environment change significantly in different strain backgrounds, environmental conditions and media, and they also depend on cell density and growth rate. When the first synthetic circuits were built, they were fragile, and it was unclear why they would work only in specific conditions20,21. Now there is a more precise understanding of the ways in which circuits break owing to interactions with the host61. A better understanding of what these failure modes are and of the methods that natural systems use to overcome them will lead to new design rules for composing synthetic circuits.

A common observation is that some synthetic regulators can cause growth defects. Yet it remains unclear why certain regulators can be expressed at high levels with no noticeable impact whereas others in the same class are very toxic. This was evident in analyses of large libraries of TetR and σ-factor homologs sourced from diverse organisms and transferred into E. coli71,100. Expression of some regulators slowed E. coli growth, but the origin of this effect is unclear as it does not correlate with the number of predicted binding sites in the genome or off-target gene expression measured using RNA-seq. T7 RNAP is another part that can be very toxic when combined with a strong T7 promoter102. It is also unclear how this toxicity arises, but it could be due to the difficulty terminating T7 RNAP, which could cause excessive transcription around a plasmid or expose mRNA by decoupling RNAP and ribosome progression. Circuits based on protein-protein interactions can also exhibit toxicity when the proteins bind to off-target partners. We observed this with anti-σ factors, which appear to bind and titrate native σ factors100. Small RNA with RBS-like sequences can also cause toxicity by titrating ribosomes, increasing expression variability and reducing growth145 (Fig. 5a). Larger circuits are particularly sensitive to the toxicity that can arise from individual regulators because their effects are compounded when they are expressed together190.

Figure 5: Circuit performance within the context of a living cell.
figure 5

(a) Synthetic sRNAs compete with mRNA for ribosomes. When sRNAs are produced (left graph, gray bar), ribosomes are titrated away from fluorescent protein mRNA and observed fluorescence is reduced relative to no sRNA (white bar)229. AU, arbitrary units. Center graph, colored circles represent the overexpression of different proteins in E. coli (blue, Pu promoter β-galactosidase; red, T7 promoter β-galactosidase; black, tac promoter ΔEF-Tu; green, bla promoter β-lactamase)230. Right graph, colored circles represent growth of different bacterial and yeast strains plotted against rRNA supply (blue, E. coli 30 °C; green, Aerobacter aerogenes 37 °C, red; Candida utilis 25 °C; orange, C. utilis 30 °C; black, Neurospora crassa 30 °C)230. Left graph adapted from ref. 229 with permission of The Royal Society of Chemistry. Center and right graphs from Scott, M., Gunderson, C.W., Mateescu, E.M., Zhang, Z. & Hwa, T. Interdependence of cell growth and gene expression: origins and consequences. Science 330, 1099–1102 (2010). Reprinted with permission from AAAS. (b) Queuing as a result of overloading the ClpXP protease machinery with proteins from a synthetic oscillator. The graph shows the difference between expected (black) and measured (red) dynamics for an oscillator affected by queuing166. Adapted with permission from ref. 166, Wiley. (c) An additional output (PR2) on a high-copy plasmid is added to the NOT gate, which alters the activation dynamics of the original output (PR1) (black line, original dynamic response; orange line, retroactive effect)195. Adapted with permission from Jayanthi, S., Nilgiriwala, K.S. & Del Vecchio, D. Retroactivity controls the temporal dynamics of gene transcription. ACS Synth. Biol. 2, 431–441 (2013). Copyright 2013 American Chemical Society. (d) One plasmid with two reporter proteins is transformed into different E. coli strains. The ratio of expression varies in some strains (center: wild-type E. coli strains; right, KEIO collection knockouts)196. Adapted with permission from ref. 196, Elsevier. (e) Different media affect the performance of an AND gate based on T7 RNAP2,102. Data are shown for the circuit in the absence (white) and presence (black) of both inputs in different media (Min, minimal medium; #T and/or #Y, minimal medium supplemented with tryptone and/or yeast extract, where # indicates the grams of tryptone and/or yeast extract per liter). Reprinted with permission from Moser, F. et al. Genetic circuit performance under conditions relevant for industrial bioreactors. ACS Synth. Biol. 1, 555–564 (2012). Copyright 2012 American Chemical Society.

Circuits can also decrease growth rate by monopolizing host resources and slowing production of essential protein and RNAs191 (Fig. 5a). A small reduction in the growth rate can be a problem when using a circuit for industrial applications that rely on high product yields. A decrease in growth rate can reduce the dilution rate of circuit components and lead to unintended buildup of proteins or RNA that can cause a circuit to fail. In fact, circuits can appear to function better when growth is impeded because slow dilution increases the observed concentration of transcription factors and reporters. Slow growth can also put pressure on the host organism to evolve away the burdensome circuit, via either homologous recombination, point mutations, deletions or copy-number reduction.

Circuits can diverge from their expected behavior when they overuse a limited resource that is shared with other cellular processes. Overburdening resources causes queuing, which results in a delay or reduction in circuit activity192. For example, when σ factors are overexpressed, they can occupy the entire pool of free core RNAP. When this happens, σ factors must compete to bind to the core, which indirectly couples their activity and can disrupt host processes193. Native σ factors are able to avoid queuing by pulsing their expression such that they alternate the usage of core RNAP over time194. A similar coupling effect has been observed when the ClpXP protease is shared by regulators that have been modified to contain C-terminal tags for fast degradation. If too many proteins are targeted for degradation, the enzymatic machinery can become overwhelmed and force substrates to wait for processing166. The rapid degradation of regulators is important for dynamic circuits, such as oscillators, which will fail if the regulatory proteins accumulate (Fig. 5b).

Retroactivity can also interfere with circuit activity. Retroactivity is defined as the influence that a downstream genetic element can have on an upstream one, and it describes the changes in circuit behavior that result from connecting new downstream modules to a circuit138. For example, connecting a second output to a NOT gate may cause retroactivity by titrating the repressor away from the original output promoter (Fig. 5c). Retroactivity will affect the NOT gate's dynamics by increasing the time it takes to build up an adequate amount of protein to repress promoter activity195. Retroactivity that delays a circuit's response to input stimulation can be alleviated by increasing expression of the problematic circuit component; however, increasing expression can lead to other trade-offs, including toxicity.

Strain variation can affect circuit performance in different ways. Differences in growth rate, ribosome concentration and induction lag time have been identified as the main contributors to strain-dependent variations in circuit performance196. In recent reports, these phenotypes have been correlated with specific genes by studying growth and circuit performance across single-gene knockouts196,197 (Fig. 5d). Media and growth conditions can also influence circuit performance by altering promoter activity, protein stability and regulator dilution198,199. These effects can be so severe that switching from LB to minimal medium can cause circuits to fail2 (Fig. 5e).

One approach to reduce strain- and medium-based variation is to use reference standards to report circuit performance. To this end, the relative expression unit (REU) was introduced as a standard for reporting promoter activity2,200. REUs report the promoter activity by normalizing measurements to a constitutive promoter standard in an identical strain. REU measurements have yielded reliable, reproducible data when compared across labs, strains and media, which is important for transcriptional circuits that use promoters as inputs and outputs. In the future, this will facilitate the computer-aided design of large circuits.

Conclusions

The first circuits were built by repurposing a small number of regulators and genetic parts from other areas of genetic engineering. After early success47,88, these parts were put together in different combinations to explore the range of circuit functions that could be performed in the cell. We are now in a phase where there are >100 new regulators55,71,78,79,80,82,83,101,146,180 that are orthogonal and could theoretically be used to build synthetic regulatory networks at the scale of natural networks in bacteria201. The challenge is to be able to design and construct synthetic regulatory networks at this scale.

There are several key advances that need to happen before we can build and debug genetic circuits this large. First, computational tools have to be developed to aid the design process. These programs must be able to simulate the dynamics of a circuit and convert the designs into a linear assembly of genetic parts68,202,203,204,205. Insulating DNA sequences will be critical in future circuits because the majority of parts will be in new contexts206,207. Second, new approaches to whole-cell omics measurements have to be integrated into the debugging cycle. Currently, there is an overreliance on fluorescent proteins as the output of circuits. However, transcriptomics is now sufficiently inexpensive that it could be used to infer polymerase flux on many of the parts internal to a circuit208. Other single-molecule approaches, such as ribosome and RNAP mapping, will become powerful when the experiments become more routine209,210. Third, new approaches need to be developed that can rapidly test circuits under conditions that are difficult to control in the cell. Circuits are sensitive to parameters such as the number of ribosomes, the number of available RNAP, the redox state of the cell, the growth temperature and the ATP concentration, all of which change in different cell types and conditions. However, these parameters are difficult to measure in the cell without broadly affecting the host. To this end, the development of in vitro cell-free methods to debug circuits will be valuable for designing circuits that are robust to these changes211,212,213,214,215,216,217,218,219,220.

New biochemistries, tuning knobs and troubleshooting methods are now converging for the sophisticated design and construction of genetic circuits. Different classes of regulators can be used in a single circuit to fulfill specialized functions. In this vision, each regulator has found a niche within the larger circuit that exploits its strengths. For example, digital circuits can be used to integrate sensors and respond to environmental conditions, whereas analog circuitry can perform arithmetic functions with a small number of regulators103. Integrases can store memory or cause an irreversible commitment. CRISPRi can regulate essentially any gene in the genome. A vision of this marriage is shown in Figure 6, which is an example of a commensal bacterium that has been engineered to produce a drug while colonizing the gut. In it, repressor-based logic gates respond dynamically to environmental states, and invertases record these observations. Analog circuits can be used to calculate a dosage rate, and, if the drug dosage is surpassed, CRISPRi knocks down specific host genes to arrest growth and avoid overmedication. Collectively, these new circuits and the tools and knowledge to connect and debug them will enable a new era of cellular programming and the applications that come with this capability.

Figure 6: Conceptual circuit for a therapeutic bacterium that colonizes a niche in the human microbiome and delivers a drug.
figure 6

This circuit demonstrates how the different classes of regulators and circuits described in this Review could be combined into a single system. The left panel shows genetically modified bacteria that have colonized the interior of a human gastrointestinal tract. The top right panel focuses on the conceptual circuit that the bacteria use to regulate their growth and deliver drugs to the human patient. Bottom, an analog circuit103 (left) and irreversible recombinases (right) are highlighted to emphasize the diverse biochemistries used to build this circuit.