Learn more: PMC Disclaimer | PMC Copyright Notice
Engineering Escherichia coli for Increased Productivity of Serine-Rich Proteins Based on Proteome Profiling
Abstract
Variations in proteome profiles of Escherichia coli in response to the overproduction of human leptin, a serine-rich (11.6% of total amino acids) protein, were examined by two-dimensional gel electrophoresis. The levels of heat shock proteins increased, while those of protein elongation factors, 30S ribosomal protein, and some enzymes involved in amino acid biosynthesis decreased, after leptin overproduction. Most notably, the levels of enzymes involved in the biosynthesis of serine family amino acids significantly decreased. Based on this information, we designed a strategy to enhance the leptin productivity by manipulating the cysK gene, encoding cysteine synthase A. By coexpression of the cysK gene, we were able to increase the cell growth rate by approximately twofold. Also, the specific leptin productivity could be increased by fourfold. In addition, we found that cysK coexpression can improve the production of another serine-rich protein, interleukin-12 β chain, suggesting that this strategy may be useful for the production of other serine-rich proteins as well. The approach taken in this study should be useful in designing a strategy for improving recombinant protein production.
Escherichia coli has been the most widely used host for the production of various recombinant proteins. The yield of a recombinant protein is generally proportional to both the final cell density and the specific protein productivity (25). Therefore, various strategies have been developed to cultivate E. coli to high densities and/or to optimize expression systems for increased specific protein productivity. However, overexpression of recombinant proteins results in a rapid stress response and changes in E. coli metabolism (10, 33). These cellular responses may lead to plasmid instability, ribosome destruction, growth inhibition, or even cell lysis (6, 11, 21), all of which negatively affect recombinant protein production.
Recent advances in genomics and transcriptomics have allowed many functional genomic-based approaches to be taken toward understanding global metabolic changes caused by genotypic and/or environmental changes (26, 32, 36, 38, 39). Similarly, proteome profiling can also be employed to examine the changes in the expression levels of many proteins under particular conditions to be compared (3, 29, 39). Even though transcriptome profiling is truly useful, as it can generate a full spectrum of data on the expression levels of all of the genes at the mRNA level, there is growing evidence showing poor correlation between mRNA and protein abundances for a number of genes (2, 14). However, proteome profiling is presently limited by the fact that many fewer proteins than genes can be identified on a two-dimensional (2D) gel. Nonetheless, it is possible to identify protein spots showing altered expression levels, which may help us to understand metabolic and physiological changes and thereby to design metabolic and cellular engineering strategies. Several groups have carried out proteome analysis of E. coli cells overproducing recombinant proteins and have described metabolic changes under these conditions (19, 33). However, the results obtained from these studies have not been extended to actual engineering of cells to achieve enhanced recombinant protein production.
Here we report profiling of the proteome of recombinant E. coli during the overproduction of human leptin, identification of a target gene to be manipulated, and engineering of metabolic pathways to achieve increased leptin productivity. In order to understand how the overproduction of foreign proteins influences the synthesis of critical cellular proteins and thus to identify relevant target genes to be engineered, proteome analyses were performed (i) in the presence of the control plasmid only, (ii) before and after induction, and (iii) on inclusion body (IB) fractions.
MATERIALS AND METHODS
Bacterial strains and plasmids.
The bacterial strains and plasmids used in this study are shown in Table Table1.1. E. coli XL1-Blue was used as a host strain for cloning and maintenance of plasmids. E. coli BL21(DE3) was used as a host strain for the production of recombinant proteins. For the expression of the E. coli BL21(DE3) cysK gene, pAC104CysK (Fig. (Fig.1A)1A) was constructed as follows. The forward primer 5′-GCGAATTCATGAGTAAGATTTTTGAAGATAA-3′ was designed to contain an EcoRI site (underlined) immediately upstream of the start codon (ATG) of the cysK gene. The reverse primer 5′-GCGAATTCTATATACTGTTGCAATTCTTTCTC-3′ was designed to contain an EcoRI site (underlined). PCR was performed in a PCR thermal cycler (Takara Shuzo Co., Shiga, Japan) by using the High Fidelity PCR system (Boehringer, Mannheim, Germany) according to the manufacturer's instruction. The PCR product was digested with EcoRI prior to being cloned into the same restriction enzyme site in p10499A (30). The resulting plasmid, p104CysK, was digested with EcoRV and ScaI, and the fragment containing the cysK expression module was cloned into the EcoRV site of pACYC184 to yield pAC104CysK. The cysK gene is constitutively expressed by using this plasmid. Plasmids used for the production of human leptin and human granulocyte colony-stimulating factor (G-CSF) were pEDOb5 and pEDCSFmII, respectively, and were described previously (17, 18).
Schematic diagrams of plasmids pAC104CysK (A) and pEDIL-12p40 (B). The 966-bp PCR product which encodes the E. coli BL21(DE3) cysK gene was digested with EcoRV and ScaI and cloned into pACYC184 at the EcoRV site to make pAC104CysK. The 918-bp PCR product which encodes mature human interleukin-12 β chain was digested with AseI and BamHI and cloned into pET21c at the NdeI and BamHI sites to make pEDIL-12p40.
TABLE 1.
Bacterial strains and plasmids used in this study
Strain or plasmid | Relevant characteristics | Reference or source |
---|---|---|
E. coli strains | ||
XL1-Blue | supE44 hsdR17 recA1 endA1 gyrA96 thi relA1 | Stratagenea |
BL21 (DE3) | lac F′[proAB+laclqlacZΔM15 Tn10(Tetr)] F−ompT hsdSB (rB− mB−) gal dcm (DE3) | Novagenb |
Plasmids | ||
p10499A | 4.20 kb, Apr, gntT104promoter | Lab stock |
p104CysK | 5.20 kb, Apr, gntT104promoter | This study |
pACYC184 | 4.30 kb, Cmr, Tcr | New England Biolabsc |
pAC104CysK | 6.20 kb, Cmr Tcr, gntT104promoter | This study |
pEDCSFmII | 5.90 kb, human G-CSF gene, T7 promoter | 18 |
pEDOb5 | 5.75 kb, Apr, human obese gene, T7 promoter | 17 |
pUC18/p40 | 3.67 kb, Apr, human IL-12p40 gene | BCRd |
pEDIL-12p40 | 6.1 kb, Apr, human IL-12p40 gene, T7 promoter | This study |
For the expression of the mature interleukin-12 β chain gene, plasmid pIL-12p40 (Fig. (Fig.1B)1B) was constructed as follows: The forward primer 5′-GGCTAGCATTAATGATATGGGAACTGAAGAAAGAT-3′ was designed to contain an AseI site (underlined) immediately upstream of the codon for isoleucine (ATA), which is the first amino acid of the mature interleukin-12 β chain. The reverse primer 5′-GCCGGATCCTTATTAACTGCAGGGCACAGA-3′ was designed to contain a BamHI site (underlined) and tandem TAA stop codons immediately after the final serine codon (AGT). The PCR product was digested with NdeI and BamHI and was cloned into the same restriction sites of pET21c. The interleukin-12 β chain gene was expressed from the strong T7 promoter by induction with isopropyl-β-d-thiogalactopyranoside (IPTG) (Sigma Chemical Co., St. Louis, Mo.). All DNA manipulations, including restriction digestion, ligation, and agarose gel electrophoresis, were carried out as described by Sambrook et al. (34).
Fed-batch culture conditions.
Fed-batch cultures were carried out in a 6.6-liter jar fermentor (Bioflo 3000; New Brunswick Scientific Co., Edison, N.J.) containing 1.8 liters of R/2 medium plus 20 g of glucose per liter. The R/2 medium (pH 6.8) contains, per liter, 2 g of (NH4)2HPO4, 6.75 g of KH2PO4, 0.85 g of citric acid, 0.7 g of MgSO4 · 7H2O, and 5 ml of a trace metal solution. The trace metal solution contains, per liter of 5 M HCl, 10 g of FeSO4 · 7H2O, 2.25 g of ZnSO4 · 7H2O, 1 g of CuSO4 · 5H2O, 0.5 g of MnSO4 · 5H2O, 0.23 g of Na2B4O7 · 10H2O, 2 g of CaCl2 · 2H2O, and 0.1 g of (NH4)6MO7O24. A seed culture was prepared in a 1-liter flask containing 200 ml of R/2 medium. Except for periods when the pH increased due to glucose depletion, it was kept at 6.8 by adding 28% (vol/vol) ammonia water. The dissolved oxygen concentration was kept at 40% of air saturation by automatically increasing the agitation speed to 1,000 rpm and by changing the percentage of pure oxygen. A nutrient feeding solution was added by using the pH-stat (with high limit) feeding strategy (25). The feeding solution contained 800 g of glucose per liter and 20 g of MgSO4 · 7H2O per liter. When the pH rose to a value of 0.08 greater than its set point (pH 6.8) due to the depletion of glucose, the appropriate volume of feeding solution was automatically added to increase the glucose concentration in the culture broth to 0.7 g/liter. Cell growth was monitored by measuring the optical density at 600 nm (OD600) (DU series 600 spectrophotometer; Beckman, Fullerton, Calif.). For the expression of leptin, G-CSF, and interleukin-12 β chain, cells were induced with 1 mM IPTG at an OD600 of 30 or 90.
One-dimensional gel electrophoresis.
For protein quantification, cells at the same concentrations were harvested by centrifugation at 3,500 × g for 5 min at 4°C. Protein samples were analyzed by electrophoresis on sodium dodecyl sulfate-12% (wt/vol) polyacrylamide gels as described by Laemmli (22). The gels were stained with Coomassie brilliant blue R250 (Bio-Rad, Hercules, Calif.), and the protein bands were quantified with a GS-710 calibrated imaging densitometer (Bio-Rad).
2D gel electrophoresis and peptide mass fingerprinting.
2D gel electrophoresis experiments were carried out with a Protean II xi 2-D cell (Bio-Rad) by procedures described previously (39). Culture broth was centrifuged for 5 min at 3,500 × g and 4°C. The pellet was washed four times with TE solution (10 mM Tris-HCl, 1 mM EDTA [pH 8.0]) and suspended in double-distilled water, followed by four cycles of sonication (each for 10 s at 10% of maximum output with high-intensity ultrasonic liquid processors [Sonics & Material Inc., Newtown, Conn.]). By centrifugation of the cell extract at 10,000 × g and 4°C for 20 min, the supernatant containing soluble proteins and the pellet containing inclusion bodies were obtained. The pellet was washed twice with 1% (vol/vol) Triton X-100 and once with double-distilled water to remove contaminants. After protein quantification by the Bradford assay with bovine serum albumin as a standard (8), protein samples (200 μg) were dried by vacuum centrifugation, resuspended in 340 μl of isoelectric focusing denaturation buffer {9 M urea, 0.5% 3-[(3-cholamidopropyl)-dimethylammonio]-1-propanesulfonate [CHAPS], 10 mM dithiothreitol, 0.2% [wt/vol] Bio-lyte pH 3 to 10, 0.001% [wt/vol] bromophenol blue), and carefully loaded on immobilized pH gradient strips (pH 3 to 10; 17 cm) (Bio-Rad). The loaded immobilized pH gradient strips were rehydrated for 12 h and focused at 20°C for 15 min at 250 V, followed by 60,000 V · h with a Protean isoelectric focusing cell (Bio-Rad). The strips were exchanged in equilibration buffer and then were placed on sodium dodecyl sulfate-12% polyacrylamide gels prepared by the standard protocol (22). Protein spots were visualized with a silver staining kit (Amersham Biosciences, Uppsala, Sweden), and the stained gels were scanned with a GS-710 calibrated imaging densitometer (Bio-Rad). Melanie II software (Bio-Rad) was used to identify spots and to quantify spot densities on a volume basis (i.e., integration of spot optical intensity over the spot area). Matrix-assisted laser desorption ionization-time-of-flight (MALDI-TOF) mass spectrometer analysis was carried out as described previously (15).
RESULTS AND DISCUSSION
Fed-batch culture and proteome profiles.
pH-stat fed-batch culture of E. coli BL21(DE3) was first carried out. E. coli BL21(DE3) was grown to an OD600 of 148 (74 g [dry weight] of cells/liter), with an apparent specific growth rate of 0.49 h−1 (Fig. (Fig.2A).2A). When the OD600 reached 30, sample S1 was taken for proteome analysis. pH-stat fed-batch culture of recombinant E. coli BL21(DE3)(pEDOb5) was carried out next. The apparent specific growth rate was 0.15 h−1, which is less than a half of that obtained with the strain without the plasmid. Just before induction at an OD600 of 30, sample S2 was taken. The dry cell weight and the maximum leptin content at 8 h after induction were 24 ± 0.5 g/liter and 41% ± 1.8% of the total protein, respectively (Fig. (Fig.2C).2C). At this point, sample S3 was taken. The proteome profiles of S1, S2, and S3 were analyzed and compared. From over 1,000 spots on the 2D gels, we identified 88 total proteins by comparing them with the E. coli SWISS 2D polyacrylamide gel electrophoresis database (http://kr.expasy.org/cgi-bin/map2/def?ECOLI) or by MALDI-TOF mass spectrometry analysis (Table (Table2).2). Considering the error range of detection, we accepted quantitative differences of above a 1.5-fold change as an increase or of below a 0.6-fold change as a decrease.
Time profiles of cell growth and recombinant protein production during the fed-batch cultures. (A) Cell densities (OD600) of E. coli BL21(DE3) (○) and E. coli BL21(DE3)(pACYC184) (▿); (B) cell density (OD600) of E. coli BL21(DE3)(pAC104CysK); (C to H) cell density (OD600) (□), dry cell weight (DCW) (○), and recombinant protein content (▴) without (C, E, and G) and with (D, F, and H) cysK coexpression for the production of leptin (C, D, E, and F) and interleukin-12 β chain (G and H). The dashed lines indicate the time of induction. S1, S2, S3, S4, and S5 are the sampling points for proteome analyses.
TABLE 2.
Proteins identified from 2D electrophoresis
Effect of plasmid presence.
To examine the influence of plasmid presence on proteome variation, we compared the proteome of S2 with that of S1 (Fig. 3A and B). The levels of about 33% (29 proteins) of the total identified proteins were altered in the presence of the plasmid (Table (Table2).2). Among the 20 proteins that showed higher expression levels were stationary-phase-responsive proteins, including GrpE, HdeB, AhpC, SspA, UspA, and Hns. These proteins are known to protect cells from stressful conditions such as heat shock, ethanol, H2O2, and NaCl. The sigma factor σS, encoded by the rpoS gene, is a central regulator for the expression of many stationary-phase-responsive genes (23). However, the above-mentioned proteins, except for HdeB, are not regulated by σS. These stationary-phase-responsive proteins are known to be induced under conditions of slow growth (16). Therefore, it seems that the presence of a plasmid itself resulted in slower growth, as shown by the observation that the growth rate of E. coli BL21(DE3)(pEDOb5) was less than half of that of E. coli BL21(DE3) and consequently induced expression of the stationary-phase-responsive proteins. To examine if this is truly due to the presence of a plasmid itself, we carried out pH-stat fed-batch culture of E. coli BL21(DE3)(pACYC184). The apparent specific growth rate was 0.22 h−1, which is less than a half of that of E. coli BL21(DE3) without the plasmid (Fig. (Fig.2A).2A). These results suggest that plasmid presence itself can induce stationary-phase-responsive proteins during fed-batch culture. Other than these proteins, the levels of 14 proteins (AceF, NusA, RfaD, PotD, TalB, Bla, HisJ, FliY, AccB, DksA, SgaH, AroK, GreA, and YjgF) increased in the presence of the plasmid (Table (Table22).
2D gel electrophoresis of samples for Fig. Fig.2.2. The soluble fractions of samples S1 (A), S2 (B), S3 (C), S4 (E), and S5 (F), as well as the insoluble fraction of sample S3 (D), were analyzed. Identified proteins shown by numbers corresponding to those in Table Table2.2. Proteins showing increased and decreased levels are indicated by circles and rectangles, respectively.
The levels of nine proteins (EF-G, EF-Tu, GlyA, CarA, Pgk, CysK, Sbp, SodA, and RplI) decreased in the presence of the plasmid. Since the levels of protein elongation factors (EF-G and EF-Tu) and ribosomal proteins (RplI) decreased, the protein translational activity seems to be negatively affected in the presence of the plasmid. This result is consistent with previous reports showing that plasmid presence decreased the levels of proteins involved in translation as well as ribosomal subunit pools (4, 7). The levels of most of the identified proteins involved in amino acid biosynthesis, including GlnA, IlvC, LeuC, TrpA, TrpD, SerC, and AroG, did not change much in the presence of the plasmid. However, the level of shikimate kinase I (AroK), which is involved in a common pathway of aromatic amino acid biosynthesis, increased by twofold in the presence of the plasmid. On the other hand, the levels of GlyA and CysK, which are involved in the synthesis of serine family amino acids, decreased in the presence of the plasmid. Taken together, these results suggest that the presence of the plasmid negatively affects the cellular capacity of protein synthesis and biosynthesis of serine family amino acids. At present, it is not clear why the levels of enzymes for serine family amino acid synthesis decreased in the presence of the plasmid.
Effect of leptin overproduction.
In order to investigate the effect of overproducing leptin on the host cell physiology, we compared the proteomes of S2 and S3 (Fig. 3B, C, and D). Overproduction of leptin altered the levels of 47% (41 proteins) of the total identified proteins (Table (Table2).2). In agreement with previous studies (10, 19, 33), the levels of heat shock proteins such as DnaK and GroEL increased upon overproduction of leptin. Also, relatively large amounts of the small heat shock proteins IbpA and IbpB, which are good indicators for the formation of IBs (1, 24), were associated with the IB fraction (Fig. (Fig.3B).3B). Production of these proteins is known to be regulated by σ32 (12). This indicates that overproduction of recombinant protein (leptin) acts as a stress to the cells. The levels of nine proteins (AcnB, AceF, AtpD, SdhA, MalE, HdhA, AtpC, UspA, and OmpF) increased after leptin production (Table (Table22).
Twenty-eight proteins were repressed by the overproduction of leptin. The levels of proteins involved in protein synthesis, such as protein elongation factors (EF-Tu and EF-Ts) and 30S ribosomal protein (RpsF), decreased. It was interesting that the protein elongation factor EF-Tu was associated with IBs (Fig. (Fig.3D).3D). This coincided with the decreased level of soluble EF-Tu, indicating that the association of EF-Tu with IBs may be one reason for the observed reduction in overall protein synthesis. This in turn shows that the overproduction of recombinant proteins negatively influences the capacity of the cellular translational machinery (37).
We also observed decreased levels of enzymes involved in the biosynthesis of serine family (GlyA and CysK), aromatic family (AroG, TrpA, and TrpD), and branched-chain family (LeuC) amino acids following induction of leptin overproduction. This suggests that the overproduction of leptin leads to an imbalance in free amino acid stocks and translational machinery (5, 13, 31). The levels of enzymes involved in the synthesis of serine family amino acids were most strongly downregulated by leptin overproduction. This can be explained by the fact that leptin contains much more serine (11.6% of total amino acids) than common E. coli proteins do (average of 5.6%). It should be remembered that the presence of a plasmid itself decreased the levels of GlyA and CysK by 40%. Leptin production resulted in a further decrease of their levels by ca. 60%.
Interestingly, the levels of OmpF and MalE were upregulated after overproduction of leptin, while those of GlyA, OppA, LivJ, and LivK were downregulated. All of these proteins are regulated by leucine-responsive regulatory protein (Lrp). Lrp activates transcription of the ompF and malE genes but represses that of the glyA, oppA, livJ, and livK genes (9, 28). Some metabolic pathways affected by Lrp are shown in Fig. Fig.4A.4A. Lrp activates 3-phosphoglycerate dehydrogenase (encoded by serA), which converts the glycolytic intermediate 3-phosphoglycerate to serine, while it represses serine hydroxymethyltransferase (encoded by glyA) and serine deaminase (encoded by sdaA) of the serine degradation pathway. It seems that leptin could not be produced immediately after induction due to the insufficient pool of available serine (Fig. (Fig.4A).4A). As time passed, Lrp activated the pathways for the biosynthesis of serine and other amino acids. This led to an increase in the amino acid pool required for protein synthesis and consequently to leptin production that reached the maximum level in 8 h.
Metabolic pathway of recombinant E. coli overproducing leptin, quantified from the S3/S2 (A), S4/S2 (B), and S5/S2 (C) protein level ratios. Boldface arrows indicate that the protein levels were increased by more than 1.5-fold, while dotted arrows indicate that the protein levels were decreased to less than 0.6-fold. Some metabolic pathways affected by Lrp are also indicated in panel A. Reactions regulated by Lrp enzymes are marked + and − for positive and negative regulation, respectively. Abbreviations: 6PGL, 6-phosphogluconolactone; 6PG, 6-phosphogluconate; 2K3D6PG, 2-dehydro-3-deoxy-6-phosphogluconate; R5P, ribose-5-phosphate; RL5P, ribulose-5-phosphate; X5P, xylulose-5-phosphate; S7P, sedoheptulose-7-phosphate; E4P, erythrose-4-phosphate; F6P, fructose-6-phosphate; F1,6P2, fructose-1,6-diphosphate; G3P, glyceraldehydes-3-phosphate; DHAP, dihydroxyacetone-phosphate; G1,3P2, glycerate-1,3-diphosphate; 3PG, 3-phosphate-glycerate; 2PG, 2-phosphate-glycerate; PEP, phosphoenolpyruvate; PYR, pyruvate; AcCoA, acetyl coenzyme A; CIT, citrate; ICT, isocitrate; α-KG, α-ketoglutarate; Suc-CoA, succinyl coenzyme A; SUC, succinic acid; FUM, fumaric acid; MAL, malic acid; OAA, oxaloacetate.
Enhanced productivity of human leptin by cysK coexpression.
We made an effort to search for a target protein from proteome analysis to shorten the long period required to reach the maximum leptin content. Developing a strategy for increasing the serine pool is difficult due to the complex nature of metabolic pathway regulation. During growth on glucose, 15% of the carbon assimilated in E. coli is converted to serine or its intermediates (35). As described above, the expression of the glyA gene was negatively regulated by Lrp during leptin production. The decreased GlyA has a positive effect on increasing the serine pool and consequently on leptin production. These results led us to examine the effects of the cysK gene on the biosynthesis of serine family amino acids. When the cysK gene was coexpressed in leptin-producing cells, the dry cell weight and the maximal leptin content obtained 2 h after induction were 26 ± 0.8 g/liter and 39% ± 1.1% of total protein, respectively (Fig. (Fig.2D),2D), which are nearly identical to the levels obtained in a strain without cysK coexpression. However, it should be noted that the maximal leptin content was reached at only 2 h after induction by coexpression of the cysK gene, compared to the 8 h required without cysK coexpression. Therefore, cysK coexpression increased leptin productivity by fourfold. A similar phenomenon was observed when cells were induced at an OD600 of 90 (Fig. 2E and F). When the cysK gene was coexpressed, the dry cell weight and the maximal leptin content obtained 3 h after induction were 51 ± 0.5 g/liter and 35% ± 1.5% of total protein, respectively, which are nearly identical to the levels obtained at 8 h without cysK coexpression.
Metabolic pathways affected by cysK coexpression.
In order to examine the changes in metabolism caused by cysK coexpression in leptin-producing cells, we analyzed proteomes of S4 (before induction) and S5 (after induction) of the cysK-coexpressing strain (Fig. 2G and H and 3E and F) and compared the profiles with that of S2 (Table (Table2).2). Two beneficial effects of cysK coexpression were evident. First, EF-Tu was no longer trapped in the IBs, and consequently more EF-Tu existed in soluble form (Table (Table2;2; Fig. 3E and F). This may be due to the increased levels of chaperones, including DnaK, GroEL, and HtpG, which helped correct folding of EF-Tu and prevented trapping of EF-Tu within IBs. The availability of more EF-Tu would have strengthened the protein biosynthesis capacity. Second, metabolic fluxes seemed to be altered to allow more efficient production of leptin. Before induction, the levels of TpiA, GapA, Pgk, and AceF were increased in cysK-coexpressing cells compared with control cells, while that of GpmA decreased. As shown in Fig. Fig.4B,4B, TpiA, GapA, and Pgk catalyze the reactions leading to be formation of 3-phosphoglycerate, which is further converted to serine. Through these metabolic changes, the cysK coexpression activated the serine biosynthetic pathway even prior to induction, which led to immediate production of leptin. After induction, the levels of these proteins were similar in cells with and without cysK coexpression (Fig. 4A and C). In summary, cysK coexpression resulted in the central metabolic pathway fluxes being adjusted to allow efficient production of leptin, a serine-rich protein. In addition to this positive effect, cysK coexpression made cells maintain a high protein biosynthetic capacity. All of these factors contributed to achieve high productivity and specific productivity of leptin. Even though the reasons for this phenomenon could not be explained, these results clearly demonstrate the effectiveness of the approach taken to improve the leptin productivity.
Enhanced productivity of another serine-rich protein by cysK coexpression.
To examine whether cysK coexpression can increase the specific productivity of other serine-rich proteins, interleukin-12 β chain (11.1% serine) was selected as another model protein. The amino acid compositions of E. coli proteins show no dramatic deviations from those in the complete protein sequence database, with leucine and alanine being most abundant and cysteine and tryptophan being least abundant (20). We therefore selected G-CSF (18.84% leucine and 11.59% alanine) as a control protein having an amino acids composition similar to that of E coli proteins.
As expected, production of interleukin-12 β chain was improved by cysK coexpression, while production of G-CSF was not affected. When recombinant E. coli BL21(DE3)(pEDIL-12p40) was induced at an OD600 of 30, the dry cell weight and the maximum interleukin-12 β chain content reached at 7 h after induction were 23 ± 0.7 g/liter and 9% ± 0.5% of the total protein, respectively (Fig. (Fig.2E).2E). On the other hand, when BL21(DE3)(pEDIL-12p40)(pAC104CysK) was induced at an OD600 of 30, the dry cell weight and the maximum interleukin-12 β chain content at 2 h after induction were 24 ± 0.8 g/liter and 8% ± 0.3% of the total protein, respectively (Fig. (Fig.2F).2F). The maximum G-CSF content (35% of the total protein) was reached at 2 and 3 h after induction with and without cysK coexpression, respectively. Therefore, cysK coexpression increased interleukin-12 β chain productivity by threefold. These results suggest that cysK coexpression may improve production of any serine-rich protein.
Improved cell growth by cysK coexpression during high-cell-density culture.
Consistent with other studies (4, 7), the introduction of the plasmid caused changes in the levels of cellular proteins and adversely affected cell growth. Interestingly, it was found from proteome profiling that the levels of proteins involved in the biosynthesis of serine family amino acids decreased during the high-cell-density culture of E. coli BL21(DE3). If this is one of the factors negatively affecting cell growth, cysK coexpression should cause recovery of the growth rate. Therefore, we compared the growth rates of wild-type E. coli BL21(DE3), E. coli BL21(DE3)(pACYC184), and E. coli BL21(DE3)(pAC104CysK)(Fig. 2A and B). The presence of the plasmid decreased the growth rate by 50% compared with that of the wild-type strain. Surprisingly, cysK coexpression restored the growth rate of the plasmid-carrying strain to nearly that of the wild type, indicating that the reduced biosynthesis of serine family amino acids caused a lower growth rate during the high-cell-density culture of E. coli BL21(DE3). To see if this is a universal phenomenon in E. coli, we carried out high-cell-density cultures of E. coli W3110 harboring pACYC184 or pAC104CysK. It was found that cysK coexpression did not increase the specific growth rate of recombinant W3110. Therefore, the beneficial effect of cysK coexpression on the growth rate may be specific for E. coli BL21(DE3).
In summary, the growth of recombinant BL21(DE3) and leptin production could be enhanced by cysK coexpression. Coexpression of the cysK gene could increase the biosynthetic flux of serine family amino acids and indirectly repress EF-Tu aggregation by inducing the expression of heat shock proteins, leading to improved cell growth and a three- to fourfold increase in the productivity of serine-rich recombinant proteins. It should be mentioned that the high productivities of serine-rich proteins achieved by cysK coexpression are not solely due to the improved cell growth. As seen above, the growth rate was increased by approximately twofold by cysK coexpression, while the productivities of recombinant proteins increased by three- to fourfold. This is the first report on improving recombinant protein productivity by engineering the metabolic pathways based on the results of proteome analysis. Consequently, we propose a strategy for the rational engineering of metabolic pathways and cellular properties based on the results of proteome profiling. The procedure is to (i) obtain the proteome profiles of E. coli (or recombinant E. coli) under different conditions of interest, (ii) identify potentially limiting enzymes in the biosynthetic pathways, (iii) examine theoretically and/or experientially the possible flux changes that can be achieved by amplifying (or knocking out) the activities of the enzymes identified, (iv) select the final candidate enzymes to be amplified (or knocked out), (v) examine the effects of this metabolic and cellular engineering on achieving the desired objectives, and (vi) repeat steps ii to iv until the objectives are accomplished. This strategy may be extended beyond serine-rich proteins to increase the yield and productivity of other recombinant proteins in industrial bioprocesses.
Acknowledgments
This work was supported by the National Research Laboratory Program (grant 2000-N-NL-01-C-237) of the Ministry of Science and Technology; the Basic Industrial Research Project of the Ministry of Commerce, Industry and Energy; and the Brain Korea 21 project. Hardware for the analysis of proteomes was supported by the IBM-SUR program.