Pulse Brain · Growing Health Evidence Index
Tier 3 — Observational / field trialPreprint

Hybrid genome-scale modeling and machine learning reveal cost-efficient strategies for phototrophic PHB production in Rhodopseudomonas palustris

Hernandez Gonzalez, H. A.; Buitron, G.

bioRxiv · 2026

Read source ↗ All evidence

Summary

Plastic pollution from fossil-based materials is a major global environmental challenge. Microbial-derived bioplastics, such as polyhydroxybutyrate (PHB), offer a promising biodegradable alternative. However, the high substrate and operational costs of PHB production remain a major barrier to large-scale deployment. Optimizing PHB synthesis requires navigating a multidimensional design space of metabolic, nutritional, and operational variables that is impractical to explore experimentally. Here, we developed an integrated computational framework that combines a genome-scale metabolic model (GEM) of Rhodopseudomonas palustris, machine-learning surrogate modeling, Pareto multi-objective optimization, and thermodynamics-based flux analysis (TFA) to identify cost-efficient and biologically feasible PHB production strategies. Experimental and literature-derived medium compositions were translated into mechanistic constraints, enabling the GEM to generate metabolically coherent synthetic datasets that augmented sparse experimental observations. CatBoost surrogate models trained on this hybrid dataset accurately predicted PHB synthesis across thousands of hypothetical conditions, and Pareto optimization revealed operating regimes that balance PHB productivity with nutrient cost. TFA validated the thermodynamic feasibility of these strategies and refined pathway usage, reinforcing thiolase-initiated routing into PHB biosynthesis and suppressing infeasible {beta}-oxidation-like redox loops. Overall, this hybrid GEM-ML-TFA framework identifies metabolic bottlenecks, engineering targets, and cost-optimal nutrient regimes for phototrophic PHB production, providing a scalable blueprint for rational process and strain design.

Outcomes reported

Plastic pollution from fossil-based materials is a major global environmental challenge. Microbial-derived bioplastics, such as polyhydroxybutyrate (PHB), offer a promising biodegradable alternative. However, the high substrate and operational costs of PHB production remain a major barrier to large-scale deployment. Optimizing PHB synthesis requires navigating a multidimensional design space of metabolic, nutritional, and operational variables that is impractical to explore experimentally. Here, we developed an integrated computational framework that combines a genome-scale metabolic model (GEM) of Rhodopseudomonas palustris, machine-learning surrogate modeling, Pareto multi-objective optimization, and thermodynamics-based flux analysis (TFA) to identify cost-efficient and biologically feasible PHB production strategies. Experimental and literature-derived medium compositions were translated into mechanistic constraints, enabling the GEM to generate metabolically coherent synthetic datasets that augmented sparse experimental observations. CatBoost surrogate models trained on this hybrid dataset accurately predicted PHB synthesis across thousands of hypothetical conditions, and Pareto optimization revealed operating regimes that balance PHB productivity with nutrient cost. TFA validated the thermodynamic feasibility of these strategies and refined pathway usage, reinforcing thiolase-initiated routing into PHB biosynthesis and suppressing infeasible {beta}-oxidation-like redox loops. Overall, this hybrid GEM-ML-TFA framework identifies metabolic bottlenecks, engineering targets, and cost-optimal nutrient regimes for phototrophic PHB production, providing a scalable blueprint for rational process and strain design.

Theme
Farming systems, soils & land use
Subject
Measurement methods & nutrient profiling
Study type
Research
Source type
Preprint
Status
Preprint
Geography
United Kingdom
System type
Other
DOI
10.64898/2026.04.24.720730
Catalogue ID
IRmoq8418w-5d6b68
Pulse AI · ask about this record

Dig deeper with Pulse AI.

Pulse AI has read the whole catalogue. Ask about this record, its theme, or how the findings apply to UK farming and policy — every answer cites the underlying studies.