Optimization of multi-omic genome-scale models
Metabolism is the set of biochemical reactions in a cell that maintain its living state. As these reactions are essential, it is vital that metabolic networks in all living organisms are well-characterized. This can be achieved by constructing a genome-scale metabolic model, a computational representation of every metabolic reaction taking place within a living system and the chemical compounds which are involved in these reactions (metabolites). Being downstream of gene expression, metabolism is being increasingly used as an indicator of the outcome for drugs and therapies, as well as for cancer studies.
Flux balance analysis (FBA) is commonly used to mathematically express the flow of metabolites (flux) through a network of biochemical pathways. FBA predicts rates to represent amounts of metabolites that are consumed or produced in every reaction in the metabolic network. Constraints can be imposed on the system to identify a range of values representing the entire distribution of fluxes under a specific set of conditions. A multi-omic model introduces additional constraints in the form of data spanning entire biological fields: for example, genes (genome), RNA (transcriptome), proteins (proteome), metabolites (metabolome), etc.
The integration of multi-omic data from existing literature and in-vivo experiments generates condition-specific models that represent the metabolic state more precisely. To extract the most meaning from multi-omic models, fusion of multiple data types into a single, cohesive network is essential for measuring responses at multiple omic levels.
Simulation of growth under different conditions allows for condition-specific optimization of each of the omic layers, which can be combined to form a multi-omic network (Fig. 1). This enables detection of coordinated responses shared between different data types as well as variation in responses across different growth conditions.
In many cases, an optimal flux value is sought that best satisfies a single metabolic objective (e.g. maximization of cellular growth rate) by using a process known as linear optimization. Owing to the fact that organisms often have multiple objectives to satisfy, multi-objective optimization presents a valuable alternative. Trade-offs between conflicting metabolic objectives can be resolved through simulating a series of non-dominated vectors, which are lists of values where there is no better solution that exists for a given objective without sacrificing the performance of another. A series of non-dominated vectors is called a Pareto front and enables consideration of multiple conditions and constraints affecting each objective in a multi-objective optimization problem (Fig. 2).
METRADE is a method for mapping transcriptomic or proteomic data onto a multi-omic metabolic model, and allows performing multi-objective optimization to identify optimal states through the comparison of predicted flux rates for multiple objectives. This is achieved by constructing a Pareto front displaying these reaction flux rates in a space where each profile is associated with a growth condition. The comparison of objectives is used to identify the best trade-off, where the maximal number of cellular objectives are simultaneously optimized.
The fusion of multiple data types into a single, cohesive network is a challenge, for which many factors must be considered. Generation of intermediate data structures or correlation of multi-omic data with model outputs can identify the circumstances under which certain variables are important and what their effects are. Regression and machine learning techniques will be increasingly used to predict the effect sizes of the multiple omic variables involved (genes, transcripts, proteins and reaction fluxes), towards optimization of the cellular goal.
Supreeta Vijayakumar 1, Max Conway 2, Pietro Lio’ 2, Claudio Angione 1
1Department of Computer Science and Information Systems, Teesside University, United Kingdom
2Department of Computer Science and Technology, University of Cambridge, United Kingdom
Optimization of Multi-Omic Genome-Scale Models: Methodologies, Hands-on Tutorial, and Perspectives.
Vijayakumar S, Conway M, Lió P, Angione C
Methods Mol Biol. 2018
|Constructing a Google-Earth-like functional brain atlas Brain, the final frontier. These are the voyages of the brain cartographers. Their continuing mission: To explore strange new maps, to seek out new areas and new functions, to BOLD-ly…|
|Targeting cholesterol metabolism to suppress… Metastatic prostate cancer is the second leading cause of cancer death in men worldwide. Prostate cancer itself is one of the most common cancers in men. At the early stages,…|
|Achieving synergistic biopolymer functions using… Biology succeeds by maintaining intricate networks of chemical and structural information far from equilibrium. The guiding premise of the highlighted manuscript1 asks whether synthetic and materials chemistry will be able…|
|Capturing hydrogen transfer from modeling: a small… All the matter of the world around us are made of atoms that are joined to form molecules. Chemists can create new molecules by changing connectivity of atoms in molecules…|
|Bayesian sparse neural networks for biomarker… Recent advances in high-throughput biotechnologies, such as microarray and sequencing technologies, have provided an unprecedented opportunity for biomarker discovery, which potentially enhances the development of precision medicine. From a statistical…|
|When the immune system is away, biliary cancer cells… Tumours of the biliary tract (BTC) develop from the epithelium of bile ducts located inside and outside the liver and in the gallbladder. Surgery is the only curative treatment available.…|