Skip to content

Integrative Scalable Computing Laboratory

A research group at the Department of Information Technology, Uppsala Universtity.

  • Home
  • Projects
  • People
  • Publications
  • Teaching
  • Software
  • Recruitment
  • About us
  • Toggle search form

Machine learning-assisted analysis of stochastic biochemical reaction networks

Posted on October 15, 2018October 16, 2020 By Prashant Singh No Comments on Machine learning-assisted analysis of stochastic biochemical reaction networks

Biochemical reaction networks represent complex cellular regulatory mechanisms. These networks are typically analyzed using discrete stochastic simulation models. The models typically involve numerous reactions involving a large number of chemical species, governed by highly uncertain parameters.

Likelihood-free parameter inference

Given existing data pertaining to a biochemical reaction network, one is often interested in inferring the values of the model parameters that likely generated the data. The data itself may come from models simulated in the past, or physical experiments. Approximate Bayesian Computation (ABC) is a proven approach that effectively solves such parameter inference problems by using simulation models as a tool to find the region in the parameter space corresponding to least deviation from given data.

The rejection sampling algorithm forms the basis of the ABC framework. Samples are drawn from a specified prior distribution, and subsequently simulated. The simulated responses are compared to existing data by means of a distance function and appropriate summary statistics. Samples that result in distance function values below a specified tolerance threshold are accepted, and the rest rejected. The sampling algorithm proceeds until the desired number of accepted samples have been obtained. The inferred parameters are then reported as the mean parameter values corresponding to the accepted samples.

Design choices such as selection of distance functions, summary statistics and acquisition function for the inference process have a deep impact on the solution quality. Furthermore, increasing problem complexity often leads to impractically high inference times using rejection sampling.

Our research explores methods to accelerate high-quality parameter inference by leveraging state-of-the-art methods from the fields of computational biology, machine learning, optimization and statistics. Some of our active research topics include investigating intelligent construction of priors, methods for automated large-scale summary statistic selection,  and training fast local and global approximations or surrogate models of computationally expensive simulators.

Model exploration

The exploration of a system described by a non-linear, high-dimensional and stochastic computational model is a fundamental problem in all scientific disciplines relying on modeling and simulation.  In this project we are interested in the scenario where a modeler has no or very limited prior knowledge about what type of qualitative interesting behavior the model can display over the large parameter space. The tools we develop should help the modeler discover those behaviors with a small computational budget, and as little manual work as possible. By utilizing human-in-the-loop machine learning  we are developing a smart parameter sweep workflow. An example is shown in the image below, where a high-dimensional parameter sweep application is augmented with automated feature extraction and clustering, followed by training a model for classification based on user-defined labels (such as interesting or non-interesting realizations). With this model, the smart sweep application will learn to more efficiently explore areas of interestingness in the parameter space.  

ReserachSlider

Post navigation

Previous Post: MSc projects available in multicellular systems biology
Next Post: Highly Scalable Federated Machine Learning

More Related Articles

Highly Scalable Federated Machine Learning ReserachSlider
StochSS: Stochastic Simulation Service ReserachSlider
Hierarchical Analysis of Spatial and Temporal Data Data Science
Multiscale simulations of chemical kinetics ReserachSlider
Scalable simulation of stochastic multicellular systems ReserachSlider

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Data-and simulation-driven life science. Much of our work in eScience and applied ML has applications in life science, and in Systems Biology in particular. We aim to enable data-and simulation-driven scientific discovery.

HASTE - a cloud native framework for intelligent processing of image streams: http://haste.research.it.uu.se/

Follow us on twitter

Andreas HellanderFollow

Andreas Hellander
Retweet on TwitterAndreas Hellander Retweeted
SciLifeLab_DCSciLifeLab_DataCentre@SciLifeLab_DC·
3 Nov

Join our great team at @SciLifeLab_DC!

We are now looking for IT-ansvarig SciLifeLab
👉Apply by Dec 12th.
👉More & apply here: https://www.kth.se/om/work-at-kth/lediga-jobb/what:job/jobID:546469/where:4/
👉More about @SciLifeLab_DC here: https://scilifelab.se/data

@scilifelab @KTHuniversity

Reply on Twitter 1588187309098295298Retweet on Twitter 15881873090982952983Like on Twitter 15881873090982952983Twitter 1588187309098295298
A_HellanderAndreas Hellander@A_Hellander·
25 Oct

Starting in 30mins :-)

Prashant Singh@prashant_rsingh

Join us tomorrow for an exciting seminar by @uPicchini on “guided sequential ABC schemes for intractable Bayesian models”. The seminar starts at 13.15 until 14.00 CEST in Room 101127, Ångströmlaboratoriet, Uppsala University & online: https://uu-se.zoom.us/j/65354024469. Warmly welcome!

Reply on Twitter 1584856493882757121Retweet on Twitter 1584856493882757121Like on Twitter 1584856493882757121Twitter 1584856493882757121
A_HellanderAndreas Hellander@A_Hellander·
6 Oct

eSSENCE, SERC and Chalmers e-science Centre are providing core e-science education to PhD students from the SeSE platform: https://sese.nu/

Researchers - get funding to develop and give a PhD course!
@uppsalauni @lunduniversity @umeauniversitet

Reply on Twitter 1577921378514337793Retweet on Twitter 1577921378514337793Like on Twitter 15779213785143377933Twitter 1577921378514337793
A_HellanderAndreas Hellander@A_Hellander·
6 Oct

Day two of the Swedish eScience Academy organized by eSSENCE.

Interesting to learn from Sverker Holmgren of Chalmers eCommons about the holistic approach to infrastructure and support for data centric research at Chalmers!

@UmeaUniversity @UU_University @lunduniversity

Reply on Twitter 1577917568035201024Retweet on Twitter 1577917568035201024Like on Twitter 15779175680352010243Twitter 1577917568035201024
A_HellanderAndreas Hellander@A_Hellander·
6 Oct

So great to be at the Swedish e-science Academy organized by #essenceofescience! Two days of scientific exchange between colleagues nationally, and in particular from the partner universities @UU_University @UmeaUniversity @lunduniversity.

Keynote day one by Kersti Hermansson.

Reply on Twitter 1577915269770625025Retweet on Twitter 1577915269770625025Like on Twitter 15779152697706250253Twitter 1577915269770625025
Load More...

Decentralized AI, Federated Learning. One focus area of the group is development of methods and software to address decentralized and privacy-preserving AI. We are core contributors to the FEDn open source framework for scalable federated machine learning:

https://github.com/scaleoutsystems/fedn
Introduction to Federated Learning by Andreas Hellander
Join the discussion on Decentralized AI:

Scaleout Systems is a spin-out from ISCL on a mission to enable decentralized AI and federated learning to production.

https://www.scaleoutsystems.com/

Copyright © 2023 Integrative Scalable Computing Laboratory.

Powered by PressBook Blog WordPress theme