Skip to content

Integrative Scalable Computing Laboratory

A research group at the Department of Information Technology, Uppsala Universtity.

  • Home
  • Projects
  • People
  • Publications
  • Teaching
  • Software
  • Recruitment
  • About us
  • Toggle search form

MSc thesis opportunities in privacy-preserving Machine Learning

Posted on November 10, 2019November 10, 2019 By admin

We have opportunities for a number of MSc thesis students to work with the group in the spring semester 2020.

Artificial intelligence is rapidly transforming our society. Machine learning models will be in every digital system we use, and it is imperative that we protect the integrity of data owners. In this project we work on training schemes, scalable implementations, and applications of Federated Learning – a recent approach to training ML models while keeping input data privacy of data owners.

Federated Machine Learning 

Federated machine learning has recently attracted a lot of attention both in industry and academia. Simply speaking, training proceeds by model updates on private data nodes, then weights are averaged by a server forming a  global model (schematic figure inline). While simple in concept, care needs to be taken to balance local model training with global synchronization to avoid poor convergence and to minimize communication rounds. FedML differs from standard distributed learning/optimization in that data cannot be assumed to be balanced across nodes, data may not be i.i.d., and we cannot assume consistent node uptime nor low-latency high-throughput networking between nodes. During 2017 and 2018, Google Research presented an approach to FedML based on TensorFlow targeting mobile devices [2,3]. Other prominent efforts include the open source project  OpenMinded (https://www.openmined.org/) and the latest API extension of Tensorflow federated [4]. Intel in collaboration with the University of Pennsylvania recently demonstrated a real-world case for FedML based on biomedical imaging [5]. Machine learning models that has been demonstrated in the FedML case include CNNs, LSTMs and conformal predictors [6]. In our group we are currently working on various aspects of FedML such as new federated ensemble methods and schemes to measure individual member contributions in a scalable fashion.

Potential thesis topics 

We have opportunities for MSc thesis students in a number of areas in privacy-preserving learning, such as: 

  1. Performance evaluation and optimization of federated learning algorithms for new application areas and/or models.      
  2. Development of new FedML schemes. 
  3. Development of scalable computing backends.  
  4. Decentralized implementations to enable FedML without a trusted-third party.
  5. Privacy-enhancing techniques such as differential privacy and secure multiparty computation. 

Research environment

The work will be conducted as part of the research group Integrative Scalable Computing Laboratory. ISCL is an interdisciplinary team working on the interface of scientific computing, machine learning and distributed systems. The group runs a number of eScience projects with funding from eSSENCE, SSF, VR and NIH. The MSc student will get the opportunity to participate in the work of the group during the semester the thesis is written, gaining insight into the academic work culture.  

Contact

Reach out to Andreas Hellander or Salman Toor to discuss opportunities:

Andreas: andreas.hellander@it.uu.se

Salman:  salman.toor@it.uu.se 

References

  1. Feng X., Qing, K., Meyer CH. and Chen Q., Deep convolutional neural  network for segmentation of thoracic organ-at-risk using cropped 3D images, Med. Phys., 46(5), 2019.  
  2. Konečný J,, Brendan McMahan H., X. Yu F., Richtárik P.,, Theertha Suresh A., Bacon D., Federated Learning: Strategies for imporving communication efficiency, ArXiv 1610.05492, 2016.
  3. K. Bonawitz et al., Towards Federated Learning at Scale: System Design,  ArXiv 1902.01046, 2019.
  4. Tensorflow federated, https://www.tensorflow.org/federated 
  5. Sheller MJ, Reina GA, Edwards B, Martin J, Bakas S., Multi-institutional Deep Learning Modeling Without Sharing Patient Data: A Feasibility Study on Brain Tumor Segmentation,, Lecture Notes in Computer Science book series (Volume 11383). 2019.
  6. Gauraha, N. and Spjuth, O. Synergy Conformal Prediction DiVA preprint. 360504 (2018). URL: urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-360504. 
  7. How to Backdoor Federated Learning, E. Bagdasaryan, A. Veit, Y. Hua, D. Estrin, V. Shmatikov, ArXiv, 1807.00459, 2017


Applied Cloud Computing, FedML, News, Open Positions

Post navigation

Previous Post: Challenges
Next Post: Addi Ait-Mlouk joins the lab as postdoc focusing on FedML

More Related Articles

FedQAS: Federated machine reading comprehension based on FEDn Data-Intensive Computing
StochSS 1.6 is now officially released! News
Sadi Alawadi joins the lab as a postdoc focusing on FedML FedML
We welcome Ben Blamey to the group! HASTE
Open PhD student position Multiscale methods
Postdoc in Scientific Computing/Applied Cloud Computing Applied Cloud Computing

Data-and simulation-driven life science. Much of our work in eScience and applied ML has applications in life science, and in Systems Biology in particular. We aim to enable data-and simulation-driven scientific discovery.

HASTE - a cloud native framework for intelligent processing of image streams: http://haste.research.it.uu.se/

Follow us on twitter

Andreas HellanderFollow

Andreas Hellander
Retweet on TwitterAndreas Hellander Retweeted
SciLifeLab_DCSciLifeLab_DataCentre@SciLifeLab_DC·
3 Nov

Join our great team at @SciLifeLab_DC!

We are now looking for IT-ansvarig SciLifeLab
👉Apply by Dec 12th.
👉More & apply here: https://www.kth.se/om/work-at-kth/lediga-jobb/what:job/jobID:546469/where:4/
👉More about @SciLifeLab_DC here: https://scilifelab.se/data

@scilifelab @KTHuniversity

Reply on Twitter 1588187309098295298Retweet on Twitter 15881873090982952983Like on Twitter 15881873090982952983Twitter 1588187309098295298
A_HellanderAndreas Hellander@A_Hellander·
25 Oct

Starting in 30mins :-)

Prashant Singh@prashant_rsingh

Join us tomorrow for an exciting seminar by @uPicchini on “guided sequential ABC schemes for intractable Bayesian models”. The seminar starts at 13.15 until 14.00 CEST in Room 101127, Ångströmlaboratoriet, Uppsala University & online: https://uu-se.zoom.us/j/65354024469. Warmly welcome!

Reply on Twitter 1584856493882757121Retweet on Twitter 1584856493882757121Like on Twitter 1584856493882757121Twitter 1584856493882757121
A_HellanderAndreas Hellander@A_Hellander·
6 Oct

eSSENCE, SERC and Chalmers e-science Centre are providing core e-science education to PhD students from the SeSE platform: https://sese.nu/

Researchers - get funding to develop and give a PhD course!
@uppsalauni @lunduniversity @umeauniversitet

Reply on Twitter 1577921378514337793Retweet on Twitter 1577921378514337793Like on Twitter 15779213785143377933Twitter 1577921378514337793
A_HellanderAndreas Hellander@A_Hellander·
6 Oct

Day two of the Swedish eScience Academy organized by eSSENCE.

Interesting to learn from Sverker Holmgren of Chalmers eCommons about the holistic approach to infrastructure and support for data centric research at Chalmers!

@UmeaUniversity @UU_University @lunduniversity

Reply on Twitter 1577917568035201024Retweet on Twitter 1577917568035201024Like on Twitter 15779175680352010243Twitter 1577917568035201024
A_HellanderAndreas Hellander@A_Hellander·
6 Oct

So great to be at the Swedish e-science Academy organized by #essenceofescience! Two days of scientific exchange between colleagues nationally, and in particular from the partner universities @UU_University @UmeaUniversity @lunduniversity.

Keynote day one by Kersti Hermansson.

Reply on Twitter 1577915269770625025Retweet on Twitter 1577915269770625025Like on Twitter 15779152697706250253Twitter 1577915269770625025
Load More...

Decentralized AI, Federated Learning. One focus area of the group is development of methods and software to address decentralized and privacy-preserving AI. We are core contributors to the FEDn open source framework for scalable federated machine learning:

https://github.com/scaleoutsystems/fedn
Introduction to Federated Learning by Andreas Hellander
Join the discussion on Decentralized AI:

Scaleout Systems is a spin-out from ISCL on a mission to enable decentralized AI and federated learning to production.

https://www.scaleoutsystems.com/

Copyright © 2023 Integrative Scalable Computing Laboratory.

Powered by PressBook Blog WordPress theme