Finalists Announced for the 2024 ARDC Eureka Prize for Excellence in Research Software
Exploreabout Finalists Announced for the 2024 ARDC Eureka Prize for Excellence in Research Software
As part of our Research Software Agenda for Australia, the ARDC is working with the research community to shape better research software in order to recognise it as a first-class output of research. This interview is the tenth in a series about research software engineers in Australia. Each month we talk to a leading research software engineer about their experiences and best-practice tips in creating, sustaining and improving software for research.
Continuing the series, we spoke with Dr Marcel Keller, a Senior Research Scientist at CSIRO’s Data61. Dr Keller is a cryptographer working on multi-party computation and the maintainer of MP-SPDZ, a framework implementing multi-party computation. MP-SPDZ is written in C++ and Python.
Multi-party computation (MPC) is one of the main privacy-enhancing techniques. It allows computing on secret data without revealing it. An example of how it can be used is the millionaire’s problem, where 2 people want to find out who is richer without revealing their actual wealth. There is clearly more information in the wealth difference (several bits of information), than there is in the answer of who is the richest (one bit of information). With MPC, the 2 parties can run a protocol that reveals the answer without revealing any of the inputs. It therefore provides a property called input privacy. In a more practical example, MPC could be used by a number of healthcare providers to find the markers of disease by pooling patient data without revealing the patient data to any other party.
I used to work as a software engineer and I felt there was something missing. My duties as a software engineer were at times mundane, such as transcribing code from one language to another. Then I started a PhD, in contrast, I sometimes missed hands-on work with software. In my postdoc at the University of Bristol, I could finally combine the two, by focussing on practical applications of MPC and working on the predecessor of MP-SPDZ. In my current role at CSIRO’s Data61, I have been given the opportunity to expand this to a broader project where I am wearing even more hats: developer, maintainer, dev-ops engineer, and application support to name a few. It is this diversity of tasks that I really enjoy.
I think that the issue of privacy and thus privacy-enhancing technologies (PETs) will only increase in the coming years. In the last year there has been a flurry of documents highlighting the importance of PETs and MPC, be it by the United Nations, the Royal Society of London, the OECD, or a joint competition by the UK and US governments. In Australia, the Australian Transaction Reports and Analysis Centre (AUSTRAC) is exploring privacy-preserving analytics to combat financial crime.
There are a number of frameworks in the space, but I find them rather limited in one way or another. Many of them do not provide a platform for general computation but simply insert MPC functionality into machine learning frameworks like PyTorch, which limits the potential applications. Most frameworks also limit themselves to one or a few MPC protocols (of which there are many, catering to a number of settings in terms of the number of participants and security assumptions). MP-SPDZ is the only framework that offers more than 40 protocol variants and an interface that is both general and relatively easy to use. I believe that this infrastructure approach and the open source is the reason for the rapid increase in its usage and citations.
The most high-profile use is by a research lab at Bosch, which has built a cloud-native framework around MP-SPDZ. However, I think the breadth of use is more important than any specific user, because of the flexibility of the infrastructure I mentioned earlier. On Google Scholar, there are about 90 papers mentioning the use of the software, of which 60 were added over the last 12 months alone.
In terms of formal networks, I’m a member of the International Association for Cryptologic Research. They are a volunteer-run organisation that organises a number of conferences in cryptography. However, I would put more emphasis on informal networks that come out of attending conferences and participating in program committees. In particular, I would highlight the Privacy Enhancing Technologies Symposium, where I have been a member of the program committee for over 2 years.
If you’d like to be part of the growing community of research software engineers in Australia, become a member of the RSE Association of Australia and New Zealand (RSE-AUNZ) (it’s free!).
You can connect with Marcel via Linkedin, Github, Twitter, Mastodon.
Eureka Prize for Excellence in Research Software:
Entries to the 2023 Australian Research Data Commons Eureka Prize for Excellence in Research Software is now open. More information, closing on 14 April 2023 >
For Statistics:
The 2023 Statistical Society of Australia (SSA) Bill Venables Award for new developers of open source software for data analytics, sponsored by the ARDC is now open Apply now, closing on 22 April 2023 >
Stay tuned for our next interview in the Shaping Research Software series, coming out in May.
Learn more about the ARDC’s Research Software Agenda for Australia
The ARDC is funded through the National Collaborative Research Infrastructure Strategy (NCRIS) to support national digital research infrastructure for Australian researchers.
Dr Marcel Keller was interviewed by Dr Paula Andrea Martinez, ARDC.