As part of our Research Software Agenda for Australia, the ARDC is working with the research community to shape better research software in order to recognise it as a first-class output of research.
This interview is the seventh in a series about research software engineers in Australia. Each month we will talk to a leading research software engineer about their experiences and best-practice tips in creating, sustaining and improving software for research.
Continuing the series, we talked with the winner of the Ecological Society of Australia Early-career Ecologist Award for New Developers of Open Source Software in Ecology. This award is sponsored by the ARDC in recognition of the valuable contribution made by software to ecological research. The winning team is the {austraits}: R package which allows users to access, explore and wrangle data from the AusTraits database in R.
AusTraits is a national data asset quantifying the traits of Australia’s flora, which comprises approximately 26,000 different plant species. It is a co-investment project of the ARDC through the Australian Data Partnerships program, and the University of New South Wales, Western Sydney University and Royal Botanic Gardens, plus 15 others. The {austraits}: R package has been developed over the last 2 years and its code is open source. One of the core maintainers is Dr Fonti Kar.
Meet Dr Fonti Kar, a Postdoctoral Research Associate at the Evolution & Ecology Research Centre based at the University of New South Wales, Sydney. Fonti is also a teaching assistant of Biostatistics, a Data Analyst of the Atlas of Living Australia (ALA), a Software Carpentry Instructor, a co-founder of the UNSWcodeRs and engages in a variety of science outreach activities.
Fonti, tell us about your role as the main {austraits} R package developer.
The most exciting part of developing {austraits} is seeing people use the package. I enjoy hearing people’s experiences with the package as this ultimately leads to improvements to the software. It’s been a really fulfilling experience to work on a research tool that can facilitate the research of others and allow the community to better understand plant ecology at a broader scale.
As a biologist myself, it’s been an insightful process to think about how a typical user would want to interact with our package. This has guided many package development decisions from the tone of the documentation; to the functionality of the package; and even the naming of functions. It is important to me to help create a tool that can be used by many people from students to budding ecologists.
Who do you work with day to day?
Maintaining an R package like {austraits} is a constant work-in-progress! More often than not, solving one problem will lead me to make adjustments elsewhere. When I am not addressing bug reports flagged by myself and other users, I am writing tests or combing through the package to streamline processes. This sometimes involves improving documentation and the overall ‘look’ of the package.
Having said that, I think I have it quite easy compared to the rest of the AusTraits team. I work mostly with Dr Daniel Falster, a core contributor to the R package and one of the masterminds that was inspired to start compiling data for AusTraits, and Dr Elizabeth Wenk, the resident expert and custodian of the AusTraits database. Dr Falster offers coding support and big-picture feedback on my implementations, whereas Dr Wenk supports me with essential knowledge to help me develop code to navigate the AusTraits database. It takes a village to create a high-quality, relational database; maintaining this R package is one small part of it.
What does winning the ESA Open Source Software in Ecology award mean to you and your team?
Open access to an extensive and unified plant trait database was not possible prior to the release of the AusTraits database. One considerable benefit of the {austraits} R package is the ability for users to seamlessly download specific versions of the AusTraits database so that the entire workflow from data processing, to analysis and visualisation can be fully integrated and reproducible with R code. I hope my contributions to the R package will facilitate curious scientists to ask more ambitious questions and deepen our understanding of Australian plant ecology.
Receiving the award is a massive confidence boost! It reaffirms that I should continue following my interests in R programming and facilitating research as a software developer. It will allow me to upskill certain aspects of package development and provide me opportunities to connect with other R developers to share ideas and experiences.
Could you mention what best practices for quality software have you implemented in your work?
I consider myself a novice R package developer. There is so much to learn and every R package requires very different needs. I’m very lucky to work with a code-savvy team and have access to open source resources such as The R Packages book by Hadley Wickham and Jenny Bryan and the rOpenSci’s Packages Book. These reference texts have helped me create {austraits} and other packages to the best current practices in the R community.
One thing I am proud of learning and implementing for {austraits} is GitHub Actions. It ensures that an R package is stable and up-to-date as contributors make changes to the code. GitHub Actions has automated many aspects of my package development workflow such as testing and website deployment. At the moment, resources for implementing GitHub Actions are a little spread out, I’ve consolidated what I’ve learned in a recent workshop to help aspiring R package developers get started.
What communities or professional networks would you recommend?
The R programming community is vast. I try to stay up-to-date with the latest R developments via Twitter. One account I really like that amplifies the voices of people from all walks of life is @WeAreRLadies.
I have really enjoyed participating in rOpenSci’s community activities. I’ve learned about many package development best practices from watching their community calls, including tips on writing tests and how to set up a R package to welcome community contributions and engagement. If timing permits, I try to join in on rOpenSci’s social coworking hours. This is where I’ve connected with fellow R package developers who have shared valuable perspectives about their roles and projects.
Empowering Others
It has been an intimidating and humbling experience entering the research software development space with no computer science or software engineering background. I came into my role with a PhD in evolutionary biology and an aptitude for R programming; I didn’t feel particularly qualified to help write research software. I second-guessed myself a lot and took coding blockages quite personally. My outlook shifted for the better when I approached my role with a ‘learning on the job’ and ‘work in progress’ mentality.
There is a dismaying lack of diversity in the research community, and the research software development community is no exception. I found connecting with R programmers that have been historically or systematically excluded from this space has helped me build a sense of belonging. My experience has encouraged me to foster a welcoming space in the UNSW R user group I co-founded and in the workshops I run. I hope these efforts can empower others to see themselves in the research software development community.
Keep In Touch
You can connect with Fonti via Twitter, LinkedIn and GitHub.
If you’d like to be part of the growing community of research software engineers in Australia, become a member of the RSE Association of Australia and New Zealand (RSE-AUNZ) (it’s free!).
In our last RSE Story, we talked with Varvara Efremova from opendata.fit. Stay tuned for our next interview in the Shaping Research Software series, coming out in February 2023.
Do you create or maintain research software for ecology? Applications for the ARDC-sponsored award for New Developers of Open Source Software in Ecology open on 1 February 2023! Learn how to apply.
Learn more about the ARDC’s Research Software Agenda for Australia.
The ARDC is funded through the National Collaborative Research Infrastructure Strategy (NCRIS) to support national digital research infrastructure for Australian researchers.
Author
Reviewed by
Categories
Research Topic
Related Program
Related Projects
Related News & Events
- Plant Trait Data Guides Australia’s Bushfire Recovery
- Shaping Research Software: An Interview with Anthony Truskinger
- Shaping Research Software: An Interview with Varvara Efremova
- Shaping Research Software: An Interview with Johanna Bayer
- Shaping Research Software: An Interview with Lydia Lucchesi and Sam Nelson
- Shaping Research Software: An Interview with Dr Juan Nunez-Iglesias
- Shaping Research Software: An Interview with Dr Emily Kahl
- Dr Daniel Falster, Expert in Understanding How Forests Work
- Shaping Research Software: An Interview with Michael Roach
- Shaping Research Software: An Interview with Melina Vidoni
- Shaping Research Software: An Interview with Dr Marcel Keller
- Shaping Research Software: An Interview with Cynthia Huang
- Shaping Research Software: An Interview with Catherine Bromhead
- Global Knowledge Inequality Threatens Ecosystems Across the Global South
- Shaping Research Software: An Interview with Dr Adam Sparks
- Shaping Research Software: An Interview with Matthew Sainsbury-Dale and Andrew Zammit-Mangion
- Shaping Research Software: An Interview with Manodeep Sinha
- Shaping Research Software: An Interview with Roozbeh Valavi
- Shaping Research Software: An Interview with the predictNMB Team
- Shaping Research Software: An Interview with Ryan Wick
- Shaping Research Software: An Interview with Ben Foley
- Shaping Research Software: An Interview with Dr Kate Harborne
- Shaping Research Software: An Interview with Sharon Tickell
- Shaping Research Software: An Interview with Dr Saras Windecker
- Ecological Society of Australia (ESA) Conference 2024
- Shaping Research Software: An Interview with Dr Robert Smith