The Challenge
Genomic data of some species is publicly available but stored in a multitude of largely disconnected online databases, and only parts are known to people. Data for many species is being actively generated, and a significant amount is stored in herbaria, museums and other organisational repositories that have limited discoverability.
To realise the vision where genomic breeding approaches are applied widely across agriculture and conservation, data from as many Australian-relevant species as possible needs to be findable and combinable for subsequent analyses.
The Approach
The overall goal of the project was to establish the critical infrastructure to make genomics data available and discoverable. For genomic approaches to be applied widely across agriculture and biodiversity in Australia, this infrastructure needs to:
- locate and aggregate descriptions of all relevant genomic data in one place
- enable searching across these data based on a variety of contextual aspects around the organism, functional classification, geographical location and altitude
- enable comparative analysis of genomic data from organisms that have been selected based on the taxonomic or functional classifications.
To this end, the project has established the Australian Reference Genome Atlas (ARGA) platform.
The Outcomes
Genetic information has a substantial impact on improving conservation outcomes by providing answers to important questions on species populations recovery and rebound after major catastrophic events.
ARGA will locate and aggregate descriptions of all relevant genomic data – such as genome assemblies, genome annotations, barcodes, raw data and other omic data – to create tools that allow conservation managers to protect species and their genetic diversity. These tools will enable searching across these genomic data based on a variety of contextual aspects around the organism, such as:
- taxonomic grouping
- functional classifications (e.g. fire tolerance, drought, salt, conservation status)
- geographical location.
Use the Australian Reference Genome Atlas (ARGA).
Also read the following journal articles from the project:
- dos Remedios, N., Richmond, S., Christiansen, J., Ward, N., Holewa, H., & Hall, K. (2022). Building an Australian Reference Genome Atlas. Biodiversity Information Science and Standards. https://doi.org/10.3897/biss.6.91415
- Christiansen, J., & Hall, K. (2023). Biodata Infrastructure within Australia and Beyond: Landscapes and horizons. Biodiversity Information Science and Standards, 7. https://doi.org/10.3897/biss.7.112274
- Hall, K., Andrews, M., Connolly, K., Kankanamge, Y., Mangion, C., Mok, W., … & Brenton, P. (2023). The Australian Reference Genome Atlas (ARGA): Finding, sharing and reusing Australian genomics data in an occurrence-driven context. Biodiversity Information Science and Standards, 7. https://doi.org/10.3897/biss.7.112129
Who Will Benefit
The Partners
- Atlas of Living Australia (ALA)
- BioPlatforms Australia
- Australian BioCommons
Further Resources
- View a presentation by Kathryn Hall (DC011) about ARGA at the November 2023 Bushfire Data Challenges Forum.
- Read the ARGA Project Community Engagement Report.
- Access all code produced by ARGA on GitHub.
Contact the ARDC
Timeframe
Current Phase
ARDC Co-investment
Project lead
Categories
Related Resources
Related Projects
- Bushfire Research Data Management Plans
- Assessing the Impact of Bushfire Smoke on Health
- Aggregating and Integrating Data on Health Outcomes Associated with Bushfires at a National Scale
- Curated Biodiversity Data for Rapid Assessment of Bushfire Impacts
- Bushfire Data Access and Impact Modelling Platform