24
Apr

TechTalk 26: Techniques and Software Framework for Extracting Metadata

Learn about software architecture and techniques that attempt to alleviate the burden of metadata collation and curation in geoscience.
A person giving a talk to a crowd, sitting

About the Event

Like many other government and research institutions, Australia’s geological institutions house datasets with metadata of varying quality. This creates challenges for data providers and aggregators trying to maintain a certain standard of FAIR compliance across all their offerings. For example, when the offering is very poor, more research and manual data entry may be required. For aggregators, there is the problem of extracting metadata of a consistent standard from a wide variety of catalogue systems.

This TechTalk will outline a software architecture and techniques that attempt to alleviate the burden of metadata collation and curation. The software is designed to homogenise metadata harvested from a variety of common metadata catalogue applications. For metadata-poor sources, extraction of metadata from associated technical reports using textual analysis and machine learning models is utilised. The limitations and viability of such techniques are discussed. At the end of the transformation process, ISO-compliant metadata records are created which are suitable for importing into a GeoNetwork geospatial catalogue.

Speaker

  • Vincent Fazio, Senior Engineer (Minerals), CSIRO

Who Should Attend

  • Research software engineers
  • Academics
  • Coders
  • Other interested parties

Recording

This session will be recorded, and the recording will be provided to all registrants. Please register even if you are unable to attend the live session.

Learn More

TechTalks are forums for sharing technical experience and expertise in digital research. Access presentation slides and free resources from previous talks.

Do you have questions about this event? Contact us.