News

Subscribe to the Australian BioCommons monthly newsletter or read previous editions  

Christina Hall Christina Hall

Strategic talks bring BioCommons Hub team together in Melbourne

The BioCommons Hub team will soon meet in Melbourne in October to reflect on our rapid ‘growing up’ and plan for our busy future.

While Australian BioCommons co-funds dozens of roles hosted by partner organisations and in the last 12 months has coordinated projects and activities involving over 200 people across Australia, its Hub is made up of 20 dedicated and dispersed (and busy!) team members. Hub staff work from their homes across 5 states, and have offices at Melbourne Bioinformatics, QCIF, University of Newcastle, University of Adelaide, and CSIRO.

We regularly have ‘retreats’ to strengthen the team bonds, work through strategic approaches and to develop a shared vision for Australian BioCommons. We are so happy that we can now meet for these intensive sessions in person.

Earlier in the year we spent two days at the Melbourne Zoo, meeting new team members, workshopping ideas and planning our future - all under the watchful eyes of honorary participants in the enclosures outside our glass walls!

Next week we are gathering for two days in Melbourne to discuss the rapid developments in our team and projects, and to catch our breath and reflect on what ‘growing up’ has been like for our team. You may notice a lull in our activity for a couple of days, but rest assured we’ll be back on deck with fresh ideas and motivation soon after!

Read More
Christina Hall Christina Hall

BioCommons team and projects well represented at eResearch Australasia 2022

The BioCommons team will be attending the 2022 eResearch Australasia Conference and sharing details of a wide range of our collaborative current activities.

The 2022 eResearch Australasia Conference which will take place in Brisbane from 17 – 20 October. Offering delegates the opportunity to re-engage, reconnect, and share ideas, eResearch 2022 will investigate information-centric research capabilities and how information and communication technologies help researchers to collaborate, collect, manage, share, process, analyse, store, find, understand and re-use information.

A large group of the BioCommons team will be attending the conference and sharing details of a diverse range of our current activities, along with many of our collaborators. Please join us at the presentations below!

Tue 18 Oct

11.40 Supporting the discovery and reuse of workflows in bioinformatics (Gareth Price)

12.00 The Biocommons Nextflow Tower Service: A user-friendly and infrastructure-flexible platform to run and monitor Nextflow pipelines (Nigel Ward and Ziad Al Bkhetan)

14.55 DReSA: Building momentum in digital research skills uptake (Melissa Burke co-presents with other members of the Digital Research Skills Australasia Working Group)

Wed 19 Oct

10.25 Galaxy Australia as a vehicle for cutting-edge biotechnology (Cameron Hyde)

10.45 Building community data assets for life sciences through ABLeS – the Australian BioCommons Leadership Share (Ziad Al Bkhetan)

15.50 The importance of requirements elicitation and analysis in setting projects up for success: a case study from the Human Genomes Platform Project (Kylie Davies, Jess Holliday)

16.30 User-led platform design - the Australian BioCommons journey (Winnie Mok)

16.50 A FAIRer future for genomics research in Australia: creating an Australian Genome Reference Atlas (Kathryn Hall)

Thu 20 Oct

9.45 Visible Research Software Interest Group (Johan Gustafsson co-leads BoF session with Paula Andrea Martinez)

Posters

Enhancing Australia’s capability for secure and responsible sharing of human genome research data: The Human Genomes Platform Project (Marion Shadbolt)

Bioinformatics training “with a little help from my friends” (Melissa Burke)

We look forward to catching up with many of you in person at the conference!

To register for the conference visit: https://conference.eresearch.edu.au/home/2022-registration/

Read More
Christina Hall Christina Hall

Enabling reproducible and portable workflows: Janis

As part of the Australian BioCommons ‘Bring Your Own Data’ Expansion Project, a specialised framework is under development that creates simple workflow definitions that enables researchers and clinicians to work with different workflow languages.

Being able to repeat analytical workflows consistently and accurately is critical in transferring and scaling methodologies, whether between research groups or for large-scale clinical use.

As part of the Australian BioCommons ‘Bring Your Own Data’ Expansion Project, a specialised framework is under development that creates simple workflow definitions that enable researchers and clinicians to work with different workflow languages.

‘Janis’ provides both a consistent language for describing workflows and a framework that can translate workflows between existing languages such as CWL and WDL. Originating from the Portable Pipelines Project between Melbourne Bioinformatics, the Peter MacCallum Cancer Centre, and the Walter and Eliza Hall Institute of Medical Research, BioCommons quickly saw the potential of supporting such a useful tool.

Janis will help make the work that biomedical researchers do more portable and less dependent on any particular technology.

A/Prof Bernie Pope
Australian BioCommons A/Director: Human Genome Informatics
Melbourne Bioinformatics Human Genomics Lead
Victorian Health and Medical Research Fellow

Janis is already being used by a number of groups, such as the Molecular Pathology Laboratory at the Peter MacCallum Cancer Centre where it helps manage workflows for the analysis of clinical cancer data on a significant scale. Melbourne Bioinformatics software engineer, Grace Hall, is currently working to extend Janis's ability to convert workflows from the popular Galaxy platform. Grace points out that research groups currently invest a lot of time and effort in creating workflows; translating them with Janis will make them easier to share, and maintain with minimal effort into the future. Richard Lupat, a bioinformatics software engineer from the Peter MacCallum Cancer Centre and one of the original Janis developers, says that translating to and from Nextflow, another widely used workflow language, will be the next goal.

More information about Janis can be found here.

This story was based on a news item first published in the Melbourne Bioinformatics newsletter.

The Australian BioCommons BYOD Expansion Project project is funded through NCRIS investments from Bioplatforms Australia and the Australian Research Data Commons (http://doi.org/10.47486/PL105) that are matched with co-investments from AARNet, Melbourne Bioinformatics, NCI, Pawsey, QCIF via the Queensland Government RICF fund, The University of Sydney, AGRF, Griffith University and Monash University.

Read More
Christina Hall Christina Hall

Global solutions are built on strong international connections

The Galaxy Australia team has been taking up opportunities to join the international Galaxy community in-person around the world. After years of working intensively online at all hours of the night, it is a joy to be able to meet up with global team members.

The Galaxy Australia team has been taking up opportunities to join the international Galaxy community in-person around the world. After years of working intensively online at all hours of the night, it is a joy to be able to meet up with global team members. The excellent progress made through focussing our combined efforts in a shared time and place is evidence that the benefits of working in the same room often outweigh the unfortunate costs of travel between the core Galaxy groups in Australia, Europe and the USA.

Galaxy Australia Lead Engineer, Simon Gladman, from Melbourne Bioinformatics at the University of Melbourne, will jet off today to Freiburg, Germany to contribute to the European Galaxy Days. The events offer an opportunity to drive the future progress of Galaxy, and Simon will present his recent work, discuss future improvements with other Galaxy developers and help to coordinate their efforts to fast track new features.

Several Galaxy Australia team members presented their work at GCC2022 and participated in the following hands-on “CoFest” and deep discussions about Galaxy’s development in Minneapolis, USA in July. Cameron Hyde who works in the Galaxy Australia team from QCIF at University of the Sunshine Coast, was invited by the conference hosts to stay on after the conference to discuss the GalaxyP project in detail. GalaxyP is a multi-omics informatics platform for integrative analysis of genomics, transcriptomics and mass spectrometry-based proteomics and metabolomics data that is under active development at the University of Minnesota in collaboration with the Minnesota Supercomputing Institute. The expansion into the proteomics domain is a current focus for Galaxy Australia to address the need to support Australian researchers in their proteomics analysis, as identified in Australian BioCommons’ Proteomics Analysis Infrastructure Roadmap for Australia. The insights shared between these global collaborators will help the community effort move forward to constantly improve the Galaxy platform for everyone’s benefit.

We are also thrilled to be hosting next year's international Galaxy Community Conference in Australia. The international team who develop Galaxy will be there, along with colleagues from throughout our local region who manage Galaxy instances for their own local research communities. Combining the interests of both developers and users of Galaxy, GCC2023 will be a friendly and fun meeting that is also well suited to researchers who would like to learn more about using Galaxy for their own analyses.

Keep in touch with updates about GCC2023: register your interest here.

Read More
Christina Hall Christina Hall

Launching the Australian AlphaFold Service

A new service is now offering researchers the chance to accelerate their research through access to AlphaFold. The Australian AlphaFold Service delivers AlphaFold through Galaxy Australia, offering an easy to use interface coupled with the required computational resources at no cost to Australian researchers.

A new Australian BioCommons service is now offering researchers the chance to accelerate their research through access to AlphaFold. By providing this new capability through Galaxy Australia, the powerful tool is backed by the required computational resources at no cost to Australian researchers. After an intensive period of testing across many fields of biology, access is now open to the new Australian AlphaFold Service.

When the AI system, AlphaFold, was released by DeepMind it promised to reduce painstakingly laborious and time consuming experimental techniques for determining protein structures down to accurate predictions that can take minutes. Researchers jumped at the opportunity to investigate the 3D structure predictions built using only their primary amino acid sequences.

In July 2022, Deepmind partnered with EMBL’s European Bioinformatics Institute (EMBL-EBI) to release the predicted structures for nearly all catalogued proteins known to science, expanding the AlphaFold Database to over 200 million structures. With accuracy rivalling real-life experiments and the results shared freely the impact of the tool on research was felt internationally.  

Galaxy Australia seized the opportunity to facilitate access to the tool for even more researchers. They made AlphaFold 2 available in their easy-to-use interface, and paired it with the GPU clusters required to power AlphaFold.

To design the best possible service, Galaxy Australia worked with a group of beta testers who provided feedback on how to improve the experience for their particular research application. A diverse group of Australian researchers who brought with them a wide range of experience with AlphaFold and Galaxy.

Dr Kate Michie runs the Structural Biology Facility at the Mark Wainwright Analytical Centre, UNSW, and has incorporated AlphaFold into her standard practice. The protein structure prediction speeds up laborious and sometimes frustrating protein crystallisation, saving time and money in the wetlab process.

“As structural biologists, we can use AlphaFold for both designing constructs and solving protein structures. It is quickly becoming the first thing you do, to inform what happens in the lab.”

Kate is currently looking at ways to sustain her ongoing use of AlphaFold in the long term. The computationally intensive jobs are expensive to run, so she was keen to sign up to become a beta tester for the new service. These types of jobs may not run as fast as on a dedicated private facility, but once submitted to the Australian AlphaFold Service they conveniently run in the background at no cost to researchers. 

Another benefit of using AlphaFold in this way comes from the easy to navigate interface that is provided by Galaxy Australia. Like many of the beta testers, Kate is an experienced user already, and found using the tool in Galaxy Australia was easier.

“AlphaFold is enormously helpful for lots of different projects, and they are often led by researchers who are not structural biologists, who come to me asking “what can I understand from this esoteric protein we found in some weird organism?” So in order to really jump things forward and trim down which experiment is likely to work and which one isn’t, I become this mediator who helps interpret what AlphaFold can do for them and run their structures. 

Ideally researchers could be using the Australian AlphaFold Service themselves. It is easy to navigate and it would be awesome if I don’t have to perform the AlphaFold runs for them.”

Dr Frank Sainsbury from the Griffith Institute for Drug Discovery at Griffith University had not used Galaxy Australia before, and was concerned about whether he would have the computational skills to be one of the beta testers for the new AlphaFold service. But with Galaxy Australia removing any requirement to install and compile obscure software, he had such success that he is looking at integrating it into his future research.

“We are looking at how a particular genus of plant viruses evolved to only infect plants. We make virus-like particles in order to determine the structure of viruses and also for drug discovery and biomedical use. AlphaFold was used to check for evidence of a core structural domain of a putative coat protein and the fact that it was there gave us the confidence to go on and make virus-like particles.”

The beta testers also identified other potentially powerful capabilities they would like to see, like using Alphafold to visualise and predict interactions between homo- and hetero- polymer proteins. This functionality is in the Alphafold Multimer tool, so Galaxy Australia is working to make it available in the future. The team is always open to finding new ways to support researchers, in this case to enable even greater insights into the 3D structural relationships between proteins.

If you are an Australian researcher who is keen to use AlphaFold to investigate protein structure, apply for access to the Australian AlphaFold Service now.

The Australian AlphaFold Service is part of Galaxy Australia which is managed by QCIF, Melbourne Bioinformatics and AARNet. The AlphaFold Service is specifically underpinned by scalable computational resources procured from Microsoft Azure. This service is supported by funding from the Queensland Government’s Research Infrastructure Co-investment Fund (RICF), Bioplatforms Australia (BPA) and Australian Research Data Commons (ARDC). BPA and ARDC are enabled by NCRIS.

Read More
Christina Hall Christina Hall

Uncovering the secrets of parasites through AI and Cloud computing

A partnership between Australian BioCommons and Microsoft has resulted in University of Melbourne researchers being able to quickly leverage new methods through a large grant to use the Azure Cloud, with significant implications for the research community worldwide.

Adult Schistosoma haematobium


Adult Schistosoma haematobium. Image credit: Sara Li, NIH NIAID Schistosomiasis Resource Center, Biomedical Research Institute, Rockville, MD.

An exciting novel approach to investigating human and animal parasites is underway. New software and computational capabilities enabling automated structure-based genome annotation are overcoming some of the challenges of working with non-model organisms. A partnership between Australian BioCommons and Microsoft has resulted in University of Melbourne researchers being able to quickly leverage new methods through a large grant to use the Azure Cloud, with significant implications for the research community worldwide.

Conventional, primary sequence-based methods of annotation leave a large proportion of non-model organisms’ genes and proteins “unannotatable” because of the lack of known homologs in other species. But a new world of possibilities opens up by using structure to guide the search using AlphaFold, the AI system that accurately predicts the 3D structure of proteins.

Parasites that cause major human and animal suffering are prime candidates for this new structure-based approach. The research of Dr Neil Young from Prof Robin B Gasser’s laboratory at The University of Melbourne’s Faculty of Veterinary and Agricultural Sciences focuses on a range of socioeconomically important parasites. They are poised to fast track our understanding of how these complex organisms work by utilising AlphaFold to annotate all proteins encoded in the genomes of parasitic flatworms and their intermediate hosts (snails).

AlphaFold 2 is now available via the easy-to-use Galaxy Australia interface, with the platform also providing access to a rich catalogue of computational resources including the GPU clusters required to power AlphaFold. Further, Australian BioCommons has been working to deploy Galaxy Australia on commercial cloud resources, to enable massive scale-up of the platform and access specialised resources. The new Australian AlphaFold Service brings these significant developments together: researchers’ jobs are now running on the Azure Cloud thanks to an Australian BioCommons collaboration with BizData and Microsoft on Azure.

Connecting the Microsoft team with researchers who could immediately get to work, the proposal from the University of Melbourne Team  to perform automated structure-based proteome annotation on a genome-wide scale was granted a massive computational boost. Jobs have already begun capitalising on the generous 12,000-hour A100 GPGPU allocation on Microsoft’s Azure cloud (equivalent to approximately $65,000 AUD). 

By applying new capabilities in structure-based homology, this project will overcome the problems associated with primary sequence-based methods that have long dogged researchers seeking to annotate protein-coding genes and their products in silico. A substantial improvement in the accuracy of the functional annotation of nine parasite proteomes is expected, but this approach can be immediately applied to any other groups of eukaryotic organisms by researchers worldwide.

Read more about the work of Robin B Gasser’s Lab.

All Australian researchers can now access AlphaFold 2 through Galaxy Australia by simply applying through the Australian AlphaFold Service

Australian BioCommons’ Australian AlphaFold Service is part of Galaxy Australia which is managed by QCIF, Melbourne Bioinformatics and AARNet. The AlphaFold Service is specifically underpinned by scalable computational resources procured from Microsoft Azure. This service is supported by funding from the Queensland Government’s Research Infrastructure Co-investment Fund (RICF), Bioplatforms Australia (BPA) and Australian Research Data Commons (ARDC). 


Galaxy Australia is underpinned by computational resources provided by AARNet, Nectar Research Cloud, University of Melbourne, QCIF, National Computational Infrastructure and the Pawsey Supercomputing Research Centre. These efforts are supported by funding from The University of Melbourne, the Queensland Government’s Research Infrastructure Co-investment Fund (RICF), Bioplatforms Australia (BPA) and Australian Research Data Commons (ARDC). BPA and ARDC are enabled by NCRIS.

Read More
Christina Hall Christina Hall

Join a community of data wranglers: large and complex research datasets in the cloud

To ensure bioscientists have access to essential data and tools, we've partnered with the Center for Translational Data Science, University of Chicago, USA. Working with the team behind the open-source data platform, Gen3, several data commons are currently being tailor-made for use by large numbers of Australian researchers working across national consortia. We're co-hosting the inaugural online Gen3 Community Forum 11-13 Oct and you're invited!

The ability of biologists to manage, analyse and share research data is key to making new discoveries and understanding the molecular basis of life. To ensure bioscientists have access to essential data and tools, Australian BioCommons has partnered with the Center for Translational Data Science, University of Chicago, USA. Working with the team behind the open-source data platform, Gen3, several data commons are currently being tailor-made for use by large numbers of Australian researchers working across national consortia.

The Gen3 platform consists of open-source software services that support the emergence of healthy data ecosystems by enabling the interoperation and creation of cloud-based data resources, including data commons and analysis workspaces. Gen3 aims to accelerate and democratise the process of scientific discovery by making it easy to manage, analyse, harmonise, and share large and complex datasets in the cloud.

Gen3 is now a key component of the large-scale translational medicine data ecosystem in the USA, having been used to build over 15 data commons by organisations including several NIH Institutes and Centers. Australian BioCommons is also supporting multiple deployments, helping to establish new Gen3 data commons for management and secure sharing of human omics data holdings between research collaborators. One project that supports Australian cardiovascular research is designing a system for the Precision Medicine Flagship of the Australian Cardiovascular Alliance (ACvA) that allows data to be received, managed, described, quality controlled and shared with authorised and authenticated individuals. Another project resulted in a webinar featuring partners from The University of Melbourne Centre for Cancer Research and the Zero Childhood Cancer Program who shared their experiences and progress towards Establishing Gen3 to enable better human genome data sharing in Australia.

With Gen3 use spreading globally, there is a demand to coalesce shared knowledge and activities into a community. A new Gen3 Community will meet for the first time at a virtual forum co-hosted by the University of Chicago and the Australian BioCommons. Everyone is invited to the Gen3 Community Forum 2022, taking place online 4-7 p.m. 10-12 October (CDT) / 8-11 a.m. 11-13 October (AEDT).

The inaugural Gen3 Community Forum will share knowledge about Gen3, its architecture, and the Gen3 roadmaps and priorities; strengthen the connection between the core team and those developing, operating and using Gen3 platforms; design a set of ongoing community engagement activities; and discuss and agree on key shared development priorities between the Gen3 core team and the community. The first day will showcase different examples of Gen3 data commons, and participants are welcome to attend some or all sessions.

Further details of the program and free registration is now available.

Read More
Galaxy Australia news Christina Hall Galaxy Australia news Christina Hall

Data management solutions designed for Australian bioscience

Life sciences researchers are benefiting from the nifty integration that allows secure data transfer between Galaxy Australia and CloudStor. The streamlined movement of data between the two freely-available platforms demonstrates how collaboration between services are meeting the needs of Australian life science researchers.

Life sciences researchers are benefiting from the nifty integration that allows secure data transfer between Galaxy Australia and CloudStor. The streamlined movement of data between the two freely-available platforms demonstrates how collaboration between services are meeting the needs of Australian life science researchers.

The 20,500 registered users of the Galaxy Australia data analysis platform can now easily and securely move their data to and from the AARNet CloudStor research data storage platform. The integration was introduced to allow users of Galaxy Australia to more easily share data with colleagues and collaborators through CloudStor. The ability to seamlessly move data into Galaxy for analysis and back out for long term storage avoids messy local uploads and downloads. 

This powerful combination of secure scalable cloud storage with a ready-to-use analysis platform comprising all the necessary compute, tools and workflows is easily accessed. CloudStor can be linked to a Galaxy user's account through a simple configuration, with data security at both CloudStor and Galaxy. Users simply navigate to their folders and files and import data directly through Galaxy’s Upload Data / Choose remote files option. Similarly, when an analysis is finished, Galaxy users simply invoke the “Export datasets to remote files source” where they can select single, multiple or dataset collections for export to CloudStor.

If you are interested in using the Galaxy Australia and CloudStor integration, further information is available to support you to make the connection.

Galaxy Australia is an Australian BioCommons service, jointly supported by the Australian Government’s National Collaborative Research Infrastructure Strategy (NCRIS) through the Australian Research Data Commons and Bioplatforms Australia; the Queensland Government’s Research Infrastructure Co-investment Fund; and The University of Melbourne.

Managed by QCIF, Melbourne Bioinformatics and AARNet, Galaxy Australia is underpinned by computational resources provided by AARNet, the ARDC, The University of Melbourne, The University of Queensland, QCIF, National Computational Infrastructure, and the Pawsey Supercomputing Centre.

The BioCommons BYOD [Bring Your Own Data] Expansion Project received investment (doi.org/10.47486/PL105) from the Australian Research Data Commons (ARDC). The ARDC is funded by the National Collaborative Research Infrastructure Strategy.

Read More
Galaxy Australia news Christina Hall Galaxy Australia news Christina Hall

Popularity of Galaxy Australia prompts move of core services to AARNet

Galaxy Australia has moved its head node and associated services to Australia’s Academic and Research Network (AARNet) to allow increased on-demand support for more users. This move provides a long-term, high-performing and reliable hosting environment for Galaxy Australia infrastructure.

As a key web-based platform for bioinformatics analysis in Australia, Galaxy Australia is focused on maintaining a robust front-end web presence with the scalable capacity and high performance expected by researchers. A continuous improvement approach is in place to ensure the needs of a growing cohort of researchers registering for the service are met. 

Galaxy Australia has moved its head node and associated services to Australia’s Academic and Research Network (AARNet) to allow increased on-demand support for more users. This move provides a long-term, high-performing and reliable hosting environment for Galaxy Australia infrastructure. Importantly, the move also enables the Pawsey Supercomputing Research Centre (Pawsey), a tier one national infrastructure available to all researchers in Australia, to focus on providing back-end compute services to power Galaxy Australia’s more than 1,800 installed tools, covering genomics, proteomics and metabolomics, statistics and data visualisations.

AARNet is a national resource owned by Australian universities and national science agency CSIRO and has provided ultra-high-speed telecommunications and collaboration services specifically for research and education for more than three decades. A trusted sector partner renowned for an exceptionally high level of service delivery, AARNet will provide Galaxy Australia with 24/7 operational monitoring and response services, seamless network configuration and failover management, and the hardware capacity to support user and data growth projections.

With the AARNet team taking care of all the front-end physical infrastructure operations, the Galaxy Australia team can focus on using computational resources at Pawsey, University of Melbourne, QCIF, and Azure to meet the growing needs of the more than 19,500 registered users of the service. 

Prior to the deployment to AARNet, an integrated team working across AARNet, Pawsey, Queensland Cyber Infrastructure Foundation (QCIF) and Melbourne Bioinformatics undertook many months of exhaustive preparation and testing. This all paid off, with little service downtime experienced during the deployment and Galaxy Australia jobs now running successfully from AARNet. 

We are delighted to be providing a high-performing long-term hosting solution that will support the growth and development of Galaxy Australia and help life sciences researchers with their important work. This is a great example of how AARNet works closely with sector partners to solve complex technical problems with infrastructure and make it easier for researchers to analyse data and collaborate.

Chris Hancock, AARNet CEO

AARNet joins QCIF and Melbourne Bioinformatics in the collective responsibility for the management of the Australia BioCommons’ Galaxy Australia service. 

The move to AARNet means our existing and new users will experience fast response times across all aspects of their Galaxy experience - homepage loading, history refreshes, and workflow execution to name a few. On top of the performance improvements, we add new tools weekly, have annotated tools to aid in discovery, and updated our support options. If it's been a while since you last visited Galaxy Australia I recommend coming back for a visit.

Gareth Price, Galaxy Australia’s Science Lead

Galaxy Australia is an Australian BioCommons service, jointly supported by the Australian Government’s National Collaborative Research Infrastructure Strategy (NCRIS) through the Australian Research Data Commons and Bioplatforms Australia; the Queensland Government’s Research Infrastructure Co-investment Fund; and The University of Melbourne.

Managed by QCIF, Melbourne Bioinformatics and AARNet, Galaxy Australia is underpinned by computational resources provided by AARNet, the ARDC, The University of Melbourne, The University of Queensland, QCIF, National Computational Infrastructure, and the Pawsey Supercomputing Centre.

The BioCommons BYOD [Bring Your Own Data] Expansion Project received investment (doi.org/10.47486/PL105) from the Australian Research Data Commons (ARDC). The ARDC is funded by the National Collaborative Research Infrastructure Strategy.

This announcement was co-published with AARNet.

Read More
Christina Hall Christina Hall

New reports provide insights into what's needed for better sharing of human genomics research data in Australia

Insights from the Human Genomes Platform Project (HGPP) knowledge discovery phase have recently been published online. This ensures that knowledge is shared with others who will follow on the complicated path of sharing human genomics research data.

The HGPP will enhance capability for securely and responsibly sharing human genomics research data to maximise the utility of these sensitive and valuable assets.

Insights from the Human Genomes Platform Project (HGPP) knowledge discovery phase have recently been published online. This will ensure that knowledge is shared with others who will follow on the complicated path of sharing human genomics research data. Updates will be shared publicly throughout the life of the project, and the most recent reports detail the project’s Discovery Phase.

Representing many of the largest human genome sequencing and analysis efforts in Australia, the HGPP team is working with global peers to learn from other initiatives with similar ambitions to share sensitive research data. Consulting GA4GH and ELIXIR’s standards to better understand the international contexts, the project has described Australia’s own unique requirements. 

At the heart of any technology platform is Identity and Access Management (IAM): a collection of standards, policies and technologies that enable a platform to determine whether to permit access to a user. In a federated environment IAM is the glue that enables loosely coupled systems to establish strong trust relationships for the purposes of data sharing. Trust relies on technologies such as cryptography but also on coordinated policies outlining shared expectations between federation participants.

The initial focus of the Federated IAM sub-project team was a discovery and recording phase to define the current state of identity and access management in the community, the set of problems that need to be addressed, and key stakeholders and their (likely) requirements. To learn more about the current state of processes and tools for identity and access management across the community, national community needs, gap analysis, and identification of international projects with components suitable to canvas and potentially pilot, read the Human Genomes Platform Project: Federated Identity and Access Management (IAM) Discovery Phase Report.

A major challenge for human genome data sharing is navigating restrictions on secondary use. Decisions on granting access to data require significant effort by Data Access Committees (DAC). This manual approach is slow and burdensome. The aims of the DAC Automation sub-project are to explore semi-automated data access request and approval processes for the national human genome research community. For an Australian genomics federation to be successful, widespread adoption of new processes and systems will be needed. To learn more about the current state of relevant processes and tools, national community needs, gap analysis, and identification of international projects with potential solution components for piloting in later project stages, read the Human Genomes Platform Project: DAC Automation Discovery Phase Report.


The Human Genomes Platform Project is a partnership between Australian Access Federation, Australian BioCommons, Australian Genomics, Garvan Institute of Medical Research, National Computational Infrastructure, QIMR Berghofer Medical Research Institute, University of Melbourne Centre for Cancer Research and Zero Childhood Cancer. The project forms part of the Australian BioCommons’ Human Genome Informatics  theme and is funded through NCRIS via the Australian Research Data Commons (https://doi.org/10.47486/PL032) and Bioplatforms Australia.

Read More