Global genomic data sharing enabled by purpose-built platform

A new secure data sharing platform, called SeqHaven, is facilitating the transfer of large volumes of human genomics research data in Australia. Delivered by the QIMR Berghofer Genome Informatics and Medical Genomics teams as part of the GUARDIANS program of work, the new service has streamlined the way massive genomic datasets are shared.

What does the data sharing platform do?

The platform is a dedicated, scalable system that provides an environment for safely and efficiently transferring large volumes of genomic data generated by QIMR Berghofer's researchers. Significantly, it recently shared the first major dataset using technology built for GUARDIANS, and will go on to underpin the sharing of resources such as the International Cancer Genome Consortium’s (ICGC) datasets on melanoma and pancreatic cancers. 

Stock image of dna and networks

The tailored infrastructure is helping to relieve a technical bottleneck in collaborative research, which has enabled QIMR Berghofer to share data with collaborators in Australia and globally. As with all components of the national GUARDIANS project, strict privacy and confidentiality requirements have been a priority. Access and data governance are supported by the QIMR Berghofer Data Access Committee, who ensure responsible transfer of genomic data that always respects the wishes of research participants.

The platform uses Globus to securely transfer large files between research organisations in Australia and overseas, enabling genomic research data to be made available to verified researchers via controlled access. This demonstration of a scalable solution to a common challenge used AARNet’s Globus service to share large-scale data across organisational boundaries and international borders.

How will this impact researchers?

The ability to move large volumes of data without compromising security is an integral requirement for genomics. Increased data sharing improves statistical power, supports new insights from existing datasets, and promotes collaboration across institutions to accelerate discovery and improve health outcomes. The new platform allows researchers to leverage valuable datasets, with data custodians being confident that rigorous data governance policies are in place.

‘The SeqHaven platform is a game-changer in how we, as researchers, manage collaborations and data sharing,’ says John Pearson, Project Lead at QIMR Berghofer. ‘It allows us to move big data while protecting our systems and, ultimately, our participants.’

This sentiment is echoed by BioCommons A/Director (Human Genome Informatics) and GUARDIANS program lead, Prof Bernie Pope:

‘This national collaboration is a prime example of how the GUARDIANS program is uplifting our national research infrastructure to help researchers easily discover, access and analyse human genomics research data.’’

Learn more

QIMR Berghofer welcomes further enquiries from the Australian genomics community about sharing their data by contacting the QIMR Berghofer Data Access Committee.

You can hear more from John Pearson, Manager, Genome Informatics Group, QIMR Berghofer Medical Research Institute, about the development of the SeqHaven platform and the integration of Globus software, along with other life science applications in the upcoming BioCommons webinar: Globus: moving large volumes of research data with ease.

Next
Next

Key molecular structure prediction datasets now available in the new Structural Biology AI Reference Collection